-
Notifications
You must be signed in to change notification settings - Fork 28.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-15480][UI][Streaming]show missed InputInfo in streaming UI #13259
Conversation
Test build #59124 has finished for PR 13259 at commit
|
cc @zsxwing |
inputInfoMissedTimes.foreach (time => { | ||
val streamIdToInputInfos = inputInfoTracker.getInfo(time) | ||
val fakeJobSet = JobSet(time, Seq(), streamIdToInputInfos) | ||
listenerBus.post(StreamingListenerBatchCompleted(fakeJobSet.toBatchInfo)) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Will this change the semantics if posing fake StreamingListenerBatchCompleted
, if user's code rely on this, will this break their assumptions?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good point, it would be a breaking change in that case. Give the fact that some information is indeed missing now, we need either send more events or add additional fields in current event. The later might be better and more correct in semantics, although it can still break current implementation of user's custom listener. And I noticed that listener interface is annotated as DeveloperAPI, so it's expected that API might be changed. How do you think?
I suggest we change |
@zsxwing thank you for the suggestion, But I have two concerns:
|
How about adding a field |
1. revert perivous change 2. add batchTimesWithNoJob Set to record the batch with no job 3. add aggrate method of InputInfo
@zsxwing Thanks for your suggestion, I have re-implemented this PR according to that. But instead of maintaining state I according to my manual test the new code works. Could you help review again? |
Test build #59730 has finished for PR 13259 at commit
|
@mwws sorry for the delay. Could you add a unit test, please? |
@zsxwing Yes sir, a unit test has been added. |
Test build #60095 has finished for PR 13259 at commit
|
@zsxwing Could you take a look at this PR. Would be good to merge this. |
@zsxwing This patch has been pended for a long time, could you help to review? |
Closes apache#11785 Closes apache#13027 Closes apache#13614 Closes apache#13761 Closes apache#15197 Closes apache#14006 Closes apache#12576 Closes apache#15447 Closes apache#13259 Closes apache#15616 Closes apache#14473 Closes apache#16638 Closes apache#16146 Closes apache#17269 Closes apache#17313 Closes apache#17418 Closes apache#17485 Closes apache#17551 Closes apache#17463 Closes apache#17625 Closes apache#10739 Closes apache#15193 Closes apache#15344 Closes apache#14804 Closes apache#16993 Closes apache#17040 Closes apache#15180 Closes apache#17238
This pr proposed to close stale PRs. Currently, we have 400+ open PRs and there are some stale PRs whose JIRA tickets have been already closed and whose JIRA tickets does not exist (also, they seem not to be minor issues). // Open PRs whose JIRA tickets have been already closed Closes apache#11785 Closes apache#13027 Closes apache#13614 Closes apache#13761 Closes apache#15197 Closes apache#14006 Closes apache#12576 Closes apache#15447 Closes apache#13259 Closes apache#15616 Closes apache#14473 Closes apache#16638 Closes apache#16146 Closes apache#17269 Closes apache#17313 Closes apache#17418 Closes apache#17485 Closes apache#17551 Closes apache#17463 Closes apache#17625 // Open PRs whose JIRA tickets does not exist and they are not minor issues Closes apache#10739 Closes apache#15193 Closes apache#15344 Closes apache#14804 Closes apache#16993 Closes apache#17040 Closes apache#15180 Closes apache#17238 N/A Author: Takeshi Yamamuro <yamamuro@apache.org> Closes apache#17734 from maropu/resolved_pr. Change-Id: Id2e590aa7283fe5ac01424d30a40df06da6098b5
What changes were proposed in this pull request?
It's a bug in Streaming UI. If BatchDuration is changed by window operation, InputInfo on time with no output ops will not be shown in streaming UI.
Here is a simple example to reproduce:
How was this patch tested?
manually tested
I manually input 6 records from Kafka. According to output, there are indeed 6 records processed
data:image/s3,"s3://crabby-images/71b95/71b959445e7a4f009e988da62d08444544238574" alt="output"
but according to web UI, input records show 0.
data:image/s3,"s3://crabby-images/50831/5083168fc3d31659a929d67cd8f947b1cfdaa843" alt="original"
And here is the Screenshot after my change:
data:image/s3,"s3://crabby-images/932c7/932c7fca3bfa32c638bdc0f61c54063700e24f56" alt="newresult"