-
Notifications
You must be signed in to change notification settings - Fork 78
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Make the Dataset equality inequality messages better #68
Comments
@carlsverre @nvander1 @gorros - Can you please take a look and provide thoughts on the best error message we can provide users for DataFrame inequality comparisons? Thanks! |
See here for the utest output that doesn't have all the info warnings: #64 |
I agree. |
I like this but I would add spaces between the values:
Also consider outputting strings wrapped with |
@gorros @carlsverre - Here's a PR to migrate spark-fast-tests back to Scalatest (it's currently using utest): #69 I think it'll be easier to develop the optimal Scalatest output if this repo is actually using Scalatest ;) Let me know your thoughts! |
Here's the current DataFrame comparison message: Here's the new message (added in this PR): @carlsverre @gorros @snithish - can you please take a look and let me know if this output looks better / you have any suggestions. Some specific points to note:
|
Good catch - ScalaTest is almost certainly running trim on the string before printing it which will remove all leading/trailing whitespace. I guess a null byte is also considered part of that... If you can get blue to work I think that's better - dark grey can be set to be very similar to the shell bg color in some colorschemes. This looks good to me though - love the new format! |
@MrPowers : Re: brackets around the values, I would recommend keeping those, as it helps to avoid subtle issues around spaces and tabs and such that can affect inequality but be hidden without such delimiters, e.g.
I agree the alignment is definitely a plus. I agree with @carlsverre that blue would be better than dark grey in terms of colors for equality. |
I would love to see something that shows what column values are different. This is especially important for larger data frames that may have 50 columns. |
Here's the current content inequality message:
I think it'd be better to align this output. It'd also be better to put "Actual Content | Expected Content" on a newline.
It'd be really nice to suppress all the info warnings, but not sure if that's possible with Scalatest.
Should we get rid of the square brackets for each row of data too?
The text was updated successfully, but these errors were encountered: