-
Notifications
You must be signed in to change notification settings - Fork 6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Datasets] [Docs] Update map_batches
documentation
#28435
[Datasets] [Docs] Update map_batches
documentation
#28435
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice, big improvement!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is a massive improvement to this docstring! Only small nits left, so this looks good to merge after the default batch PR is merged and this is rebased.
Call this function to iterate over batches of data. | ||
|
||
:meth:`~Dataset.default_batch_format` | ||
Call this function to determine the default batch type. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit: to learn? User cannot choose default type.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think "determine" is okay in this context.
This is one of the Google definitions:
ascertain or establish exactly, typically as a result of research or calculation.
"the point of our study was to determine what is true, not what is practicable"
- Users aren't aware of BatchMapper - Users can't figure out how to write UDFs, and there's no link to the relevant user guide - Users get confused by the examples Signed-off-by: Weichen Xu <weichen.xu@databricks.com>
Signed-off-by: Balaji Veeramani balaji@anyscale.com
Depends on:
Dataset.default_batch_format
#28434Why are these changes needed?
BatchMapper
Related issue number
Checks
git commit -s
) in this PR.scripts/format.sh
to lint the changes in this PR.