Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

downcast before saving batch #700

Merged
merged 1 commit into from
Jun 29, 2023
Merged

downcast before saving batch #700

merged 1 commit into from
Jun 29, 2023

Conversation

Ben-Epstein
Copy link
Contributor

we've done this in a few places before uploading to Galileo, but we still write the batches to disk. This can really add up (f64 embeddings are huge).

Safely always downcast before writing to disk

@Ben-Epstein Ben-Epstein requested a review from a team as a code owner June 29, 2023 16:19
@Ben-Epstein Ben-Epstein self-assigned this Jun 29, 2023
@Ben-Epstein Ben-Epstein requested a review from dcaustin33 as a code owner June 29, 2023 16:19
Copy link
Member

@setu4993 setu4993 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💯

@codecov-commenter
Copy link

codecov-commenter commented Jun 29, 2023

Codecov Report

Merging #700 (e2b84b9) into main (deff792) will decrease coverage by 0.04%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##             main     #700      +/-   ##
==========================================
- Coverage   89.50%   89.47%   -0.04%     
==========================================
  Files         166      166              
  Lines       13241    13250       +9     
==========================================
+ Hits        11851    11855       +4     
- Misses       1390     1395       +5     
Impacted Files Coverage Δ
...aquality/loggers/model_logger/base_model_logger.py 91.07% <100.00%> (+0.78%) ⬆️

... and 3 files with indirect coverage changes

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

Copy link
Contributor

@elboy3 elboy3 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👨‍🍳 💋

@Ben-Epstein Ben-Epstein merged commit ed5d29e into main Jun 29, 2023
@Ben-Epstein Ben-Epstein deleted the downcast-data-dict branch June 29, 2023 16:29
bogdan-galileo pushed a commit that referenced this pull request Jul 10, 2023
we've done this in a few places before uploading to Galileo, but we
still write the batches to disk. This can really add up (f64 embeddings
are huge).

Safely always downcast before writing to disk
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants