Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[aot] Fix aot quantization for weight only quantization #2079

Merged

Conversation

tosterberg
Copy link
Contributor

Description

Fixes the quantization path when doing AOT partitioning for weight only quantization strategies, since these do not require any AOT model changes.

  • If this change is a backward incompatible change, why must this change be made?
  • Interesting edge cases to note here

@tosterberg tosterberg requested review from zachgk, frankfliu and a team as code owners June 17, 2024 23:29
@sindhuvahinis sindhuvahinis merged commit 00f7412 into deepjavalibrary:master Jun 17, 2024
8 checks passed
@tosterberg tosterberg deleted the fix-neo-quant-neuron branch June 17, 2024 23:42
sindhuvahinis pushed a commit to sindhuvahinis/djl-serving that referenced this pull request Jun 17, 2024
tosterberg added a commit that referenced this pull request Jun 17, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants