-
Notifications
You must be signed in to change notification settings - Fork 21
Issues: ServiceNow/Fast-LLM
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Implement .generate (greedy decoding only)
enhancement
New feature or request
#217
opened Apr 1, 2025 by
tscholak
4 tasks done
Reset Number of Steps in WandB to the Latest Saved Checkpoint or Implement Distinguishable Experiment Run Logging in WandB
enhancement
New feature or request
need update
#215
opened Mar 31, 2025 by
bigximik
4 tasks
End-to-end knowledge distillation
enhancement
New feature or request
need update
#214
opened Mar 30, 2025 by
tscholak
2 of 4 tasks
Move Config Validations (e.g., Dataset Usage vs. Definitions) to New feature or request
need update
_validate
for Dry Run Checks
enhancement
#213
opened Mar 28, 2025 by
bigximik
1 of 4 tasks
Frozen reference model support for DPO, distillation, etc.
enhancement
New feature or request
#212
opened Mar 27, 2025 by
tscholak
3 of 4 tasks
Direct Preference Optimization (DPO) support
enhancement
New feature or request
#209
opened Mar 26, 2025 by
tscholak
2 of 4 tasks
Prototype masked diffusion modeling support
enhancement
New feature or request
#208
opened Mar 26, 2025 by
tscholak
2 of 4 tasks
Add converter for Hugging Face peft (LoRA)
enhancement
New feature or request
#204
opened Mar 25, 2025 by
jlamypoirier
1 of 4 tasks
Discussion about dataset preparation speed
enhancement
New feature or request
need update
#202
opened Mar 25, 2025 by
bigximik
Support easy concatenation of datasets
enhancement
New feature or request
need update
#201
opened Mar 24, 2025 by
tscholak
1 of 4 tasks
Online dataset mixing based on validation metrics
enhancement
New feature or request
need update
#200
opened Mar 24, 2025 by
bigximik
1 of 4 tasks
Multi-Dataset Validation with Generative Benchmarks
enhancement
New feature or request
#199
opened Mar 24, 2025 by
bigximik
1 of 4 tasks
Nemotron-H support
enhancement
New feature or request
need update
#198
opened Mar 23, 2025 by
tscholak
2 of 4 tasks
Llamba support
enhancement
New feature or request
need update
#197
opened Mar 23, 2025 by
tscholak
3 of 4 tasks
[bug] 16 unit tests fail on main with custom install
bug
Something isn't working
#196
opened Mar 20, 2025 by
bigximik
Option to avoid truncations while packing
enhancement
New feature or request
#192
opened Mar 19, 2025 by
sohamparikh
4 tasks
Per-example & Phatgoose routing
enhancement
New feature or request
#181
opened Mar 10, 2025 by
oleksost
3 of 4 tasks
[bug] Failing to run training with concatenation of file datasets
bug
Something isn't working
#176
opened Mar 7, 2025 by
oleksost
Add end-to-end test for dataset preparation
enhancement
New feature or request
#175
opened Mar 7, 2025 by
jlamypoirier
4 tasks
Make the model config override the pretrained config
enhancement
New feature or request
#170
opened Mar 6, 2025 by
jlamypoirier
4 tasks done
Missing configuration when converting from HF model config json
bug
Something isn't working
#166
opened Feb 27, 2025 by
tscholak
Expert Parallelism for MoEs
enhancement
New feature or request
need update
#165
opened Feb 27, 2025 by
tscholak
4 tasks
Add features for Qwen2_moe support
enhancement
New feature or request
#164
opened Feb 27, 2025 by
bigximik
4 tasks
Muon Optimizer Integration
enhancement
New feature or request
need update
#159
opened Feb 24, 2025 by
tscholak
4 tasks
Option to vary configuration parameters across layers
enhancement
New feature or request
#155
opened Feb 19, 2025 by
jlamypoirier
4 tasks
Previous Next
ProTip!
Updated in the last three days: updated:>2025-03-29.