Skip to content

llama: Add attention and final logit soft-capping, update scaling fac… #800

llama: Add attention and final logit soft-capping, update scaling fac…

llama: Add attention and final logit soft-capping, update scaling fac… #800