[mistral] Support passing head_dim
through config (and do not require head_dim * num_heads == hidden_size
)#32050
Merged
xenova merged 4 commits intomainfrom mistral-head_dimJul 18, 2024
+6-7
Commits
Commits on Jul 17, 2024
Commits on Jul 18, 2024
- committed
- committed
- committed