Skip to content

[mistral] Support passing head_dim through config (and do not require head_dim * num_heads == hidden_size)#32050

Merged
xenova merged 4 commits intomainfrom mistral-head_dimJul 18, 2024

Commits

Commits on Jul 17, 2024

Commits on Jul 18, 2024