Support Mixtral #259

pseudotensor · 2023-12-14T05:12:06Z

Related?
https://huggingface.co/casperhansen/mixtral-instruct-awq-it1/tree/main
9c3dfa0

Also see:
https://huggingface.co/ybelkada/Mixtral-8x7B-Instruct-v0.1-AWQ
huggingface/transformers#27950

Right now get:

  File "/home/jon/miniconda3/envs/h2ogpt/lib/python3.10/site-packages/awq/models/auto.py", line 50, in from_quantized
    model_type = check_and_get_model_type(quant_path, trust_remote_code)
  File "/home/jon/miniconda3/envs/h2ogpt/lib/python3.10/site-packages/awq/models/auto.py", line 25, in check_and_get_model_type
    raise TypeError(f"{config.model_type} isn't supported yet.")
TypeError: mixtral isn't supported yet.

Or is this something that has to be only done in transformers? Sorry I get confused between what is in transformers vs. here.

The text was updated successfully, but these errors were encountered:

casper-hansen · 2023-12-14T06:25:58Z

Hi @pseudotensor, support is coming! #251 Before the model is supported, in figuring out an effective scaling of its layers.

pseudotensor · 2023-12-14T07:25:03Z

Great, looking forward. Love AWQ stuff, always works better than llama.cpp, esp. for heavy use in vLLM.

casper-hansen · 2023-12-14T07:30:30Z

That’s why I keep working on it! Admittedly, this is a very time consuming task:

Make tweak to code
Quantize model (wait 40 minutes)
Measure perplexity
Repeat

casper-hansen · 2023-12-22T13:38:52Z

Mixtral support is on main now.

pseudotensor · 2024-02-14T20:35:53Z

Howdy @casper-hansen Curious what you did with your AWQ process for Mixtral. Do you have your steps documented/coded? For https://huggingface.co/casperhansen/mixtral-instruct-awq.

I ask because we checked out many AWQ Mixtrals, e.g.

TheBloke/dolphin-2.7-mixtral-8x7b-AWQ : Bad repetition
TheBloke/Mixtral-8x7B-Instruct-v0.1-AWQ : Doesn't even start to generate, as others have complained
ybelkada/Mixtral-8x7B-Instruct-v0.1-AWQ : Bad repetition

pseudotensor mentioned this issue Dec 14, 2023

Error: Mixtral-8x7B-Instruct-v0.1 issue with Llama-cpp-python h2oai/h2ogpt#1202

Closed

pseudotensor mentioned this issue Dec 16, 2023

Mixtral in docker h2oai/h2ogpt#1216

Closed

casper-hansen closed this as completed Dec 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Mixtral #259

Support Mixtral #259

pseudotensor commented Dec 14, 2023 •

edited

Loading

casper-hansen commented Dec 14, 2023

pseudotensor commented Dec 14, 2023

casper-hansen commented Dec 14, 2023

casper-hansen commented Dec 22, 2023

pseudotensor commented Feb 14, 2024 •

edited

Loading

Support Mixtral #259

Support Mixtral #259

Comments

pseudotensor commented Dec 14, 2023 • edited Loading

casper-hansen commented Dec 14, 2023

pseudotensor commented Dec 14, 2023

casper-hansen commented Dec 14, 2023

casper-hansen commented Dec 22, 2023

pseudotensor commented Feb 14, 2024 • edited Loading

pseudotensor commented Dec 14, 2023 •

edited

Loading

pseudotensor commented Feb 14, 2024 •

edited

Loading