awq-py : remove #5768

ggerganov · 2024-02-28T14:29:47Z

I think this functionality never ended up being used, plus I think GGUF support was at some point implemented in the OG repo. So I think there is no reason to keep the code around

wilderfield · 2024-03-05T23:51:27Z

@ggerganov looks like the functionality was referenced by convert.py ?
https://github.com/ggerganov/llama.cpp/blob/652ca2bded3c818320d92c70d2b67f64bdbff5e5/convert.py#L1395-L1407

I'm a bit confused now, does this repo support quantizing models with AWQ

ggerganov · 2024-03-06T07:17:16Z

I just removed the leftovers from convert.py

llama.cpp does not provide AWQ quantization functionality. I believe there was work in the AWQ repos to add support for generating GGUF files directly which in turn are compatible with llama.cpp

wilderfield · 2024-03-06T18:17:44Z

Thanks @ggerganov. PS, I really want to know whats in your .vimrc.

ggerganov · 2024-03-06T19:36:13Z

It's a bit of a mess, but here it is: https://github.com/ggerganov/ggterm

awq-py : remove

3921ff5

ggerganov merged commit 78aacf3 into master Feb 28, 2024
24 checks passed

ggerganov deleted the gg/remove-awq branch February 28, 2024 15:36

ggerganov added a commit that referenced this pull request Mar 6, 2024

convert : remove AWQ remnants (#5768)

1e35d61

hazelnutcloud pushed a commit to hazelnutcloud/llama.cpp that referenced this pull request Mar 10, 2024

convert : remove AWQ remnants (ggml-org#5768)

1f745cd

NeoZhangJianyu pushed a commit to NeoZhangJianyu/llama.cpp that referenced this pull request Mar 12, 2024

convert : remove AWQ remnants (ggml-org#5768)

4a1d950

jordankanter pushed a commit to jordankanter/llama.cpp that referenced this pull request Mar 13, 2024

awq-py : remove (ggml-org#5768)

2ad8da2

jordankanter pushed a commit to jordankanter/llama.cpp that referenced this pull request Mar 13, 2024

convert : remove AWQ remnants (ggml-org#5768)

c441445

hodlen pushed a commit to hodlen/llama.cpp that referenced this pull request Apr 1, 2024

awq-py : remove (ggml-org#5768)

02a44b4

hodlen pushed a commit to hodlen/llama.cpp that referenced this pull request Apr 1, 2024

convert : remove AWQ remnants (ggml-org#5768)

98b7b26

hanasay mentioned this pull request Jul 10, 2024

awqint4 to gguf ,ModuleNotFoundError: No module named 'awq.apply_awq' casper-hansen/AutoAWQ#502

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

awq-py : remove #5768

awq-py : remove #5768

ggerganov commented Feb 28, 2024

wilderfield commented Mar 5, 2024

ggerganov commented Mar 6, 2024

wilderfield commented Mar 6, 2024

ggerganov commented Mar 6, 2024

awq-py : remove #5768

awq-py : remove #5768

Conversation

ggerganov commented Feb 28, 2024

wilderfield commented Mar 5, 2024

ggerganov commented Mar 6, 2024

wilderfield commented Mar 6, 2024

ggerganov commented Mar 6, 2024