FEAT: add llava to autoawq #250

younesbelkada · 2023-12-11T17:07:01Z

LLava is an new and exciting multi-modal architecture that has been recently integrated in HF transformers
This PR adds llava support in transformers.

With huggingface/transformers#27950 you can load the converted llava weights in 4bit:

from transformers import pipeline
from PIL import Image    
import requests

model_id = "ybelkada/llava-1.5-7b-hf"
pipe = pipeline("image-to-text", model=quant_path, device=0)
url = "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/diffusers/compel-neg.png"

image = Image.open(requests.get(url, stream=True).raw)
prompt = "USER: <image>\nCan you please describe this image?\nASSISTANT:"

outputs = pipe(image, prompt=prompt, generate_kwargs={"max_new_tokens": 100})
print(outputs[0]["generated_text"])

USER: \nCan you please describe this image?\nASSISTANT: The image features a brown and white cat sitting on a green surface, possibly a carpet or a grassy area. The cat is holding a red ball in its paws, seemingly playing with it. The cat appears to be focused on the ball, possibly preparing to play or just enjoying the toy.

cc @casper-hansen

younesbelkada · 2023-12-11T17:07:14Z

This PR is in a draft state, will ping you once ready

younesbelkada · 2023-12-11T18:40:02Z

The PR is now ready for review!

casper-hansen · 2023-12-13T16:04:35Z

Looking forward to LLaVa being added. Can we also run inference in AutoAWQ after this PR?

younesbelkada · 2023-12-13T16:10:09Z

@casper-hansen I will try out and let you know

younesbelkada · 2023-12-13T17:15:53Z

I can confirm this script worked fine for me:

import requests
import torch
from PIL import Image

from awq import AutoAWQForCausalLM
from transformers import AutoProcessor

quant_path = "ybelkada/llava-1.5-7b-hf-awq"

# Load model
model = AutoAWQForCausalLM.from_quantized(quant_path, safetensors=True, device_map={"": 0})
processor = AutoProcessor.from_pretrained(quant_path)

prompt = "USER: <image>\nWhat are these?\nASSISTANT:"
image_file = "http://images.cocodataset.org/val2017/000000039769.jpg"

raw_image = Image.open(requests.get(image_file, stream=True).raw)
inputs = processor(prompt, raw_image, return_tensors='pt').to(0, torch.float16)
# Generate output
generation_output = model.generate(
    **inputs, 
    max_new_tokens=512
)

print(processor.decode(generation_output[0], skip_special_tokens=True))

Let me know if I should modify anyhting else

casper-hansen · 2023-12-23T11:38:34Z

I tried running through the llava quantization and generation example. They both work but there is one problem with the quantization example. It seems we are not saving the preprocessor_config.json, so if you run the quant example and then generation example after, it does not work because it's missing a config file.

It seems that AutoProcessor.from_pretrained() has no equivalent for save_pretrained().

EDIT: Seems this could work processor.image_processor.save_pretrained()

EDIT 2: Fixed this!

add llava to autoawq

c1e6bdc

younesbelkada added 3 commits December 11, 2023 19:20

Merge remote-tracking branch 'origin/main' into add-llava

a8ccf6f

refactor a bit to make it cleaner

221867d

update readme + add example

c4bc0eb

younesbelkada marked this pull request as ready for review December 11, 2023 18:39

younesbelkada requested a review from casper-hansen December 11, 2023 18:39

add inference script

32af7ff

casper-hansen added 4 commits December 22, 2023 14:46

Merge branch 'main' into add-llava

07e09db

Rename llava examples

848630e

Remove unused imports

4f84a4d

Add Mixtral to auto-map

67be3c8

Load/Save processor for vision models

9c42fd8

casper-hansen merged commit 9e8e28b into main Dec 23, 2023

casper-hansen deleted the add-llava branch December 23, 2023 14:04

WanBenLe mentioned this pull request Apr 23, 2024

Support of llava-v1.5 and llava-v1.6 with transformers==4.40.0 #456

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FEAT: add llava to autoawq #250

FEAT: add llava to autoawq #250

younesbelkada commented Dec 11, 2023

younesbelkada commented Dec 11, 2023

younesbelkada commented Dec 11, 2023

casper-hansen commented Dec 13, 2023

younesbelkada commented Dec 13, 2023

younesbelkada commented Dec 13, 2023

casper-hansen commented Dec 23, 2023 •

edited

Loading

FEAT: add llava to autoawq #250

FEAT: add llava to autoawq #250

Conversation

younesbelkada commented Dec 11, 2023

younesbelkada commented Dec 11, 2023

younesbelkada commented Dec 11, 2023

casper-hansen commented Dec 13, 2023

younesbelkada commented Dec 13, 2023

younesbelkada commented Dec 13, 2023

casper-hansen commented Dec 23, 2023 • edited Loading

casper-hansen commented Dec 23, 2023 •

edited

Loading