OpenAI GPT-3.5-turbo Streaming API with FastAPI

This project demonstrates how to create a real-time conversational AI by streaming responses from OpenAI's GPT-3.5-turbo model. It uses FastAPI to create a web server that accepts user inputs and streams generated responses back to the user.

Running the Project

Clone the repository.
Install Python (Python 3.7+ is recommended).
Update the value of OpenAI API key in .env.example and rename file into .env
Install necessary libraries. This project uses FastAPI, uvicorn, LangChain, among others. You can install them with pip: pip install fastapi uvicorn langchain.
Add your OpenAI API key to the .env file.
Start the FastAPI server by running uvicorn main:app --port 7000 in the terminal.
Access the application by opening your web browser and navigating to localhost:7000.

In the index.html file, you need to update the http into 127.0.0.1:7000 instead of localhost:7000 -> CORS problem

Serving the html file

python3 -m http.server 8888

Access the web URL via: http://127.0.0.1:8888

Note: Ensure the appropriate CORS settings if you're not serving the frontend and the API from the same origin.

Web

Define the async function sendMessage()
reader.read().then(function processResult(result) { ... }): This line initiates the reading of data from a stream using the reader.read() method. The then() method is used to handle the asynchronous result of the read() operation. The processResult function is the callback function that will be executed when the read() operation completes.
if (result.done) return;: This checks if the stream has finished reading all the data, indicated by the result.done property being true. If the stream has finished reading, the function simply returns, effectively terminating the current iteration of the loop.
The recursive call ensures that the function continues to read and process data from the stream until it is completely read.

Project Overview

The project uses an HTML interface for user input. The user's input is sent to a FastAPI server, which forwards it to the GPT-3.5-turbo model. The generated response is streamed back to the user, simulating a real-time conversation.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
index.html		index.html
main.py		main.py
test.py		test.py
test_stream.py		test_stream.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OpenAI GPT-3.5-turbo Streaming API with FastAPI

Running the Project

Web

Project Overview

About

Releases

Packages

Languages

leviethung2103/langchain-fastapi-streaming

Folders and files

Latest commit

History

Repository files navigation

OpenAI GPT-3.5-turbo Streaming API with FastAPI

Running the Project

Web

Project Overview

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages