The MultiPDF Chat App is a Python application that allows you to upload multiple PDF documents and chat with the data. You can ask questions about the PDFs using natural language, and the app will provide relevant responses based on the content of the documents.
- Upload multiple PDF documents
- Chat with the content of the PDFs
- Ask natural language questions
- Receive accurate responses based on PDF content
- PDF Loading: The app reads multiple PDF documents and extracts their text content.
- Text Chunking: The extracted text is divided into smaller chunks for efficient processing.
- Language Model: Utilizes a language model to generate vector representations (embeddings) of the text chunks.
- Similarity Matching: The app compares your question with text chunks and identifies the most semantically similar ones.
- Response Generation: The selected chunks are passed to the language model, which generates a response based on the PDF content.
- Python
- Streamlit
- LangChain
- OpenAI Embeddings
- FAISS
- PyPDF2
-
Clone this repository:
git clone https://github.com/avr-varshan/multipdf-chat-app.git