RAG-based Document Insights Generator

This project leverages Retrieval-Augmented Generation (RAG) using Streamlit and various NLP models to provide insights from your uploaded documents. It integrates multiple document loaders and NLP models to create an interactive platform for querying document content.

What is Retrieval-Augmented Generation (RAG)?

Retrieval-Augmented Generation (RAG) is a method that combines retrieval-based techniques with generative models. It enhances the ability to generate accurate and contextually relevant responses by retrieving relevant information from a large corpus of documents and feeding it into a generative model.

How RAG Works

Document Retrieval: The system retrieves relevant chunks of text from a document database based on the user's query.
Response Generation: The retrieved text is then used to generate a detailed and contextually accurate response using advanced language models.

Features

Multi-Format Document Support: Upload PDFs, CSVs, and text files.
Efficient Text Processing: Documents are split into manageable text chunks for better processing.
Robust Vector Store Creation: Create and update vector stores for efficient document retrieval.
Conversational AI: Ask questions and get insights from your documents using advanced NLP models.

How It Works

Upload Documents: Use the sidebar to upload your documents.
Process Documents: Click the "Process" button to parse and store document content.
Ask Questions: Enter your queries in the main interface to get insights from your documents.

Technologies Used

Streamlit: For the interactive web interface.
Langchain: For creating document chains and managing conversational AI.
Chroma: For vector store creation and retrieval.
OpenAI & Cohere: For embeddings and language models.

Installation

Clone the Repository

git clone https://github.com/pritamgouda11/Retrieval-Augmented-Generation.git
cd Retrieval-Augmented-Generation

Install Dependencies
```
pip install -r requirements.txt
```

Set Up Environment Variables Create a .env file in the root directory and add the following:

PERSIST_DIRECTORY=path_to_persist_directory
SOURCE_DIRECTORY=source_documents
TARGET_SOURCE_CHUNKS=5
EMBEDDINGS_MODEL_NAME=openai
MODEL_TYPE=OpenAI

Run the Application
```
streamlit run app.py
```

Usage

Upload Documents
- Supported formats: PDF, CSV, Text
- Upload documents via the sidebar.
Process Documents
- Click the "Process" button to parse and vectorize the documents.
Ask Questions
- Use the text input box to ask questions about the uploaded documents.
- The system will retrieve relevant information and generate insightful responses.

Screenshots

Customization

Embeddings and Models: Customize the embeddings model and the conversational AI model by modifying the environment variables.
Chunk Size and Overlap: Adjust the text chunk size and overlap parameters to optimize processing.

Acknowledgments

OpenAI for their powerful language models.
Streamlit for making data apps simple and accessible.
Langchain and Chroma for their amazing tools and libraries.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
db		db
source_documents		source_documents
LLM_tool.py		LLM_tool.py
README.md		README.md
Trial.py		Trial.py
constants.py		constants.py
ingest.py		ingest.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG-based Document Insights Generator

What is Retrieval-Augmented Generation (RAG)?

How RAG Works

Features

How It Works

Technologies Used

Installation

Usage

Screenshots

Customization

Acknowledgments

About

Releases

Packages

Languages

pritamgouda11/Retrieval-Augmented-Generation

Folders and files

Latest commit

History

Repository files navigation

RAG-based Document Insights Generator

What is Retrieval-Augmented Generation (RAG)?

How RAG Works

Features

How It Works

Technologies Used

Installation

Usage

Screenshots

Customization

Acknowledgments

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages