Insurance Documents RAG QA Chatbot

Effortless Question Answering on Insurance Documents Powered by LlamaIndex and OpenAI Models

✨ About the Project

The Insurance Documents RAG QA Chatbot is a smart solution for answering queries related to insurance documents. It simplifies the interpretation of dense and complex policies by integrating retrieval and generation techniques to provide accurate and contextual answers in real time.

🔍 Key Features

🌟 Precise Responses: Employs a Retrieval-Augmented Generation (RAG) pipeline for highly relevant answers.
⚡ Efficient Embedding Management: Utilizes ChromaDB for scalable and fast embedding storage and retrieval.
🧠 AI-Driven Answers: Combines LlamaIndex retrieval with OpenAI GPT-4o-mini/GPT-4o for user-centric response generation.
🛠️ Caching Layers:
- Embedding Cache: Prevents re-embedding of identical documents for optimal performance.
- Query Cache: Skips redundant searching for previously answered queries.
📄 Dynamic Document Processing: Splits documents into smaller chunks for efficient query-based retrieval.
🤖 Interactive User Experience: Seamlessly retrieves relevant information and generates concise answers.

🛠️ Tech Stack

Language: Python
Frameworks/Libraries: LlamaIndex, ChromaDB, DiskCache
APIs/Models:
- OpenAI's Embedding Model for vector creation
- LlamaIndex for document ingestion and querying
- GPT-4o-mini/GPT-4o for final response generation

🧪 Example Use Cases

"What is covered under this insurance policy?"
"What is the claim settlement process for my policy?"

📸 Sample Output

1. Sample Code Output

2. Sample Code Output Cache Response

🚀 Getting Started

Prerequisites

Ensure you have the following installed:

Python 3.8+
Docker (optional, for containerized deployment)

Installation

Clone the repo: git clone https://github.com/SandeepGitGuy/Insurance_Documents_QA_Chatbot_RAG_LlamaIndex.git
Navigate to the project directory: cd Insurance_Documents_QA_Chatbot_RAG_LlamaIndex
Install the required dependencies: pip install -r requirements.txt

Please note: OpenAI API keys are required for the project to function.

Run the main file from Jupyter environment: "Insurance_Doc_llamaindex_RAG.ipynb"

🛠️ Challenges/Issues Addressed

Optimized PDF Parsing: Enhanced extraction and chunking using Llamaindex's Ingestion pipeline for seamless data ingestion.
Embedding Efficiency: Added an embedding cache to reduce redundant embeddings in ChromaDB.
Query Optimization: Integrated a query cache to prevent duplicate searches and improve response times.
Enhanced Passage Ranking: Introduced a reranker for better relevance in retrieved sections.
Dynamic Embedding Updates: Leveraged OpenAI's embedding model for high-quality vector representation.

🚀 Future Scope

Extend support for multi-lingual document processing and querying.
Add compatibility for non-PDF formats like Word or Excel files.
Integrate additional generative models like Claude AI for diverse response generation.

📖 Documentation

No documentation will be made available for this project since this project only uses technologies that already have their own documentation. Please refer to the following links for more information:

🛡️ Conclusion

The Insurance Documents RAG QA Chatbot bridges the gap between users and complex insurance policies by delivering precise, contextual responses. With robust caching, efficient embeddings, and intelligent document querying, it ensures a seamless experience for policyholders and professionals alike.

🛡️ License

Distributed under the MIT License. See LICENSE for more information.

💬 Contact

For any queries or feedback, reach out via:

Email: sandy974278@gmail.com
GitHub: https://github.com/SandeepGitGuy
LinkedIn: www.linkedin.com/in/sandeepgowda24a319192

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
Dataset_folder		Dataset_folder
Sample Code output Screenshots		Sample Code output Screenshots
cache		cache
chroma.db		chroma.db
CHANGELOG.md		CHANGELOG.md
Insurance_Doc_llamaindex_RAG.ipynb		Insurance_Doc_llamaindex_RAG.ipynb
Insurance_Documents_AQ_Chatbot_RAG_LlamaIndex.pdf		Insurance_Documents_AQ_Chatbot_RAG_LlamaIndex.pdf
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Insurance Documents RAG QA Chatbot

✨ About the Project

🔍 Key Features

🛠️ Tech Stack

🧪 Example Use Cases

📸 Sample Output

1. Sample Code Output

2. Sample Code Output Cache Response

🚀 Getting Started

Prerequisites

Installation

🛠️ Challenges/Issues Addressed

🚀 Future Scope

📖 Documentation

🛡️ Conclusion

🛡️ License

💬 Contact

About

Languages

License

SandeepGitGuy/Insurance_Documents_QA_Chatbot_RAG_LlamaIndex

Folders and files

Latest commit

History

Repository files navigation

Insurance Documents RAG QA Chatbot

✨ About the Project

🔍 Key Features

🛠️ Tech Stack

🧪 Example Use Cases

📸 Sample Output

1. Sample Code Output

2. Sample Code Output Cache Response

🚀 Getting Started

Prerequisites

Installation

🛠️ Challenges/Issues Addressed

🚀 Future Scope

📖 Documentation

🛡️ Conclusion

🛡️ License

💬 Contact

About

Topics

Resources

License

Stars

Watchers

Forks

Languages