Spaces:

MerveA
/

InsightRAG_Chatbot

Runtime error

App Files Files Community

InsightRAG_Chatbot / README.md

MerveA

Fix langchain dependency for HF Space

27cfd4d about 2 months ago

preview code

raw

history blame contribute delete

11.9 kB

A newer version of the Gradio SDK is available: 6.1.0

Upgrade

metadata

title: 'RAG Chatbot: ML/AI Assistant'
emoji: 🤖
colorFrom: purple
colorTo: blue
sdk: gradio
sdk_version: 3.35.0
app_file: app.py
pinned: false

<<<<<<< HEAD

🤖 InsightRAG Chatbot: ML/AI Knowledge Assistant

A fully functional Retrieval-Augmented Generation (RAG) chatbot that provides comprehensive information about machine learning, deep learning, AI, and related topics. Built with modern AI technologies and ready for deployment.

🎯 Project Purpose

This RAG chatbot serves as an intelligent knowledge assistant specializing in machine learning, deep learning, and artificial intelligence topics. The chatbot leverages a sophisticated retrieval-augmented generation pipeline to provide accurate, contextual answers by combining:

Knowledge Retrieval: Accessing relevant information from a curated ML/AI knowledge base
Contextual Generation: Using Google Gemini 2.5 Flash to generate comprehensive responses
Interactive Learning: Enabling users to explore complex AI concepts through natural conversation

The primary goal is to make AI and machine learning knowledge accessible through an intuitive, conversational interface that can handle both basic concepts and advanced technical questions.

📚 Dataset Information

Dataset Source

Primary Dataset: The Pile (EleutherAI/the_pile) from Hugging Face
Access Method: Hugging Face Datasets API (no local downloads required)
Content Type: Text-only data (no tables, images, or PDFs)

Dataset Structure

The dataset contains diverse text content filtered specifically for ML/AI relevance:

Content Filtering: Text samples are filtered using ML/AI keywords including:
- Machine learning, deep learning, neural networks
- Artificial intelligence, algorithms, models
- Training, data, features, classification
- Regression, clustering, optimization, gradient, tensor
Text Processing:
- Content is cleaned and preprocessed
- Text is chunked into manageable pieces (500 words with 50-word overlap)
- Only substantial chunks (100-2000 characters) are retained
- Text is embedded using sentence transformers for vector search
Storage: Processed text chunks are stored in Chroma vector database for efficient similarity search

Usage in RAG Pipeline

The dataset serves as the knowledge base for the RAG system, enabling:

Semantic search for relevant context
Contextual answer generation
Comprehensive coverage of ML/AI topics

🔧 Methods Used

RAG Pipeline Architecture

The chatbot implements a sophisticated Retrieval-Augmented Generation pipeline:

1. Data Processing Pipeline

Raw Text → Filtering → Chunking → Embedding → Vector Storage

Text Filtering: ML/AI keyword-based content selection
Chunking: Intelligent text segmentation with overlap
Embedding: Sentence transformer-based vectorization
Storage: Chroma vector database for efficient retrieval

2. Retrieval System

Embedding Model: all-MiniLM-L6-v2 (sentence-transformers)
Vector Database: Chroma with persistent storage
Similarity Search: Cosine similarity for document retrieval
Context Assembly: Top-k relevant documents combined

3. Generation System

Language Model: Google Gemini 2.5 Flash
Temperature: 0.7 for balanced creativity and accuracy
Context Integration: Retrieved documents used as context
Response Formatting: Markdown support for rich text

4. Technical Stack

RAG Framework: LangChain for pipeline orchestration
Vector Database: Chroma for embedding storage and retrieval
Embeddings: Sentence Transformers for text vectorization
LLM: Google Gemini 2.5 Flash for response generation
Interface: Streamlit for web-based chat interface

📊 Results Summary

The RAG chatbot successfully provides comprehensive answers across multiple ML/AI domains:

Answer Quality

Contextual Accuracy: Responses are grounded in retrieved knowledge
Comprehensive Coverage: Handles both basic and advanced topics
Structured Output: Well-formatted responses with examples
Technical Depth: Can explain complex algorithms and concepts

Performance Metrics

Response Time: Fast retrieval and generation (< 5 seconds)
Relevance: High-quality context retrieval from knowledge base
Coverage: Extensive ML/AI topic coverage
Usability: Intuitive conversational interface

Capabilities Demonstrated

Explains fundamental ML/AI concepts
Provides algorithm explanations with examples
Offers practical implementation guidance
Covers current trends and advanced topics
Handles both theoretical and applied questions

💡 Example Questions

The chatbot can answer a comprehensive range of questions across multiple categories:

Basic Concepts

What is the difference between AI, machine learning, and deep learning?
Can you explain supervised, unsupervised, and reinforcement learning?
What are features and labels in a dataset?
Explain overfitting vs underfitting.

Algorithms & Models

How does a neural network learn?
What is gradient descent and how does it work?
Explain decision trees and random forests.
What are convolutional neural networks (CNNs) used for?
How does a transformer model like GPT work?

Practical Applications

How to preprocess data for machine learning?
How can I use AI for image recognition?
Give an example of AI in healthcare.
What are common pitfalls when training deep learning models?

Technical Details

What is backpropagation?
How does regularization prevent overfitting?
Explain embedding vectors and similarity search.
What are activation functions and why are they important?

Performance & Optimization

How to improve model accuracy?
What is cross-validation and why is it used?
Explain hyperparameter tuning.
What is transfer learning?

Trends & Advanced

Explain reinforcement learning with examples.
What are large language models and how do they work?
How is generative AI different from predictive AI?
What is the future of AI in finance/medicine?

🚀 Quick Start

Option 1: Google Colab (Recommended)

Open the notebook: Click the Colab badge above or upload rag_notebook.ipynb to Google Colab
Set up API key: Add your Gemini API key to Colab secrets
Run all cells: Execute the notebook to build the RAG system
Test the system: Try the sample questions provided

Option 2: Local Development

Create virtual environment:

python -m venv rag_chatbot_env
source rag_chatbot_env/bin/activate  # On Windows: rag_chatbot_env\Scripts\activate

Install dependencies:
```
pip install -r requirements.txt
```

Set up environment:

export GOOGLE_API_KEY="your_gemini_api_key_here"

Run the Streamlit app:
```
streamlit run app.py
```
Access the interface: Open http://localhost:8501 in your browser

🔑 API Key Setup

Google Colab

Go to the key icon (🔑) in the left sidebar
Add a new secret with key GEMINI_API_KEY and your API key as value
Restart the runtime and run the notebook

Local/Hugging Face Spaces

Get your API key from Google AI Studio
Set it as an environment variable: GOOGLE_API_KEY
Or enter it directly in the Streamlit interface

🏗️ Solution Architecture

Problem Statement

Traditional chatbots often provide generic responses without access to specific domain knowledge. This project solves the challenge of creating an AI assistant that can provide accurate, contextual information about machine learning and AI topics.

Technology Stack

Frontend: Streamlit for web interface
Backend: Python with LangChain framework
Vector Database: Chroma for embedding storage
Embeddings: Sentence Transformers
LLM: Google Gemini 2.5 Flash
Data Source: The Pile dataset via Hugging Face

Architecture Benefits

Scalable: Can handle multiple users simultaneously
Accurate: Grounded responses using retrieved context
Flexible: Easy to extend with additional knowledge sources
Efficient: Fast retrieval and generation pipeline

🌐 Web Interface & Deployment

Local Testing

Run streamlit run app.py
Open http://localhost:8501
Enter your Gemini API key
Initialize the RAG system
Start chatting!

Web Deployment

Deploy to Hugging Face Spaces - Add your deployment link here

Interface Features

Chat Interface: Clean, responsive design
Real-time Responses: Instant AI-generated answers
Context Display: Shows retrieved documents and similarity scores
Sample Questions: Quick-start buttons for common queries
System Status: Real-time monitoring of RAG system health

📁 Project Structure

Chatbot_Project/
├── rag_notebook.ipynb      # Complete Colab notebook with RAG pipeline
├── app.py                  # Streamlit web application
├── requirements.txt        # Python dependencies
├── README.md              # This documentation
└── chroma_db/             # Vector database (created during execution)

🔧 Configuration Options

The system can be customized through various parameters:

# RAG Pipeline Configuration
EMBEDDING_MODEL = 'all-MiniLM-L6-v2'
GEMINI_MODEL = 'gemini-2.0-flash-exp'
TEMPERATURE = 0.7
MAX_OUTPUT_TOKENS = 1024
N_RETRIEVAL_RESULTS = 5
CHUNK_SIZE = 500
CHUNK_OVERLAP = 50

🐛 Troubleshooting

Common Issues

API Key Error: Ensure your Gemini API key is correctly set
Memory Issues: Reduce the number of documents processed in Colab
Chroma Connection: Check if the vector database directory exists
Model Loading: Ensure all dependencies are installed correctly

Solutions

Restart Runtime: In Colab, use Runtime → Restart Runtime
Check Logs: Look for error messages in the console
Verify Dependencies: Run pip list to check installed packages
Test Components: Use the test functions in the notebook

🤝 Contributing

Contributions are welcome! Please feel free to:

Fork the repository
Create a feature branch
Make your changes
Submit a pull request

📄 License

This project is open source and available under the MIT License.

🙏 Acknowledgments

EleutherAI for The Pile dataset
Google for Gemini API
LangChain for RAG framework
Chroma for vector database
Streamlit for web interface
Hugging Face for dataset access and deployment platform

📞 Support

If you encounter any issues or have questions:

Check the troubleshooting section above
Review the notebook comments and documentation
Open an issue in the repository
Contact the development team

🚀 Ready to explore the world of AI with our RAG chatbot!

Built with ❤️ using modern AI technologies

title: InsightRAG Chatbot emoji: 🏆 colorFrom: green colorTo: pink sdk: gradio sdk_version: 5.49.1 app_file: app.py pinned: false

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

22938a84be4affe70a5e5035544417eff395bd6e