vectordb-recipes/tutorials/Multi-Head-RAG-from-Scratch at main · lancedb/vectordb-recipes

History

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
lease.txt		lease.txt
main.py		main.py
paper.png		paper.png
requirments.txt		requirments.txt

README.md

Multi-Head RAG from Scratch

This example demonstrates Multi-Head RAG built from scratch without using any supporting framework like Langchain and LlamaIndex.

Multi-Head RAG concept came into picture from following paper.

Paper - https://arxiv.org/pdf/2406.05085

Multi-Head RAG (MRAG) is designed for queries needing multiple diverse documents. This approach improves retrieval accuracy for complex queries.

Steps:

Reading Document and Recursive Text Splitting
Setup 3 Embedding spaces schema with LanceDB Embedding API
Insert Chunks into all 3 Embedding spaces
Semantic search all the Embedding Spaces with Query question and collect the context.
Use collected context to generate answers.

This example utilizes OPENAI's text-embedding-ada-002, llama3, and mistral default models are used to create embedding spaces.

Installation

Install all the dependencies

pip install -r requirements.txt

Install Ollama

curl -fsSL https://ollama.com/install.sh | sh

Download models

ollama pull llama3
ollama pull mistral

Set OPENAI key as env

export OPENAI_API_KEY=sk-...

Run RAG

python3 main.py

CAUTION: MRAG takes lot of time to create all 3 Embedding spaces and query also because of working with 3 Embedding space requires converting text into embeddings three times with increases throughput time 3 times.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-Head-RAG-from-Scratch

Multi-Head-RAG-from-Scratch

README.md

Multi-Head RAG from Scratch

Installation

Download models

Set OPENAI key as env

Run RAG

Files

Multi-Head-RAG-from-Scratch

Directory actions

More options

Directory actions

More options

Latest commit

History

Multi-Head-RAG-from-Scratch

Folders and files

parent directory

README.md

Multi-Head RAG from Scratch

Installation

Download models

Set OPENAI key as env

Run RAG