Compact RAG

Background

I develop a compact RAG (Retrieval-Augmented Generation) that runs on Raspberry Pi. As the database for RAG, I adopt SQLite and implement a vector DB using sqlite-vec.

Goal of this project

Develop a compact RAG that runs on Raspberry Pi, supporting Hybrid RAG: SQL DB and Vector DB.
The RAG also works as an API server for my other projects: virtual-showroom and node-red-ai-agents.

Requrements

OpenAI API key
LLM model: gpt-4o-mini
Embeddings model: text-embedding-3-small
Raspberry Pi

Architecture

                                   Brain
                           [OpenAI API service]
Unity app                            |
[VirtualShowroom]-----+              |
                      |              |
Web apps              |        Compact RAG (app.py)
[Web Browser]---------+------- [Raspberry Pi]---+---USB---[Camera with mic]
                      |              |          |
AI Agents             |          SQLite DB      +---USB---[Speaker]
[Node-RED]------------+

Compiling sqlite-vec on Rapsberry Pi

$ git clone https://github.com/asg017/sqlite-vec
$ cd sqlite-vec
$ sudo apt-get install libsqlite3-dev
$ make loadable

Find "vec0.so" in ./dist directory.

Reference documents, chunking and embeddings for RAG

Document sources

Travel guidebooks for virtual-showroom.

Chunking and Embeddings

Step 1. Generating Chunks: I run this notebook on my Mac.
Step 2. Calculating embeddings: I run this script on my Raspberry Pi 3.

Implementations

cx package ... Python package
API server ... API Server

Unit tests

Running the API server

$ cd app
$ python app.py

The API server provides simple web apps. Access "http://<IP address of the API server>:5050" with a web browser.

virtual-showroom uses this API server to access the OpenAI API service.

Starting the API server automatically

Refer to this article to start the server automatically.

A sample service file is like this:

[Unit]
Description=Python Generative AI API server
After=network.target

[Service]
ExecStart=/usr/bin/python3 -m app --directory <Path to "app" folder>
WorkingDirectory=<Path to "app" folder>
Restart=always
RestartSec=10
User=<Your user name>
Group=users
Environment=PYTHONPATH=<Path to this repo on Raspberry Pi>:$PYTHONPATH OPENAI_API_KEY=<OpenAI API key>

[Install]
WantedBy=multi-user.target

After having created the service file, do this:

$ sudo systemctl daemon-reload
$ sudo systemctl start gen_ai.service

Confirm the daemon process running:

$ sudo systemctl start gen_ai.service

If something wrong happened, check the syslog:

$ tail /var/log/syslog

sqlite-vec metadata filtering

asg017/sqlite-vec#26

Name		Name	Last commit message	Last commit date
Latest commit History 220 Commits
app		app
cx		cx
db		db
docs		docs
ref		ref
sqlite_vec		sqlite_vec
unittest		unittest
.gitattributes		.gitattributes
.gitignore		.gitignore
CHARACTER_PROFILING.md		CHARACTER_PROFILING.md
HAND_GESTURE_RECOGNITION.md		HAND_GESTURE_RECOGNITION.md
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Compact RAG

Background

Goal of this project

Requrements

Architecture

Compiling sqlite-vec on Rapsberry Pi

Reference documents, chunking and embeddings for RAG

Document sources

Chunking and Embeddings

Implementations

Unit tests

Running the API server

Starting the API server automatically

sqlite-vec metadata filtering

Extra: Some experiments with gpt-4o-mini

About

Releases

Packages

Languages

License

araobp/compact-rag

Folders and files

Latest commit

History

Repository files navigation

Compact RAG

Background

Goal of this project

Requrements

Architecture

Compiling sqlite-vec on Rapsberry Pi

Reference documents, chunking and embeddings for RAG

Document sources

Chunking and Embeddings

Implementations

Unit tests

Running the API server

Starting the API server automatically

sqlite-vec metadata filtering

Extra: Some experiments with gpt-4o-mini

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages