Stars
High performance self-hosted photo and video management solution.
Easily compute clip embeddings and build a clip retrieval system with them
A concise but complete implementation of CLIP with various experimental improvements from recent papers
Unify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT, ReRanker.
A curated list of retrieval-augmented generation (RAG) in large language models
A modular graph-based Retrieval-Augmented Generation (RAG) system
This folder of code contains code and notebooks to supplement the "Vision Transformers Explained" series published on Towards Data Science written by Skylar Callis.
Video+code lecture on building nanoGPT from scratch
Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
Lecture notes, projects and other materials for Course 'CS205 C/C++ Program Design' at Southern University of Science and Technology.
A fast and lightweight pure Python library for splitting text into semantically meaningful chunks.
llama3 implementation one matrix multiplication at a time
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
OCR, layout analysis, reading order, table recognition in 90+ languages
LLM-driven automated knowledge graph construction from text using DSPy and Neo4j.
👮♂️The sensitive word tool for java.(敏感词/违禁词/违法词/脏词。基于 DFA 算法实现的高性能 java 敏感词过滤工具框架。内置支持单词标签分类分级。请勿发布涉及政治、广告、营销、翻墙、违反国家法律法规等内容。高性能敏感词检测过滤组件,附带繁体简体互换,支持全角半角互换,汉字转拼音,模糊搜索等功能。)
DeepSeek-VL: Towards Real-World Vision-Language Understanding