Stars
Python client library for Google Maps API Web Services
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Deepparse is a state-of-the-art library for parsing multinational street addresses using deep learning
Open-Sora: Democratizing Efficient Video Production for All
Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, ...) or 100+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, Inter…
[CVPR2023] Lite-Mono: A Lightweight CNN and Transformer Architecture for Self-Supervised Monocular Depth Estimation
Tracking and collecting papers/projects/others related to Segment Anything.
Downstream-Dino-V2: A GitHub repository featuring an easy-to-use implementation of the DINOv2 model by Facebook for downstream tasks such as Classification, Semantic Segmentation and Monocular dept…
PyTorch code and models for the DINOv2 self-supervised learning method.
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
Instruct-tune LLaMA on consumer hardware
骆驼(Luotuo): Open Sourced Chinese Language Models. Developed by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子昂 @ 商汤科技
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
billjie1 / Chinese-CLIP
Forked from OFA-Sys/Chinese-CLIPChinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
OpenMMLab Detection Toolbox and Benchmark
A codebase and a curated list of awesome deep long-tailed learning (TPAMI 2023).
Official implementation of Deep Burst Super-Resolution
Simple implementation of OpenAI CLIP model in PyTorch.
OpenAI CLIP text encoders for multiple languages!