Skip to content

Navigation Menu

FoundationVision

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

FoundationVision

Overview
Repositories
Projects
Packages
People

More

Overview
Repositories
Projects
Packages
People

README.md

Hi there 👋

This is FoundationVision official website repo

Popular repositories Loading

VAR VAR Public

[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…

Jupyter Notebook 6.5k 436
LlamaGen LlamaGen Public

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1.4k 57
GLEE GLEE Public

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Python 1.1k 86
Infinity Infinity Public

Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 656 17
Groma Groma Public

[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

Python 589 61
OmniTokenizer OmniTokenizer Public

[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.

Python 279 7

Repositories

Loading

Type

Select type

All Public Sources Forks Archived Mirrors Templates

Language

Select language

All HTML Jupyter Notebook Python

Sort

Select order

Last updated Name Stars

Showing 10 of 12 repositories

Infinity Public
Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

FoundationVision/Infinity’s past year of commit activity

Python 656 MIT 17 6 0 Updated Dec 30, 2024
infinity.project Public

FoundationVision/infinity.project’s past year of commit activity

HTML 0 0 0 0 Updated Dec 24, 2024
VAR Public
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

FoundationVision/VAR’s past year of commit activity

Jupyter Notebook 6,540 MIT 436 35 0 Updated Dec 23, 2024
Liquid Public
Liquid: Language Models are Scalable Multi-modal Generators

FoundationVision/Liquid’s past year of commit activity

55 MIT 0 2 0 Updated Dec 12, 2024
GLEE Public
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

FoundationVision/GLEE’s past year of commit activity

Python 1,124 MIT 86 40 2 Updated Oct 21, 2024
LlamaGen Public
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

FoundationVision/LlamaGen’s past year of commit activity

Python 1,432 MIT 57 52 0 Updated Aug 15, 2024
OmniTokenizer Public
[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.

FoundationVision/OmniTokenizer’s past year of commit activity

Python 279 MIT 7 8 0 Updated Jul 9, 2024
vaex Public
🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook

FoundationVision/vaex’s past year of commit activity

Python 69 MIT 4 3 0 Updated Jun 23, 2024
Groma Public
[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

FoundationVision/Groma’s past year of commit activity

Python 589 Apache-2.0 61 9 1 Updated Jun 7, 2024
GenerateU Public
[CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection

FoundationVision/GenerateU’s past year of commit activity

Python 151 6 15 0 Updated Mar 25, 2024

View all repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Python HTML Jupyter Notebook

Most used topics

Loading…

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.