π Iβm currently working on video understanding and large multimodal models
π« How to reach me: [email protected]
Contact GitHub support about this userβs behavior. Learn more about reporting abuse.
Report abuseπ Iβm currently working on video understanding and large multimodal models
π« How to reach me: [email protected]
PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models
π€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.