Spaces:
Running
Running
metadata
title: README
emoji: ⚡
colorFrom: purple
colorTo: gray
sdk: static
pinned: false
OpenGVLab
Welcome to OpenGVLab! We are a research group from Shanghai AI Lab focused on Vision-Centric AI research. The GV in our name, OpenGVLab, means general vision, a general understanding of vision, so little effort is needed to adapt to new vision-based tasks.
Models
- InternVL: a pioneering open-source alternative to GPT-4V.
- InternImage: a large-scale vision foundation models with deformable convolutions.
- InternVideo: large-scale video foundation models for multimodal understanding.
- VideoChat: an end-to-end chat assistant for video comprehension.
- All Seeing:
- All Seeing V2:
Datasets
- ShareGPT4o:
- InternVid: a large-scale video-text dataset for multimodal understanding and generation.
Benchmarks
- MVBench: a comprehensive benchmark for multimodal video understanding.