OpenGVLab

community

https://github.com/opengvlab

opengvlab

OpenGVLab

Request to join this org

AI & ML interests

Computer Vision

Organization Card

Community About org cards

OpenGVLab

Welcome to OpenGVLab! We are a research group from Shanghai AI Lab focused on Vision-Centric AI research. The GV in our name, OpenGVLab, means general vision, a general understanding of vision, so little effort is needed to adapt to new vision-based tasks.

Models

InternVL: a pioneering open-source alternative to GPT-4V.
InternImage: a large-scale vision foundation models with deformable convolutions.
InternVideo: large-scale video foundation models for multimodal understanding.
VideoChat: an end-to-end chat assistant for video comprehension.
All-Seeing-Project: towards panoptic visual recognition and understanding of the open world.

Datasets

ShareGPT4o: a groundbreaking large-scale resource that we plan to open-source with 200K meticulously annotated images, 10K videos with highly descriptive captions, and 10K audio files with detailed descriptions.
InternVid: a large-scale video-text dataset for multimodal understanding and generation.

Benchmarks

MVBench: a comprehensive benchmark for multimodal video understanding.

Collections 11

spaces 10

InternVideo2 Chat 8B HD

MVBench Leaderboard

ControlLLM

InternVL

Running on Zero

VideoMamba

VideoChat2

models 88

OpenGVLab/VideoChat2_HD_stage4_Mistral_7B_hf

Updated 10 days ago • 71

OpenGVLab/InternVL2-8B

Image-Text-to-Text • Updated 11 days ago • 59.9k • 123

OpenGVLab/InternVL2-4B

Image-Text-to-Text • Updated 12 days ago • 19.3k • 34

OpenGVLab/InternVL2-Llama3-76B-AWQ

Image-Text-to-Text • Updated 12 days ago • 1.23k • 19

OpenGVLab/InternVL2-40B-AWQ

Image-Text-to-Text • Updated 12 days ago • 1.39k • 14

OpenGVLab/InternVL2-26B-AWQ

Image-Text-to-Text • Updated 12 days ago • 558 • 14

OpenGVLab/InternVL2-8B-AWQ

Image-Text-to-Text • Updated 12 days ago • 1.48k • 10

OpenGVLab/InternVL2-2B-AWQ

Image-Text-to-Text • Updated 12 days ago • 11.1k • 13

OpenGVLab/InternVL2-Llama3-76B

Image-Text-to-Text • Updated 12 days ago • 162k • 190

OpenGVLab/InternVL2-40B

Image-Text-to-Text • Updated 12 days ago • 22.2k • 87

datasets 25

OpenGVLab/GMAI-MMBench

Preview • Updated 4 days ago • 1 • 10

OpenGVLab/InternVL-SA-1B-Caption

Viewer • Updated 15 days ago • 8.63M • 6

OpenGVLab/InternVL-Chat-V1-2-SFT-Data

Viewer • Updated 16 days ago • 573k • 82 • 10

OpenGVLab/InternVL-LaionCOCO-OCR

Updated 16 days ago

OpenGVLab/InternVL-WuKong-OCR

Updated 16 days ago

OpenGVLab/GUI-Odyssey

Viewer • Updated 22 days ago • 7.74k • 4 • 6

OpenGVLab/ScaleVLN

Updated 25 days ago

OpenGVLab/OmniCorpus-CC-210M

Viewer • Updated Aug 30 • 208M • 28 • 11

OpenGVLab/ShareGPT-4o

Viewer • Updated Aug 17 • 59.4k • 58 • 130

OpenGVLab/MVBench

Viewer • Updated Aug 14 • 4k • 115k • 22