VLM2Vec

community

https://github.com/TIGER-AI-Lab/VLM2Vec

AI & ML interests

Multimodal Embeddings and Retrieval.

Recent Activity

memray authored a paper 11 days ago

XGen-7B Technical Report

memray authored a paper 11 days ago

Exploring the Integration Strategies of Retriever and Large Language Models

memray authored a paper 11 days ago

General-to-Specific Transfer Labeling for Domain Adaptable Keyphrase Generation

View all activity

Organization Card

Community About org cards

VLM2Vec & MMEB: Benchmarking multimodal embeddings and adapting state-of-the-art multimodal large language models into embedding models.

Website - https://tiger-ai-lab.github.io/VLM2Vec/
Github https://github.com/TIGER-AI-Lab/VLM2Vec

List of Our Papers

Main VLM2Vec / MMEB Series

VLM2Vec / MMEB – Image embedding benchmarking and models. (ICLR2025)
VLM2Vec-V2 / MMEB-V2 – Extension of our previous work to video and visual document tasks. (TMLR2026)

Other Related Papers from Our Team

GAE-Retriever – Benchmark and model for trajectory modeling in GUI environments. (Computer-use Agents@ICML 2025)
B3 – A novel batch mining strategy for contrastive learning. (Neurips2025)

models 1

VLM2Vec/VLM2Vec-V2.0

Image-Text-to-Text • Updated Jul 13, 2025 • 2.89k • 29

datasets 45

VLM2Vec/MMEB-V3

Preview • Updated 13 days ago • 514 • 2

VLM2Vec/GAE-Mind2Web

Viewer • Updated Feb 11 • 12.1k • 94

VLM2Vec/GAE-GUIAct

Viewer • Updated Feb 11 • 74.3k • 36

VLM2Vec/Video_Caption_HN

Viewer • Updated Dec 20, 2025 • 302k • 33

VLM2Vec/MMLongBench-page-fixed

Viewer • Updated Nov 4, 2025 • 8.91k • 2.8k

VLM2Vec/ViDoSeek-page-fixed

Viewer • Updated Nov 4, 2025 • 8.78k • 1.1k

VLM2Vec/MMEB-V2

Updated Sep 24, 2025 • 532 • 2

VLM2Vec/B3-7b

Viewer • Updated Aug 29, 2025 • 1.03M • 10 • 1

VLM2Vec/B3-2b

Viewer • Updated Aug 29, 2025 • 1.03M • 20

VLM2Vec/MVBench

Viewer • Updated Aug 15, 2025 • 4k • 1.46k

View 45 datasets