AI & ML interests

Multimodal Embeddings and Retrieval.

Recent Activity

ziyjiang  updated a Space about 21 hours ago
VLM2Vec/README
ziyjiang  updated a dataset 28 days ago
VLM2Vec/Video_Caption_HN
ziyjiang  published a dataset 28 days ago
VLM2Vec/Video_Caption_HN
View all activity

VLM2Vec & MMEB: Benchmarking multimodal embeddings and adapting state-of-the-art multimodal large language models into embedding models.

List of Our Papers

Main VLM2Vec / MMEB Series

  • VLM2Vec / MMEB – Image embedding benchmarking and models. (ICLR2025)
  • VLM2Vec-V2 / MMEB-V2 – Extension of our previous work to video and visual document tasks. (TMLR2026)

Other Related Papers from Our Team

  • GAE-Retriever – Benchmark and model for trajectory modeling in GUI environments. (Computer-use Agents@ICML 2025)
  • B3 – A novel batch mining strategy for contrastive learning. (Neurips2025)