水上一只鹅
水上一只鹅
Home
Publications
Experience
Projects
Wiki
CV
Blog
Light
Dark
Automatic
Vision-Language Pre-training
EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE (AAAI 2024)
Building scalable vision-language models to learn from diverse, multimodal data remains an open challenge. In this paper, we introduce …
Junyi Chen
,
Longteng Guo
,
Jia Sun
,
Shuai Shao
,
Zehuan Yuan
,
Liang Lin
,
Dongyu Zhang
Paper
Code
Cite
×