MMRL: Multi-Modal Representation Learning for Vision-Language Models
Published in CVPR2025, 2025
This article aims to address the generalization challenge on new categories after efficient transfer learning for vision-language models.
Recommended citation: Guo Y, Gu X. Mmrl: Multi-modal representation learning for vision-language models[C]//Proceedings of the Computer Vision and Pattern Recognition Conference. 2025: 25015-25025.
Download Paper Download Code
