Distill compact multilingual vision-text embeddings from large multimodal teachers for real-world deployment.
Mentors Peerat Limkonchotiwat, Ekapol Chuangsuwanich, Pume Tuchinda
Mentees (4) Ashvanth S, Faiz Assabil Firdaus, Ilma Aliya Fiddien, Puja Ahmad Habibi