- vision
- language
- time series
- multimodal
- embedded
- internship
- thesis
•
•
•
•
•
•
-
Investigating the Modality Gap in Multimodal Large Language Models
This thesis involves investigating modality gap in MLLMs over a chosen multimodal task.
-
Adaptive Granularity Retrieval for Retrieval-Augmented Generation
This thesis explores adaptive retrieval for Retrieval-Augmented Generation, developing a system that dynamically adjusts the granularity of retrieved context (document, section, or passage) based on query intent. The goal is to improve both precision for fine-grained questions and coherence for broad, open-ended queries.
-
Enhancing Multimodal RAG Systems through Cross-Modal Retrieval and Reranking
This thesis involves the implementation of a cross-modal retrieval and reranking pipeline.
-
Refactoring a Web Application for generating paintings from music
Refactoring a Web Application for generating paintings from music