- vision
- language
- time series
- multimodal
- embedded
- internship
- thesis
•
•
•
•
•
•
-
Classifying Multimodal Post Content through Multimodal Large Language Models
This thesis involves specialize MLLMs to Multimodal Post Content Classification.
-
Investigating the Modality Gap in Multimodal Large Language Models
This thesis involves investigating modality gap in MLLMs over a chosen multimodal task.
-
Development of a Foosball ELO System
Design and implementation of a complete Foosball ELO rating system, including backend services, frontend UI and database architecture
-
Object Detection and Tracking for automated video analysis of padel matches and generation of game statistics
Develop a machine learning algorithm able to track players and balls in padel matches footage and to gnerate game statistics
-
Adaptive Granularity Retrieval for Retrieval-Augmented Generation
This thesis explores adaptive retrieval for Retrieval-Augmented Generation, developing a system that dynamically adjusts the granularity of retrieved context (document, section, or passage) based on query intent. The goal is to improve both precision for fine-grained questions and coherence for broad, open-ended queries.