- vision
- language
- time series
- multimodal
- embedded
- internship
- thesis
•
•
•
•
•
•
-
Beyond the Canvas: A Systematic Review of Generative AI for Image Synthesis and Editing
Providing a comprehensive review of state-of-the-art image generative models, exploring architectural evolutions from GANs to Diffusion Models and hybrid systems, while analyzing evaluation paradigms and ethical challenges.
-
Classifying Multimodal Post Content through Multimodal Large Language Models
This thesis involves specialize MLLMs to Multimodal Post Content Classification.
-
Adaptive Granularity Retrieval for Retrieval-Augmented Generation
This thesis explores adaptive retrieval for Retrieval-Augmented Generation, developing a system that dynamically adjusts the granularity of retrieved context (document, section, or passage) based on query intent. The goal is to improve both precision for fine-grained questions and coherence for broad, open-ended queries.
-
Development of a PyTorch-Based Framework for Semantic Segmentation on Point Clouds
A short description of the proposal.
-
Refactoring a Web Application for generating paintings from music
Refactoring a Web Application for generating paintings from music