- vision
- language
- time series
- multimodal
- embedded
- internship
- thesis
•
•
•
•
•
•
-
Deep Learning for wildfire spread modeling
The thesis aims at developing a Deep Learning model to predict wildfire spread using multimodal and multivariate data.
-
Beyond the Canvas: A Systematic Review of Generative AI for Image Synthesis and Editing
Providing a comprehensive review of state-of-the-art image generative models, exploring architectural evolutions from GANs to Diffusion Models and hybrid systems, while analyzing evaluation paradigms and ethical challenges.
-
3D Urban Scene Synthesis from Multi-View Satellite Imagery
Synthesizing real-time, navigable 3D urban environments from multi-view satellite imagery using 3D Gaussian Splatting and generative refinement, with a focus on a case study in Turin.
-
Classifying Multimodal Post Content through Multimodal Large Language Models
This thesis involves specialize MLLMs to Multimodal Post Content Classification.
-
Investigating the Modality Gap in Multimodal Large Language Models
This thesis involves investigating modality gap in MLLMs over a chosen multimodal task.