Roberto Amoroso
Roberto Amoroso
Home
News
Experience
Awards
Publications
Activities
Contact
Light
Dark
Automatic
Visual-Semantic Understanding
Multimodal Attentive Deep Learning Architectures for Visual-Semantic Understanding
PhD thesis investigating critical challenges associated with multimodal attentive architectures, including improving semantic segmentation accuracy, enabling open-vocabulary segmentation, and advancing video question answering.
Roberto Amoroso
PDF
Cite
Cite
×