FREE VERSION DOWNLOADWork published in open access form and licensed under Creative Commons Attribution – NonCommercial – NoDerivatives 3.0 Italy (CC BY-NC-ND 3.0 IT).Opera distribuita con licenza Creative Commons Attribuzione – Non commerciale – Non opere derivate 3.0 Italia e diffusa in modalità open access (CC BY-NC-ND 3.0 IT).
Winner of the Competition “Prize for PhD Thesis 2020” arranged by Sapienza University Press.
Despite being the leading paradigm in computer vision, deep neural networks are inherently limited by the visual and semantic information contained in their training set. In this thesis, we aim to design deep models operating with previously unseen visual domains and semantic concepts. We first describe different solutions for generalizing to new visual domains, applying variants of normalization layers to multiple challenging settings e.g. where new domain data is not available but arrives online or is described by metadata. In the second part, we incorporate new semantic concepts into pretrained deep models. We propose specific solutions for different problems such as multi-task/incremental learning and open-world recognition. Finally, we merge the two challenges: given images of multiple domains and categories, can we recognize unseen concepts in unseen domains? We propose an approach that is the first, promising step, towards solving this problem.
eBook disponibile anche su Google Play, Google Libri, MLOL
© Sapienza Università Editrice - Piazzale Aldo Moro 5, 00185 Roma - CF 80209930587 PI 02133771002 - firstname.lastname@example.org