Mapping the black box. Visual investigation of a diffusion model’s latent space

dc.contributor.authorOliva, Octavio
dc.contributor.mentorOplatek, Jiri
dc.contributor.mentorSchubbach, Arno
dc.contributor.mentorReymond, Claire
dc.date.accessioned2025-02-14T06:23:52Z
dc.date.issued2024
dc.description.abstractAlthough the translation from text to images has been a long-standing aspect of human visual expression, generative AI models add a new way to perform these translations based on textual prompts. This new possibility makes the generative models’ internal logic and decision-making processes become central. The research explores the Midjourney v6.0-mediated translation from text to images through three types of experiments, with a particular focus on the correlation between generated images and specific prompt variations. The proposed methods prove to be a successful strategy to investigate the model’s latent space and decision-making processes, and the analysis of the generated image series reveals intriguing insights about the AI’s ‘black box’ structure and its internal latent representations.
dc.description.urihttps://hdl.handle.net/20.500.11806/next/IDCE_20240013
dc.identifier.urihttps://irf.fhnw.ch/handle/11654/50381
dc.language.isoen
dc.publisherHochschule für Gestaltung und Kunst Basel FHNW
dc.spatialBasel
dc.subjectKünstliche Intelligenz
dc.subjectHalluzinationen
dc.subjectBildgenerierung
dc.subjectPrompting
dc.subjectModell
dc.subject.ddc700 - Künste und Unterhaltung
dc.titleMapping the black box. Visual investigation of a diffusion model’s latent space
dc.type11 - Studentische Arbeit
dspace.entity.typePublication
fhnw.InventedHereYes
fhnw.StudentsWorkTypeMaster
fhnw.affiliation.hochschuleHochschule für Gestaltung und Kunst Basel FHNWde_CH
fhnw.affiliation.institutInstitute of Digital Communication Environmentsde_CH
fhnw.studyProgramMaster of Arts FHNW in Digital Communication Environments
relation.isMentorOfPublication21160a48-7cc5-4342-860a-9a6d56c8a504
relation.isMentorOfPublication56b6cc42-54af-4abb-aa33-255de7f9746e
relation.isMentorOfPublicationd8615942-7666-4969-9070-e39957c805b2
relation.isMentorOfPublication.latestForDiscovery21160a48-7cc5-4342-860a-9a6d56c8a504
Dateien