Latent space representation and manipulation of StyleGANs
dc.contributor.advisor | Oliveira Neto, Manuel Menezes de | pt_BR |
dc.contributor.author | Dick, João Atz | pt_BR |
dc.date.accessioned | 2024-08-15T06:30:21Z | pt_BR |
dc.date.issued | 2022 | pt_BR |
dc.identifier.uri | http://hdl.handle.net/10183/277344 | pt_BR |
dc.description.abstract | StyleGAN models are a new paradigm in artificial image generation. Initially proposed to generate fake facial images, these deep generative models can be used to edit real photographs with the aid of GAN Inversion and Latent Manipulation algorithms. GAN Inversion techniques embed real images into StyleGAN’s latent space, yielding a latent point used to generate an artificial image as close as possible to the original one. Latent Manipulation operations work on top of the resulting latent point to do semantic-oriented edits, aiming to preserve the remaining image’s characteristics. Recent research initiatives attempt to better understand and model the rich latent space from StyleGANs, focusing on how to invert and edit real images with minimum distortion and maximum editing efficiency. Although significant advances have been made in the past few years, GAN In version and Latent Manipulation methods still face difficulties when trying to disentangle semantic features in latent spaces. A better understanding of basic latent-space arith metic is needed to assess the entanglement of StyleGAN’s semantic features. This under graduate thesis describes the state-of-the-art in this field, defines basic latent arithmetic operations, and performs a variety of latent arithmetic experiments. The experimental results are used to develop a better understanding of StyleGAN’s latent space, setting a theoretical basis for future research directions in GAN Inversion and Latent Manipulation. | en |
dc.format.mimetype | application/pdf | pt_BR |
dc.language.iso | eng | pt_BR |
dc.rights | Open Access | en |
dc.subject | StyleGAN | en |
dc.subject | Imagem | pt_BR |
dc.subject | Image generation | en |
dc.subject | Fotografia | pt_BR |
dc.subject | Latent space | en |
dc.subject | GAN inversion | en |
dc.subject | Latent manipulation | en |
dc.title | Latent space representation and manipulation of StyleGANs | pt_BR |
dc.type | Trabalho de conclusão de graduação | pt_BR |
dc.identifier.nrb | 001162398 | pt_BR |
dc.degree.grantor | Universidade Federal do Rio Grande do Sul | pt_BR |
dc.degree.department | Instituto de Informática | pt_BR |
dc.degree.local | Porto Alegre, BR-RS | pt_BR |
dc.degree.date | 2022 | pt_BR |
dc.degree.graduation | Ciência da Computação: Ênfase em Ciência da Computação: Bacharelado | pt_BR |
dc.degree.level | graduação | pt_BR |
Este item está licenciado na Creative Commons License

-
TCC Ciência da Computação (1084)