Bimodal Neural Style Transfer for Image Generation Based on Text Prompts
Keywords: Generative models, Creative AI, Image generation
Abstract
Neural networks have become one of the essential areas in Artificial Intelligence due to their extraordinary capacity to address problems in different domains. This ability led to the proposal of novel architectures and models to tackle challenging tasks such as neural style transfer. We propose a novel methodology for bimodal style transfer using text as input. We initially retrieve one image and a short descriptive text, which are mapped into a multimodal common latent space. Then, a new image is retrieved using an image retrieval engine. Finally, we use a generative model, which allows us to create artistic images by combining content and style. The proposed system can retrieve semantically similar images concerning a descriptive text (prompt), achieving great precision rates in image retrieval applied to the SemArt dataset. The transfer style neural model also preserves the imageâs high quality, combining style and content.
Más información
| Título según WOS: | Bimodal Neural Style Transfer for Image Generation Based on Text Prompts |
| Título según SCOPUS: | Bimodal Neural Style Transfer for Image Generation Based on Text Prompts |
| Título de la Revista: | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
| Volumen: | 14035 |
| Editorial: | Springer Science and Business Media Deutschland GmbH |
| Fecha de publicación: | 2023 |
| Página de inicio: | 379 |
| Página final: | 390 |
| Idioma: | English |
| DOI: |
10.1007/978-3-031-34732-0_29 |
| Notas: | ISI, SCOPUS |