420273

Advancing Creativity: A Comprehensive Review of AI-Driven Text-to-Image Generation and Its Applications

Article

Last updated: 09 Apr 2025

Subjects

-

Tags

Software Engineering.

Abstract

The field of AI-driven text-to-image generation has emerged as a transformative intersection of technology and creativity, enabling the automatic synthesis of visuals from textual descriptions. This capability has profound implications for diverse applications, from storytelling and education to digital art and design. By automating the translation of textual content into visually rich representations, text-to-image generation bridges the gap between linguistic and visual modalities, fostering novel opportunities for innovation and exploration. This review explores the state-of-the-art advancements in text-to-image synthesis, emphasizing the technological evolution from Generative Adversarial Networks (GANs) to diffusion models and transformer-based architectures. It highlights how these models, including tools like DALL-E-2, Midjourney, and Stable Diffusion, have advanced in generating semantically aligned, visually coherent, and aesthetically appealing images. Despite notable progress, significant challenges remain. These include maintaining contextual coherence across sequences, adhering to artistic and compositional principles, and addressing the dependency on detailed textual prompts. Moreover, the limitations of existing evaluation metrics, such as the Inception Score (IS) and Fréchet Inception Distance (FID), are critically analyzed, underscoring the need for metrics that account for semantic fidelity, emotional resonance, and user-centric perspectives. The review synthesizes insights from recent studies to identify key areas for innovation, such as enhanced context management, integration of 3D modeling capabilities, and real-time user interaction mechanisms. Finally, the paper outlines future directions to address current limitations, promote interdisciplinary collaboration, and establish ethical guidelines for responsible AI deployment. By doing so, this work aims to provide a comprehensive foundation for advancing generative AI and its applications across creative industries

DOI

10.21608/astj.2025.343418.1018

Keywords

Book Illustration Context Transition Generative AI Narrative Coherence Prompt Engineering Text, Image Synthesis Text, to, Image Illustration User Experience

Authors

First Name

Noha

Last Name

Hussen

MiddleName

-

Affiliation

Software Engineering Department, Faculty of Engineering and Technology, Egyptian Chinese University, Cairo, Egypt

Email

noha.hussen@ecu.edu.eg

City

-

Orcid

0009-0003-4364-5247

First Name

Ahmed

Last Name

Samir

MiddleName

-

Affiliation

Software Engineering Department, Faculty of Engineering and Technology, Egyptian Chinese University, Cairo, Egypt

Email

ahmedalayan815@gmail.com

City

-

Orcid

-

First Name

Aliaa

Last Name

Adel

MiddleName

-

Affiliation

Software Engineering Department, Faculty of Engineering and Technology, Egyptian Chinese University, Cairo, Egypt

Email

aliaaadel2000@gmail.com

City

-

Orcid

-

First Name

Abdelrahman

Last Name

Gaber

MiddleName

-

Affiliation

Software Engineering Department, Faculty of Engineering and Technology, Egyptian Chinese University, Cairo, Egypt

Email

abdelrahman@gmail.com

City

-

Orcid

-

First Name

Mommen

Last Name

Attaia

MiddleName

-

Affiliation

Software Engineering Department, Faculty of Engineering and Technology, Egyptian Chinese University, Cairo, Egypt

Email

mommenatia3@gmail.com

City

-

Orcid

-

First Name

Ahmed

Last Name

Mohamed

MiddleName

-

Affiliation

Software Engineering Department, Faculty of Engineering and Technology, Egyptian Chinese University, Cairo, Egypt

Email

ahm112029@gmail.com

City

-

Orcid

-

Volume

2

Article Issue

2

Related Issue

53297

Issue Date

2025-12-01

Receive Date

2024-12-10

Publish Date

2025-12-01

Page Start

1

Page End

17

Online ISSN

3009-7614

Link

https://astj.journals.ekb.eg/article_420273.html

Detail API

http://journals.ekb.eg?_action=service&article_code=420273

Order

420,273

Type

Review Article

Type Code

3,383

Publication Type

Journal

Publication Title

Advanced Sciences and Technology Journal

Publication Link

https://astj.journals.ekb.eg/

MainTitle

Advancing Creativity: A Comprehensive Review of AI-Driven Text-to-Image Generation and Its Applications

Details

Type

Article

Created At

09 Apr 2025