Persistent Homology Analysis of AI-Generated Fractal Patterns: A Mathematical Framework for Evaluating Geometric Authenticity

Lee, Minhyeok; Lee, Soyeon

doi:10.3390/fractalfract8120731

Open AccessArticle

Persistent Homology Analysis of AI-Generated Fractal Patterns: A Mathematical Framework for Evaluating Geometric Authenticity

by

Minhyeok Lee

^1,2,*

and

Soyeon Lee

²

¹

School of Electrical and Electronics Engineering, Chung-Ang University, Seoul 06974, Republic of Korea

²

Department of Intelligent Semiconductor Engineering, Chung-Ang University, Seoul 06974, Republic of Korea

^*

Author to whom correspondence should be addressed.

Fractal Fract. 2024, 8(12), 731; https://doi.org/10.3390/fractalfract8120731

Submission received: 16 November 2024 / Revised: 9 December 2024 / Accepted: 11 December 2024 / Published: 13 December 2024

(This article belongs to the Special Issue Advances in Pattern Recognition—Image and Time Series Analyses—through Fractal Geometry and Complexity Theory)

Download

Browse Figures

Versions Notes

Abstract

We present a mathematical framework for analyzing fractal patterns in AI-generated images using persistent homology. Given a text-to-image mapping

M : T \to I

, we demonstrate that the persistent homology groups

H_{k} (t)

of sublevel set filtrations

{f^{- 1} ((- \infty, t])}_{t \in R}

characterize multi-scale geometric structures, where

f : M (p) \to R

is the grayscale intensity function of a generated image. The primary challenge lies in quantifying self-similarity in scales, which we address by analyzing birth–death pairs

(b_{i}, d_{i})

in the persistence diagram

P D (M (p))

. Our contribution extends beyond applying the stability theorem to AI-generated fractals; we establish how the self-similarity inherent in fractal patterns manifests in the persistence diagrams of generated images. We validate our approach using the Stable Diffusion 3.5 model for four fractal categories: ferns, trees, spirals, and crystals. An analysis of guidance scale effects

γ \in [4.0, 8.0]

reveals monotonic relationships between model parameters and topological features. Stability testing confirms robustness under noise perturbations

η \leq 0.2

, with feature count variations

Δ μ_{f} < 0.5

. Our framework provides a foundation for enhancing generative models and evaluating their geometric fidelity in fractal pattern synthesis.

Keywords:

fractal dimension analysis; persistent homology; topological data analysis; text-to-image synthesis; generative models; box-counting dimension; pattern synthesis; computational topology

1. Introduction

The mathematical analysis of generative models presents a fundamental challenge in computational topology [1], particularly in the context of text-to-image synthesis [2,3], where geometric characterization remains an open problem. Let

(T, d_{T})

and

(I, d_{I})

be metric spaces representing the text and image domains, respectively. Recent advances in latent diffusion models [4,5] have established continuous mappings

M : T \to I

that generate high-fidelity visual content [6,7,8]. Of particular mathematical interest are natural fractal patterns characterized by self-similarity in multiple scales [9,10]. These patterns arise from iterated function systems

{f_{i} : R^{n} \to R^{n}}_{i = 1}^{m}

satisfying the fixed-point equation

F = ⋃_{i = 1}^{m} f_{i} (F)

for some compact set

F \subset R^{n}

. The analysis of such patterns through their topological invariants [11,12] provides a framework for evaluating the geometric fidelity of generative models since their recursive structure admits mathematical characterization through persistent homology [13,14].

The importance of this research lies in its potential to enhance our understanding of the geometric properties of AI-generated images [15], particularly those exhibiting fractal-like structures. By developing robust mathematical tools for analyzing these patterns [16,17], we can improve the evaluation and optimization of generative models [18], leading to more accurate and controllable image synthesis. This work is relevant not only to computer vision and machine learning but also to fields such as computational geometry, topological data analysis [19,20], and applied mathematics [21,22,23].

Practical applications of evaluating the geometric authenticity of fractal-like structures arise in domains such as remote sensing, where accurate characterization of landscape patterns can guide land-use planning [24,25,26], or in biomedical imaging, where distinguishing fractal tumor growth patterns from benign structures can inform diagnostic decisions [27]. Furthermore, fields such as computer graphics and virtual reality benefit from improved fractal synthesis to improve the realism of generated environments [28]. Building on established topological analysis approaches [29], our framework may assist in verifying the authenticity of patterns in heritage preservation imaging or to scrutinize the fidelity of AI-generated art, ensuring that generative models produce visually coherent and scientifically consistent fractal motifs.

Our goal is to establish a comprehensive mathematical framework for characterizing fractal patterns in AI-generated images using persistent homology. This framework aims to provide quantitative measures that capture the multi-scale geometric properties of generated patterns, overcoming the limitations of traditional metrics based on pixel-wise comparisons or perceptual similarity functions [30]. The challenge lies in developing stability guarantees for these geometric measurements, given the high-dimensional nature of the generation process

M (p)

for

p \in T

.

We address these challenges through a systematic application of persistent homology to AI-generated fractal patterns. For a generated image

g = M (p)

, we analyze its grayscale intensity function

f : g \to R

through the lens of persistent homology, examining the filtration

{f^{- 1} ((- \infty, t])}_{t \in R}

and its associated homology groups

H_{k} (t)

. This approach enables the characterization of geometric features on multiple scales using the persistence diagram

P D (g)

, which records the birth and death times

(b_{i}, d_{i})

of topological features. Our main theoretical contribution lies in establishing how the self-similarity inherent in fractal patterns manifests itself in the persistence diagrams of AI-generated images. We demonstrate that under certain conditions, the scaling behavior of topological features in the persistence diagrams corresponds to the self-similar structure of the fractals.

To validate our solution, we performed extensive experiments using the Stable Diffusion 3.5 model [31] in four fractal categories: ferns, trees, spirals, and crystals. Our results demonstrate a statistically significant differentiation of pattern types through dimension 1 persistence features. This work establishes persistent homology as a mathematical tool for analyzing geometric properties of AI-generated patterns, with implications for model evaluation and optimization. The framework enables systematic assessment of geometric fidelity through well-defined topological measurements, complementing existing evaluation protocols for synthetic imagery.

In this work, we present a mathematical framework that leverages persistent homology to characterize fractal patterns in AI-generated images. Our key contributions are as follows: (1) We establish a direct connection between self-similarity in fractal structures and their topological signatures captured via persistence diagrams. (2) We provide a stability analysis that ensures the robustness of these topological features under variations in model parameters and noise. (3) We demonstrate how textual prompts influence fractal complexity, illustrating that linguistic nuances can significantly alter the geometric authenticity of generated patterns. (4) We introduce a methodology that can serve as a topological diagnostic tool for the broader AI and computational geometry communities, offering a new perspective on evaluating generative models and their outputs.

2. Related Work

2.1. Topological Data Analysis in Image Processing

Topological data analysis (TDA) has emerged as a powerful tool for analyzing complex data structures, with particular applications in image processing. The stability theory of persistence diagrams, established by Cohen-Steiner et al. [32], provides fundamental guarantees for the robustness of topological features. For tame functions

f, g : X \to R

on a topological space X, the stability theorem bounds the bottleneck distance between persistence diagrams:

d_{B} (Dgm (f), Dgm (g)) \leq {∥ f - g ∥}_{\infty}

(1)

This result has been extended to

L_{p}

-stability [33], broadening the framework for comparing persistence diagrams. Further generalizations by Patel [34] have expanded the applicability of these concepts in computational topology. Although these works provide a solid foundation for topological analysis, they do not specifically address the challenges of analyzing AI-generated fractal patterns, which is the focus of our study.

In image analysis, persistent homology has been applied to various tasks, including segmentation. Clough et al. [35] introduced a topological loss function

L_{topo}

incorporating persistence-based features:

L_{topo} = \sum_{k = 0}^{d} \sum_{(b, d) \in {Dgm}_{k} (f)} ϕ (d - b)

(2)

where

ϕ

weights long-lived topological features. In contrast to this general approach, our framework specifically addresses the characterization of self-similarity in scales in fractal patterns, providing a novel perspective on the analysis of AI-generated images.

2.2. Fractal Analysis of Natural Patterns

The mathematical foundations of fractal geometry, pioneered by Mandelbrot [36], have revealed the ubiquity of fractal structures in nature. The fractal dimension D of a set F, defined through the box-counting method, is given by:

D = lim_{ϵ \to 0} \frac{log N (ϵ)}{log (1 / ϵ)}

(3)

where

N (ϵ)

is the number of boxes of side length

ϵ

needed to cover F. This concept has been applied to analyze various natural phenomena, from non-linear time series [37] to texture patterns. However, these traditional approaches often lack the ability to capture the multi-scale topological features inherent in fractal structures.

Our work extends these approaches by establishing a connection between the fractal dimensions and persistent homology. We provide a multi-scale topological perspective on fractal patterns, relating the persistence diagrams of generated fractals to their theoretical scaling properties. This approach bridges the gap between fractal geometry and topological data analysis, offering a novel framework for characterizing complex geometric structures in AI-generated images.

2.3. Text-to-Image Generation Models

Recent advancements in text-to-image synthesis have led to sophisticated architectures based on diffusion models. The generation process can be formalized as a mapping

G : T \times Z \to X

, where

T

,

Z

, and

X

denote the text embedding, latent, and image spaces, respectively.

Podell et al. [38] introduced a multi-scale framework with dual UNet backbones

U_{1}, U_{2} : X \times T \to X

operating at different resolutions. The composition

U_{2} \circ U_{1}

enables high-fidelity synthesis through cascaded refinements. This approach was extended to video generation by Blattmann et al. [39], generalizing the process to temporal evolution

G_{t} : T \times Z \times [0, T] \to X

. While these models have shown impressive results in generating complex images, they lack a mathematical framework to analyze the geometric properties of the generated patterns.

Efficiency-focused architectures, such as those proposed by Xie et al. [40], have reformulated the generation process using linear operators

L : X \to X

that approximate non-linear diffusion steps:

x_{t - 1} = L (x_{t}, t) + σ_{t} ϵ, ϵ \sim N (0, I)

(4)

where

σ_{t}

is a time-dependent noise scale. In contrast, Gu et al. [41] introduced nested denoising autoencoders

{E_{k} \circ D_{k}}_{k = 1}^{K}

operating at multiple scales simultaneously, allowing for hierarchical generation of fractal-like structures. These approaches, while efficient, do not provide a direct means of quantifying the geometric authenticity of the generated patterns.

In addition to topological and fractal-based analyses, other image synthesis and evaluation frameworks incorporate domain-specific properties. For example, recent work in underwater image generation leverages special optical characteristics of water to improve the fidelity of generated scenes [42]. Similarly, in satellite imagery, self-training methods grounded in geometrical constraints have been proposed to bridge domain gaps, facilitating improved pose estimation and reducing annotation costs [43]. While these approaches focus on specific physical or geometrical constraints, our method offers a more general, topological perspective, complementing such domain-tailored solutions by providing a universal framework to assess the inherent geometric authenticity of AI-generated fractal patterns.

Our work complements these generative approaches by providing a topological framework for analyzing the geometric properties of generated patterns. We focus particularly on fractal characteristics and multi-scale structure, offering a novel perspective on the evaluation and understanding of text-to-image generation models.

2.4. Persistent Homology in Machine Learning

The application of persistent homology to machine learning tasks has gained traction in recent years. Horak et al. [44] proposed a topology-based approach to evaluate generative adversarial networks (GANs), using persistence diagrams to compare the topological structure of real and generated data distributions. Their method, while innovative, does not specifically address the challenges of fractal pattern analysis in AI-generated images.

Alipourjeddi and Miri [45] extended this line of research by developing a topological approach to evaluate GANs, focusing on the stability and quality of the generated samples. However, their work does not delve into the specific challenges posed by fractal patterns or the multi-scale nature of such structures.

Our approach differs from those of these previous works in several key aspects. First, we focus specifically on the analysis of fractal patterns in AI-generated images, leveraging the self-similarity properties inherent in these structures. Second, we establish a direct connection between the persistence diagrams and the theoretical scaling properties of fractals, providing a more comprehensive framework for evaluating geometric authenticity. Lastly, our method is tailored to the unique challenges posed by text-to-image generation models, offering insights into how textual descriptions influence the topological features of the resulting images.

3. Background

3.1. Text-to-Image Generation Models

Text-to-image generation models operate on probability spaces

(T, Σ_{T}, μ_{T})

and

(I, Σ_{I}, μ_{I})

representing the prompt and image spaces, respectively. The generation process is characterized by a measurable mapping

M : T \to I

that preserves the underlying probabilistic structure. We focus on the mapping

M : T \to I

as it pertains to generating images for analysis via persistent homology.

Definition 1.

A forward diffusion process on the image space

I \subseteq R^{n}

is defined as a sequence of latent variables

{x_{t}}_{t = 0}^{T}

where

x_{0} \sim q (x_{0})

(the data distribution) and for

t = 1, \dots, T

:

q (x_{t} | x_{t - 1}) = N (x_{t}; \sqrt{1 - β_{t}} x_{t - 1}, β_{t} I)

(5)

Here,

β_{t}

is a predefined variance schedule, and

N

denotes the normal distribution.

3.2. Persistent Homology

Persistent homology provides a framework for analyzing topological features on multiple scales. We formalize key concepts in this context.

Definition 2.

A filtration of a topological space X is a family

{X_{t}}_{t \in R}

of subspaces such that

X_{s} \subseteq X_{t}

for all

s \leq t

. For a continuous function,

f : X \to R

, the sublevel set filtration is defined as

X_{t} = f^{- 1} ((- \infty, t])

.

Definition 3.

A persistence module is a family of vector spaces

{V_{t}}_{t \in R}

with homomorphisms

v_{s}^{t} : V_{s} \to V_{t}

for

s \leq t

satisfying:

v_{t}^{u} \circ v_{s}^{t} = v_{s}^{u} for all s \leq t \leq u

(6)

Theorem 1

(Structure Theorem). Every pointwise finite-dimensional persistence module V decomposes uniquely as a direct sum of interval modules:

V ≅ ⨁_{i \in I} I [b_{i}, d_{i}]

(7)

where

I [b, d]

denotes the interval module supported on

[b, d]

.

Definition 4.

For a continuous function

f : X \to R

on a topological space X, the persistence diagram

P D (f)

is the multiset of points

(b, d) \in {\bar{R}}^{2}

corresponding to the interval summands in the decomposition of the persistence module

H_{*} {(f^{- 1} ((- \infty, t]))}_{t \in R}

.

3.3. Fractal Geometry and Natural Patterns

Fractal geometry provides the mathematical foundation for analyzing patterns exhibiting self-similarity on scales, which is characteristic of many natural phenomena. An iterated function system (IFS) is a method for constructing such fractals by repeatedly applying a set of contractive mappings. The Hutchinson–Barnsley theorem offers a way to calculate the fractal (Hausdorff) dimension of the set generated by an IFS, which is essential in understanding the complexity of fractal structures.

Definition 5.

An iterated function system (IFS) is a finite set of contractive mappings

{f_{i}}_{i = 1}^{m}

on a complete metric space

(X, d)

. The attractor of an IFS is the unique compact set F satisfying

F = ⋃_{i = 1}^{m} f_{i} (F)

.

Theorem 2

(Hutchinson–Barnsley). For an IFS

{f_{i}}_{i = 1}^{m}

with contraction ratios

{r_{i}}_{i = 1}^{m}

, the Hausdorff dimension D of the attractor F satisfies:

\sum_{i = 1}^{m} r_{i}^{D} = 1

(8)

Definition 6.

For a bounded set

F \subset R^{n}

, let

N_{ϵ} (F)

be the minimum number of boxes of side length ϵ needed to cover F. The box-counting dimension is:

{dim}_{B} (F) = lim_{ϵ \to 0} \frac{log N_{ϵ} (F)}{- log ϵ}

(9)

when this limit exists.

4. Method

4.1. Theoretical Framework

Persistent homology can be understood as a tool that tracks the formation and disappearance of features (e.g., distinct “islands” of intensity or “holes”) within an image as we gradually adjust a grayscale threshold. Imagine starting from a completely dark image and slowly “flooding” it with brightness: connected bright regions emerge, merge, and sometimes enclose voids. Persistent homology quantifies these changes, encoding the life-span of topological features in different intensity levels. This perspective allows non-specialist readers to grasp the core idea without being overwhelmed by formal definitions and equations.

We begin by formalizing the text-to-image generation process in the context of fractal pattern analysis. Let T denote the space of text prompts and I the space of images. We consider a text-to-image mapping

M : T \to I

that generates images from textual descriptions.

Assumption 1.

For any prompt

p \in T

that describes natural fractal patterns, the generated image

g = M (p)

admits a grayscale intensity function

f : g \to R

.

This assumption allows us to analyze the topological properties of the generated images through their grayscale representations. While we reduce each generated color image to a grayscale intensity function for topological analysis, this simplification may overlook structural information encoded in color channels. Complex fractal patterns could manifest differently in hue or saturation, and subtle chromatic variations might not translate to grayscale intensity. Consequently, certain geometrically meaningful features that are discernible only through color contrasts might remain undetected. Future work could incorporate multi-channel or color-based filtrations to capture richer geometric information and overcome this limitation. We construct a filtration based on the sublevel sets of f:

Definition 7.

Let

F_{t} = f^{- 1} ((- \infty, t])

denote the sublevel sets of f. The filtration

{F_{t}}_{t \in R}

induces a sequence of homology groups:

H_{k} (F_{t_{1}}) \to H_{k} (F_{t_{2}}) \to \dots \to H_{k} (F_{t_{n}})

(10)

where

H_{k}

denotes the k-th homology group.

This filtration captures the evolution of topological features as we vary the threshold t, providing a multi-scale perspective on the image’s structure.

4.2. Main Results and Proofs

Our main result establishes a connection between the fractal nature of the generated images and their persistence diagrams. We begin with a key assumption and lemma that characterize the behavior of topological features under scaling transformations.

Assumption 2.

Let

F \subset R^{n}

be a self-similar fractal set generated by an iterated function system (IFS)

{f_{i}}_{i = 1}^{m}

that satisfies the open set condition with a uniform contraction ratio

r \in (0, 1)

. We assume that for each

ϵ > 0

, the sublevel sets

U_{ϵ} = {x \in R^{n} ∣ d_{F} (x) \leq ϵ}

satisfy the following homological self-similarity property:

H_{k} (U_{ϵ}) ≅ H_{k} (U_{r ϵ}) for all k \geq 0,

(11)

where

H_{k}

denotes the k-th homology group, and ≅ denotes isomorphism. This implies that the homological features of

U_{ϵ}

replicate at scales related by powers of r, reflecting the fractal’s self-similar structure.

Lemma 1.

Let

F \subset R^{n}

be a self-similar fractal set generated by an iterated function system (IFS)

{f_{i}}_{i = 1}^{m}

that satisfies the open set condition with a uniform contraction ratio

r \in (0, 1)

. Let

d_{F} : R^{n} \to R_{\geq 0}

be the distance function defined by

d_{F} (x) = {inf}_{y \in F} ∥ x - y ∥

. Then:

The function $d_{F}$ is 1-Lipschitz continuous.
The k-th persistence diagram ${PD}_{k} (d_{F})$ exhibits patterns corresponding to the self-similarity of F, with features appearing at scales related by powers of r.

Proof.

For any

x, y \in R^{n}

:

| d_{F} (x) - d_{F} (y) | = |inf_{z \in F} ∥ x - z ∥ - inf_{w \in F} ∥ y - w ∥| \leq sup_{z \in F} | ∥ x - z ∥ - ∥ y - z ∥ | \leq ∥ x - y ∥ .

(12)

This inequality establishes that

d_{F}

is 1-Lipschitz continuous.

Consider the sublevel sets

U_{ϵ} = {x \in R^{n} ∣ d_{F} (x) \leq ϵ}

. Due to self-similarity,

U_{ϵ}

contains scaled copies of

U_{ϵ / r^{m}}

for

m \in N

. As

ϵ

varies, the topological features (e.g., connected components, holes) in

U_{ϵ}

replicate at scales

ϵ, r ϵ, r^{2} ϵ, \dots

. This scaling manifests itself in the persistence diagram

{PD}_{k} (d_{F})

as repeated patterns of birth and death times at logarithmically spaced intervals, reflecting the fractal’s self-similarity. □

Building on this lemma, we can now state our main theorem characterizing the persistence structure of fractal patterns in AI-generated images.

Assumption 3.

Assume there exists a continuous, bijective mapping

ϕ : dom (f_{g}) \to dom (f_{F})

such that the pullback function

f_{g}^{ϕ} = f_{g} \circ ϕ^{- 1}

is well-defined on the domain of

f_{F}

. Furthermore, we assume that the supremum norm of the difference between

f_{g}^{ϕ}

and

f_{F}

is bounded by a constant

δ > 0

, i.e.,

∥ f_{g}^{ϕ} - f_{F} ∥_{\infty} \leq δ .

(13)

This allows us to consider

f_{g}^{ϕ}

and

f_{F}

as functions in a common domain

X \subset R^{n}

, facilitating the application of the stability theorem for persistence diagrams.

Theorem 3.

Let

p \in T

be a prompt that describes a natural fractal pattern, and let

g = M (p)

be the image generated by the text-to-image model M. Let

f_{g} : g \to R

be the grayscale intensity function of g, and let

f_{F} : F \to R

represent a similar function for the ideal fractal

F \subset R^{n}

. Suppose

f_{g}

and

f_{F}

are extended to functions in a common domain

X \subset R^{n}

by embedding both the generated image g and the ideal fractal F in a shared coordinate space. If

∥ f_{g} - f_{F} ∥_{\infty} \leq δ

for some

δ > 0

, then for each

k \geq 0

, the bottleneck distance between their persistence diagrams satisfies:

d_{B} ({PD}_{k} (f_{g}), {PD}_{k} (f_{F})) \leq δ .

(14)

Proof.

Given that

∥ f_{g} - f_{F} ∥_{\infty} \leq δ

, it follows directly from the stability theorem for persistence diagrams (Theorem 4) that:

d_{B} ({PD}_{k} (f_{g}), {PD}_{k} (f_{F})) \leq {∥ f_{g} - f_{F} ∥}_{\infty} \leq δ .

(15)

This inequality holds because the bottleneck distance between the persistence diagrams of two functions is bounded above by the

L^{\infty}

norm of their difference, provided that the functions are defined on the same domain and meet the necessary conditions of tameness and continuity. □

4.3. Algorithmic Implementation

The computation of persistence diagrams from generated images involves transforming discrete grayscale data into a filtered simplicial complex suitable for topological analysis. The process consists of the following steps:

The grayscale image

f : [0, h] \times [0, w] \to [0, 1]

is represented as a 2D grid of pixels with intensity values

f_{i, j}

, where

i = 1, \dots, h

and

j = 1, \dots, w

.

Each pixel corresponds to a vertex

v_{i, j}

in a simplicial complex K. The vertices inherit the grayscale intensity values from the pixels, i.e.,

f_{K} (v_{i, j}) = f_{i, j}

.

Edges are formed between adjacent pixels (horizontally and vertically), potentially including diagonals if 8-connectivity is desired. For example, an edge

e = [v_{i, j}, v_{i + 1, j}]

is included if

| f_{i, j} - f_{i + 1, j} |

is below a threshold or unconditionally to capture adjacency.

Higher-dimensional simplices (triangles and tetrahedra) can be formed by connecting three or more adjacent vertices. The function

f_{K}

is extended to simplices by taking the maximum of the vertex values:

f_{K} (σ) = max_{v \in σ} f_{K} (v) .

(16)

The simplicial complex K is filtered by the function

f_{K}

. Filtration

{K_{t}}_{t \in [0, 1]}

is defined by including all simplices

σ

with

f_{K} (σ) \leq t

:

K_{t} = {σ \in K ∣ f_{K} (σ) \leq t} .

(17)

This filtration captures how topological features emerge and disappear as the intensity threshold t increases from 0 to 1.

Persistent homology is computed using standard algorithms, such as matrix reduction methods on finite fields (typically

Z_{2}

). Software libraries such as Ripser or GUDHI can efficiently handle these computations. The persistence pairs

(b_{i}, d_{i})

are extracted from the reduced boundary matrix, representing the birth and death times of the topological features.

The resulting persistence diagrams

{PD}_{k} (f)

for

k = 0, 1

provide insight into the topological structure of the image at different intensity thresholds. The features in

{PD}_{0} (f)

correspond to connected components, while those in

{PD}_{1} (f)

represent loops or holes.

4.4. Stability Analysis

The stability of persistence diagrams is crucial for the robustness of our analysis. To apply the stability theorem effectively, we must verify that our functions satisfy the required conditions.

Definition 8.

A function

f : X \to R

defined on a triangulable compact metric space X is termed tame if:

It is continuous.
It has a finite number of critical values.
For all $t \in R$ , the sublevel sets $X_{t} = f^{- 1} ((- \infty, t])$ have finite-dimensional homology groups.

In the context of this paper, the space X corresponds to the pixel grid of the image, which is compact and can be triangulated using simplices formed from adjacent pixels. The grayscale intensity function f is inherently discrete due to the finite resolution of digital images, making it continuous in a discrete sense.

Although f is defined on a discrete grid, we can consider it as a piecewise constant function extended over each pixel, which is a small square in

R^{2}

. Within each pixel, f is constant, and in adjacent pixels, the change in f is finite. This allows us to treat f as a continuous function on X.

Since the grayscale values are quantized (typically in 256 levels for an 8-bit image), f takes on a finite number of values. Therefore, the number of critical values that determine the intensity levels at which topological changes occur is finite. In each sublevel set

X_{t}

, the simplicial complex constructed from pixels with intensity less than or equal to t is finite, as there are a finite number of pixels. Consequently, the homology groups

H_{k} (X_{t})

are finite-dimensional for all k.

Given that f satisfies these conditions, we can apply the stability theorem for persistence diagrams:

Theorem 4

(Stability Theorem). For tame functions

f, g : X \to R

, the bottleneck distance between their k-th persistence diagrams satisfies:

d_{B} ({PD}_{k} (f), {PD}_{k} (g)) \leq {∥ f - g ∥}_{\infty} .

(18)

This theorem ensures that small changes in the function f (measured by the

L^{\infty}

norm) lead to small changes in persistence diagrams. In our analysis, this means that perturbations in the grayscale intensity due to noise or variations in the generative process will not significantly affect the topological features captured by persistent homology, provided that the perturbations are bounded.

We provide here a brief summary of its central assumptions and reasoning. The theorem assumes that f and g are tame functions defined on a common domain, ensuring finite and well-behaved persistence diagrams. Under these conditions, the bottleneck distance between the diagrams cannot exceed the

L^{\infty}

norm of

f - g

. Intuitively, this result follows from the idea that small pointwise perturbations in the function lead to correspondingly small shifts in the birth and death times of topological features. While the full proof involves careful measure-theoretic arguments and stability properties of sublevel sets, the key takeaway is that topological summaries remain stable under bounded perturbations, providing a foundational guarantee for our analysis.

Theorem 5.

Let

f, g : X \to R

be tame Lipschitz functions defined on a compact metric space X. For each

k \geq 0

, the difference in the total persistence

μ_{k}

of their k-th persistence diagrams satisfies:

| μ_{k} ({PD}_{k} (f)) - μ_{k} ({PD}_{k} (g)) | \leq 2 \cdot d_{B} ({PD}_{k} (f), {PD}_{k} (g)) \cdot N_{k}

(19)

where

N_{k}

is the number of points (including multiplicities) in

{PD}_{k} (f)

or

{PD}_{k} (g)

, whichever is greater.

Proof.

Let

γ : {PD}_{k} (f) \to {PD}_{k} (g)

be an optimal match that realizes the bottleneck distance

d_{B} ({PD}_{k} (f), {PD}_{k} (g))

. For any

(b_{i}, d_{i}) \in {PD}_{k} (f)

matched to

(b_{i}^{'}, d_{i}^{'}) = γ (b_{i}, d_{i})

:

| (d_{i} - b_{i}) - (d_{i}^{'} - b_{i}^{'}) | \leq | d_{i} - d_{i}^{'} | + | b_{i} - b_{i}^{'} | \leq 2 d_{B} ({PD}_{k} (f), {PD}_{k} (g))

(20)

The total persistence difference can be bounded:

| μ_{k} ({PD}_{k} (f)) - μ_{k} ({PD}_{k} (g)) | = |\sum_{(b_{i}, d_{i})} (d_{i} - b_{i}) - \sum_{(b_{i}^{'}, d_{i}^{'})} (d_{i}^{'} - b_{i}^{'})|

(21)

\leq \sum_{(b_{i}, d_{i})} | (d_{i} - b_{i}) - (d_{i}^{'} - b_{i}^{'}) | \leq 2 d_{B} ({PD}_{k} (f), {PD}_{k} (g)) \cdot N_{k}

(22)

□

These stability results ensure that our analysis is robust to small perturbations in the input images, providing a solid foundation for comparing and analyzing persistence diagrams of AI-generated fractal patterns.

5. Experimental Settings

Our experimental framework quantifies the topological properties of AI-generated fractal patterns using the Stable Diffusion 3.5 model [31]. Let

C

= fern, tree, spiral, crystal denote our set of fractal categories, where each category

c \in C

represents natural patterns with inherent self-similarity properties. For each category

c \in C

, we define a prompt set

P_{c} = {p_{1}^{c}, \dots, p_{4}^{c}}

where each

p_{i}^{c}

is constructed to capture specific mathematical aspects of fractal geometry. For instance,

p_{1}^{fern} = “ detailed fern leaf with intricate fractal patterns ”

and

p_{2}^{fern} = “ mathematical fern fractal with precise structure ”

. The prompt space

P = ⋃_{c \in C} P_{c}

forms a discrete metric space under the Levenshtein distance

d_{L} : P \times P \to R_{\geq 0}

, enabling quantitative comparison of textual descriptions.

Our study utilizes the Stable Diffusion 3.5 model obtained through the Hugging Face Diffusers library, which provides a convenient interface for text-to-image generation. This model is a multimodal diffusion transformer trained on large-scale image-text datasets and equipped with multiple text encoders (such as CLIP and T5). By conditioning the latent diffusion process on textual prompts, the model produces high-fidelity images that can reflect complex concepts and stylistic nuances. Although details of the full architecture and training protocols are proprietary to Stability AI, the stable-diffusion-3.5-medium checkpoint incorporates improvements over previous versions, including enhanced multi-resolution capabilities and QK-normalization for training stability. We used StableDiffusion3Pipeline from Hugging Face Diffusers to load the model and run the generation locally on GPU hardware.

The experimental pipeline formalizes the image generation and analysis process as follows. Let

M : P \to I

denote our text-to-image mapping from the prompt space P to the image space

I \subset {[0, 1]}^{h \times w \times 3}

, where

h, w \in {128, 256, 512}

specify the image dimensions. For each prompt

p \in P

, we generate an image

g = M (p)

with guidance scale

γ \in Γ = {4.0, 5.0, 6.0, 7.0, 8.0}

. To ensure reproducibility, we employ a deterministic seed mechanism

s \in S = {1, \dots, 50}

, which produces the generation function

G : P \times S \times Γ \to I

defined by:

G (p, s, γ) = M_{γ} (p; ξ_{s})

(23)

where

ξ_{s}

denotes the random state initialized by seed s. Each generated image undergoes a transformation

T : I \to {[0, 1]}^{h \times w}

to a grayscale intensity function via the rgb2gray operator:

T {(g)}_{i, j} = 0.2989 g_{i, j, 1} + 0.5870 g_{i, j, 2} + 0.1140 g_{i, j, 3}

(24)

Topological analysis employs the Ripser library for computing persistence diagrams. For each grayscale image f, we compute its persistence diagram

P D (f)

containing birth-death pairs

(b_{i}, d_{i})

that characterize the multi-scale topological features. The computation is performed in dimensions 0 and 1, capturing connected components and loops, respectively. We define three key statistical measures for each diagram

P D (f)

:

μ_{f} (P D (f)) = | P D (f) | (feature count)

(25)

μ_{p} (P D (f)) = \frac{1}{| P D (f) |} \sum_{(b_{i}, d_{i}) \in P D (f)} (d_{i} - b_{i}) (mean persistence)

(26)

σ_{p} (P D (f)) = \sqrt{\frac{1}{| P D (f) |} \sum_{(b_{i}, d_{i}) \in P D (f)} {((d_{i} - b_{i}) - μ_{p} (P D (f)))}^{2}} (std)

(27)

Each category incorporates four distinct prompt variations designed to explore different aspects of fractal generation. The variations range from mathematical descriptions (e.g., “Koch snowflake fractal”) to natural interpretations (e.g., “Natural fern with self-similar structure”), allowing us to evaluate how different textual formulations affect the topological characteristics of the generated patterns. These prompts were selected to maintain consistent geometric properties while varying the linguistic approach to description.

In creating our dataset, we thoughtfully chose four types of fractals (ferns, trees, spirals, and crystals) to capture a wide range of self-similar structures that match the theoretical framework outlined in Section 3 and Section 4. Each type includes prompts aimed at highlighting fractal traits such as recursive branching, scale invariance, and geometric complexity. By ensuring that text descriptions explicitly evoke fractal concepts, we construct a dataset that meets the mathematical criteria required for persistent homology analysis. This clear link between prompt design and theoretical fractal traits enables us to effectively use our topological approach and verify that the observed persistent features are deliberate and arise from the planned fractal elements of the generated images. (See Table 1).

Our experimental design incorporates three key analyses: impact on the guidance scale, noise robustness, and effects of prompt variation. The guidance scale analysis examines the values

γ \in {4.0, 5.0, 6.0, 7.0, 8.0}

to understand how the conditioning of the model affects the topological features. The noise robustness study introduces Gaussian noise levels

η \in {0.0, 0.05, 0.1, 0.15, 0.2}

to assess the stability of the characteristics. The prompt variation analysis explores four distinct prompt formulations per category to evaluate the consistency of topological signatures in different textual descriptions. This comprehensive framework enables a systematic evaluation of the relationship between the parameters of the generative model and the resulting topological features of fractal patterns.

As illustrated in Figure 1, the process of applying persistent homology to AI-generated fractal patterns involves several key steps, beginning with a text prompt that is transformed into a generated image by a text-to-image model. The resulting image is then converted to a grayscale intensity function and filtered to form sublevel sets, providing a structured input for topological analysis. After computing the persistent homology and extracting the associated persistence diagrams, we performed a detailed evaluation of the geometric authenticity and fractal properties inherent in the generated patterns.

6. Results

6.1. Preliminary Topological Analysis

Preliminary experimental analysis consisted of generating 20 images (5 each) in four fractal categories using the text-to-image model, with fixed image dimensions of 256 × 256 pixels. Figure 2 presents a representative subset of the generated patterns. The persistence diagrams were computed in dimensions 0 and 1, capturing the evolution of connected components and loops, respectively. The dimension 0 persistence analysis revealed consistent feature counts (

μ_{f} = 256.0

) in all categories, with infinite mean persistence and undefined standard deviation. This uniformity is attributed to the digital image representation, where each pixel forms a distinct connected component at the lowest intensity threshold. In contrast, the characteristics of dimension 1 exhibited statistically significant variations between categories, as evidenced by the measurements in Table 2.

The mean persistence values in dimension 1 provided quantitative support for our theoretical framework with respect to the complexity of the generated fractal patterns. Crystal structures exhibited the highest mean persistence (

μ_{p} = 19.33

), followed closely by ferns (

μ_{p} = 17.59

), while trees (

μ_{p} = 11.53

) and spirals (

μ_{p} = 7.37

) showed lower values. The standard deviation of persistence values revealed varying degrees of structural consistency, with spirals showing remarkable consistency (

σ_{p} = 1.07

) and trees exhibiting the highest variability (

σ_{p} = 4.37

).

These measurements align with the theoretical prediction that more complex fractal patterns generate more persistent topological features while also providing a quantitative measure of structural regularity. However, we acknowledge that factors such as model biases, prompt specificity, and image resolution may influence these outcomes.

6.2. Extended Study on Prompt Variations

An extended study was conducted to analyze the impact of prompt variations on topological features, with 160 images generated per category using four distinct prompt formulations. This comprehensive analysis revealed significant insights into the relationship between textual descriptions and the resulting topological structures of generated fractals. Table 3 presents the results for the fern category, illustrating how different prompt formulations affect the topological features.

The results demonstrate that prompt variations significantly influence the topological characteristics of the generated fractals. For instance, in the fern category, the “Barnsley” prompt yielded the highest mean persistence (

μ_{p} = 22.64

) and standard deviation (

σ_{p} = 8.74

), indicating more complex and variable structures. In contrast, the “Natural” prompt produced the lowest feature count (

μ_{f} = 51.58

) and mean persistence (

μ_{p} = 17.94

), suggesting simpler and more uniform patterns.

Similar patterns were observed in other categories. In the spiral category, logarithmic spiral prompts produced markedly different features (

μ_{f} = 175.75

,

μ_{p} = 32.18

) compared to other variations (

μ_{f} \in [40.28, 78.28]

,

μ_{p} \in [8.25, 13.17]

), suggesting a strong sensitivity to the specific mathematical terminology in the prompts. The Pythagoras tree prompts similarly generated distinctive features (

μ_{f} = 108.33

,

μ_{p} = 25.55

) compared to other tree variations.

These results provide empirical support for our theoretical framework, demonstrating that persistent homology effectively captures the underlying geometric structure of AI-generated fractal patterns while being sensitive to variations in textual descriptions. The significant differences observed in prompt variations highlight the importance of language in guiding the generation of specific fractal structures.

6.3. Impact of Guidance Scale

The analysis of guidance scale effects on topological features reveals systematic patterns in the range

γ \in {4.0, 5.0, 6.0, 7.0, 8.0}

. Figure 3 illustrates the relationship between the guidance scale and the mean persistence for each fractal category.

For the crystal category, the mean persistence exhibits monotonic growth from

μ_{p} = 15.49

at

γ = 4.0

to

μ_{p} = 21.43

at

γ = 8.0

, while feature counts decrease from

μ_{f} = 85.5

to

μ_{f} = 73.7

. The tree category demonstrates similar behavior, with mean persistence increasing from

μ_{p} = 13.71

to

μ_{p} = 18.19

as feature counts decrease from

μ_{f} = 106.8

to

μ_{f} = 98.0

. The spiral category maintains a consistent feature reduction from

μ_{f} = 97.1

to

μ_{f} = 79.1

with increasing persistence from

μ_{p} = 12.19

to

μ_{p} = 14.40

. The fern category exhibits the most stable response to guidance scaling, with persistence values ranging from

μ_{p} = 18.63

to

μ_{p} = 22.42

and feature counts varying between

μ_{f} = 82.2

and

μ_{f} = 74.6

.

These results suggest that higher guidance scales generally lead to more pronounced and persistent topological features, potentially at the cost of reduced feature diversity. This trade-off between feature persistence and count provides insight into the model’s behavior under different levels of prompt adherence. However, we note that the generative model’s interpretation of prompts and inherent limitations could impact the fidelity of the generated fractal patterns. (See Figure 4).

6.4. Noise Robustness Analysis

Noise robustness analysis quantifies the stability of topological features under Gaussian noise perturbations

η \in {0.0, 0.05, 0.1, 0.15, 0.2}

. Table 4 summarizes the results for each category at different noise levels.

The fern category maintains exceptional stability, with mean feature counts that vary only between

μ_{f} = 69.73

and

μ_{f} = 70.03

and persistence values that remain consistent at

μ_{p} = 20.11 \pm 0.01

at all noise levels. The crystal category exhibits similar robustness, with feature counts varying from

μ_{f} = 78.47

to

μ_{f} = 78.70

and persistence values that maintain

μ_{p} = 19.87 \pm 0.01

. The tree and spiral categories show comparable stability, with minimal variations in feature counts and persistence values across noise levels.

These quantitative results establish the robustness of persistent homology as a reliable tool for analyzing the geometric properties of AI-generated fractal patterns under varying noise conditions. The stability of topological features across different noise levels supports the theoretical predictions of the stability theorem for persistence diagrams, as discussed in Section 4.4.

Figure 5 provides a concrete visual example of the influence of prompt variations on the fractal-like structures generated by the model. By comparing multiple images across categories and prompt formulations, we observe that certain textual cues lead to more pronounced self-similar patterns or increased feature complexity. This visual evidence supports our quantitative findings, reinforcing the conclusion that the interplay between language and topological characteristics is pivotal in shaping the geometric authenticity of AI-generated fractal patterns.

7. Conclusions

Our analysis demonstrates that persistent homology effectively detects and quantifies fractal characteristics in AI-generated images, thus confirming the presence of self-similar structures. By examining variations in the guidance scale, we identify a monotonic relationship between the model’s parameters and the resulting topological complexity, indicating that parameter tuning can systematically influence geometric authenticity. Furthermore, noise robustness tests verify that the topological features remain stable under perturbations, underscoring the reliability and broad applicability of our approach. Finally, our comparative examination of different textual prompts reveals that persistent homology is sensitive to linguistic specificity, which substantially affects the complexity of the generated fractal patterns.

This work establishes a framework for analyzing fractal patterns in AI-generated images using persistent homology. We have shown that for a text-to-image mapping,

M : T \to I

, the persistent homology groups

H_{k} (t)

of sublevel set filtrations characterize the multi-scale structure in generated patterns. Our analysis demonstrates that for a fractal prompt

p \in T

, the persistence diagram

P D (M (p))

contains birth–death pairs

(b_{i}, d_{i})

that reflect the geometric properties of synthetic patterns.

Our experimental validation using the text-to-image model in four fractal categories (ferns, trees, spirals, and crystals) supports our theoretical framework. The analysis of the guidance scale

γ \in [4.0, 8.0]

reveals a relationship between

γ

and the mean persistence

μ_{p}

, with crystal patterns showing an increase from

μ_{p} = 15.49

at

γ = 4.0

to

μ_{p} = 21.43

at

γ = 8.0

. Concurrently, feature counts for crystals decrease from

μ_{f} = 85.5

to

μ_{f} = 73.7

. These results suggest that higher guidance scales generally lead to more pronounced but fewer topological features in generated fractals.

The stability of our approach is supported by a noise perturbation analysis, where feature counts remain

Δ μ_{f} < 0.5

at all noise levels

η \in [0, 0.2]

. Our analysis of prompt variations shows that topological signatures are sensitive to specific mathematical terminology. For example, logarithmic spiral prompts produce features with

μ_{f} = 175.75

and

μ_{p} = 32.18

, differing significantly from other variations (

μ_{f} \in [40.28, 78.28]

,

μ_{p} \in [8.25, 13.17]

). These findings suggest persistent homology as a potential metric for evaluating generative models in the context of geometric pattern synthesis.

We recognize the limitations inherent in our methodology, particularly the assumption of a close approximation between generated and ideal fractals, which may not consistently hold true. Furthermore, the current implementation is restricted to dimensions 0 and 1 of persistent homology, suggesting opportunities for investigation into higher-dimensional characteristics. Variables such as model biases, specificity of prompts, and image resolution might impact our results, necessitating future efforts to disentangle the effects of the fractal structure from these influences.

Beyond the immediate context of fractal image synthesis, our framework holds practical implications for the broader AI and computational geometry communities. In AI-driven content generation, persistent homology can serve as a diagnostic tool, allowing practitioners to verify whether the generated imagery adheres to the specified geometric or structural constraints. This topological perspective can complement existing evaluation metrics, offering a more intrinsic shape-based assessment. In computational geometry and related fields, our approach opens avenues for analyzing complex data through topology-based features, potentially informing shape analysis, 3D reconstruction tasks, and graph-based pattern recognition. By establishing a connection between theoretical fractal principles and empirical topological signatures, our method provides a valuable methodological template for others aiming to integrate topological data analysis into their workflows.

Future research directions include extending our framework to analyze temporal persistence in video generation models and developing topology-aware loss functions. Investigating prompt-geometry mappings and implementing parallel computation methods for higher-dimensional persistence modules could reveal more complex topological structures in generated fractals. Further studies could also explore the relationship between persistence diagrams and traditional fractal dimension measures, potentially leading to new insights into the geometric properties of AI-generated patterns.

Our future research plan includes several specific objectives. We aim to (1) investigate higher-dimensional homological features and their sensitivity to fractal prompt design, (2) integrate temporal analysis to examine how fractal patterns evolve over video sequences generated by advanced diffusion models, and (3) incorporate color-based or multi-channel filtrations to capture structural nuances absent in grayscale simplifications. Methodologically, we will explore topology-aware loss functions to guide generative models, experiment with prompt geometry mappings to link linguistic attributes with geometric outcomes and employ parallel computing strategies to handle the computational complexity of larger datasets and higher-dimensional features. We anticipate that these efforts will yield improved control over fractal authenticity, more interpretable topological descriptors, and a broader understanding of the interplay between generative modeling, language, and geometry.

Author Contributions

Conceptualization, M.L.; methodology, M.L.; software, S.L.; writing—original draft preparation, M.L.; writing—review and editing, M.L. and S.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korean government (MSIT) (RS-2024-00337250).

Data Availability Statement

We utilized the Stable Diffusion 3.5 medium model provided by Stability AI in our research. The model was employed for specific tasks related to text-to-image generation within the study. The data and models used, including the Stable Diffusion 3.5 medium model, are publicly available through Hugging Face at https://huggingface.co/stabilityai/stable-diffusion-3.5-medium (accessed on 1 November 2024).

Acknowledgments

For transparency and compliance with publisher guidelines, we acknowledge the use of an AI-powered grammar-checking tool for language refinement and ChatGPT 4o for code debugging purposes only. These tools were used strictly for technical assistance without any influence on the manuscript’s original content or conceptual framework.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Chen, M.; Wang, D.; Feng, S.; Zhang, Y. Topological Regularization for Representation Learning via Persistent Homology. Mathematics 2023, 11, 1008. [Google Scholar] [CrossRef]
Alhabeeb, S.K.; Al-Shargabi, A.A. Text-to-Image Synthesis with Generative Models: Methods, Datasets, Performance Metrics, Challenges, and Future Direction. IEEE Access 2024, 12, 24412–24427. [Google Scholar] [CrossRef]
Rauniyar, A.; Raj, A.; Kumar, A.; Kandu, A.K.; Singh, A.; Gupta, A. Text to image generator with latent diffusion models. In Proceedings of the 2023 International Conference on Computational Intelligence, Communication Technology and Networking (CICTN), Ghaziabad, India, 20–21 April 2023; pp. 144–148. [Google Scholar]
Rombach, R.; Blattmann, A.; Lorenz, D.; Esser, P.; Ommer, B. High-resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 18–24 June 2022; pp. 10684–10695. [Google Scholar]
Singh, J.; Gould, S.; Zheng, L. High-fidelity guided image synthesis with latent diffusion models. In Proceedings of the 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, BC, Canada, 17–24 June 2023; pp. 5997–6006. [Google Scholar]
Wang, W.; Jing, M.; Fan, Y.; Weng, W. PixRevive: Latent Feature Diffusion Model for Compressed Video Quality Enhancement. Sensors 2024, 24, 1907. [Google Scholar] [CrossRef]
Leiñena, J.; Saiz, F.A.; Barandiaran, I. Latent Diffusion Models to Enhance the Performance of Visual Defect Segmentation Networks in Steel Surface Inspection. Sensors 2024, 24, 6016. [Google Scholar] [CrossRef] [PubMed]
Osorio, P.; Jimenez-Perez, G.; Montalt-Tordera, J.; Hooge, J.; Duran-Ballester, G.; Singh, S.; Radbruch, M.; Bach, U.; Schroeder, S.; Siudak, K.; et al. Latent diffusion models with image-derived annotations for enhanced AI-assisted cancer diagnosis in histopathology. Diagnostics 2024, 14, 1442. [Google Scholar] [CrossRef]
Lee, M. Fractal Self-Similarity in Semantic Convergence: Gradient of Embedding Similarity across Transformer Layers. Fractal Fract. 2024, 8, 552. [Google Scholar] [CrossRef]
Lee, M. Fractal Analysis of GPT-2 Token Embedding Spaces: Stability and Evolution of Correlation Dimension. Fractal Fract. 2024, 8, 603. [Google Scholar] [CrossRef]
Oner, D.; Garin, A.; Koziński, M.; Hess, K.; Fua, P. Persistent homology with improved locality information for more effective delineation. IEEE Trans. Pattern Anal. Mach. Intell. 2023, 45, 10588–10595. [Google Scholar] [CrossRef]
Iuricich, F. Persistence cycles for visual exploration of persistent homology. IEEE Trans. Vis. Comput. Graph. 2021, 28, 4966–4979. [Google Scholar] [CrossRef]
Malott, N.O.; Wilsey, P.A. Fast computation of persistent homology with data reduction and data partitioning. In Proceedings of the 2019 IEEE International Conference on Big Data (Big Data), Los Angeles, CA, USA, 9–12 December 2019; pp. 880–889. [Google Scholar]
Hajij, M.; Wang, B.; Scheidegger, C.; Rosen, P. Visual detection of structural changes in time-varying graphs using persistent homology. In Proceedings of the 2018 IEEE Pacific Visualization Symposium (Pacificvis), Kobe, Japan, 10–13 April 2018; pp. 125–134. [Google Scholar]
Chu, D.T.; Bai, L.Y.; Huang, J.N.; Fang, Z.L.; Zhang, P.; Kang, W.; Ling, H.F. Enhanced Safety in Autonomous Driving: Integrating a Latent State Diffusion Model for End-to-End Navigation. Sensors 2024, 24, 5514. [Google Scholar] [CrossRef]
Lütgehetmann, D.; Govc, D.; Smith, J.P.; Levi, R. Computing persistent homology of directed flag complexes. Algorithms 2020, 13, 19. [Google Scholar] [CrossRef]
de Rose, S.; Meyer, P.; Bertrand, F. Human Body Shapes Anomaly Detection and Classification Using Persistent Homology. Algorithms 2023, 16, 161. [Google Scholar] [CrossRef]
Lee, M. Emergence of Self-Identity in AI: A Mathematical Framework and Empirical Study with Generative Large Language Models. arXiv 2024, arXiv:2411.18530. [Google Scholar]
Asaad, A.; Ali, D.; Majeed, T.; Rashid, R. Persistent homology for breast tumor classification using mammogram scans. Mathematics 2022, 10, 4039. [Google Scholar] [CrossRef]
Maršálek, R.; Zedka, R.; Zöchmann, E.; Vychodil, J.; Závorka, R.; Ghiaasi, G.; Blumenstein, J. Persistent homology approach for human presence detection from 60 GHz OTFS transmissions. Sensors 2023, 23, 2224. [Google Scholar] [CrossRef]
Branco, S.; Carvalho, J.G.; Reis, M.S.; Lopes, N.V.; Cabral, J. 0-Dimensional Persistent Homology Analysis Implementation in Resource-Scarce Embedded Systems. Sensors 2022, 22, 3657. [Google Scholar] [CrossRef]
Malek, A.A.; Alias, M.A.; Razak, F.A.; Noorani, M.S.M.; Mahmud, R.; Zulkepli, N.F.S. Persistent Homology-Based Machine Learning Method for Filtering and Classifying Mammographic Microcalcification Images in Early Cancer Detection. Cancers 2023, 15, 2606. [Google Scholar] [CrossRef]
Choe, S.; Ramanna, S. Cubical homology-based machine learning: An application in image classification. Axioms 2022, 11, 112. [Google Scholar] [CrossRef]
Wang, X.; Yuan, B.; Li, Z.; Wang, H. A Fractal Curve-Inspired Framework for Enhanced Semantic Segmentation of Remote Sensing Images. Sensors 2024, 24, 7159. [Google Scholar] [CrossRef]
Ji, Z.; Ziyu, L.; Angsheng, W.; Peng, C. An approach to extracting fractal in remote sensing image. Wuhan Univ. J. Nat. Sci. 2006, 11, 606–610. [Google Scholar] [CrossRef]
Sun, W.; Xu, G.; Gong, P.; Liang, S. Fractal analysis of remotely sensed images: A review of methods and applications. Int. J. Remote Sens. 2006, 27, 4963–4990. [Google Scholar] [CrossRef]
Kato, C.N.; Barra, S.G.; Tavares, N.P.; Amaral, T.M.; Brasileiro, C.B.; Mesquita, R.A.; Abreu, L.G. Use of fractal analysis in dental images: A systematic review. Dentomaxillofac. Radiol. 2020, 49, 20180457. [Google Scholar] [CrossRef] [PubMed]
Huang, J.; Zhou, Y.; Luo, Y.; Liu, G.; Guo, H.; Yang, G. Representing Topological Self-Similarity Using Fractal Feature Maps for Accurate Segmentation of Tubular Structures. In Proceedings of the 18th European Conference on Computer Vision, Milan, Italy, 29 September–4 October 2024; pp. 143–160. [Google Scholar]
Wasserman, L. Topological data analysis. Annu. Rev. Stat. Appl. 2018, 5, 501–532. [Google Scholar] [CrossRef]
Abdullahi, M.S.; Suratanee, A.; Piro, R.M.; Plaimas, K. Persistent Homology Identifies Pathways Associated with Hepatocellular Carcinoma from Peripheral Blood Samples. Mathematics 2024, 12, 725. [Google Scholar] [CrossRef]
Esser, P.; Kulal, S.; Blattmann, A.; Entezari, R.; Müller, J.; Saini, H.; Levi, Y.; Lorenz, D.; Sauer, A.; Boesel, F.; et al. Scaling rectified flow transformers for high-resolution image synthesis. In Proceedings of the Forty-First International Conference on Machine Learning, Vienna, Austria, 21–27 July 2024. [Google Scholar]
Cohen-Steiner, D.; Edelsbrunner, H.; Harer, J. Stability of Persistence Diagrams. Discret. Comput. Geom. 2005, 37, 103–120. [Google Scholar] [CrossRef]
Cohen-Steiner, D.; Edelsbrunner, H.; Harer, J.; Mileyko, Y. Lipschitz Functions Have Lp-Stable Persistence. Found. Comput. Math. 2010, 10, 127–139. [Google Scholar] [CrossRef]
Patel, A. Generalized persistence diagrams. J. Appl. Comput. Topol. 2016, 1, 397–419. [Google Scholar] [CrossRef]
Clough, J.; Byrne, N.; Oksuz, I.; Zimmer, V.; Schnabel, J.; King, A. A Topological Loss Function for Deep-Learning Based Image Segmentation Using Persistent Homology. IEEE Trans. Pattern Anal. Mach. Intell. 2019, 44, 8766–8778. [Google Scholar] [CrossRef]
Mandelbrot, B.B.; Van Ness, J.W. Fractional Brownian Motions, Fractional Noises and Applications. SIAM Rev. 1968, 10, 422–437. [Google Scholar] [CrossRef]
la Torre, F.L.; GonzÃ¡lez-Trejo, J.; Real-RamÃ rez, C.; Hoyos-Reyes, L.F. Fractal dimension algorithms and their application to time series associated with natural phenomena. J. Phys. Conf. Ser. 2013, 475, 012002. [Google Scholar] [CrossRef]
Podell, D.; English, Z.; Lacey, K.; Blattmann, A.; Dockhorn, T.; Muller, J.; Penna, J.; Rombach, R. SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis. arXiv 2023, arXiv:2307.01952. [Google Scholar]
Blattmann, A.; Dockhorn, T.; Kulal, S.; Mendelevitch, D.; Kilian, M.; Lorenz, D. Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets. arXiv 2023, arXiv:2311.15127. [Google Scholar]
Xie, E.; Chen, J.; Chen, J.; Cai, H.; Lin, Y.; Zhang, Z.; Li, M.; Lu, Y.; Han, S. SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers. arXiv 2024, arXiv:2410.10629. [Google Scholar]
Gu, J.; Zhai, S.; Zhang, Y.; Susskind, J.M.; Jaitly, N. Matryoshka Diffusion Models. arXiv 2023, arXiv:2310.15111. [Google Scholar]
Yao, F.; Zhang, H.; Gong, Y. Difsg2-ccl: Image reconstruction based on special optical properties of water body. IEEE Photonics Technol. Lett. 2024, 36, 1417–1420. [Google Scholar] [CrossRef]
Wang, Z.; Chen, M.; Guo, Y.; Li, Z.; Yu, Q. Bridging the domain gap in satellite pose estimation: A self-training approach based on geometrical constraints. IEEE Trans. Aerosp. Electron. Syst. 2023, 60, 2500–2514. [Google Scholar] [CrossRef]
Horak, D.; Yu, S.; Salimi-Khorshidi, G. Topology distance: A topology-based approach for evaluating generative adversarial networks. Proc. AAAI Conf. Artif. Intell. 2021, 35, 7721–7728. [Google Scholar] [CrossRef]
Alipourjeddi, N.; Miri, A. Evaluating Generative Adversarial Networks: A Topological Approach. In Proceedings of the 2023 International Conference on Computing, Networking and Communications (ICNC), Honolulu, HI, USA, 20–22 February 2023; pp. 202–206. [Google Scholar]

$Fractalfract 08 00731 g001$

Figure 1. A conceptual flow diagram illustrating the steps for applying persistent homology to analyze AI-generated fractal patterns. Starting from text prompts, images are generated, converted to grayscale intensity functions, filtered to form a simplicial complex, and then analyzed to produce persistence diagrams.

$Fractalfract 08 00731 g001$

$Fractalfract 08 00731 g002$

Figure 2. Generated fractal patterns for the different categories. Each row represents one category (fern, tree, spiral, crystal) with four different examples, demonstrating the variety and consistency of the generated patterns. All images are 256 × 256 pixels.

$Fractalfract 08 00731 g002$

$Fractalfract 08 00731 g003$

Figure 3. Impact of guidance scale on topological features. The plot shows mean persistence values (dimension 1) in different guidance scales for each fractal category, with error bands indicating standard deviation.

$Fractalfract 08 00731 g003$

$Fractalfract 08 00731 g004$

Figure 4. Comparison of topological features in prompt variations. Bar heights represent mean persistence values for different prompt formulations within each category, with error bars showing standard deviation.

$Fractalfract 08 00731 g004$

$Fractalfract 08 00731 g005$

Figure 5. Visual comparison of generated fractal patterns. Each cell represents a single AI-generated image, arranged according to category, prompt index (

p_{n}

), and seed (

s_{n}

). The displayed grid (here an 8×8 subset) illustrates the variability and complexity across different textual descriptions, confirming that subtle linguistic changes influence the geometric authenticity and fractal complexity of the resulting images.

Figure 5. Visual comparison of generated fractal patterns. Each cell represents a single AI-generated image, arranged according to category, prompt index (

p_{n}

), and seed (

s_{n}

). The displayed grid (here an 8×8 subset) illustrates the variability and complexity across different textual descriptions, confirming that subtle linguistic changes influence the geometric authenticity and fractal complexity of the resulting images.

$Fractalfract 08 00731 g005$

Table 1. Prompt variations by fractal category.

Category	Prompt Variations
Fern	1. Detailed fern leaf with intricate fractal patterns 2. Mathematical Barnsley fern fractal 3. Natural fern with self-similar structure 4. Recursive fern pattern with multiple iterations
Tree	1. Fractal binary tree structure 2. Mathematical tree with recursive branching 3. Pythagoras tree fractal 4. Binary branching pattern with self-similarity
Spiral	1. Golden spiral with fractal elements 2. Logarithmic spiral pattern 3. Nautilus shell with recursive structure 4. Fibonacci spiral with self-similar details
Crystal	1. Snowflake with hexagonal symmetry 2. Ice crystal with recursive branching 3. Koch snowflake fractal 4. Crystalline growth pattern

Table 2. Dimension 1 features from preliminary analysis.

Category	Mean Features	Mean Persistence	Std Persistence
Fern	63.8	17.59	3.51
Tree	79.2	11.53	4.37
Spiral	37.4	7.37	1.07
Crystal	71.2	19.33	2.87

Table 3. Dimension 1 features by prompt variation—fern category.

Prompt	Mean Features	Mean Persistence	Std Persistence
Detailed	62.75	18.35	6.09
Mathematical	73.85	19.66	6.94
Natural	51.58	17.94	6.53
Barnsley	69.65	22.64	8.74

Table 4. Noise robustness analysis (dimension 1 features).

Category	Noise 0.00	Noise 0.05	Noise 0.10	Noise 0.15	Noise 0.20
Fern	69.73	69.80	69.87	70.03	69.87
Tree	97.93	97.97	97.70	98.23	98.17
Spiral	76.37	76.27	76.60	76.60	76.27
Crystal	78.47	78.30	78.33	78.43	78.70

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Lee, M.; Lee, S. Persistent Homology Analysis of AI-Generated Fractal Patterns: A Mathematical Framework for Evaluating Geometric Authenticity. Fractal Fract. 2024, 8, 731. https://doi.org/10.3390/fractalfract8120731

AMA Style

Lee M, Lee S. Persistent Homology Analysis of AI-Generated Fractal Patterns: A Mathematical Framework for Evaluating Geometric Authenticity. Fractal and Fractional. 2024; 8(12):731. https://doi.org/10.3390/fractalfract8120731

Chicago/Turabian Style

Lee, Minhyeok, and Soyeon Lee. 2024. "Persistent Homology Analysis of AI-Generated Fractal Patterns: A Mathematical Framework for Evaluating Geometric Authenticity" Fractal and Fractional 8, no. 12: 731. https://doi.org/10.3390/fractalfract8120731

APA Style

Lee, M., & Lee, S. (2024). Persistent Homology Analysis of AI-Generated Fractal Patterns: A Mathematical Framework for Evaluating Geometric Authenticity. Fractal and Fractional, 8(12), 731. https://doi.org/10.3390/fractalfract8120731

Article Menu

Persistent Homology Analysis of AI-Generated Fractal Patterns: A Mathematical Framework for Evaluating Geometric Authenticity

Abstract

1. Introduction

2. Related Work

2.1. Topological Data Analysis in Image Processing

2.2. Fractal Analysis of Natural Patterns

2.3. Text-to-Image Generation Models

2.4. Persistent Homology in Machine Learning

3. Background

3.1. Text-to-Image Generation Models

3.2. Persistent Homology

3.3. Fractal Geometry and Natural Patterns

4. Method

4.1. Theoretical Framework

4.2. Main Results and Proofs

4.3. Algorithmic Implementation

4.4. Stability Analysis

5. Experimental Settings

6. Results

6.1. Preliminary Topological Analysis

6.2. Extended Study on Prompt Variations

6.3. Impact of Guidance Scale

6.4. Noise Robustness Analysis

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI