Next Article in Journal
Research on the Aerodynamic–Propulsion Coupling Characteristics of a Distributed Propulsion System
Previous Article in Journal
A Lightweight Received Signal Strength Indicator Estimation Model for Low-Power Internet of Things Devices in Constrained Indoor Networks
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Fast Fourier Asymmetric Context Aggregation Network: A Controllable Photo-Realistic Clothing Image Synthesis Method Using Asymmetric Context Aggregation Mechanism

by
Haopeng Lei
1,†,
Ying Hu
1,†,
Mingwen Wang
1,*,
Meihai Ding
1,
Zhen Li
1 and
Guoliang Luo
2
1
The School of Computer Information Engineering, JiangXi Normal University, Nanchang 330022, China
2
Virtual Reality and Interactive Techniques Institute, East China Jiaotong University, 808 Changbei Shuanggang East Street, Nanchang 330013, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Appl. Sci. 2025, 15(7), 3534; https://doi.org/10.3390/app15073534
Submission received: 19 January 2025 / Revised: 4 March 2025 / Accepted: 5 March 2025 / Published: 24 March 2025
(This article belongs to the Section Computing and Artificial Intelligence)

Abstract

Clothing image synthesis has emerged as a crucial technology in the fashion domain, enabling designers to rapidly transform creative concepts into realistic visual representations. However, the existing methods struggle to effectively integrate multiple guiding information sources, such as sketches and texture patches, limiting their ability to precisely control the generated content. This often results in issues such as semantic inconsistencies and the loss of fine-grained texture details, which significantly hinders the advancement of this technology. To address these issues, we propose the Fast Fourier Asymmetric Context Aggregation Network (FCAN), a novel image generation network designed to achieve controllable clothing image synthesis guided by design sketches and texture patches. In the FCAN, we introduce the Asymmetric Context Aggregation Mechanism (ACAM), which leverages multi-scale and multi-stage heterogeneous features to achieve efficient global visual context modeling, significantly enhancing the model’s ability to integrate guiding information. Complementing this, the FCAN also incorporates a Fast Fourier Channel Dual Residual Block (FF-CDRB), which utilizes the frequency-domain properties of Fast Fourier Convolution to enhance fine-grained content inference while maintaining computational efficiency. We evaluate the FCAN on the newly constructed SKFashion dataset and the publicly available VITON-HD and Fashion-Gen datasets. The experimental results demonstrate that the FCAN consistently generates high-quality clothing images aligned with the design intentions while outperforming the baseline methods across multiple performance metrics. Furthermore, the FCAN demonstrates superior robustness to varying texture conditions compared to the existing methods, highlighting its adaptability to diverse real-world scenarios. These findings underscore the potential of the FCAN to advance this technology by enabling controllable and high-quality image generation.
Keywords: artificial intelligence; image synthesis; visual modeling; computer vision artificial intelligence; image synthesis; visual modeling; computer vision

Share and Cite

MDPI and ACS Style

Lei, H.; Hu, Y.; Wang, M.; Ding, M.; Li, Z.; Luo, G. Fast Fourier Asymmetric Context Aggregation Network: A Controllable Photo-Realistic Clothing Image Synthesis Method Using Asymmetric Context Aggregation Mechanism. Appl. Sci. 2025, 15, 3534. https://doi.org/10.3390/app15073534

AMA Style

Lei H, Hu Y, Wang M, Ding M, Li Z, Luo G. Fast Fourier Asymmetric Context Aggregation Network: A Controllable Photo-Realistic Clothing Image Synthesis Method Using Asymmetric Context Aggregation Mechanism. Applied Sciences. 2025; 15(7):3534. https://doi.org/10.3390/app15073534

Chicago/Turabian Style

Lei, Haopeng, Ying Hu, Mingwen Wang, Meihai Ding, Zhen Li, and Guoliang Luo. 2025. "Fast Fourier Asymmetric Context Aggregation Network: A Controllable Photo-Realistic Clothing Image Synthesis Method Using Asymmetric Context Aggregation Mechanism" Applied Sciences 15, no. 7: 3534. https://doi.org/10.3390/app15073534

APA Style

Lei, H., Hu, Y., Wang, M., Ding, M., Li, Z., & Luo, G. (2025). Fast Fourier Asymmetric Context Aggregation Network: A Controllable Photo-Realistic Clothing Image Synthesis Method Using Asymmetric Context Aggregation Mechanism. Applied Sciences, 15(7), 3534. https://doi.org/10.3390/app15073534

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop