Next Article in Journal
Modeling of Chemical Vapor Infiltration for Fiber-Reinforced Silicon Carbide Composites Using Meshless Method of Fundamental Solutions
Next Article in Special Issue
Induction of Convolutional Decision Trees with Success-History-Based Adaptive Differential Evolution for Semantic Segmentation
Previous Article in Journal
A Coupled Finite-Boundary Element Method for Efficient Dynamic Structure-Soil-Structure Interaction Modeling
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

M5GP: Parallel Multidimensional Genetic Programming with Multidimensional Populations for Symbolic Regression

by
Luis Cárdenas Florido
1,2,3,
Leonardo Trujillo
1,4,*,
Daniel E. Hernandez
1 and
Jose Manuel Muñoz Contreras
1
1
Departamento de Ingeniería Eléctrica Electrónica, Posgrado en Ciencias de la Ingeniería, Tecnológico Nacional de México/IT de Tijuana, Tijuana 22430, Mexico
2
División de Estudios de Posgrado, Maestría en Sistemas Computacionales, Tecnológico Nacional de México/IT de La Paz, La Paz 23080, Mexico
3
Departamento de Sistemas y Computación, Tecnológico Nacional de México/IT de Ensenada, Ensenada 22780, Mexico
4
LASIGE, Department of Informatics, Faculty of Sciences, University of Lisbon, 1749-016 Lisboa, Portugal
*
Author to whom correspondence should be addressed.
Math. Comput. Appl. 2024, 29(2), 25; https://doi.org/10.3390/mca29020025
Submission received: 22 January 2024 / Revised: 12 March 2024 / Accepted: 13 March 2024 / Published: 18 March 2024
(This article belongs to the Special Issue New Trends in Computational Intelligence and Applications 2023)

Abstract

Machine learning and artificial intelligence are growing in popularity thanks to their ability to produce models that exhibit unprecedented performance in domains that include computer vision, natural language processing and code generation. However, such models tend to be very large and complex and impossible to understand using traditional analysis or human scrutiny. Conversely, Symbolic Regression methods attempt to produce models that are relatively small and (potentially) human-readable. In this domain, Genetic Programming (GP) has proven to be a powerful search strategy that achieves state-of-the-art performance. This paper presents a new GP-based feature transformation method called M5GP, which is hybridized with multiple linear regression to produce linear models, implemented to exploit parallel processing on graphical processing units for efficient computation. M5GP is the most recent variant from a family of feature transformation methods (M2GP, M3GP and M4GP) that have proven to be powerful tools for both classification and regression tasks applied to tabular data. The proposed method was evaluated on SRBench v2.0, the current standard benchmarking suite for Symbolic Regression. Results show that M5GP achieves performance that is competitive with the state-of-the-art, achieving a top-three rank on the most difficult subset of black-box problems. Moreover, it achieves the lowest computation time when compared to other GP-based methods that have similar accuracy scores.
Keywords: Genetic Programming; M3GP; M4GP; graphical processing units; regression Genetic Programming; M3GP; M4GP; graphical processing units; regression

Share and Cite

MDPI and ACS Style

Cárdenas Florido, L.; Trujillo, L.; Hernandez, D.E.; Muñoz Contreras, J.M. M5GP: Parallel Multidimensional Genetic Programming with Multidimensional Populations for Symbolic Regression. Math. Comput. Appl. 2024, 29, 25. https://doi.org/10.3390/mca29020025

AMA Style

Cárdenas Florido L, Trujillo L, Hernandez DE, Muñoz Contreras JM. M5GP: Parallel Multidimensional Genetic Programming with Multidimensional Populations for Symbolic Regression. Mathematical and Computational Applications. 2024; 29(2):25. https://doi.org/10.3390/mca29020025

Chicago/Turabian Style

Cárdenas Florido, Luis, Leonardo Trujillo, Daniel E. Hernandez, and Jose Manuel Muñoz Contreras. 2024. "M5GP: Parallel Multidimensional Genetic Programming with Multidimensional Populations for Symbolic Regression" Mathematical and Computational Applications 29, no. 2: 25. https://doi.org/10.3390/mca29020025

APA Style

Cárdenas Florido, L., Trujillo, L., Hernandez, D. E., & Muñoz Contreras, J. M. (2024). M5GP: Parallel Multidimensional Genetic Programming with Multidimensional Populations for Symbolic Regression. Mathematical and Computational Applications, 29(2), 25. https://doi.org/10.3390/mca29020025

Article Metrics

Back to TopTop