Evaluating Packaging Design Relative Feature Importance Using an Artificial Neural Network (ANN)
Abstract
:1. Introduction
2. Methods
2.1. ANN Neuron Number Determination in the Hidden Layer
2.2. Methods for Evaluating Input Feature Importance
2.2.1. Connection Weights Method
- —The standard deviation of the ith input is equal to the ith input; subtract the mean of input features and divide it by the standard deviation of inputs features.
- —The ith input’s importance.
- —The number of hidden nodes in the first layer.
- —The weight connecting the ith input to the jth hidden node in the first layer.
2.2.2. Gradient-Based Method
- X + ,
- X − .
2.2.3. Permutation Method
- ij—Importance for feature fj.
- s—Reference score of the model m on dataset D.
- K—Repetition times for randomly shuffling column j of the dataset D to generate a corrupted version of the data named Dkj.
- skj—Score of the model on corrupted data Dkj.
2.2.4. SHAP Values
- p—The total number of features.
- N\{j}—A set of all possible combinations of features excluding Xj.
- S—A feature set in N\{j}.
- f(S)—The model prediction with features in S.
- f(S∪{j})—The model prediction with features in S plus feature Xj.
2.3. Final Relative Feature Importance Determination
3. Results
3.1. Case Study 1: Relative Feature Importance of the Synthetic Dataset
- ECT—Edge crush strength (lb/in).
- Thickness—Thickness of the corrugated board (in).
- P—Perimeter of the box (in).
3.1.1. ANN Training Using the Synthetic Dataset
3.1.2. Relative Feature Importance Analysis of the Synthetic Dataset
3.2. Case Study 2: Relative Feature Importance of the Real Dataset
- ECT—Edge crush strength (lb/in).
- EIx, EIy—Flexural stiffness in the machine direction and cross-machine directions of the corrugated board (lb×in).
- P—Perimeter of the box (in).
3.2.1. ANN Training Using the Real Dataset
3.2.2. Relative Feature Importance Analysis of the Real Dataset
4. Discussion
- Expanding Input Features: Incorporating additional packaging parameters, such as material properties, environmental factors, and manufacturing processes, could improve model accuracy and provide a more comprehensive understanding of factors influencing BCS. These aspects were not fully explored in this study but could significantly impact packaging performance.
- Enhancing Model Generalizability: Broadening the range of package types and materials in the dataset would improve the ANN model’s applicability across various industries, making it more versatile and widely usable.
- Improving Data Quality and Expanding Data: Utilizing numerical methods to generate higher-quality synthetic data or collecting real-world data from diverse sources, such as physical tested data from different companies, would enhance the model’s reliability and robustness.
- Exploring Hybrid Models: Combining ANN-based approaches with traditional simulation could yield more robust results, particularly in cases where input feature quality or dataset representativeness is a limiting factor.
- Integrating Real-Time Data and Machine Learning: Implementing real-time data collection and machine learning techniques would enable adaptive and dynamic packaging optimization, allowing designs to evolve based on continuous performance feedback.
- Incorporating Advanced Machine Learning Techniques: Utilizing advanced machine learning methods, such as reinforcement learning or other deep learning techniques, could further refine predictive accuracy and optimization, enabling a more automated and scalable approach to packaging engineering.
- Overfitting monitoring: Overfitting can lead to misleadingly high accuracy on training data but poor performance on test or validation data. In this study, overfitting was monitored and mitigated by identifying an early stopping point during training. However, future studies could incorporate additional techniques such as regularization, dropout, and cross-validation to further reduce the risk of overfitting and enhance the model’s real-world performance.
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Zhang, L.; Lu, J.; Cao, G.; Miao, H. Research on packaging evaluation system of fast moving consumer goods based on analytical hierarchy process method. In Advanced Graphic Communications and Media Technologies; Springer: Berlin/Heidelberg, Germany, 2017. [Google Scholar]
- Pérez, A.; Villena, J.; Matuk, D.; Luna, A.; Chong, M. An analytical hierarchy approach applied in the packaging supply chain. In Supply Chain Management and Logistics in Emerging Markets: Selected Papers from the 2018 MIT SCALE Latin America Conference; Emerald Publishing Limited: Warrington, UK, 2020. [Google Scholar]
- Fadiji, T.; Coetzee, C.J.; Berry, T.M.; Ambaw, A.; Opara, U.L. The efficacy of finite element analysis (FEA) as a design tool for food packaging: A review. Biosyst. Eng. 2018, 174, 20–40. [Google Scholar] [CrossRef]
- Hicks, B.J.; Mullineux, G.; Sirkett, D. A finite element-based approach for whole-system simulation of packaging systems for their improved design and operation. Packag. Technol. Sci. Int. J. 2009, 22, 209–227. [Google Scholar] [CrossRef]
- Park, J.; Park, M.; Choi, D.S.; Jung, H.M.; Hwang, S.W. Finite element-based simulation for edgewise compression behavior of corrugated paperboard for packaging of agricultural products. Appl. Sci. 2020, 10, 6716. [Google Scholar] [CrossRef]
- Nefeslioglu, H.A.; Sezer, E.A.; Gokceoglu, C.; Ayas, Z. A modified analytical hierarchy process (M-AHP) approach for decision support systems in natural hazard assessments. Comput. Geosci. 2013, 59, 1–8. [Google Scholar] [CrossRef]
- Nygårds, M.; Hallbäck, N.; Just, M.; Tryding, J. A finite element model for simulations of creasing and folding of paperboard. In Abaqus Users’ Conference; ABAQUS, Inc.: Stockholm, Sweden, 2005. [Google Scholar]
- Garbowski, T.; Gajewski, T.; Grabski, J.K. Estimation of the compressive strength of corrugated cardboard boxes with various perforations. Energies 2021, 14, 1095. [Google Scholar] [CrossRef]
- Alper, M.M.; Sezen, B.; Balcıoğlu, Y.S. Machine Learning Applications in Packaging Industry Optimization: A Systematic Literature Review and Future Research Directions. In International Newyork Congress Conference; BZT TURAN Publishing House: Baku, Azerbaijan, 2025. [Google Scholar]
- Kazeruni, M.; Ince, A. Data-driven artificial neural network for elastic plastic stress and strain computation for notched bodies. Theor. Appl. Fract. Mech. 2023, 125, 103917. [Google Scholar] [CrossRef]
- Tsaur, S.-H.; Chiu, Y.-C.; Huang, C.-H. Determinants of guest loyalty to international tourist hotels—A neural network approach. Tour. Manag. 2002, 23, 397–405. [Google Scholar] [CrossRef]
- Iqbal, J.; Saeed, A.; Khan, R.A. The relative importance of textual indexes in predicting the future performance of banks: A connection weight approach. Borsa Istanb. Rev. 2023, 23, 240–253. [Google Scholar] [CrossRef]
- Goh, A.T. Back-propagation neural networks for modeling complex systems. Artif. Intell. Eng. 1995, 9, 143–151. [Google Scholar] [CrossRef]
- Ombres, L.; Aiello, M.A.; Cascardi, A.; Verre, S. Modeling of steel-reinforced grout composite system-to-concrete bond capacity using artificial neural networks. J. Compos. Constr. 2024, 28, 04024034. [Google Scholar] [CrossRef]
- John, S.K.; Cascardi, A.; Nadir, Y.; Aiello, M.A.; Girija, K. A New Artificial Neural Network Model for the Prediction of the Effect of Molar Ratios on Compressive Strength of Fly Ash-Slag Geopolymer Mortar. Adv. Civ. Eng. 2021, 2021, 6662347. [Google Scholar] [CrossRef]
- Tavar, M.; Rabbani, H.; Gholami, R.; Ahmadi, E.; Kurtulmus, F. Investigating the Effect of Packaging Conditions on the Properties of Peeled Garlic by Using Artificial Neural Network (ANN). Packag. Technol. Sci. 2024, 37, 755–767. [Google Scholar] [CrossRef]
- Ahmed, A.R.; Aleid, S.M.; Mohammed, M. Impact of Modified Atmosphere Packaging Conditions on Quality of Dates: Experimental Study and Predictive Analysis Using Artificial Neural Networks. Foods 2023, 12, 3811. [Google Scholar] [CrossRef] [PubMed]
- El-Mesery, H.S.; Adelusi, O.A.; Ghashi, S.; Njobeh, P.B.; Hu, Z.; Kun, W. Effects of storage conditions and packaging materials on the postharvest quality of fresh Chinese tomatoes and the optimization of the tomatoes’ physiochemical properties using machine learning techniques. LWT 2024, 201, 116280. [Google Scholar] [CrossRef]
- Brazolin, I.F.; Sousa, F.M.M.; Dos Santos, J.W.S.; Concha, V.O.C.; Da Silva, F.V.; Yoshida, C.M.P. A method for pH food dependent shelf-life prediction of intelligent sustainable packaging device using artificial neural networks. J. Appl. Polym. Sci. 2024, 141, e55646. [Google Scholar] [CrossRef]
- Gajewski, T.; Grabski, J.K.; Cornaggia, A.; Garbowski, T. On the use of artificial intelligence in predicting the compressive strength of various cardboard packaging. Packag. Technol. Sci. 2024, 37, 97–105. [Google Scholar] [CrossRef]
- Malasri, S.; Rayapati, P.; Kondeti, D. Predicting corrugated box compression strength using an artificial neural network. Int. J. 2016, 4, 169–176. [Google Scholar] [CrossRef]
- Da Costa, N.L.; De Lima, M.D.; Barbosa, R. Evaluation of feature selection methods based on artificial neural network weights. Expert Syst. Appl. 2021, 168, 114312. [Google Scholar] [CrossRef]
- Zobel, C.W.; Cook, D.F. Evaluation of neural network variable influence measures for process control. Eng. Appl. Artif. Intell. 2011, 24, 803–812. [Google Scholar] [CrossRef]
- Garson, G.D. Interpreting neural-network connection weights. AI Expert 1991, 6, 46–51. [Google Scholar]
- Hill, A.; Lucet, E.; Lenain, R. A novel gradient feature importance method for neural networks: An application to controller gain tuning for mobile robots. In Proceedings of the International Conference on Informatics in Control, Automation and Robotics, Online, 7–9 July 2020; Springer: Cham, Germany, 2020. [Google Scholar]
- Fukumizu, K.; Leng, C. Gradient-based kernel method for feature extraction and variable selection. Adv. Neural Inf. Process. Syst. 2012, 25, 2114–2122. [Google Scholar]
- Iooss, B.; Saltelli, A. Introduction to sensitivity analysis. In Handbook of Uncertainty Quantification; Springer: Cham, Germany, 2017; pp. 1103–1122. [Google Scholar]
- Lundberg, S.M.; Lee, S.-I. A unified approach to interpreting model predictions. Adv. Neural Inf. Process. Syst. 2017, 30, 4768–4777. [Google Scholar]
- Mandler, H.; Weigand, B. Feature importance in neural networks as a means of interpretation for data-driven turbulence models. Comput. Fluids 2023, 265, 105993. [Google Scholar] [CrossRef]
- Altmann, A.; Toloşi, L.; Sander, O.; Lengauer, T. Permutation importance: A corrected feature importance measure. Bioinformatics 2010, 26, 1340–1347. [Google Scholar] [CrossRef] [PubMed]
- Li, Z. Extracting spatial effects from machine learning model using local interpretation method: An example of SHAP and XGBoost. Comput. Environ. Urban Syst. 2022, 96, 101845. [Google Scholar] [CrossRef]
- Urbanik, T.J.; Frank, B. Box compression analysis of world-wide data spanning 46 years. Wood Fiber Sci. 2006, 38, 399–416. [Google Scholar]
- Frazier, P.I. A tutorial on Bayesian optimization. arXiv 2018, arXiv:1807.02811. [Google Scholar]
- Will, K. A Conceptual Explanation of Bayesian Hyperparameter Optimization for Machine Learning. Towards Data Sci. 2018, 24. Available online: https://medium.com/towards-data-science/a-conceptual-explanation-of-bayesian-model-based-hyperparameter-optimization-for-machine-learning-b8172278050f (accessed on 13 March 2025).
- Akaike, H. Information theory and an extension of the maximum likelihood principle. In Selected Papers of Hirotugu Akaike; Springer: New York, NY, USA, 1998; pp. 199–213. [Google Scholar]
- Schwarz, G. Estimating the Dimension of a Model. Ann. Stat. 1978, 6, 461–464. [Google Scholar] [CrossRef]
- Hebbian Theory. 11 January 2025 (UTC). Available online: https://en.wikipedia.org/wiki/Hebbian_theory (accessed on 20 December 2024).
- Lecun, Y.; Denker, J.; Solla, S. Optimal brain damage. In Proceedings of the 3rd International Conference on Neural Information Processing Systems; MIT Press: Cambridge, MA, USA, 1989; pp. 598–605. [Google Scholar]
- Jeon, G.; Jeong, H.; Choi, J. Distilled gradient aggregation: Purify features for input attribution in the deep neural network. Adv. Neural Inf. Process. Syst. 2022, 35, 26478–26491. [Google Scholar]
- Azmat, M.; Alessio, A.M. Feature Importance Estimation Using Gradient Based Method for Multimodal Fused Neural Networks. In Proceedings of the 2022 IEEE Nuclear Science Symposium and Medical Imaging Conference (NSS/MIC), Milano, Italy, 5–12 November 2022. [Google Scholar]
- Van Den Broeck, G.; Lykov, A.; Schleich, M.; Suciu, D. On the tractability of SHAP explanations. J. Artif. Intell. Res. 2022, 74, 851–886. [Google Scholar] [CrossRef]
- Štrumbelj, E.; Kononenko, I. Explaining prediction models and individual predictions with feature contributions. Knowl. Inf. Syst. 2014, 41, 647–665. [Google Scholar] [CrossRef]
- Gu, J.; Frank, B.; Lee, E. A comparative analysis of artificial neural network (ANN) architectures for box compression strength estimation. Korean J. Packag. Sci. Technol. 2023, 29, 163–174. [Google Scholar] [CrossRef]
- Mckee, R.; Gander, J.; Wachuta, J. Compression strength formula for corrugated boxes. Paperboard Packag. 1963, 48, 149–159. [Google Scholar]
- Pyr’yev, Y.; Kibirkštis, E.; Gegeckienė, L.; Vaitasius, K.; Venytė, I. Empirical models for prediction compression strength of paperboard carton. Wood Fiber Sci. 2022, 54, 60–73. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Gu, J.; Lee, E. Evaluating Packaging Design Relative Feature Importance Using an Artificial Neural Network (ANN). Appl. Sci. 2025, 15, 3261. https://doi.org/10.3390/app15063261
Gu J, Lee E. Evaluating Packaging Design Relative Feature Importance Using an Artificial Neural Network (ANN). Applied Sciences. 2025; 15(6):3261. https://doi.org/10.3390/app15063261
Chicago/Turabian StyleGu, Juan, and Euihark Lee. 2025. "Evaluating Packaging Design Relative Feature Importance Using an Artificial Neural Network (ANN)" Applied Sciences 15, no. 6: 3261. https://doi.org/10.3390/app15063261
APA StyleGu, J., & Lee, E. (2025). Evaluating Packaging Design Relative Feature Importance Using an Artificial Neural Network (ANN). Applied Sciences, 15(6), 3261. https://doi.org/10.3390/app15063261