SynergyAI: A Human–AI Pair Programming Tool Based on Dataflow †
Abstract
:1. Introduction
- Scatter Plot Matrices and Correlation Analysis: assists in feature selection by highlighting relationships between variables.
- Automated Hyperparameter Tuning: suggests optimal configurations to enhance model efficiency.
- Dataflow Optimization Suggestions: recommends adjustments to improve model accuracy and stability.
2. AI Programming and Its Difficulties
2.1. Related Work
- (A)
- Human Programming Model
- (B)
- Human–AI Programming Model
- (C)
- Comparison with Different Tools
2.2. Difficulties in AI Programming
- (A)
- Black-Box Problem
- (B)
- Trial and Error
- (C)
- Lack of Human–AI Cooperative Ability
3. SynergyAI Concept
3.1. DataFlow
3.2. Frontend Development
3.3. Backend Development
3.4. Dataflow Prediction
- All nodes must have assigned data.
- Each node must have at least one input or output.
- Each dataflow must start from an input node, traverse through AI nodes, and end at an output node, as shown in Figure 3.
3.5. Dataflow Modularity
Algorithm 1 Comprehensive prediction algorithm |
Require: User input data, AI node , Function , Function
|
3.6. Ensemble Learning in SynergyAI
4. AI Assistant in SynergyAI
4.1. Explainability of AI Model
4.2. Dataflow with AI Advice
- (A)
- Advice 1: Data Selection
- (B)
- Advice 2: Optimize the Structure of Dataflow
- (C)
- Advice 3: Optimize the Decision Tree Model
5. Application Examples
5.1. Chocolate Factory Machinery Failure Prediction
- Dataset Introduction
- Sensor Data: data from sensors on factory machinery, including temperature, rotational speed, time, etc.
- Quality Inspection Data: data from the inspection of finished chocolates, including glossiness, sweetness, qualification status, etc.
- Machinery Failure Data: records of actual mechanical inspections, indicating whether the machinery failed.
- Create Dataflow Model with AI Assistant
- Select only sensor data.
- Preferentially select sensor data from the target machine and other machines that have strong operational dependencies on that machine.
- Use the selected data as input for the second layer.
- Select the data with the highest correlation with the second-layer variables from the AI assistant’s suggestions and establish connections accordingly.
- Retain sensor data connections related to the target machine (Sugar Mixer in this case) since the goal is to predict its failure.
- Train a decision tree model and analyze its structure. If certain input features are not utilized in decision-making, remove their corresponding connections to simplify the model.
5.2. Evaluation
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
- The following abbreviations are used in this manuscript:
AI | Artificial Intelligence |
IoT | Internet of Things |
DT | Decision Tree |
RF | Random Forest |
DL | Deep Learning |
References
- Esteva, A.; Kuprel, B.; Novoa, R.; Ko, J.; Swetter, S.; Blau, H.; Thrun, S. Dermatologist-level classification of skin cancer with deep neural networks. Nature 2017, 542, 115–118. [Google Scholar] [CrossRef] [PubMed]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.; Kaiser, L.; Polosukhin, I. Attention Is All You Need. arXiv 2023, arXiv:1706.03762. [Google Scholar]
- Shin, C.; Moon, J. A Gradient Based Adversarial Example Method Using Attention Map Against Text Transformer Model. J. Digit. Contents Soc. 2021, 22, 2019–2026. [Google Scholar] [CrossRef]
- Chen, C.; Shu, K. Combating misinformation in the age of LLMs: Opportunities and challenges. AI Mag. 2024, 45, 354–368. [Google Scholar] [CrossRef]
- Goodall, N. Machine Ethics and Automated Vehicles. In Machine Ethics and Automated Vehicles; Springer: Berlin/Heidelberg, Germany, 2014; pp. 93–102. [Google Scholar]
- Storey, V.; Lukyanenko, R.; Maass, W.; Parsons, J. Explainable AI. Commun. ACM 2022, 65, 27–29. [Google Scholar] [CrossRef]
- Zhou, Z.-H.; Wu, J.; Tang, W. Ensembling neural networks: Many could be better than all. Artif. Intell. 2002, 137, 239–263. [Google Scholar] [CrossRef]
- Molnar, C. Interpretable Machine Learning: A Guide for Making Black Box Models Explainable; Lulu Press: Raleigh, NC, USA, 2022. [Google Scholar]
- Liang, J.; Yang, C.; Myers, B. A Large-Scale Survey on the Usability of AI Programming Assistants: Successes and Challenges. In Proceedings of the 46th IEEE/ACM International Conference on Software Engineering (ICSE), Ottawa, ON, Canada, 26 April–3 May 2023; pp. 616–628. [Google Scholar]
- Zhang, D.; Bin Ahmadon, M.; Yamaguchi, S. Human-AI Pair Programming by Data Stream and Its Application Example. In Proceedings of the 2022 IEEE International Conference on Consumer Electronics-Asia (ICCE-Asia), Yeosu, Republic of Korea, 26–28 October 2022; pp. 1–4. [Google Scholar]
- Kumar, A.; Cuccuru, G.; Grüning, B.; Backofen, R. An accessible infrastructure for artificial intelligence using a Docker-based JupyterLab in Galaxy. GigaScience 2023, 12, giad028. [Google Scholar] [CrossRef] [PubMed]
- Krizhevsky, A.; Sutskever, I.; Hinton, G. ImageNet Classification with Deep Convolutional Neural Networks. In Proceedings of the 26th Annual Conference on Neural Information Processing Systems, Lake Tahoe, NV, USA, 3–6 December 2012. [Google Scholar]
- Lundberg, S.M.; Lee, S.I. A unified approach to interpreting model predictions. In Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017; Volume 10, pp. 4765–4777. [Google Scholar]
- Tang, X.; Li, X.; Ding, Y.; Song, M.; Bu, Y. The pace of artificial intelligence innovations: Speed, talent, and trial-and-error. J. Inf. 2020, 14, 101094. Available online: https://www.sciencedirect.com/science/article/pii/S1751157720301991 (accessed on 23 February 2025). [CrossRef]
- Liu, X.; Song, H.; Liu, A. Intelligent UAVs Trajectory Optimization from Space-Time for Data Collection in Social Networks. IEEE Trans. Netw. Sci. Eng. 2021, 8, 853–864. [Google Scholar] [CrossRef]
- Scibilia, A.; Pedrocchi, N.; Fortuna, L. Modeling Nonlinear Dynamics in Human–Machine Interaction. IEEE Access 2023, 11, 58664–58678. [Google Scholar] [CrossRef]
- Jiang, L.; Bin Ahmadon, M.; Zhang, D.; Fukada, T.; Yamaguchi, S. SynergyAI: An Human-AI Pair Programming Design Tool Based on Program Net. In Proceedings of the 2023 11th International Conference on Information and Education Technology (ICIET), Fujisawa, Japan, 18–20 March 2023; pp. 210–214. [Google Scholar]
- Abadi, M.; Barham, P.; Chen, J.; Chen, Z.; Davis, A.; Dean, J.; Devin, M.; Ghemawat, S.; Irving, G.; Isard, M.; et al. TensorFlow: A system for large-scale machine learning. In Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16), Savannah, GA, USA, 2–4 November 2016. [Google Scholar]
- PyCaret. PyCaret: An Open-Source, Low-Code Machine Learning Library in Python. Available online: https://github.com/pycaret/pycaret (accessed on 16 July 2024).
- National Instruments. LabVIEW: Graphical System Design Software. 2023. Available online: https://www.ni.com/en-us/shop/labview.html (accessed on 12 February 2025).
- PTC Inc. ThingWorx: Industrial IoT Solutions. 2023. Available online: https://www.ptc.com/en/products/thingworx (accessed on 12 February 2025).
- MathWorks. Simulink. 2023. Available online: https://www.mathworks.com/products/simulink.html (accessed on 12 February 2025).
- Bin Ahmadon, M.A.; Yamaguchi, S.; Mahamad, A.K.; Saon, S. Physical Device Compatibility Support for Implementation of IoT Services with Design Once, Provide Anywhere Concept. Information 2021, 12, 30. [Google Scholar] [CrossRef]
- Doshi-Velez, F.; Kim, B. Towards a rigorous science of interpretable machine learning. arXiv 2017, arXiv:1702.08608. [Google Scholar]
- Topol, E. High-performance medicine: The convergence of human and artificial intelligence. Nat. Med. 2019, 25, 44–56. [Google Scholar] [CrossRef] [PubMed]
- Davenport, T.; Ronanki, R. Artificial intelligence for the real world. Harv. Bus. Rev. 2018, 96, 108–116. [Google Scholar]
- Luther, T.; Kimmerle, J.; Cress, U. Teaming Up with an AI: Exploring Human–AI Collaboration in a Writing Scenario with ChatGPT. AI 2024, 5, 1357–1376. [Google Scholar] [CrossRef]
- Shannon, P.; Markiel, A.; Ozier, O.; Baliga, N.; Wang, J.; Ramage, D.; Amin, N.; Schwikowski, B.; Ideker, T. Cytoscape: A software environment for integrated models of biomolecular interaction networks. Genome Res. 2003, 13, 2498–2504. [Google Scholar] [CrossRef]
- Franz, M.; Lopes, C.; Huck, G.; Dong, Y.; Sumer, O.; Bader, G. Cytoscape.js: A graph theory library for visualisation and analysis. Bioinformatics 2016, 32, 309–311. [Google Scholar] [CrossRef]
- Rudin, C. Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nat. Mach. Intell. 2019, 1, 206–215. [Google Scholar] [CrossRef]
- Zhou, Z. Ensemble Methods: Foundations and Algorithms; Chapman: Orange, CA, USA, 2012. [Google Scholar]
- Lee, Z.-J.; Cai, J.-X.; Wang, L.-H.; Yang, M.-R. Ensemble Algorithm Based on Gene Selection, Data Augmentation, and Boosting Approaches for Ovarian Cancer Classification. Diagnostics 2024, 14, 2772. [Google Scholar] [CrossRef]
- Skaria, A.P. The economic and societal burden of Alzheimer disease: Managed care considerations. Am. J. Manag. Care 2022, 28 (Suppl. S10), S188–S196. [Google Scholar] [CrossRef] [PubMed]
- Yatusiro. Human-AI Pair Programming Tool Based on Dataflow. GitHub Repository. Available online: https://github.com/yatusiro/SynergyAI241216 (accessed on 16 December 2024).
- Yamaguchi. Chocolate Factory Dataset. (Kaggle, 2024). Available online: https://www.kaggle.com/datasets/yamaguchi1/chocolate-factory-dataset (accessed on 23 February 2025).
Dataflow | White Box | AI Assistant | Data Selection | Code Assistance | |
---|---|---|---|---|---|
TensorFlow | ✓ | × | × | Human only | × |
PyCaret | ✓ | ✓ | × | Human only | × |
GitHub Copilot | × | × | ✓ | × | ✓ |
Tabnine | × | × | ✓ | × | ✓ |
DeepCode | × | × | ✓ | × | ✓ |
SynergyAI | ✓ | ✓ | ✓ | Human + AI | × |
Node Name | Input | Output |
---|---|---|
Input Node | None | AI Node |
AI Node | Input Node or AI Node | AI Node or Output Node |
Output Node | AI Node | None |
Pump3_Pressure | Conching_Time | Sweetness | Passed | Roaster_Failure | Sugar_Mixer_Failure |
---|---|---|---|---|---|
63 | 224 | 1 | 0 | 0 | 0 |
58 | 182 | 2 | 0 | 0 | 0 |
64 | 249 | 2 | 0 | 0 | 0 |
64 | 255 | 3 | 0 | 0 | 0 |
61 | 192 | 3 | 0 | 0 | 0 |
72 | 359 | 3 | 1 | 0 | 0 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Jiang, L.; Yamaguchi, S.; Bin Ahmadon, M.A. SynergyAI: A Human–AI Pair Programming Tool Based on Dataflow. Information 2025, 16, 178. https://doi.org/10.3390/info16030178
Jiang L, Yamaguchi S, Bin Ahmadon MA. SynergyAI: A Human–AI Pair Programming Tool Based on Dataflow. Information. 2025; 16(3):178. https://doi.org/10.3390/info16030178
Chicago/Turabian StyleJiang, Le, Shingo Yamaguchi, and Mohd Anuaruddin Bin Ahmadon. 2025. "SynergyAI: A Human–AI Pair Programming Tool Based on Dataflow" Information 16, no. 3: 178. https://doi.org/10.3390/info16030178
APA StyleJiang, L., Yamaguchi, S., & Bin Ahmadon, M. A. (2025). SynergyAI: A Human–AI Pair Programming Tool Based on Dataflow. Information, 16(3), 178. https://doi.org/10.3390/info16030178