Next Article in Journal
Unveiling University Groupings: A Clustering Analysis for Academic Rankings
Previous Article in Journal
Spectral Library of Plant Species from Montesinho Natural Park in Portugal
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Data Descriptor

A Series Production Data Set for Five-Axis CNC Milling

by
Anna-Maria Schmitt
and
Bastian Engelmann
*
Institute of Digital Engineering, Technical University of Applied Sciences Würzburg-Schweinfurt, 97421 Schweinfurt, Germany
*
Author to whom correspondence should be addressed.
Submission received: 22 March 2024 / Revised: 22 April 2024 / Accepted: 26 April 2024 / Published: 30 April 2024

Abstract

:
The described data set contains features from the machine control of a five-axis milling machine. The features were recorded during thirteen series productions. Each series production includes a changeover process in which the machine was set up for the production of a different product. In addition to the timestamps and the twenty recorded features derived from Numerical Control (NC) variables, the data set also contains labels for the different production phases. For this purpose, up to 23 phases were assigned, which are based on a generalized milling process. The data set consists of thirteen .csv files, each representing a series production. The data set was recorded in a production company in the contract manufacturing sector for components with real series orders in ongoing industrial production.
Dataset: 10.5281/zenodo.10853254
Dataset License: Creative Commons Attribution 4.0 International (CC BY 4.0)

1. Summary

The data set presented offers researchers insight into the changes in machine parameters on a milling machine during different phases of the production process. The data set contains the recorded features of the Numerical control (NC), and it also includes an assignment of the underlying production phases, which was added by manual labeling.
Compared to other domains where dedicated data repositories are available, engineering data sets can generally be found in generalist data repositories [1]. Table 1 shows the result of a search for entries of milling-related data sets in the largest generalist repositories. In relation to the huge amount of existing data sets in the repositories, milling-related data sets can rarely be found. After closer analysis of the search results, the number of unique data records with a manufacturing reference appears to be few as well (see the last column in Table 1). Therefore, publishing the presented data set aims to make more manufacturing data available for research.
The recorded data set contains production data from the NC of a five-axis milling machine, which was recorded in thirteen sessions between the end of November 2021 and the beginning of April 2022. For each recorded session, the anonymous data from the NC of the machine covers the preparation for the next manufacturing order (also defined as changeover) and the subsequently produced parts.
The used milling machine, HERMLE C600 U, was equipped with an NC HEIDENHAIN iTNC 530, and was operated in regular production shifts at the company Pabst Komponentenfertigung GmbH in Schweinfurt, Germany. Pabst specializes in the design and manufacture of tools and special machines, as well as the individual and series production of machine components. The machined parts were bearing components from the field of aerospace. The Pabst company was a member of the publicly funded research project Optimization of Processes and Machine Tools through Provision, Analysis and Target/Actual Comparison of Production Data (OBerA), which was established to support metalworking companies with a focus on Small and medium-sized enterprises (SMEs) from northern Bavaria in digital techniques to optimize their production. The project lasted from 1 April 2018 to 31 December 2021. Several sessions of the data set were also recorded after the official end of the project. Overall, 13 sessions were recorded, forming the complete data set (see Table 2).
The changeover process of a machine is characterized by manual activities that leave their traces in the data of the machine control system. However, the production phase that follows changeover is characterized by machine movements that take place largely without human interaction. As the dataset contains data for both changeover and production, it is suitable for creating models of human–machine interaction during changeover, but also for the specific analysis of the production phase. As the dataset was recorded in series production, specific effects in series production can also be analyzed with the dataset.
Selected sessions from the data set were used in two publications of the authors so far.
In [2], the data set presented here was used to train a Machine Learning (ML), model for detecting changeover periods in production data. These models were then compared to ML models, which were trained with a data set from a DMG 100 U duoBLOCK milling machine. The DMG machine was equipped only with external sensors, which were not connected to its NC. The ML approaches for both machines were compared and discussed. Data from the DMG machine is not part of the data set presented here, only data from the HERMLE machine is included. Sessions 1 and 2 were used for this research.
In [3], the influence of different noise types on the training data of ML, models was evaluated. For this purpose, sessions from the data set were overlaid with simulated noise. These data set was then used to train ML models for detecting the changeover periods in manufacturing data. During the simulation, different noise types were selected, and their specific influence on the metrics of the ML model was evaluated. For this research, sessions 1 and 2 were used.
The available dataset is processed, and missing values were corrected to make the dataset easy to use.

2. Data Description

Table 2 shows the names of the .csv files for the 13 recorded sessions, the number of data rows, and the data rows per hour. It can be seen that the number of rows of all sessions ranges between 7526 and 19,810, and the data rows per hour have a comparable magnitude with an average of 1914.15 data rows per hour. Usually, the sessions were recorded during one working day. Only session No. 13 was recorded on the 5th, 7th, and 8th of April. On the 6th of April, the machine was in maintenance. Each file uses commas as the separator, has a column descriptor in the first line of the file, and contains data from one changeover sequence followed by the subsequent production sequence.
Table 3 shows the durations of the specific sessions in the column “Total time” and the duration of the changeover period in the column “Changeover time”. During a changeover, a machine is prepared for a new product type. The columns “Old product” and “New product” show an anonymized product number, which stands for the product that was produced before and after the changeover period.
In the data set, the first column contains the timestamp in the format YYYY-MM-DD HH:MM:SS. The following columns contain the recorded data from the individual features from the NC. Table 4 shows all 20 recorded features with a short description. The numerical control registers the signals from internal sensors, like spindle speed or status of a door lock, and exports them as features. The features contain information about the milling process, i.e., FeedRate, as well as status information from the milling process, i.e., ProgramStatus and status information from the machine, i.e., DriveStatus. No additional external sensors are part of the data set.
Table 5 shows that some .csv files contain 19 features, and some contain 20 features. In cases of files with 19 features, variable No. 5, “PocketTable”, was not recorded due to malfunctions in the recording interface.
In the last three columns of the data set, the authors assigned a production phase label for each timestamp. Each column represents a specific approach to label the specific production phase:
  • In the two-phase approach, only two phases are labeled if the machine is in a changeover state or intermittent idle time (No. 1) or the machine is in production state (No. 2). The column heading in the .csv files is “Production”. The labeling was performed for sessions 1, 2, and 13 by a researcher supervising the changeover and production process in real-time in situ. For the labeling of sessions 3 to 12, data from the worker terminal was used (see also the explanation below).
  • In the six-phase approach, six general phases are labeled (No. 1: Starting phase, No. 2: Main phase, No. 3: Ending phase, No. 4: Idle/break phase, No. 5: Production phase, No. 6: Quality control phase). The column heading in the .csv files is “Phase_compressed”. For the labeling of the six-phase approach, the single phases were assigned to the phases of the 23-phase approach by the authors. The exact mapping of the phases is shown in Table 6.
  • In the 23-phase approach, specific phases are labeled, which are related to the functional content of an abstracted milling process with changeover. The definition of the phases of the 23-phase approach were elaborated in the OBerA project: The phases with numbers 1–19 are milling-related sub-phases. Number 20 is assigned for idle time or breaks. Number 21 is assigned when the machine is in production phase. Phases number 22 and 23 are phases for quality control (No. 22: general quality checks, No. 23: quality checks concerning the workpiece quality). The column heading in the .csv files is “Phase”. Table 6 contains a short description of all 23 phases. For more details, please see [2].
Basic statistics of the features and labels are listed in Table A1 in Appendix A.
Labeling the specific phases to the timestamps from sessions 3 to 12 was performed using reported changeover start and stop times from the worker terminal. The timestamps from the worker terminal have a resolution of 5 min. This also implicates a rounding error of max. 2.5 min for the assigning of a specific production phase. For sessions 3 to 12, only labeling according to the two-phase approach was conducted.
For sessions 1, 2, and 13, the labeling was performed by a researcher who supervised the complete recording period in person. The timestamps have a resolution of 1 s. Deviations due to the human reaction time can be expected and are estimated to be 0.3 s. For these sessions, all three labeling approaches were conducted.
Figure 1 shows the different counts for the two-phase labeling approach in all thirteen sessions. Due to the varying order lot sizes, there are different numbers of data rows for the changeover and production phase for the thirteen sessions.
Figure 2 shows the different counts for the six-phase labeling approach for sessions 1, 2, and 13. The number of data rows for the six phases is comparable for sessions 1 and 13. Session 2 contains many data rows for the production phase “5”.
Figure 3 shows the different counts for the 23-phase labeling approach for sessions 1, 2, and 13. Session 2 contains many rows with the label of phase 21 (production). Session 13 shows more idle time (phase 20) than production (phase 21).
Figure 1 shows strong variability for the phase counts of the changeover and production class.
The variability in the counts of the changeover classes result from the different efforts required for a changeover to a new product. It should be noted here that, in theory, setting up product A after product B can result in different efforts than setting up product B after product A. This difference can arise, for example, from increased handling effort when preparing fixtures and machining tools for smaller or larger dimensions. Table 3 shows for each session from which product to which product the machine was set up.
The variability in the counts of the production classes result from different manufacturing order lot sizes, which are related to specific customer orders.
In Figure 2, session No. 1 and No. 13 show similar counts over all six phases. From Figure 1, it can be seen that these sessions have the same proportionality between the changeover and production class. In contrast to session No. 1 and No. 13, session No. 2 contains much more production and less changeover counts, which is expressed by the much higher count in its phase 5.
In Figure 3, the behavior for the variability of the counts from Figure 2 is more detailed with more subphases, but shows comparable characteristics.

3. Methods

The HEIDENHAIN numerical control iTNC 530 of the HERMLE C600 U machine is a legacy machine that does not support communication standards like OPC Unified Architecture (OPC UA). Therefore, for the data acquisition, a middleware by the company Cybus collected the NC data via the HEIDENHAIN DNC interface and the Cybus Agent transported it via the MQTT protocol to the Azure cloud and into an SQL database (Figure 4). The use of middleware resulted in a preselection of around 400 available variables. Of these variables, domain experts selected 19 variables which, based on their description, indicated a context for the milling process. Variable No. 20 “Warmup” was derived after the data acquisition from variable No. 3 “ProgramDetail” and added to the data set [2].
Due to the selected data recording concept, the data were either transferred to the database as soon as a new value was assigned to the variables, or every two seconds. The following procedure was used for the imputation of missing values:
  • For status variables, e.g., door and program statuses, missing values have been replaced by the last valid previous value.
  • For variables that can take on continuous values, such as the feed rate, a rolling mean was calculated using the previous and subsequent values.
The timestamps occur for approx. 99% of all cases in frequencies of 1 s, 2 s and 3 s. Frequencies of 1 s occur at approx. 57%. Frequencies of 2 s and 3 s occur at about 33% and 10%.

4. User Notes

The different approaches to label the specific phases of production were introduced to be able to compare the capability of different ML algorithms to classify multiple categories (multiclass classification). The data set contains different characters of imbalanced data depending on the specific labeling approach (see Section 2).
The data set is available on the Zenodo platform. Please see the DOI below. The chosen license for the data set is Creative Commons Attribution 4.0 International [5]. Researchers are free to share and adapt the presented data set, but they must:
  • Give credit to the authors with a reference;
  • Provide a link to the license;
  • Indicate changes to the original data set.

Author Contributions

Conceptualization, A.-M.S. and B.E.; methodology, B.E.; software, A.-M.S.; validation, A.-M.S. and B.E.; formal analysis, B.E. and A.-M.S.; resources, B.E.; data curation, A.-M.S.; writing—original draft preparation, A.-M.S. and B.E.; writing—review and editing, A.-M.S. and B.E.; visualization, A.-M.S.; supervision, B.E.; project administration, B.E.; funding acquisition, B.E. All authors have read and agreed to the published version of the manuscript.

Funding

The OBerA project was funded by the state of Bavaria (Bayerisches Staatsministerium für Wirtschaft, Landesentwicklung und Energie, grant no. IUK530/010). The publication is supported by the publication fund of the Technical University of Applied Sciences Würzburg-Schweinfurt.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The original data presented in these data descriptor are openly available in the Zenodo repository: https://doi.org/10.5281/zenodo.10853254 (accessed on 22 March 2024).

Acknowledgments

The authors gratefully thank Pabst Komponentenfertigung GmbH and Cybus GmbH for their contributions to the research.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
MLMachine Learning
NCNumerical Control
OBerAOptimization of Processes and Machine Tools through Provision, Analysis and Target/Actual Comparison of Production Data
OPC UAOPC Unified Architecture
SMEsSmall and medium-sized enterprises

Appendix A. Statistics for Data Set

Table A1 contains basic statistics for all the sessions of the data set. The first 20 rows contain the statistics for the recorded features. The last three rows contain the statistics for the three phase approaches (labels).
Table A1. Statistics for data set.
Table A1. Statistics for data set.
Features and LabelsCountMeanStdMin25%50%75%Max
CoolantFlow190,0460.31460.4643400011
ProgramStatus190,0461.22691.8800000211
ProgramDetail190,0461.23331.6699001211
ToolNumber190,04616.59088.839218202332
PocketTable84,4469.161610.44580032030
DriveStatus190,0460.56540.4957100111
DoorStatusMain190,0460.38470.4865200011
DoorStatusTooling190,0460.03240.1771000001
CabinDoorLockFront190,0460.00040.019300001
CabinDoorLockSide190,0460.00.000000
DNCMode190,0460.00100.031400001
SpindleCleaning190,0460.01070.102800001
ChipCleaningGunStatus190,0460.99920.027901111
OverrideSpindle190,04699.08179.53700100100100100
OverrideFeed190,04684.247132.5905089100100150
FeedRate190,046767.64454021.7358−32,7100021532,767
SpindleSpeed190,0462513.53643779.1187000509910,046
SpindleApproval190,0460.00.000000
RapidTraverseKey190,0460.00350.058900001
Warmup190,0460.03740.189800001
Phase42,11617.66145.0887113202123
Phase_compressed42,1163.83071.504612456
Production190,0460.61180.487300111

References

  1. Scientific Data. Data Repository Guidance. 2022. Available online: https://www.nature.com/sdata/policies/repositories (accessed on 4 March 2024).
  2. Engelmann, B.; Schmitt, A.M.; Theilacker, L.; Schmitt, J. Implications from Legacy Device Environments on the Conceptional Design of Machine Learning Models in Manufacturing. J. Manuf. Mater. Process. 2024, 8, 15. [Google Scholar] [CrossRef]
  3. Biju, V.G.; Schmitt, A.M.; Engelmann, B. Assessing the Influence of Sensor-Induced Noise on Machine-Learning-Based Changeover Detection in CNC Machines. Sensors 2024, 24, 330. [Google Scholar] [CrossRef] [PubMed]
  4. Miller, E.; Borysenko, V.; Heusinger, M.; Niedner, N.; Engelmann, B.; Schmitt, J. Enhanced changeover detection in industry 4.0 environments with machine learning. Sensors 2021, 21, 5896. [Google Scholar] [CrossRef] [PubMed]
  5. Creative Commons. CC BY 4.0 Deed Attribution 4.0 International. 2024. Available online: https://creativecommons.org/licenses/by/4.0/deed.en (accessed on 21 March 2024).
Figure 1. Occurrences: 2 phases.
Figure 1. Occurrences: 2 phases.
Data 09 00066 g001
Figure 2. Occurrences: six phases.
Figure 2. Occurrences: six phases.
Data 09 00066 g002
Figure 3. Occurrences: 23 phases. 14*: Phases 14 + 15, 16*: Phases 16 + 17.
Figure 3. Occurrences: 23 phases. 14*: Phases 14 + 15, 16*: Phases 16 + 17.
Data 09 00066 g003
Figure 4. Architecture for data acquisition [2].
Figure 4. Architecture for data acquisition [2].
Data 09 00066 g004
Table 1. Milling-related data sets in data generalist repositories.
Table 1. Milling-related data sets in data generalist repositories.
Repository NameFocusNo. of Milling Related Data SetsManufacturing Relation
Open Science Framework (OSF)Generalist30
FigshareGeneralist with engineering subcategories4 *3
ZenodoGeneralist with engineering subcategories43
DatadryadGeneralist00 **
Science Data BankGeneralist with engineering subcategories165
Harvard DataverseGeneralist with engineering subcategories62
* Categories: Mechanical, Materials, Other engineering, ** Mostly biological data sets.
Table 2. Recorded sessions in the data set.
Table 2. Recorded sessions in the data set.
Session No.File NameData RowsData Rows per Hour
1Data_from_2021-11-26.csv123881594
2Data_from_2021-12-07.csv134102038
3Data_from_2021-12-17.csv153961811
4Data_from_2022-01-20.csv181351844
5Data_from_2022-01-21.csv198101801
6Data_from_2022-02-14.csv75261926
7Data_from_2022-02-16.csv115821878
8Data_from_2022-02-18.csv78181956
9Data_from_2022-02-22.csv254222383
10Data_from_2022-03-02.csv153981949
11Data_from_2022-03-15.csv178282097
12Data_from_2022-04-05.csv82871579
13Data_from_2022-04.csv163182028
Table 3. Duration of complete sessions, duration of changeover period and related products.
Table 3. Duration of complete sessions, duration of changeover period and related products.
No.Total TimeChangeover TimeProduction TimeOld ProductNew Product
106:07:2904:43:0001:24:2919
206:34:4501:03:3705:31:08210
308:29:5402:04:5806:24:56311
409:49:5504:49:5804:59:5745
510:29:5501:44:5808:44:5754
603:54:2701:59:2801:54:3436
706:09:5703:10:0002:59:57612
803:59:4301:14:5802:45:3875
910:39:5604:04:5706:34:5951
1007:53:5301:33:5506:19:5844
1108:30:0403:54:5704:35:07413
1205:14:5701:49:5903:24:5858
1308:02:5305:36:3502:26:18814
Table 4. Recorded features.
Table 4. Recorded features.
No.Feature NameDescription
1CoolantFlowCoolant flow turned on/off
2ProgramStatusStatus Code:
idle: 0
started: 1
running: 2
stopped: 3
finished: 4
completed: 5
interrupted: 6
error: 7
canceled: 8
selected: 9
not_selected: 10
cleared: 11
3ProgramDetailSee ProgramStatus, additionally includes program name
4ToolNumberTool number and name
5PocketTablePlace table for tool changer, number, and tool name
6DriveStatusDrive turned on/off
7DoorStatusMainMain door open/closed
8DoorStatusToolingTooling door open/closed
9CabinDoorLockFrontFront cabin door locked/unlocked
10CabinDoorLockSideSide cabin door locked/unlocked
11DNCModeDNC mode on/off
12SpindleCleaningSpindle cleaning on/off
13ChipCleaningGunStatusChip cleaning gun on/off
14OverrideSpindleSpindle override (0–150%)
15OverrideFeedFeed override (0 to 150%)
16FeedRateFeed rate (−32,710 to 32,767 m/s)
17SpindleSpeedSpindle speed (0 to 10,046 rpm)
18SpindleApprovalPLC spindle lock on/off
19RapidTraverseKeyRapid traverse on/off
20WarmupMachine in warmup program on/off
Table 5. Number of features in the recorded sessions.
Table 5. Number of features in the recorded sessions.
No.Time Period StartTime Period EndNumber of Features
12021-11-26 07:15:362021-11-26 15:01:5920
22021-12-07 07:39:132021-12-07 14:13:5920
32021-12-17 14:00:012021-12-17 22:29:5919
42022-01-20 09:10:012022-01-20 18:59:5919
52022-01-21 11:30:012022-01-21 22:29:5919
62022-02-14 11:35:012022-02-14 15:29:3019
72022-02-16 11:30:012022-02-16 17:39:5919
82022-02-18 07:00:142022-02-18 11:00:0019
92022-02-22 11:45:012022-02-22 22:24:5820
102022-03-02 07:06:052022-03-02 15:00:0020
112022-03-15 06:58:002022-03-15 15:28:0820
122022-04-05 07:30:012022-04-05 12:45:0019
132022-04-05 13:06:112022-04-08 09:59:1019
Table 6. Different changeover phases, updated from [4].
Table 6. Different changeover phases, updated from [4].
6-Phases23-PhasesDescription
1-Start1Reporting changeover start at the terminal
2Machine table cleaning
3Remove fixture from the machine table
4Component fastening to fixture
5Move fixture to the machine table
6Attach fixture to the machine table
7NC program loading
8Tool presetting
9Tool magazine filling and tool control
10Enter tool dimensions
2-Main11Insert/remove tool directly into/from spindle
12Zero point setting
13NC program optimizing and running
3-End14 * + 15Component cleaning and dismantling
16 * + 17Component deburring and remeasuring
18 **Upoad the optimized NC program
19Reporting changeover stop at the terminal
4-Idle/break20Idle/break
5-Production21Production
6-Quality22General quality control
23Workpiece quality control
* only first number used for labeling. ** not used in this research.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Schmitt, A.-M.; Engelmann, B. A Series Production Data Set for Five-Axis CNC Milling. Data 2024, 9, 66. https://doi.org/10.3390/data9050066

AMA Style

Schmitt A-M, Engelmann B. A Series Production Data Set for Five-Axis CNC Milling. Data. 2024; 9(5):66. https://doi.org/10.3390/data9050066

Chicago/Turabian Style

Schmitt, Anna-Maria, and Bastian Engelmann. 2024. "A Series Production Data Set for Five-Axis CNC Milling" Data 9, no. 5: 66. https://doi.org/10.3390/data9050066

Article Metrics

Back to TopTop