1. Introduction
Due to its excellent property combinations and ability to specifically adjust tailor-made microstructures, steel is still the world’s most important engineering and construction material and is omnipresent in every aspect of our lives. It can also be recycled over and over again without loss of property [
1]. In addition to the variation in chemical composition, steel owes its tremendous variety of property combinations to the large spectrum of process routes and heat treatments. Steel research is still indispensable, continuously leading to constant further developments and improvements. There are more than 3500 steel grades, and 75% of modern steel grades have been developed in the last 20 years [
1]. One of many consequences is that the microstructures have constantly evolved and become finer and more complex, thus requiring advanced characterization and classification approaches. This is especially true for steel designs that include bainitic microstructures.
For a reliable and reproducible characterization of complex microstructures, machine learning (ML) based microstructure classification offers exciting potentials. Prominent examples for ML classifications of steel microstructures include Gola et al. [
2,
3], who used a combination of morphological and textural parameters with a support vector machine (SVM) to classify the carbon-rich second phase of two-phase steels into pearlite, bainite, and martensite. Azimi et al. [
4] applied deep learning (DL) to the same dataset to classify pearlite, bainite, martensite, and tempered martensite. DL was also used by DeCost et al. [
5] for the classification of ultrahigh carbon steel microstructures. General overviews of the spectrum of ML applications in microstructure research can be found in [
6,
7].
In this context, ML offers promising opportunities for the classification of the different subclasses of the steel microstructure bainite, as well. Bainite is a typical constituent of modern high strength steels, notably low-carbon and low-alloy steels, which combine high strength and high toughness, making these types of steel interesting for many applications. To adjust the desired strength or toughness of these steels, it is crucial to know and understand what types of bainite are present, depending on chemical composition and processing parameters. A ML classification of bainite subclasses can be the basis for a sophisticated microstructure quantification, enabling process–microstructure–property correlations to be established. Thereby, it can form the backbone for further microstructure-centered materials development, which is needed to fulfill the increasing demands and tighter tolerances in today’s steel industry.
The characterization or classification of bainite, however, is a difficult task, due to the variety and amount of the phases involved as well as the fineness and complexity of the structures. The continuous advancement of alloying concepts and processing routes has led to more and more diversity in bainitic structures, so that the simple first classification schemes, such as upper and lower bainite, are no longer sufficient. In this context, the definition of classes and the assignment of the ground truth for a ML classification must be discussed. It should be noted that, especially for complex microstructures, ML cannot be applied as a panacea, without precisely grasping the complex material-specific questions, but special attention must be paid when assigning the ground truth for the ML model [
8]. The diversity in bainitic structures can cause ambiguous interpretations and lead to a lack of consensus among human experts in labeling and classifying them. There is also no consistent nomenclature to describe bainitic microstructures [
9,
10], and many different classification schemes can be found in the literature. Existing schemes are usually based on the description of morphologies and arrangement of the ferritic and the carbon-rich phases. The first concept of classification schemes provides a description of the bainite type in one integral expression, e.g., [
11,
12,
13,
14,
15]. The second concept describes the ferritic and the carbon-rich phase separately, e.g., [
16,
17,
18,
19].
Approaches for a more objective ground truth assignment for ML segmentation or classification include Shen et al. [
20], who use electron backscatter diffraction (EBSD) to generate annotations for DL segmentation of steel microstructures. Müller et al. [
8] propose the use of EBSD, reference samples, and unsupervised learning as supporting methods for assigning the ground truth, demonstrated on a bainite case study. Given the above-mentioned challenges in dealing with bainite, it is not surprising that only a few approaches to the automated classification of steel microstructures, including simultaneously present bainite subclasses, are found in the literature. Although ML approaches for microstructure classification were applied by Gola et al. [
2,
3] and Azimi et al. [
4], all structures that were neither pearlite nor martensite were labeled as bainite and consequently, bainitic subclasses are not yet considered. Müller et al. [
21] employed textural parameters combined with ML to classify pearlite, martensite, and four bainite subclasses in specifically produced reference samples. Textural parameters and ML were also used by Tsutsui et al. [
22] for classifying samples with bainite and martensite. Non-ML based approaches for bainite classifications include Zajac et al. [
15,
23], who utilized misorientation angle distribution from EBSD measurements to differentiate granular, upper, and lower bainite. Ackermann et al. [
24] applied correlative characterization (electron probe microanalysis, EBSD, and nanohardness) to classify low-, medium-, and high-temperature bainite morphologies. A combination of EBSD and ML are used by Tsutsui et al. [
24], who utilize misorientation parameters and variant pairs from EBSD to distinguish bainite formed at high and low temperatures, as well as martensite and bainite-martensite mixtures.
The present paper follows the approach applied by Gola et al. [
3], i.e., the machine learning classification of the carbon-rich second phase objects in multi-phase-steels, based on scanning electron microscope (SEM) images. Here, bainite subclasses are now to be considered, resulting in seven classes for the ML classification: pearlite, martensite, and five bainite subclasses. The task is the automated, objective, and reproducible classification of the carbon-rich second phase objects in SEM micrographs, as illustrated in
Figure 1. Several classes can be present simultaneously in one micrograph. This classification in turn will enable a precise calculation of phase fractions and microstructural quantification, which again is the basis for establishing processing–microstructure–property correlations and further materials development.
First, dataset generation, ground truth assignment, and ML concepts will be described. Assigning the ground truth for the ML classification proved to be challenging. The investigated industrial samples do not show many textbook-like structures, as complex alloying concepts and industrial thermomechanical processing lead to structures that are not as clear and distinct as schematics reported in the literature. To achieve a well-founded and objective ground truth, round robin tests with a group of experts as well as supporting methods, such as the use of reference samples and correlative EBSD measurements, as described in previous works [
8,
25], were taken into account. In this context, it must be emphasized that the assignment of ground truth or available data and ML algorithms should not be treated in isolation, but rather as part of a holistic approach to building the ML model, starting with the selection of appropriate samples and achieving reproducible sample contrasting and suitable imaging techniques [
8].
Regarding ML approaches, different classification models and strategies will be tested and discussed. Also, misclassifications of the model will be evaluated. Considering the above-mentioned challenges regarding the characterization and classification of bainite, i.e., ambiguous interpretations by different experts, selection of the classification scheme, definition of classes and class boundaries, and assignment of the ground truth, a perfect classification result cannot be expected. Instead, an “inherent uncertainty” of a bainite classification can be assumed. Approaches on how to handle this uncertainty and how it influences the final phase fraction result will be discussed.