Next Article in Journal
Tailoring the Diagnostic Pathway for Medical and Surgical Treatment of Uterine Fibroids: A Narrative Review
Previous Article in Journal
Evaluation of Changes in the Patency of the Nasal Cavity and Eustachian Tube Depending on the Phase of the Menstrual Cycle: A Pilot Study
Previous Article in Special Issue
Hybrid Feature Mammogram Analysis: Detecting and Localizing Microcalcifications Combining Gabor, Prewitt, GLCM Features, and Top Hat Filtering Enhanced with CNN Architecture
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Quantitative Measurement Method for Nuclear-Pleomorphism Scoring in Breast Cancer

by
Chai Ling Teoh
1,
Xiao Jian Tan
1,2,3,*,
Khairul Shakir Ab Rahman
4,
Ikmal Hisyam Bakrin
5,
Kam Meng Goh
1,2,
Joseph Jiun Wen Siet
1 and
Wan Zuki Azman Wan Muhamad
3,6,7
1
Department of Electrical and Electronics Engineering, Faculty of Engineering and Technology, Tunku Abdul Rahman University of Management and Technology (TAR UMT), Jalan Genting Kelang, Setapak, Kuala Lumpur 53300, Malaysia
2
Biomedical and Bioinformatics Engineering (BBE) Research Group, Centre for Multimodal Signal Processing, Department of Electrical and Electronic Engineering, Faculty of Engineering and Technology, Tunku Abdul Rahman University of Management and Technology (TAR UMT), Jalan Genting Kelang, Setapak, Kuala Lumpur 53300, Malaysia
3
Sports Engineering Research Centre (SERC), Universiti Malaysia Perlis (UniMAP), Arau 02600, Perlis, Malaysia
4
Department of Pathology, Hospital Tuanku Fauziah, Jalan Tun Abdul Razak, Kangar 01000, Perlis, Malaysia
5
Department of Pathology, Faculty of Medicine and Health Sciences, Universiti Putra Malaysia (UPM) Serdang, Serdang 43400, Selangor, Malaysia
6
Institute of Engineering Mathematics, Universiti Malaysia Perlis (UniMAP), Kampus Pauh Putra, Arau 02600, Perlis, Malaysia
7
Centre of Excellence for Advanced Computing (ADVCOMP), Universiti Malaysia Perlis (UniMAP), Arau 02600, Perlis, Malaysia
*
Author to whom correspondence should be addressed.
Diagnostics 2024, 14(18), 2045; https://doi.org/10.3390/diagnostics14182045
Submission received: 14 August 2024 / Revised: 8 September 2024 / Accepted: 13 September 2024 / Published: 14 September 2024
(This article belongs to the Special Issue Quantitative and Intelligent Analysis of Medical Imaging, 2nd Edition)

Abstract

:
Background/Objectives: Nuclear pleomorphism, a crucial determinant of breast cancer grading under the Nottingham Histopathology Grading (NHG) system, remains inadequately quantified in the existing literature. Motivated by this gap, our study seeks to investigate and establish correlations among morphological features across various scores of nuclear pleomorphism, as per the NHG system. We aim to quantify nuclear pleomorphism across these scores and validate our proposed measurement method against ground-truth data. Methods: Initially, we deconstruct the descriptions of nuclear pleomorphism into three core elements: size, shape, and appearance. These elements are subsequently mathematically modeled into equations, termed E S i z e , E S h a p e , and E A p p e a r a n c e . These equations are then integrated into a unified model termed Harmonic Mean (HM). The HM equation yields a value approaching 1 for nuclei demonstrating characteristics of score-3 nuclear pleomorphism and near 0 for those exhibiting features of score-1 nuclear pleomorphism. Results: The proposed HM model demonstrates promising performance metrics, including Accuracy, Recall, Specificity, Precision, and F1-score, with values of 0.97, 0.96, 0.97, 0.94, and 0.95, respectively. Conclusions: In summary, this study proposes the HM equation as a novel feature for the precise quantification of nuclear pleomorphism in breast cancer.

1. Introduction

To date, breast cancer remains a global challenge. Recent reports have found that the incidence rates of breast cancer are soaring in developed countries, whereas high mortality rates are reported in lesser-developed countries, mainly due to late diagnoses [1,2,3]. In 2020, a total of 684,996 deaths (15.5% of all cancer cases amongst women) were reported, leading to the disease being proclaimed as the leading cause of mortality (cancer) amongst women across the globe [4,5]. The NHG system is a semi-quantitative system recommended by the World Health Organization (WHO), a modified grading procedure originating from the Scarff–Bloom–Richardson grading system [6]. Nowadays, the NHG system is widely used as the gold standard for breast cancer grading purposes, where nuclear pleomorphism is one of the main features underpinning the final grade of this grading procedure [7].
Breast cancer is a non-communicable disease which emerges in variegated forms, is self-subsistent, and interacts dynamically with its microenvironment through an adaptive process. Given these heterogeneous properties, scoring of nuclear pleomorphism in accordance with the semi-quantitative NHG system could be challenging. Table 1 shows the description of the semi-quantitative NHG system in nuclear pleomorphism, with scores ranging from 1 to 3 [7].

2. Related Works

Grading of nuclear pleomorphism is tedious, cumbersome, time-consuming, and highly susceptible to variations in the experience levels of histopathologists [8,9]. The conventional grading procedure suffers from inter- and intra-observer variability and could impinge the grading outputs, leading to suboptimal diagnostic and prognostic outcomes. Thanks to the advancements in imaging engineering, the introduction of the whole-slide-imaging scanner has made medical image processing/analysis possible. This has prompted the exploration of repeatable and reproducible algorithms in the detection of nuclear pleomorphism using variegated approaches.
Recent works pertaining to the detection of nuclear pleomorphism can be characterized into two main approaches: handcrafted- and learned-features approaches. Using the handcrafted-features approach, Das, Nair, and Peter proposed a kernel-based Fisher discriminant analysis on the Riemannian manifold (KFDAR), which takes advantage of the kernel trick to embed the non-linear Riemannian manifold M into a higher-dimensional linear Hilbert space H [10]. This subspace was then reduced to a lower-dimensional and more discriminative subspace wherein the samples can be linearly separated. The log-Euclidean metric and the two symmetrized Bregman divergences, the Stein and Jeffrey divergences, are the three Riemannian distance metrics that constitute the basis of the kernel technique developed for the Hilbert space-embedding and kernel-discriminant analysis. The experimental outcomes demonstrated that the proposed mapping to a highly discriminative space had been successful in separating the histopathology images belonging to different cancer grades. As a result, the proposed method outperformed some existing algorithms for cancer grading, both qualitatively and quantitatively.
In the same year, the same team worked on another approach in nuclear-pleomorphism detection via sparse coding and dictionary-learning on symmetric positive definite (SPD) matrices for the grading of breast cancer [11]. The motivation for this method mainly lies within the succession of SPD matrices arising from bundling the difficult approaches within computer vision and machine learning. A convex issue in the higher-dimensional RKHS space was created from the sparse coding issue on the SPD manifold Sn+ in order to better make use of the separability of the cancer grades. The suggested covariance-based SPD matrices were modelled as the sparse merging of Riemannian dictionary atoms, which together constitute a Riemannian manifold. By embedding kernels from the log-Euclidean metric to reproduce the kernel Hilbert space, Jeffrey and Stein divergences, and comparison with the non-kernel-based affine-invariant Riemannian metric, the nonlinearity of the SPD manifold is addressed. The task includes utilizing the kernel approach for the Riemannian manifold’s Hilbert space-embedding, which can improve breast cancer tissue discrimination through the use of sparse representation over learned dictionaries. The performance of the proposed method was found to be better than those of some existing algorithms, in both quantitative and qualitative analysis. The proposed work found that the learned geodesic distances with the Riemannian dictionary atoms were more sensitive and appropriate for the nuclear-pleomorphism score.
In 2020, Das et al. [9] proposed a Riemannian manifold method, a non-Euclidean framework used to investigate the potential for active learning on the nuclear-pleomorphism score. An active learning technique was used in a batch-mode framework that flexibly determined the appropriate batch size as well as the group of samples to query by using the submodular optimization framework. The score of the cancer nuclear pleomorphism depends on the batch-mode active learning, which relies on the Riemannian manifold. This flexibly recognizes the examples used for manual labelling, including the complexities and usefulness of tissue samples as well as the investment cost of labelling data. The Transductive Multi-Label Learning (TRAM) technique was used to replace the need for manual annotation by predicting the class labels of the histological breast carcinoma pictures. Jeffrey divergence, Stein divergence, and log-Euclidean metric were the three kernelized Riemannian metric variations that were subjected to sub-modularity-based dynamic batch-mode active learning and evaluated with contemporary algorithms to define the breast cancer nuclear-pleomorphism grading. Given that it utilizes data from unlabeled samples, adaptive Batch-Mode Active Learning’s results on the Riemannian metric indicates better performance than some existing approaches for evaluating the nuclear atypia in breast cancer.
Salahuddin et al. [12] proposed a novel approach to data sampling that made use of formal concept analysis. The proposed pattern-based hyper-conceptual sampling was a novel mix of conventional sampling methodologies. Formal concepts served as the foundation for the sampling process. The proposed method begins with a conversion to a formal concept analysis and then a conversion is made to an object pattern table. For pattern reduction, the proposed method made use of the hyper-context method. The sample was then generated using the coupling sampling process. To assess the quality of the samples, machine learning experiments are implemented. The performance of the suggested approach was justified by the results of another sampling technique. Using binary distribution, the final sample comprises a very condensed sample that accurately represents the functional dependencies and correlations found in the original dataset. As a result, the proposed method was found promising in producing outcomes that are competitive.
Wan et al. [13] introduced a computer-aided grading system which works dependently with multi-level characteristics and cascaded Support Vector Machine (SVM) classification. To objectively define morphological patterns and understandable concepts from the breast cancer tissue images, pixel-, object-, and semantic-level features were first retrieved. The nuclei from the images were segmented using an enhanced hybrid active contour model-based segmentation technique. In combination with object-level (architecture) features and pixel-level (texture) features, the semantic-level features such as the proportions of nuclei were abstracted with a Convolutional Neural Networks (CNN) approach and defined corresponding to various grade levels. This resulted in an interesting mix of image characteristics that may perform better than either feature subtype on its own. The performance was maximized by utilizing several feature sets collected at various levels; a cascaded technique was used to train multiple SVM classifiers by using the composition of the feature subtypes. To increase the accuracy of the grading process, a cascaded multi-class classification method was used to pyramidally classify images in a series of more-demanding categorization activities. The final class of the cancer grade was calculated by averaging the results of the various SVM classifiers. The proposed method was found computationally effective and adaptable to huge datasets, as the proposed method used only a three-layer CNN model and concurrent processing.
Faridi et al. [14] proposed an automated computer-aided detection (CAD) system that comprises methodologies such as detection, segmentation, and scoring of nuclear pleomorphism. The proposed CAD system starts with the pre-processing stage, such that the hematoxylin and eosin (H/E)-stained input images are first unmixed into their respective sub-component channels. The cores of malignant nuclei were captured in the nuclei-borders analyses that were extracted in Step Two after the preprocessed image had been subjected to morphological procedures and a Difference of Gaussian filter. In the segmentation stage, the segmented nuclei were graded to fulfil a requirement of the NHG system. Finalizing the suggested approach requires scoring segmented nuclei in accordance with the derived features using four criteria that distinguish between malignant and healthy nuclei. Features for nuclear-pleomorphism scoring include the size of the nuclei, the density of the chromatin, the regularity of the contours, and the presence of nucleoli. The proposed CAD system was found promising, and demonstrated an improved accuracy as compared to some existing techniques in the detection of malignant nuclei, with an accuracy of 86.6%.
In the learned-features approach, recent studies have used the deep CNN architecture for grading breast tumors. For cancer grading purposes in nuclear pleomorphism, Wan et al. proposed mixed CNN-derived semantic-level descriptors together with the object-based data and pixel-based data [13]. Another CNN-based approach was proposed by Mollahosseini and Mahoor, namely, the Deep Belief architecture based Deep Neural Network (DBN-DNN) and Multi-Resolution Convolutional Network (MR-CN) together with Plurality Voting (MR-CN-PV) models, which were built on Restricted Boltzmann Machines (RBM). The proposed method was a multitask CNN architecture that can simultaneously forecast the cancer malignancy attribute and level of magnification.
Numerous variegated forms of the CNN architecture have also been proposed for breast cancer grading focusing on nuclear pleomorphism, including, for example, deep belief networks [15], residual networks [16,17], and inception networks [18,19]. The deep CNN architecture had been used by [19,20] for grading breast tumors. The use of these deep neural networks had demonstrated promising performance in the detection of nuclear pleomorphism. These learning-based algorithms, however, require high-performance computing resources, for example, graphics processing units (GPUs), and were found extremely computationally intensive.
Das et al. [21] proposed a semi-supervised learning framework focusing on nuclear atypia scoring (NAS) with a deep neural network-based generative adversarial training, namely, NAS-SGAN, with a minimal labelled dataset. The proposed NAS-SGAN model comprises a generator and a discriminator model that has undergone adversarial training with both labelled and unlabeled samples. The discriminator model was created as an unsupervised model that contained layers, as opposed to the supervised model. The proposed model retrieved information pertaining to the population of data by removing discriminative features. The generator model was learned using a consistent attribute corresponding objective function that used compounded GAN architecture. Although a small number of labelled samples were used, experimental research demonstrated that the proposed model could achieve promising accuracy in distinguishing between various cancer grades, enhancing the durability and precision score of the system.
Karimi and Danyali [22] proposed a combination of the CNN for feature extraction and a two-layer Long Short-Term Memory (LSTM) for detection of nuclear pleomorphism in breast cancer. The size of the histopathology images and the limited amount of training data compel the introduction of a patch-based technique. First, the most significant features in the image were identified. Next, a three-hidden-layer CNN was created and used for feature extraction and to categorize the patches separately. For image grading, the LSTM network was used to take into account all features of an image concurrently. The novelty of the proposed method lies within the capability to distinguish between different cancer grades in accordance with the nuclear-pleomorphism scores without performing the nuclei-segmentation stage.
Alom et al. [19] proposed an Inception Recurrent Residual Convolutional Neural Network (IRRCNN) model, comprised of binary and multi-class approaches, for nuclear-pleomorphism scoring purposes. The IRRCNN, recognized as a strong Deep CNN (DCNN) model that integrates the strong performance of the Residual Network (ResNet), Recurrent Convolutional Neural Network (RCNN), and Inception Network (Inception-v4). The proposed IRRCNN was found to outperform the existing methods, including the Inception Networks, Residual Networks, and RCNNs. Testing and validation were performed using two publicly accessible datasets, namely, the Breast Cancer Classification Challenge 2015 and BreakHis. In terms of the patch-based, picture-based, patient-level, and image-level classification, the detection outputs of the proposed IRRCNN were compared with some existing methods using state-of-the-art machine learning and deep learning-based methodologies. The proposed IRRCNN was found promising, achieving higher outputs in terms of area under the curve (AUC) and overall detection accuracy.
Jiang et al. [20] proposed a CNN-based network, namely, the Breast Cancer Histopathology Image Categorization Network (BHCNet), for the categorization of breast cancer histopathology images. The proposed BHCNet comprises a Squeeze and Excitation (SE) ResNet module which could achieve comparable performance with fewer parameters. The proposed method implemented a new learning-rate scheduler, namely, the Gauss error scheduler, enabling the BHCNet to achieve promising performance without a comprehensive fine-tuning process.
Rakhlin et al. [23] proposed a computational strategy that depends on deep convolution neural networks for the categorization of breast cancer histopathology images. The gradient-boosted trees classifier and numerous deep neural network topologies were used in the proposed detection module. Strong data augmentation and deep convolutional features retrieved at various scales with freely available CNNs trained on ImageNet were used to boost the classifier’s robustness. Furthermore, the proposed module implemented a precise and susceptible-to-overfitting implementation of the gradient boosting algorithm. The proposed module achieved 87.2% accuracy for the four-class categorization task. For a two-class classification task, the proposed module demonstrated 93.8% in accuracy, 97.3% in AUC, and 96.5/ 88.0% in sensitivity/specificity in the detection of breast cancer at the high-sensitivity operational point.
In both handcrafted- and learned-features approaches, to date, in-depth analysis of the description of nuclear pleomorphism in accordance with the NHG system (as in Table 1) is very limited. Detection or modelling algorithms fully aligned with such descriptions across different scores are not available. There are a number of recent works that demonstrate promising capability in the detection of nuclear pleomorphism in breast cancer. These methods, however, partly or entirely differ from the description stipulated by the NHG system, barricading their implementation in real-world applications. In recent years, quantitative measurement and modelling of nature structures, for example, fetal cardiovascular structures [24], tubule structures [25], and androgenetic alopecia via hair diameter measurement [26], are emerging. These approaches reveal the in-depth features underpinning the object of interest and provide a systematic approach to understanding complex phenomena by capturing the underlying relationships and interactions between different variables [27,28]. As this approach aligns with human perception, trust can thus be established in the output model. Motivated by these determinations, the contributions of the present study are as follows: (1) to investigate and correlate morphological features across different scores concerning nuclear pleomorphism in accordance with the NHG system; (2) to quantify the nuclear pleomorphism via mathematical modelling; and (3) to validate and benchmark the proposed measurement method against the ground truth. The main novelty of the proposed measurements over the other existing findings is that the proposed measurements provide a clear morphological meaning for nuclear pleomorphism across different scores, alongside producing measurable output that enables quantification of qualitative nuclear-pleomorphism features in accordance with the NHG system.
The paper is organized as follows. Section 3 provides a detailed description of the proposed methods. Section 4 describes the dataset and presents the results obtained from this study. The limitations of the study and ideas for future work are given in Section 5. Section 6 summarizes the present study.

3. Methods

3.1. Theoretical Framework and Mathematical Modelling

Upon scrutinizing Table 1, the descriptions of nuclear pleomorphism across scores 1 to 3 can be broken down into three main elements: size, shape, and appearance. Figure 1 illustrates the aforementioned breakdown.
A natural question now is how to correlate the three main elements into one usable model to effectively quantify the qualitative descriptions of the nuclear pleomorphism. To achieve this, three mathematical equations are proposed, namely, E S i z e , E S h a p e , and E A p p e a r a n c e , for size, shape, and appearance elements, respectively. In this study, an equation, namely, HM, is proposed to integrate these three equations into a single, usable model. We hypothesize that the HM value for nuclear pleomorphism would fall within the range of [0, 1]. Specifically, the equation expects that a score-3 nuclear pleomorphism has an HM value approximated to 1, whereas a score-1 nuclear pleomorphism is approximated to 0.
The mathematical model of the size element is straightforward. The ratio of nucleus size variation is computed as follows:
E P S i z e = x i k k ;   s u c h   t h a t   1 i N
E S i z e = 1 ,    i f   E P S i z e 1 E P S i z e ,    i f   0.5 E P S i z e < 1 0 ,    i f   E P S i z e < 0.5
where x i denotes the size of the ith nucleus; N denotes the maximum number of nuclei; and k denotes a constant which refers to the size of the benign epithelial cell nuclei. In this study, the k constant is set as 849. The proposed k constant is determined systematically based on a comprehensive analysis of 200 benign epithelial cell nuclei. The 200 benign epithelial cell nuclei were first segmented using the CellProfiler 3.0 software, using the same procedure/setting as proposed in Step 2 in Section 3.2. Next, the mean value, in pixel area, of the benign epithelial cell nuclei was calculated. This pixel area was then used as the k constant herein. Prior to proceeding to the next step, Equations (1) and (2) were tested and validated using an independent dataset with the size of 150 nuclear pleomorphisms (specifically, 50 nuclear pleomorphisms from Scores 1, 2, and 3, respectively), to justify the applicability of the proposed k constant. The mathematical model for size element comprises a two-stage calculation, namely, E P S i z e and E S i z e . The E P S i z e is first calculated to determine the nucleus size variation, whereas E S i z e is then used to integrate rules in accordance with the descriptions of nuclear pleomorphism across different scores. The equation yields a value of 1 when the size of the nucleus is >2 times the size of the benign epithelial cell nuclei, whereas a value approaching 0 is returned when the size of the nucleus is 1.5 to 2 times or <1.5 times of the benign epithelial cell nuclei.
To model the shape element, an isoperimetric quotient is used. The equation is as follows:
E S h a p e = 1 4 π A r P e r i 2
where Ar and Peri denote Area and Perimeter, measured in pixel values, respectively. The core idea of Equation (3) is to differentiate between a circular and an irregularly shaped object using a single value within the range of [0, 1]. For a nucleus in a perfect circle shape, the equation yields a value of 0, reflecting the description of score-1 nuclear pleomorphism with an assumption of minimal pleomorphism. For irregular shapes or bizarre objects, the value would be greater than 0 (or approaching 1), reflecting the description of score-3 nuclear pleomorphism, with an assumption of marked variation in shape.
Motivated by one of the previous works, specifically one on the study of nucleus shape in breast histopathology images [8], the appearance element is modelled as follows:
E A p p e a r a n c e = H o l A r
where Hol denotes the number of hollow pixels in the cell nucleus (examples in Figure 2), reflecting the pattern and distribution of chromatin and visibility of nucleoli in the cell nucleus. The equation yields a value approaching 1 if and only if the hollow pixels are dominant in the nucleus, whereas a value approaching 0 is returned if and only if the hollow pixels are scarce.
The HM equation is then implemented to integrate the three elements (i.e., size, shape, and appearance) into one usable model by integrating Equations (2) to (4) into Equation (5), as follows:
H M = τ w 1 1 E S i z e + w 2 1 E S h a p e + w 3 1 E A p p e a r a n c e ;
s u c h   t h a t   w 1 = 0 ,   i f   E S i z e = 0 w 1 ,   o t h e r w i s e
s u c h   t h a t   τ = 2 ,   i f   w 1 = 0 3 ,   o t h e r w i s e
where w 1 , w 2 , and w 3 are the weightage of E S i z e , E S h a p e , and E A p p e a r a n c e , respectively. It is important to remark that when E S i z e = 0 (based on Equation (2)), w 1 is set as 0 (and τ is set as 2, as the HM equation now considers only two elements) and rules out the consideration of E S i z e in the HM equation. This is mainly because, mathematically, a division by a zero would result in an infinity value. In this study, all three elements shared same weightage, and thus w 1 , w 2 , and w 3 are set as 1. The HM is defined as the reciprocal of the arithmetic mean of the reciprocals of the three elements in nuclear pleomorphism. This equation provides a method to aggregate the three features, giving equal importance to each element (provided that w 1 , w 2 , and w 3 are set as 1). The weightage of each element can be adjusted by modifying the value of w 1 , w 2 , and w 3 and thus provide higher importance to a specific element. The HM equation yields a value approaching 1 when the nucleus demonstrates characteristics of score-3 nuclear pleomorphism, whereas a value approaching 0 is returned when the nucleus demonstrates characteristics of score-1 nuclear pleomorphism.

3.2. Methodology Pipeline

The implementation of proposed equations concerning the size, shape, and appearance, as well as the HM value, in quantifying the nuclear pleomorphism is simple. The methods are as follows:
  • Pre-processing: Extract the H-channel of the input breast histopathology images by converting the RGB input images into optical density space via singular value decomposition [29].
  • Nucleus segmentation: Segment the nucleus using CellProfiler 3.0 [30].
  • Post-processing: If necessary, manual intervention from an expert is involved, such that the cell boundary (pixel-based) is manually edited under the expert’s supervision.
  • Calculation: Calculate the E S i z e , E S h a p e , and E A p p e a r a n c e , using the Equations (2) to (4), respectively.
  • Measurement of nuclear pleomorphism: Quantify and measure the nuclear pleomorphism of a nucleus using the HM equation (Equation (5)).
Perfect nucleus segmentation is known as a highly challenging task in which continuous efforts are made over the years to improve and close the research gaps [31,32]. It is important to remark that in this study, the main purpose is to quantify the nuclear-pleomorphism scoring in accordance with the NHG system. The features extracted in the subsequent stages however are closely related to the succession of the nucleus segmentation stage. Thus, a semi-supervised nucleus segmentation method is employed such that the Cell Profiler 3.0 [30] is used as a software tool to segment the nuclei from the H-channel of the input histopathology images. To obtain the H-channel, the stain-unmixing method is implemented by converting the RGB input images to optical density space via singular value decomposition [29]. The segmentation outputs are reviewed and monitored by a histopathologist to ensure segmentation is performed accordingly. If necessary, manual intervention is used. This serves as a baseline to justify that the features extracted in the subsequent stages are sound and reflect the characteristics of nuclear pleomorphism across different scores. In Steps 4 and 5, calculation and quantitative measurement of nuclear pleomorphism are performed by using the proposed equations, namely, E S i z e , E S h a p e , E A p p e a r a n c e , and HM.
To validate the hypothesis and the applicability of the proposed measurement method, the measurement outputs are tested and validated against the ground truth provided by the histopathologist. For this purpose, a baseline classifier is used, namely, SVM [33,34]. The classification was performed using a fivefold cross-validation SVM classifier with the Radial Basis Function (RBF) kernel. The data are randomly divided into five equal pieces. Each selected piece is chosen as a test set, with training performed on the remaining portion of the data. The cross-validation is then repeated five times, with each subsample used exactly once as the validation data. All the observations are used for both training and validation, and each observation is specifically used for validation.
Evaluation metrics, for example, Accuracy, Recall, Specificity, Precision, and F1-score, are used to statistically analyze the performance against the ground truth. Table 2 summarizes the evaluation metrics used in this study; the tubule that was correctly labeled as a tubule is referred to as a True Positive (TP), the non-tubule that was correctly labeled as a non-tubule is referred to as a True Negative (TN), the non-tubule that was wrongly labeled as a tubule is referred to as a False Positive (FP), and the tubule that was wrongly labeled as a non-tubule is referred to as a False Negative (FN). A promising method would demonstrate high values in all the evaluation metrics.

4. Dataset, Results, and Discussions

4.1. Dataset

The dataset used in this study comprises real histopathology slides of breast cancer which were locally collected in Malaysia. The breast histopathology slides were obtained from the Pathology Department, Hospital Tuanku Fauziah, Kangar, Perlis, Malaysia. A total of 48 breast histopathology slides from 48 patients (one slide per patient) with anonymized identities were collected retrospectively. These slides were prepared under a standard staining procedure using the hematoxylin and eosin (H/E) stains. The staining procedure is aligned with the protocols of the Ministry of Health, Malaysia. An Aperio CS2 whole-slide-imaging scanner was used to convert the histopathology slides into digital form. The scanning procedure was performed under the careful supervision of an experienced histopathologist and an instrument specialist to ensure the histopathology slides were scanned under fixed instrument parameters (e.g., input format and objective lenses). The ethical protocol of this study has been approved by the Medical Research and Committee of the National Medical Research Register (NMRR) Malaysia, with protocol number: NMRR-21-949-58903. For validation and benchmarking purposes, a total of 600 nuclei were demarcated by a histopathologist from the 48 breast histopathology slides, such that 200 nuclei from each score were obtained via convenient sampling. The 600 demarcated nuclei served as ground truth in this study.

4.2. Analysis on the Outputs of E S i z e , E S h a p e , E A p p e a r a n c e , and HM

Figure 3 shows the boxplots analysis of E S i z e , E S h a p e , E A p p e a r a n c e , and HM for scores 1 to 3 for nuclear pleomorphism using 300 ground truth, such that 100 nuclei were included for each score in the analysis. Based on Figure 3a, it is evident that the E S i z e values of score-1 and score-2 nuclear pleomorphism exhibit clear separation, whereas the E S i z e values for scores 2 and 3 are overlapping. Notably, the E S i z e of score-3 nuclear pleomorphism remains consistent across all 100 nuclei, indicating remarkable variation in nucleus size (i.e., >2.0 times).
In Figure 3b, score-1 and score-2 nuclear pleomorphism display overlapping data in E S h a p e , with the E S h a p e outputs from score-2 nuclear pleomorphism slightly lower than those of score-1. Conversely, score-3 nuclear pleomorphism exhibits distinct separation from the others. This delineation is justifiable, as marked variation in shape is a common feature of score-3 nuclear pleomorphism. In the context of breast cancer, pleomorphism refers to variations in the size, shape, and structure of nucleus cells, particularly when compared to benign pre-existing epithelial cells [7]. In this study, E S i z e and E S h a p e are proposed primarily to identify variations in size and shape, respectively. As variation in size and shape are key features of pleomorphism, the proposed E S i z e and E S h a p e complement each other to effectively capture pleomorphism across scores 1 to 3. Therefore, we infer that the score-2 nucleus pleomorphism sampled herein demonstrates variation in size rather than shape, resulting in a boxplot with lower values compared to score-1 nuclear pleomorphism, as depicted in Figure 3b.
In Figure 3c, the outputs of E A p p e a r a n c e across scores 1 to 3 for nuclear pleomorphism comply to the hypothesis made, such that E A p p e a r a n c e yields a value approaching 1 if the hollow pixels are dominant in the nucleus, whereas a value approaching 0 is returned if the hollow pixels are scarce. We could deduce that the Hol is sufficiently effective to reflect chromatin pattern and visibility of nucleoli in a nucleus.
Figure 3d shows the boxplots of HM analysis across scores 1 to 3 for nuclear pleomorphism. It is evident that the boxplots exhibit clear separation across different scores and the compact boxplots reflect consistency and strong agreement when measuring nuclear pleomorphism using the proposed HM, and considering the three main elements, size, shape, and appearance, namely, E S i z e , E S h a p e , and E A p p e a r a n c e , respectively.

4.3. Measurement Outputs

To further illustrate the applicability of the proposed measurement, three samples from each score are provided in Table 3, detailing the original input nucleus patch, segmentation outputs, and measurement outputs of E S i z e , E S h a p e , E A p p e a r a n c e , and HM. Based on Table 3, the proposed measurements, E S i z e , E S h a p e , E A p p e a r a n c e , and HM, appear to be aligned with the hypothesis made in this study, such that the HM values for scores 1 and 3 nuclear pleomorphism are approaching 0 and 1, respectively. The HM values obtained for the score-1 nucleus are 0.3244, 0.3759, and 0.4382; for score-2, the values are: 0.4816, 0.4997, and 0.4985; and for score-3, the values are: 0.7542, 0.6524, and 0.8404.

4.4. Classification Outputs

As detailed in Section 3.2, the SVM with RBF kernel is used and serves as a baseline classifier to assess the viability of the proposed measurement. Here, the proposed E S i z e , E S h a p e , E A p p e a r a n c e , and HM are used as new input features to measure and discriminate nuclear pleomorphism across different scores. As fivefold cross-validation is used, the mean values of the classification outputs are initially computed. Figure 4 shows the classification outputs for E S i z e , E S h a p e , E A p p e a r a n c e , and HM, using the dataset.
Based on Figure 4a–c, using the proposed E S i z e , E S h a p e , and E A p p e a r a n c e alone as an input feature into the classifier is insufficient to discriminate nuclear pleomorphism into different scores accurately. The obtained Accuracy, Recall, Specificity, Precision, and F1-score for scores 1 to 3 are found to be lower than 0.80, denoting sub-optimal classification outcomes. In Figure 4a, the Accuracy, Specificity, Precision, and F1-score obtained for scores 2 and 3 nuclear pleomorphism are relatively low (i.e., <0.80). This suggests that using E S i z e alone is insignificant to discriminate between score-2 and score-3 nucleus pleomorphism but effective in identifying score-1 nuclear pleomorphism. This is justifiable, as the sizes of the nucleus cell for scores 2 and 3 are overlapping, as shown in Figure 3a. Supplementary features are required to better partition the data into correct categories. In Figure 4b, all evaluation metrics for score-3 nuclear pleomorphism show promising outcomes (i.e., >0.80), but relatively poor outputs are found in Accuracy, Recall, Precision, and F1-score (i.e., <0.80) for both score-1 and score-2 nuclear pleomorphism. This is primarily due to heterogeneous properties of pleomorphism concerning both size and shape features and the finding herein is aligned with the boxplot analysis as in Figure 3b. In Figure 4c, using E A p p e a r a n c e alone as an input feature to the classifier produced slightly better outcomes, compared to the E S i z e and E S h a p e , with values approaching 0.80, if not better. The classification outcomes, however, are not optimal. Based on Figure 4d, it is evident that the overall performance of the classification outputs is promising, achieving values of more than 0.90 in all evaluation metrics, such that overall values in Accuracy, Recall, Specificity, Precision, and F1-score are 0.97, 0.96, 0.97, 0.94, and 0.95, respectively. The proposed HM formulation aligns with the description of nuclear pleomorphism outlined in the NHG system, effectively incorporating all three main elements: size, shape, and appearance. Each element functions as a supplementary feature in the equation, contributing to the delineation and enhancement of classification outcomes. This comprehensive approach enables accurate discrimination of nuclear pleomorphism into different scores. Notably, the error bars in Figure 4d for all evaluation metrics are relatively narrow, indicating a strong consensus in classification outcomes with minimal standard deviation.

4.5. Ablation Study

To further justify the superiority of the proposed HM equation against the proposed equations of E S i z e , E S h a p e , and E A p p e a r a n c e , an ablation study is conducted. Here, specific input features are systematically removed from consideration while keeping the classifier parameters unchanged. This approach allows for the evaluation of the individual contributions of each feature to the overall performance. By systematically eliminating different input features and observing the resulting changes, insights into the importance and effectiveness of various features can be gleaned. Table 4 presents the F1-scores obtained from the ablation study, indicating the performance with different features used as input to the classifier. Based on Table 4, it is evident that the F1-scores for HM are highest across scores 1 to 3, with an overall output of 0.95.

4.6. Benchmarking with Existing Quantitative Features

To date, existing quantitative features focusing on nuclear pleomorphism are very limited. Due to the heterogeneous properties of pleomorphism in breast cancer, conventional features, for example, shape, texture, and fitness of the outline [35], as well as morphological and textural features [36], are commonly used. Additionally, to further validate the applicability of the proposed measurement method, recent deep learning methods, for example, the LeNet-5 [37] and CNN [38] methods, were compared with the proposed HM. Figure 5 illustrates the classification outputs of the existing quantitative features juxtaposed with the proposed HM. From the figure, it is evident that the outputs for all evaluation metrics obtained from the proposed HM are consistently higher than those of the outputs obtained from the existing quantitative features [35,36]. Compared with the recent deep learning methods [37,38], the proposed HM demonstrates comparable if not better performance. This demonstrates the robustness of the proposed HM in discriminating among nuclear pleomorphism scores 1 to 3. The proposed HM presents itself as a promising new feature for measuring nuclear pleomorphism in breast cancer.

5. Limitations and Future Works

Manual annotation and dataset preparation are known to be expensive, tedious, and time-consuming processes [39,40]. In this study, we conducted an experiment to quantify nuclear pleomorphism by thoroughly investigating the features associated with nuclear pleomorphism as described by the NHG system. The obtained outputs validate the applicability and robustness of the proposed HM as a new feature for measuring and discriminating nuclear pleomorphism using different scores. Compared to the crowdsourcing-based datasets, for example (the NuCLS dataset [41]), the dataset used in this study is seen to be small. Thus, we aim to expand and validate our findings using crowdsourcing-based datasets in the future. Furthermore, we plan to develop a fusion-oriented deep learning model to automate the nucleus segmentation stage and utilize the proposed HM as an input for measuring nuclear pleomorphism in breast cancer.

6. Conclusions

In this study, an in-depth investigation of the nuclei across score-1 to score-3 nuclear pleomorphism is first performed. The insights gained serve as the foundation for the proposed quantitative equations, which detail the three core elements of nuclear pleomorphism, namely, size, shape, and appearance, denoted as E S i z e , E S h a p e , and E A p p e a r a n c e , respectively. These equations are then integrated into a single usable model, termed HM. The output of the proposed HM is a quantitative value falling within the range of [0, 1], for which it is hypothesized that a score-3 nuclear pleomorphism has an HM value approximated to 1, while a score-1 nuclear pleomorphism is approximated to 0. To validate the applicability of the proposed measurement, a baseline classifier, namely, the SVM with RBF kernel, is employed. The proposed HM demonstrates promising outputs, achieving Accuracy, Recall, Specificity, Precision, and F1-score values of 0.97, 0.96, 0.97, 0.94, and 0.95, respectively. Compared to some existing quantitative features, the proposed HM outperforms these features, leading to the conclusion that it could serve as a new feature for measuring nuclear pleomorphism in breast cancer. Additionally, the proposed HM provides a clear morphological meaning for nuclear pleomorphism across different scores, producing measurable output that could be used as a second opinion in standard nuclear-pleomorphism scoring procedures in breast cancer.

Author Contributions

Conceptualization, C.L.T. and X.J.T.; methodology, C.L.T.; software, C.L.T. and J.J.W.S.; validation, C.L.T. and X.J.T.; formal analysis, C.L.T.; data curation, K.S.A.R. and I.H.B.; writing—original draft preparation, C.L.T.; writing—review and editing, X.J.T.; supervision, X.J.T., K.S.A.R., I.H.B., K.M.G., and W.Z.A.W.M.; funding acquisition, X.J.T., K.S.A.R., I.H.B., K.M.G., and W.Z.A.W.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by internal grant from the Tunku Abdul Rahman University of Management and Technology (TAR UMT), with grant number: UC/I/G2021-00078. The APC was funded in part by the BBE Research Group, Centre for Multimodal Signal Processing, TAR UMT.

Institutional Review Board Statement

The protocol of this study has been approved by the Medical Research and Committee of the National Medical Research Register (NMRR) Malaysia (NMRR-21-949-58903).

Informed Consent Statement

Not applicable.

Data Availability Statement

Restrictions apply to the datasets: The datasets presented in this article are not readily available because the data are part of the ongoing study and associated with privacy restrictions. Requests to access the datasets should be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Wilkinson, L.; Gathani, T. Understanding breast cancer as a global health concern. Br. J. Radiol. 2022, 95, 20211033. [Google Scholar] [CrossRef]
  2. Tan, X.J.; Cheor, W.L.; Lim, L.L.; Ab Rahman, K.S.; Bakrin, I.H. Artificial Intelligence (AI) in Breast Imaging: A Scientometric Umbrella Review. Diagnostics 2022, 12, 3111. [Google Scholar] [CrossRef] [PubMed]
  3. Flavia, Z.F.; Rodney, H.; Richard, K.; Zodwa, D. Breast cancer in low-middle income countries: Abnormality in splicing and lack of targeted treatment options. Am. J. Cancer Res. 2020, 10, 1568–1591. [Google Scholar] [PubMed]
  4. Tan, X.J.; Cheor, W.L.; Cheng, E.M.; Ab Rahman, K.S.; Wan Muhamad, W.Z.A.; Leow, W.Z. Breast cancer status, grading system, etiology, and challenges in Asia: An updated review. Oncologie 2023, 25, 99–110. [Google Scholar] [CrossRef]
  5. Bray, F.; Laversanne, M.; Sung, H.; Ferlay, J.; Siegel, R.L.; Soerjomataram, I.; Jemal, A. Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 2024, 74, 229–263. [Google Scholar] [CrossRef]
  6. Amat, S.; Penault-Llorca, F.; Cure, H.; Le Bouedec, G.; Achard, J.-L.; Van Praagh, I.; Feillel, V.; Mouret-Reynier, M.-A.; Dauplat, J.; Chollet, P. Scarff-Bloom-Richardson (SBR) grading: A pleiotropic marker of chemosensitivity in invasive ductal breast carcinomas treated by neoadjuvant chemotherapy. Int. J. Oncol. 2002, 20, 791–796. [Google Scholar] [CrossRef]
  7. Lakhani, S.R.; Ellis, I.O.; Schnitt, S.J.; Tan, P.H.; Van de Vijver, M.J. (Eds.) WHO Classification of Tumours of the Breast, 4th ed.; International Agency for Research on Cancer (IARC): Lyon, France, 2012. [Google Scholar]
  8. Tan, X.J.; Mustafa, N.; Mashor, M.Y.; Ab Rahman, K.S. Automated knowledge-assisted mitosis cells detection framework in breast histopathology images. Math. Biosci. Eng. 2022, 19, 1721–1745. [Google Scholar] [CrossRef]
  9. Das, A.; Nair, M.S.; Peter, D.S. Batch Mode Active Learning on the Riemannian Manifold for Automated Scoring of Nuclear Pleomorphism in Breast Cancer. Artif. Intell. Med. 2020, 103, 101805. [Google Scholar] [CrossRef]
  10. Das, A.; Nair, M.S.; Peter, S.D. Kernel-based Fisher discriminant analysis on the Riemannian manifold for nuclear atypia scoring of breast cancer. Biocybern. Biomed. Eng. 2019, 39, 728–741. [Google Scholar] [CrossRef]
  11. Das, A.; Nair, M.S.; Peter, S.D. Sparse representation over learned dictionaries on the riemannian manifold for automated grading of nuclear pleomorphism in breast cancer. IEEE Trans. Image Process. 2019, 28, 1248–1260. [Google Scholar] [CrossRef]
  12. Salahuddin, T.; Haouari, F.; Islam, F.; Ali, R.; Al-Rasbi, S.; Aboueata, N.; Rezk, E.; Jaoua, A. Breast cancer image classification using pattern-based Hyper Conceptual Sampling method. Inform. Med. Unlocked 2018, 13, 176–185. [Google Scholar] [CrossRef]
  13. Wan, T.; Cao, J.; Chen, J.; Qin, Z. Automated grading of breast cancer histopathology using cascaded ensemble with combination of multi-level image features. Neurocomputing 2017, 229, 34–44. [Google Scholar] [CrossRef]
  14. Faridi, P.; Danyali, H.; Helfroush, M.S.; Jahromi, M.A. Cancerous Nuclei Detection and Scoring in Breast Cancer Histopathological Images. arXiv 2016, arXiv:1612.01237. [Google Scholar]
  15. Maqlin, P.; Thamburaj, R.; Mammen, J.J.; Manipadam, M.T. Automated nuclear pleomorphism scoring in breast cancer histopathology images using deep neural networks. In Mining Intelligence and Knowledge Exploration; Springer: Cham, Switzerland, 2015; Volume 9468, pp. 269–276. [Google Scholar] [CrossRef]
  16. Han, Z.; Wei, B.; Zheng, Y.; Yin, Y.; Li, K.; Li, S. Breast Cancer Multi-classification from Histopathological Images with Structured Deep Learning Model. Sci. Rep. 2017, 7, 4172. [Google Scholar] [CrossRef] [PubMed]
  17. Wollmann, T.; Rohr, K. Automatic breast cancer grading in lymph nodes using a deep neural network. arXiv 2017, arXiv:1707.07565. [Google Scholar]
  18. Vang, Y.S.; Chen, Z.; Xie, X. Deep Learning Framework for Multi-class Breast Cancer Histology Image Classification. In Image Analysis and Recognition; Springer: Cham, Switzerland, 2018; Volume 10882, pp. 914–922. [Google Scholar] [CrossRef]
  19. Alom, M.Z.; Yakopcic, C.; Nasrin, M.S.; Taha, T.M.; Asari, V.K. Breast Cancer Classification from Histopathological Images with Inception Recurrent Residual Convolutional Neural Network. J. Digit. Imaging 2019, 32, 605–617. [Google Scholar] [CrossRef] [PubMed]
  20. Jiang, Y.; Chen, L.; Zhang, H.; Xiao, X. Breast cancer histopathological image classification using convolutional neural networks with small SE-ResNet module. PLoS ONE 2019, 14, e0214587. [Google Scholar] [CrossRef]
  21. Das, A.; Devarampati, V.K.; Nair, M.S. NAS-SGAN: A Semi-Supervised Generative Adversarial Network Model for Atypia Scoring of Breast Cancer Histopathological Images. IEEE J. Biomed. Health Inform. 2022, 26, 2276–2287. [Google Scholar] [CrossRef]
  22. Karimi Jafarbigloo, S.; Danyali, H. Nuclear atypia grading in breast cancer histopathological images based on CNN feature extraction and LSTM classification. CAAI Trans. Intell. Technol. 2021, 6, 426–439. [Google Scholar] [CrossRef]
  23. Rakhlin, A.; Shvets, A.; Iglovikov, V.; Kalinin, A.A. Deep Convolutional Neural Networks for Breast Cancer Histology Image Analysis. In Image Analysis and Recognition; Springer: Cham, Switzerland, 2018; Volume 10882, pp. 737–744. [Google Scholar] [CrossRef]
  24. Faruk Topaloğlu, Ö.; Koplay, M.; Kılınçer, A.; Örgül, G.; Sedat Durmaz, M. Quantitative measurements and morphological evaluation of fetal cardiovascular structures with fetal cardiac MRI. Eur. J. Radiol. 2023, 163, 110828. [Google Scholar] [CrossRef]
  25. Tan, X.J.; Mustafa, N.; Mashor, M.Y.; Ab Rahman, K.S. A novel quantitative measurement method for irregular tubules in breast carcinoma. Eng. Sci. Technol. Int. J. 2022, 31, 101051. [Google Scholar] [CrossRef]
  26. Park, H.U.; Chung, K.B.; Kim, D.Y. Quantitative measurement of hair diameter diversity as a diagnostic indicator of androgenetic alopecia in Korean males: A cross-sectional study. JAAD Int. 2024, 15, 121–126. [Google Scholar] [CrossRef]
  27. Starshinova, A.; Osipov, N.; Dovgalyk, I.; Kulpina, A.; Belyaeva, E.; Kudlay, D. COVID-19 and Tuberculosis: Mathematical Modeling of Infection Spread Taking into Account Reduced Screening. Diagnostics 2024, 14, 698. [Google Scholar] [CrossRef]
  28. Albarova-Corral, I.; Segovia- Burillo, J.; Malo-Urries, M.; Rios-Asin, I.; Asin, J.; Castillo-Mateo, J.; Gracia-Tabuenca, Z.; Morales-Hernandez, M.M. A New Quantitative Tool for the Ultrasonographic Assessment of Tendons: A Reliability and Validity Study on the Patellar Tendon. Diagnostics 2024, 14, 1067. [Google Scholar] [CrossRef]
  29. Macenko, M.; Niethammer, M.; Marron, J.S.; Borland, D.; Woosley, T.; Guan, X.; Schmitt, C.; Thomas, N.E. A method for normalizing histology slides for quantitative analysis. In Proceedings of the 2009 IEEE International Symposium on Biomedical Imaging: Macro to Nano, Boston, MA, USA, 18 June–1 July 2009; pp. 1107–1110. [Google Scholar] [CrossRef]
  30. McQuin, C.; Goodman, A.; Chernyshev, V.; Kamentsky, L.; Cimini, B.A.; Karhohs, K.W.; Doan, M.; Ding, L.; Rafelski, S.M.; Thirstrup, D.; et al. CellProfiler 3.0: Next-generation image processing for biology. PLoS Biol. 2018, 16, e2005970. [Google Scholar] [CrossRef]
  31. Basu, A.; Senapati, P.; Deb, M.; Rai, R.; Dhal, K.G. A survey on recent trends in deep learning for nucleus segmentation from histopathology images. Evol. Syst. 2024, 15, 203–248. [Google Scholar] [CrossRef]
  32. Tan, X.J.; Mustafa, N.; Mashor, M.Y.; Ab Rahman, K.S. Hyperchromatic nucleus segmentation on breast histopathological images for mitosis detection. J. Telecommun. Electron. Comput. Eng. 2018, 10, 1–16. [Google Scholar]
  33. Abunassar, N.; Alas, M.; Ali, S.I.A. Prediction of Compressive Strength in Self-compacting Concrete Containing Fly Ash and Silica Fume Using ANN and SVM. Arab J. Sci. Eng. 2023, 48, 5171–5184. [Google Scholar] [CrossRef]
  34. Nehra, P.; Nagaraju, A. Host utilization prediction using hybrid kernel based support vector regression in cloud data centers. J. King Saud Univ.-Comput. Inf. Sci. 2022, 34, 6481–6490. [Google Scholar] [CrossRef]
  35. Cosatto, E.; Miller, M.; Graf, H.P.; Meyer, J.S. Grading nuclear pleomorphism on histological micrographs. In Proceedings of the ICPR 2008 19th International Conference on Pattern Recognition, Tampa, FL, USA, 8–11 December 2008; pp. 8–11. [Google Scholar] [CrossRef]
  36. Petushi, S.; Katsinis, C.; Coward, C.; Garcia, F.; Tozeren, A. Automated identification of microstructures on histology slides. In Proceedings of the 2004 2nd IEEE International Symposium on Biomedical Imaging: Macro to Nano, Arlington, VA, USA, 15–18 April 2004; Volume 1, pp. 424–427. [Google Scholar] [CrossRef]
  37. Gao, Z.; Wang, L.; Zhou, L.; Zhang, J. HEp-2 cell image classification with deep convolutional neural networks. IEEE J. Biomed. Health Inform. 2017, 21, 416–428. [Google Scholar] [CrossRef]
  38. Tripathi, S.; Singh, S.K. Cell Nuclei Classification in Histopathological Images using Hybrid Olconvnet. ACM Trans. Multimed. Comput. Commun. Appl. 2020, 16, 1–22. [Google Scholar] [CrossRef]
  39. Geuenich, M.J.; Gong, D.W.; Campbell, K.R. The impacts of active and self-supervised learning on efficient annotation of single-cell expression data. Nat. Commun. 2024, 15, 1014. [Google Scholar] [CrossRef]
  40. Lin, Y.; Qu, Z.; Chen, H.; Gao, Z.; Li, Y.; Xia, L.; Ma, K.; Zheng, Y.; Cheng, K.-T. Nuclei segmentation with point annotations from pathology images via self-supervised learning and co-training. Med. Image Anal. 2023, 89, 102933. [Google Scholar] [CrossRef]
  41. Amgad, M.; Atteya, L.A.; Hussein, H.; Mohammed, K.H.; Hafiz, E.; Elsebaie, M.A.T.; Alhusseiny, A.M.; AlMoslemany, M.A.; Elmatboly, A.M.; Pappalardo, P.A.; et al. NuCLS: A scalable crowdsourcing approach and dataset for nucleus classification and segmentation in breast cancer. Gigascience 2022, 11, giac037. [Google Scholar] [CrossRef]
Figure 1. The breakdown of descriptions for nuclear pleomorphism into the three main elements: size, shape, and appearance.
Figure 1. The breakdown of descriptions for nuclear pleomorphism into the three main elements: size, shape, and appearance.
Diagnostics 14 02045 g001
Figure 2. Sample synthetic images with Hol of 0, 3, and 5, where grey and white colors denote the foreground and background. (a) Hol = 0; (b) Hol = 3; (c) Hol = 5.
Figure 2. Sample synthetic images with Hol of 0, 3, and 5, where grey and white colors denote the foreground and background. (a) Hol = 0; (b) Hol = 3; (c) Hol = 5.
Diagnostics 14 02045 g002
Figure 3. The boxplots of E S i z e , E S h a p e , E A p p e a r a n c e , and HM for scores 1 to 3 for nuclear pleomorphism using 300 ground truth, such that 100 nuclei were included for each score. (a) Analysis for E S i z e ; (b) Analysis for E S h a p e ; (c) Analysis for E A p p e a r a n c e ; (d) Analysis for HM.
Figure 3. The boxplots of E S i z e , E S h a p e , E A p p e a r a n c e , and HM for scores 1 to 3 for nuclear pleomorphism using 300 ground truth, such that 100 nuclei were included for each score. (a) Analysis for E S i z e ; (b) Analysis for E S h a p e ; (c) Analysis for E A p p e a r a n c e ; (d) Analysis for HM.
Diagnostics 14 02045 g003
Figure 4. Classification outputs for E S i z e , E S h a p e , E A p p e a r a n c e , and HM, using the dataset. (a) Using the E S i z e as input feature; (b) Using the E S h a p e as input feature; (c) Using the E A p p e a r a n c e as input feature; (d) Using the HM as input feature.
Figure 4. Classification outputs for E S i z e , E S h a p e , E A p p e a r a n c e , and HM, using the dataset. (a) Using the E S i z e as input feature; (b) Using the E S h a p e as input feature; (c) Using the E A p p e a r a n c e as input feature; (d) Using the HM as input feature.
Diagnostics 14 02045 g004aDiagnostics 14 02045 g004b
Figure 5. Classification outputs comparing the existing quantitative features and deep learning methods with the proposed HM [35,36,37,38]. (a) Score 1; (b) Score 2; (c) Score 3.
Figure 5. Classification outputs comparing the existing quantitative features and deep learning methods with the proposed HM [35,36,37,38]. (a) Score 1; (b) Score 2; (c) Score 3.
Diagnostics 14 02045 g005
Table 1. Description of nuclear pleomorphism in the semi-quantitative NHG system.
Table 1. Description of nuclear pleomorphism in the semi-quantitative NHG system.
Description of Nuclear PleomorphismScores
Score-1 nuclei are very similar in size to the nuclei of benign pre-existing epithelial cells (<1.5 times the size), and they show minimal pleomorphism, and an even chromatin pattern, as well as nucleoli that are either not visible or very inconspicuous.1
Score-2 nuclei are larger (1.5–2 times the size of benign epithelial cell nuclei), with mild to moderate pleomorphism and visible, but small and inconspicuous, nucleoli. 2
Score-3 nuclei are even larger (>2 times the size of benign epithelial cell nuclei), with vesicular chromatin; they vary markedly in size and shape and often show prominent nucleoli. 3
Table 2. Evaluation metrics.
Table 2. Evaluation metrics.
MetricsEquations
Accuracy T P + T N T P + T N + F P + F N
Recall T P F N + T P
Specificity T N F P + T N
Precision T P F P + T P
F1-score 2 R e c a l l P r e c i s i o n R e c a l l + P r e c i s i o n
Table 3. Sample measurement outputs.
Table 3. Sample measurement outputs.
Nuclear Pleomorphism E S i z e E S h a p e E A p p e a r a n c e HM
Original InputSegmentation Outputs *
Score 1
Diagnostics 14 02045 i001Diagnostics 14 02045 i0020.39930.36610.29120.3244
Diagnostics 14 02045 i003Diagnostics 14 02045 i0040.50340.34070.32700.3759
Diagnostics 14 02045 i005Diagnostics 14 02045 i0060.08630.47150.40930.4382
Score 2
Diagnostics 14 02045 i007Diagnostics 14 02045 i0080.76940.38610.42740.4816
Diagnostics 14 02045 i009Diagnostics 14 02045 i0100.56690.43600.51390.4997
Diagnostics 14 02045 i011Diagnostics 14 02045 i0120.64490.41530.48550.4985
Score 3
Diagnostics 14 02045 i013Diagnostics 14 02045 i0141.00000.64790.69730.7542
Diagnostics 14 02045 i015Diagnostics 14 02045 i0161.00000.52690.58810.6524
Diagnostics 14 02045 i017Diagnostics 14 02045 i0181.00000.70490.86870.8404
* The green color lines show the boundary of the segmented cell nucleus via the CellProfiler 3.0 [30].
Table 4. Ablation study.
Table 4. Ablation study.
Feature(s) IncludedF1-Score
Score 1Score 2Score 3Overall
E S i z e 0.920.590.740.75
E S h a p e 0.580.570.890.68
E A p p e a r a n c e 0.810.790.780.79
E S i z e + E S h a p e 0.670.510.700.63
E S i z e + E A p p e a r a n c e 0.830.640.710.73
E S h a p e + E A p p e a r a n c e 0.660.610.810.69
HM
( E S i z e + E S h a p e + E A p p e a r a n c e )
0.960.930.970.95
Bold fonts highlight the highest output in each column.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Teoh, C.L.; Tan, X.J.; Ab Rahman, K.S.; Bakrin, I.H.; Goh, K.M.; Siet, J.J.W.; Wan Muhamad, W.Z.A. A Quantitative Measurement Method for Nuclear-Pleomorphism Scoring in Breast Cancer. Diagnostics 2024, 14, 2045. https://doi.org/10.3390/diagnostics14182045

AMA Style

Teoh CL, Tan XJ, Ab Rahman KS, Bakrin IH, Goh KM, Siet JJW, Wan Muhamad WZA. A Quantitative Measurement Method for Nuclear-Pleomorphism Scoring in Breast Cancer. Diagnostics. 2024; 14(18):2045. https://doi.org/10.3390/diagnostics14182045

Chicago/Turabian Style

Teoh, Chai Ling, Xiao Jian Tan, Khairul Shakir Ab Rahman, Ikmal Hisyam Bakrin, Kam Meng Goh, Joseph Jiun Wen Siet, and Wan Zuki Azman Wan Muhamad. 2024. "A Quantitative Measurement Method for Nuclear-Pleomorphism Scoring in Breast Cancer" Diagnostics 14, no. 18: 2045. https://doi.org/10.3390/diagnostics14182045

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop