Next Article in Journal
Programmed Death Ligand 1 (PD-L1) Expression in Lymphomas: State of the Art
Previous Article in Journal
Cardiac-Specific Suppression of Valosin-Containing Protein Induces Progressive Heart Failure and Premature Mortality Correlating with Temporal Dysregulations in mTOR Complex 2 and Protein Phosphatase 1
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Brief Report

Support Vector Machine-Based Formula for Detecting Suspected α Thalassemia Carriers: A Path toward Universal Screening

1
Allergy and Clinical Immunology Unit, Meir Medical Center, Kfar Saba 4428163, Israel
2
School of Medicine, Tel Aviv University, Tel Aviv 6997801, Israel
3
Hematology Laboratory, Emek Medical Center, Afula 1834111, Israel
4
Red Blood Cell Research Group, Vetsuisse Faculty, Institute of Veterinary Physiology, University of Zurich, 8057 Zürich, Switzerland
5
The Zurich Center for Integrative Human Physiology (ZIHP), 8057 Zürich, Switzerland
6
Hematology Division, Hadassah Medical Center, Jerusalem 9112001, Israel
7
Pediatric Hematology Unit, Research Laboratory, Emek Medical Center, Afula 1834111, Israel
8
The Ruth and Bruce Rappaport Faculty of Medicine, Technion–Israel Institute of Technology, Haifa 3525422, Israel
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2024, 25(12), 6446; https://doi.org/10.3390/ijms25126446
Submission received: 8 April 2024 / Revised: 3 June 2024 / Accepted: 5 June 2024 / Published: 11 June 2024

Abstract

:
The blood counts of α thalassemia carriers (α-thal) are similar to those of β thalassemia carriers, except for Hemoglobin A2 (Hb A2), which is not elevated. The objective of this study was to determine whether mathematical formulas are effective for detecting suspected α-thal. The data were obtained from the database of the prevention program for detecting couples at risk for having a child with hemoglobinopathy. Red Blood Cells (RBC) indices were analyzed using mathematical formulas, and the sensitivity and negative predictive value (NPV) were calculated. Among 1334 blood counts suspected of α-thal analyzed, only the Shine and Lal and the Support Vector Machine formulas revealed high sensitivity and NPV. Sensitivity was 85.54 and 99.33%, and NPV was 98.93 and 99.93%, respectively. Molecular defects were found in 291, and 81 had normal α genes. Molecular analysis was not performed in 962 of the samples. Based on these results, mathematical formulas incorporating one of these reliable formulas for detecting suspected α or β thalassemia carriers in the program of the automatic analyzers can flag these results, increase the awareness of the primary physicians about the carrier risk, and send an alert with a recommendation for further testing.

1. Introduction

Hemoglobinopathies are the most common monogenic genetic diseases worldwide, including β and α thalassemia and Sickle Cell Disease (SCD). Each year, more than 300,000 new patients are born with these diseases, principally in developing countries in Africa and Asia [1]. Patients with β thalassemia and SCD experience severe symptoms that are difficult for the patients and their families and present a significant burden on healthcare services [2,3,4]. Therefore, prevention programs that can detect couples at risk of having an affected child have been implemented in many countries. Since both transfusion dependent (TDT) and non-transfusion dependent β thalassemia (NTDT) and SCD are generally more severe than α thalassemia, screening programs are conducted to detect carriers of those diseases [5,6]. Alpha gene deletions or additional mutations are much more frequent than β gene defects and more widely distributed, even before the extensive population migrations of the last centuries. About 5% of the world’s population carries an α gene defect [7].
In Asian countries, the incidence of α thalassemia carriers is much higher than the mean frequency worldwide. For example, in Vietnam, Myanmar, Laos, Thailand, and Cambodia, the frequency is 12% to 37% [8]. In Mediterranean countries, Africa, the Middle East, and the Indian subcontinent, α thalassemia carriers are frequent, but with a significantly lower prevalence than in southeastern Asian countries [9]. The relatively higher frequency of α thalassemia carriers in tropical and subtropical countries can be attributed to the selective advantage of α thalassemia carriers in areas where malaria is frequent, similar to the advantage of SCD and β thalassemia carriers [10]. In contrast to the β globin diseases, α thalassemia is characterized by the presence of two α genes on chromosome 16, the α1 and α2 genes (αα/αα). More than 100 α thalassemia genetic defects have been reported. The most common α gene defects are deletions, but other mutations are also known [7]. Clinically, there are four possibilities: four α intact genes (αα/αα), one affected gene that causes the silent α trait (-α/αα), two affected genes that cause the α thalassemia trait (-α/-α or (--/αα), three affected genes that cause the clinical form known as Hb H disease (--/-α), and four affected genes that cause the severe form of α thalassemia known as hydrops fetalis (--/--), which is usually lethal.
In the two cases of α thalassemia carriers, the silent and the α thalassemia trait, the blood count indices, mean corpuscular volume (MCV), and mean corpuscular hemoglobin (MCH) are low and in similar ranges but less affected than in carriers of β thalassemia. Usually, the Hb level can be slightly lower than normal, and the red blood cell (RBC) count is elevated compared to the Hb level. All carriers of α globin defects have normal high performance liquid chromatography (HPLC) analyses, or even low Hb A2 levels. The threshold for MCV is usually ≤78 fl, despite the fact that α and β thalassemia carriers can have MCV in the normal range of 80 fl or higher. For MCH, the usually accepted threshold is ≤27 pg. In some studies, the threshold is <80 fl for [11]. In a study that analyzed the RBC indices in 1301 α thalassemia carriers, the mean MCV was 74.36 ± 6.83 fl, with a wide range from 45 fl to even 88.8 fl. The MCH was also under 27 pg, 23.6 ± 2.7 pg, but also with a wide range from 13.1 to 30.5 pg [12]. Since the lower range for RBC and Hb in that study was 2.69 mm3 and 4.1 g/dL, respectively, some of the patients with low MCV and MCH likely had Hb H disease or α thalassemia carriers combined with iron deficiency anemia (IDA) and not only simple α thalassemia carriers.
The most common defects in the α genes are deletions, with the -α3.7 kb and -α4.2 kb deletions being the most common [7]. Non deletional mutations in the α genes can also cause the same clinical and laboratory pictures.
The genotype of the α carriers differs in eastern Asian countries, where the cis form (--/αα) is more common compared to the western world, where the trans form (-α/-α) is more common. Due to this difference, the frequency of hydrops fetalis and even the Hb H disease is common in Asian countries; therefore, routine screening for detection of α carriers is usually performed in Asian countries. Nonetheless, cases of Hb H disease and even hydrops fetalis have been described in western countries when only two or three mutations in the α gene were diagnosed (-α/-α) [7]. Non-deletional defects involving the α2 gene are more severe than deletional defects because the α2 gene encodes more α globin than does the α1 gene [13].
Interaction between α globin abnormal genes and β globin mutations can ameliorate or increase the severity of the clinical picture of α thalassemia carriers. For example, in cases of triplication or quadruplication of α thalassemia genes, this combination aggravates the severity of β thalassemia carriers, and in those subjects, a clinical picture of NTDT or even TDT can be present [14].
As mentioned, screening for β thalassemia and SCD is common in many countries, but a byproduct of the screening programs is the detection of possible α thalassemia carriers, where only molecular analysis can confirm the presence of an α gene defect. Since screening for α thalassemia carriers is not the major goal of those screening programs, the true incidence of α thalassemia carriers is unknown.
Rarely, β thalassemia carriers may have normal Hb A2 in HPLC analysis, called silent β carriers. in cases where both parents have RBC indices in the range to be considered suspected α or β globin carriers, even if one is a “classical β carrier”, and the other is defined as a “suspected carrier”, molecular analysis is recommended.
The principal differential diagnosis for low MCV and MCH is, of course, iron deficiency, but iron status is not routinely analyzed in most screening programs, and iron deficiency does not exclude the possibility of being a carrier of β or α thalassemia.
In the last few decades, investigators around the world have tried to use blood cell indices to develop a mathematical formula that would flag cases of suspected thalassemia carriers, mainly β thalassemia. Currently, none of these formulas have been adopted for routine use, mainly because there was not enough information regarding the ranges for α thalassemia carriers and the fear of false negative (FN) diagnoses in those cases.
The prevention program for detecting couples at risk of having a child with hemoglobinopathies was instituted in northern Israel in 1987 [6]. Since 2010, blood samples from women with red blood cell indices suspicious of being α thalassemia carriers have been sent for molecular analysis when the partner was also suspected of being a carrier.
The purpose of this study was to determine the impact of this screening approach and to propose more significant and cost-effective screening decisions based on the outcomes. We also tried to determine whether the formulas described in the literature are effective for detecting α thalassemia carriers among a homogenous population of women of childbearing age [15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30]. Some of these formulas were also analyzed in recent papers [31,32].

2. Results

A total of 22,842 blood count results were included in the data analysis, including the results published by our group when we compared the results of β thalassemia carriers with results that were defined as non_β thalassemia carriers [26]. In this paper, we analyzed the results of blood counts from 1334 women who were diagnosed or suspected to be α thalassemia carriers, as described in the Methods. In summary, 1334 results were α thalassemia carriers or suspected carriers (5.9%), 2936 were β thalassemia carriers (12.8%), and 18,572 had blood counts that did not correspond to α or β thalassemia carriers (81.3%; Table 1). The α thalassemia carriers had significantly lower RBC and higher Hb, MCV, MCH, MCHC, and RDW values (p < 0.001 for all) compared to the β thalassemia carriers. As expected, the A2 and Hb F levels in α thalassemia carriers were lower than in the β thalassemia group (p < 0.001 for both).
The group defined as α thalassemia carriers included 372 samples that were analyzed for molecular diagnosis. The results revealed 291 samples with mutations in the α globin gene; a normal α globin gene was found in 81 samples. Another 962 samples were defined as suspected α thalassemia carriers but did not undergo molecular analysis. Comparison of the blood count results from these three groups did not reveal significant differences except that the RBC were higher in the blood counts with proven mutations and lower in the group with normal sequences (p < 0.001). The differences in the RDW levels had a similar trend as the RBC results (p < 0.004; Table 2).
Due to the minimal differences between the three groups defined as α thalassemia carriers, normal molecular analysis, or suspected carriers, the analysis of the mathematical formula was combined as one α thalassemia group.

2.1. Comparative Results of the Formulas

Only the Shine and Lal formula [28], and our previously published SVM formula [26] had high sensitivity and NPV for detecting suspected α thalassemia carriers. The Shine and Lal formula had a sensitivity value of 85.54%, 88.22% specificity, and an NPV of 98.93%. The SVM formula had a sensitivity of 99.33%, 73.67% specificity, and 99.93% NPV. FN results were found in both the Shine and Lal and SVM formulas (n = 25,819.3%). Seventeen FN results detected in Shine and Lal’s formula were not detected in the SVM formula, and three results detected in the SVM formula were not detected by Shine and Lal’s formula.
Overall, 275 FN results were found when applying the SVM formula (20.6% of the 1334 samples), 42 FN results (18.1%) were found in the samples with a proven mutation (-α3.7 or IVS I-1 (-bp) 22.4% and 12.4%, respectively, p > 0.25), 22 FN samples were in the normal molecular analysis samples (27.2%), and 111/962 in the suspected α thalassemia carriers group (11.5%). When we compared the proportion of FN results in those groups, the p-Value was < 0.05.
The results of applying all the other formulas published in the literature failed to reveal sufficiently significant levels of sensitivity, all results were below 80%. NPV above 90% was found in several formulas, but with a sensitivity of <80%. Two formulas had 100% sensitivity but 0 NPV (Table 3).

2.2. Analysis of Samples Suspected to Have a Diagnosis of Iron Deficiency Anemia

Of 291 samples, 28 (9.62%) in the group with α genotype mutation or deletion had RBC ≤ 4.5 × 109/dL and RDW ≥ 14.5%. Consequently, those samples can be defined as having IDA concomitant with an α thalassemia trait.
Of the 81 samples in a group with normal α gene sequencing, 20 (24.7%) had RBC ≤4.5 × 109/dL and RDW ≥ 14.5% and can be defined as having IDA. In the group defined as having suspected α thalassemia (no molecular analysis), 111,962 samples (11.5%) had RBC ≤ 4.5 × 109/dL and RDW ≥ 14.5%, which can then be defined as having IDA. The MCV and MCH values in all the α thalassemia groups were in the same range (p = NS).

2.3. Comparison of the Results from the Two Common α Globin Mutations Found

In 291 of 1334 blood counts, an α globin mutation was found (21.8%). The two most common deletions were -α3.7 kb and αIVS I-1 (-5 bp) (HBA2:c.95+2_95+6delTGAGG). In 134 samples (46%), the -α3.7 kb/αα genotype was found, and in another 24 (8.2%), the -α3.7 kb/-α3.7 kb genotype was detected. In 97 samples (33.3%), the αIVS I-1 (-5 bp)/αα genotype was found, and in another eight (2.7%), the genotype was αIVS I-1 (-5 bp)/αIVS I-1 (-5 bp). The genotype was αIVS I-1 (-5 bp)/-α3.7 kb in seven samples (2.4%). Another 25 genotypes were found, including α Poly A-HBA2:c.*92A>G in seven subjects, -α4.2 kb in five, Med in three, and others in five. We did not include the data analysis of the α CD 39—HBA:c.118-120delACC genotype commonly called Hb Taybe that presents with different clinical characteristics and was described elsewhere [34].
We compared the RBC indices for the two most common genotypes found in this study (Table 4).

3. Discussion

As hemoglobinopathies are the most common monogenic clinically severe disease worldwide, any effort toward detecting carriers and offering genetic counseling for couples at risk is recommended. Carriers of Sickle Cell Disease do not have any specific or characteristic features in RBC indices. Therefore, HPLC analysis is required to detect carriers. On the other hand, the indices of carriers of α or β thalassemia can be clearly identified and lead to suspicion of being carriers. Multiple efforts have ensued to find a reliable mathematical formula that is specific enough and has a high NPV to minimize further unnecessary laboratory analyses. A reliable formula will need to be tested on as many samples as possible. It can be incorporated into all laboratory blood analyzers and then flag results that need further analysis.
To date, no formula has been incorporated in any routinely used analyzers, despite publications that demonstrate the reliability of some formulas based on large samples. The likely cause is that HPLC analysis is also performed in standard, inexpensive analyzers, and most β thalassemia carriers can be detected. However, HPLC analysis is routinely done on request in screening programs, not in routine blood samples, and not for thalassemia screening. β thalassemia screening programs are routinely implemented in countries with many carriers, but not in all of them. In recent years, the migration of large numbers of people to countries where β thalassemia was not common has presented a new health problem that needs attention [2].
Detection of α thalassemia carriers is mostly relevant in populations of Asian origin, where the cis carriers are common (αα/--) and the risk of an infant with hydrops fetalis or severe forms of Hb H disease (α-/--) is frequent. Some trans carriers (α-/α-) with specific mutations in the α gene can also be found and produce an infant with severe α thalassemia disease. Mathematical formulas published in the literature do not usually include α thalassemia carriers in their analyses, except in two studies with a relatively small number of carriers: Amendiola et al. with 81 subjects and Mentzer [15,23] with 50 subjects. The current study is the largest study to detect α thalassemia carriers using mathematical formulas.
Since 1987, a systematic screening program for detecting couples at risk of having a child with β thalassemia has been instituted in northern Israel [5,6]. Since 2010, all samples with RBC indices suspicious for β or α thalassemia carriers were sent for molecular analysis, provided the partner was also suspected to be a carrier. Therefore, we have many blood count results that can be analyzed with mathematical formulas. The present study includes the analysis of three groups of blood counts and molecular analysis results. Since the RBC indices were similar in the three groups, especially MCV, MCH, and MCHC, we analyzed the results of the three groups together and compared the results with results previously published by our group, including β thalassemia carriers and normal samples [26].
The results indicate that only three published formulas have relatively high sensitivity: 68.63% in Ricerca et al. [24], 85.54% in Shine and Lal [28], and our previously described SVM formula with a sensitivity of 99.33% [26]. The respective NPVs for these formulas are 96.83%, 98.83%, and 99.93%. Two formulas used only the MCV and MCH values for calculation. Ricerca et al. [24]. used the RDW and RBC values. MCV and MCH values were also used by d’Onofrio et al. [33]. The sensitivity was 100%, but the NPV was 0. May be if the cutoff value were changed this formula would also become reliable, but we decided to compare the formulas using the same cutoff values that were used to detect β thalassemia, as shown in Table 3. Bordbar et al. also used the MCV and MCH indices; however, when we applied this formula to our results, the NPV was high enough, at 93.72%, but the sensitivity was only 21.46% [16]. Most of the published formulas that used different combinations of indices also had an NPV above 90% but a sensitivity below 60%. As shown in the meta-analysis published by Hoffmann et al. in 2015, indeed MCV/MCH are the most relevant parameters for discriminating thalassemia from IDA [35]. We also calculated the 75th and 95th percentiles of the formulas’ results and compared them with the published cutoff for each formula (Table 3). Our SVM formula passed the cutoff value for the 75th percentile but not the lower limit of the 95th percentile. None of the other formulas passed the cutoff value even for the 75th percentile, probably indicating that the cutoff is too high and some α thalassemia carriers might be missed. Recently, Phirom et al. described the use of machine learning in a small group of individuals with α thalassemia and showed that the Convolutional Neural Networks (CNN) formula is superior to the SVM formula that we use, but the specificity of the SVM formula, the formula that we apply for our study, is superior (81.23% vs. 84.94%), while the accuracy in their work is similar for both (CNN 80.69% vs. SVM 80.68%) [36].
We decided to include in our analysis only women and results with Hb > 9.0 g/dL to obtain a more homogeneous population for more accurate data analysis. Only 38 results (0.17%) were excluded due to Hb < 9.0 g/dL, and all except two samples were in the “suspected α carrier group”. The other two samples were in the normal molecular analysis group. In our opinion, excluding those samples did not affect the overall study results. Hb level < 9 g/dL may be due to IDA or even a combination of α carrier and IDA. Since the results analyzed in this study are the product of a massive screening program, iron status was not routinely analyzed.
The main differential diagnosis for microcytosis or hypochromia is, of course, iron deficiency, which could be present when the molecular analysis does not reveal alpha globin mutations. In all three groups of α thalassemia in the present study, 9.6% to 24.7% of the results can be indices suspicious for IDA, defined as RBC lower than 4.5 × 109/dL and RDW above 14.5%. The higher percentage was from the group that did not undergo molecular analysis, which is reasonable. On the other hand, the low percentage of samples suspected to have IDA in the group with negative molecular analysis (11.5%) suggests that a mutation outside the α gene can produce characteristics of α thalassemia trait indices.
Indeed, some studies have described that individuals with characteristics of α thalassemia carriers may have intact α globin genes that may be functionally inactive. In these cases, large deletions upstream of the α globin genes, in the locus control region can cause inactivation of both α1 and α2 globin genes [37,38,39].
Also, deletions of the major regulatory element of the α globin genes that is responsible for activating the α globin genes can inactivate the normal α globin genes. In those cases, the MCV values are similar to those of classical α thalassemia carriers. Fifteen such deletions were described [40]. Other defects, not located in the α genes, are support nucleotide polymorphisms between the α-globin genes and the α gene regulatory element [40,41].
In summary, the fact that we did not find molecular defects in 6% of our molecularly analyzed samples did not preclude that they are not α thalassemia carriers. The similar RBC indices suggest that at least some might be α carriers.
We compared the RBC indices among the two most common genotypes in our study, -α3.7 kb and IVS I-1(-5 bp) (HBA2:c.95+2_95+6delTGAGG) (Table 4).
The values in the RBC indices were significantly different when we compared the one or two -α3.7 kb deletions and in the IVS I-1(-5 bp) genotypes, which were lower in the genotype with two deletions.
In a study from Iran that included 722 α thalassemia carriers, the same features were found, as carriers with two deletions in the -α3.7 kb gene (334 individuals) had significantly lower MCV and/or MCH values when compared to subjects with one deletion (45 subjects). (MCV 76.8 ± 5.02 fl vs. 72.7 ± 4.55 fl, respectively, and for MCH 24.7 ± 1.6 pg vs. 22.46 ± 1.73 pg, respectively [42].
The α-3.7 kb deletion is most prevalent in Mediterranean countries and in Africans. The Arab Bedouins in northern Israel are probably of African origin, which can explain the high incidence of this deletion found in the present study [43,44,45,46]. This is also the most common deletion found in Iran, in heterozygous or homozygous form (41.4% and 11.6%, respectively) [12].
As described, α thalassemia carriers are considered healthy subjects, but they still have significant microcytosis and mild anemia, principally in the pediatric population [47].
In general, prenatal screening for detecting couples at risk of having a child with α thalassemia in one or the two clinical forms is not cost-effective in countries outside of Asia, but in countries where screening programs for β thalassemia are implemented, including screening for α thalassemia, it does not add significant costs, and the decision to perform molecular analysis should be chosen, depending on the partner’s blood count indices and on the most frequent α thalassemia gene defects in each country [48].
The cost effectiveness of α thalassemia screening has not been estimated. However, β thalassemia screening was proven to be cost-effective [3].
For couples at risk of having an offspring with hydrops fetalis, genetic counseling should be offered, and the possible prenatal diagnosis discussed. Couples at risk of having an infant with Hb H should be informed, but prenatal diagnosis is not indicated since usually the course of the disease is mild with stable hemolytic anemia, rarely requiring blood transfusions. Also, couples at risk of having an offspring with two mutations in the α gene should be informed since, in the neonatal period, those newborns may have hemolysis requiring blood transfusions.
In couples at risk of having a child with Hb Taybe (α CD 39—HBA:c.118-120delACC [34] or other severe α gene defects such as Med or Poly A, hydrops fetalis can occur [49]. In those cases, genetic counseling and the option of prenatal diagnosis can be considered.
Based on our results, incorporating one of the formulas that we found to be significant for detecting suspected α or β thalassemia carriers in the program of the automatic analyzers, can flag the abnormal results and increase the awareness of primary physicians about the carrier risk, and if relevant, in cases of women of childbearing age, refer them for further analysis: HPLC for β thalassemia carriers and molecular analysis for α thalassemia carriers.
The major limitation of the present study is that iron status was not analyzed since the RBC indices and HPLC results were obtained as part of a very large-scale screening program for detecting carriers of hemoglobinopathies. Obviously, women with low Hb and/or RBC counts are suspected to have at least IDA or a combination of IDA and thalassemia carriers, which is relatively frequent in pregnant women.
This study was conducted only on women of childbearing age, and we cannot conclude whether those formulas are applicable to males or children. Further studies are needed to validate the results in other populations. At a minimum, even if the first stage of the formula is applied to this specific population only, it will be effective for population screening.

4. Methods

A prevention program for detecting pregnant women who were carriers of β thalassemia was initiated in the northern region of Israel from 1987. In 1999, the program was extended to include sickle cell carriers due to the relatively high frequency of sickle cell carriers in this area [6]. Through the end of 2023, more than 100,000 subjects were screened, and 2346 pregnancies at risk for having an affected child were detected and referred for genetic counseling.
Since 2010, all samples from women with low MCV (≤78) and/or low MCH (≤27), even with normal Hb A2 levels in HPLC analysis, were requested to provide blood samples from their partners. If the partners were found to carry a hemoglobinopathy, diagnosed by HPLC, or suspected to be carriers of α thalassemia based on the same MCV and MCH parameters but normal Hb A2, the blood samples were routinely referred for molecular analysis to detect or exclude α thalassemia carriers. If the partner’s RBC indices and the HPLC analysis were normal, the investigation of the couple was terminated, and there was no risk of having a child with a severe or clinically significant hemoglobinopathy. Otherwise, the samples were sent for further molecular analysis.

4.1. Red Blood Count Analysis

For the data analysis, we included the results of RBC and HPLC analyses of pregnant women or women of childbearing age to obtain a more homogeneous population. We excluded 39 blood counts (2.9%) with an Hb level less than 9 g/dL because the possibility of IDA or even a combination of α thalassemia carriers and IDA, which is more common among pregnant women, was likely.
The complete blood count analysis was performed on an ADVIA i2120 analyzer (SIEMENS, Forchheim, Germany, 91301). The Hb fractionation was analyzed using the HPLC Bio-Rad Variant-II set-up (Bio-Rad, Hercules, CA, USA).

4.2. Molecular Analysis

Alpha globin gene deletions were detected using the GAP-PCR methodology. For each of the common gene deletions: -α3.7 kb, -α4.2 kb, --Mediterranean, and -α20.5 kb, a set of specific primers is used. Point mutations are detected using Sanger sequencing following PCR amplification of each of the two alpha-globin genes. In rare cases, we used MLPA: Multiplex Ligation-Dependent Probe Amplification technology to detect deletions or multiplications.
According to the results of the molecular analysis, three groups were defined. The first group had an α thalassemia trait with proven molecular diagnosis; a second group included samples suspected to be α thalassemia carriers, but without proven mutations in the α globin gene detected; and a third group had suspected α thalassemia carriers but without molecular analysis. The third group included all the samples from pregnant or childbearing age women suspected to be α thalassemia carriers, whose partners had normal RBC indices and normal HPLC.
Subsequently, the results of the blood counts of the three groups of α thalassemia carriers were compared to the results of women in the same age range not suspected to be α or β thalassemia carriers and with the results of the women diagnosed with β thalassemia trait based on blood indices and HPLC analysis. The results of blood counts from non-carrier women and β thalassemia carriers were obtained from the same database of the prevention program, as described in a previous paper from our group [26].

4.3. Analysis of the Data Using Mathematical Formulas

The results of the three α thalassemia groups were also analyzed using the mathematical formulas described in the literature for detecting β thalassemia trait and the differential diagnosis from normal samples and/or iron deficiency to demonstrate the reliability of those formulas for detecting α thalassemia trait or carriers. A small number of samples of α globin carriers were included in two of the studies [15,23]

4.4. The Support Vector Machine (SVM) Algorithm

After analyzing the data with the published formulas, we applied another mathematical method based on the SVM algorithm to the same data. The SVM method that was published by our group for detecting blood counts suspected to be from β thalassemia carriers compared to normal counts or suspected of stemming from women with IDA. Details of this algorithm have been published elsewhere [50]. The SVM algorithm initially used all the relevant data from the RBC counts, including Hb, Hct, RBC, MCV, MCH, MCHC, and RDW. After checking the entire database, the algorithm chooses the most relevant values and discards the rest. We applied the previous SVM method for analyzing our data from α thalassemia carriers in the three groups described above and compared the results of the samples in the same database used in our previous study [26].
1.449994 * MCV 82.80762 10.28125 + 0.662983 * MCH 27.02007 3.904478 + 0.981605

4.5. Analysis of the Red Blood Count Indices and HPLC Results in the Two Most Common α Gene Defects

Finally, the results of a group of proven α thalassemia trait carriers with the two most common identified deletions found in our study, -3.7 kb and IVS I-1 (-5 bp) (HBA2:c.95+2_95+6delTGAGG), were analyzed. The heterozygous results were compared to the homozygous results for each deletion, and the common combinations between those deletions were analyzed.
Samples with the mutation α CD 39–HBA:c.118-120delACC, commonly called Hb Taybe, were excluded from the analysis because they presented a different, specific clinical picture and blood count ranges [34].
Since the blood samples and HPLC analyses were part of a very large-scale screening program, iron or ferritin levels were not analyzed.

4.6. Data Analysis

A students t-test or chi square was used for paired comparisons, and two-sided analysis of variance tests, assuming equal variances, were used for group comparisons. A p-Value of < 0.05 was considered significant. All analyses were performed using the SPSS statistical program version 28.

4.7. Ethics

The study was approved by the Emek Medical Center Ethics Committee, approval number EMC-52109-06, and registered at the NIH Clinical trials registry—NCT00481221.

5. Conclusions

Using mathematical formulas, especially the new generation of artificial intelligence, represented in this study by the Support Vector Machine (SVM)-developed formula, can detect individuals suspicious for being α thalassemia carriers, similar to the results of applying these formulas for detecting suspected β thalassemia carriers, as was demonstrated previously by our group [26].
This formula can and should be incorporated into the program of the automatic blood count analyzers to send an alert to the referring physician with a recommendation for further diagnostic tests, HPLC, and molecular analysis in the case of α thalassemia. By doing this, a universal screening program can be implemented without added costs, and the subsequent analysis can avoid the birth of affected children in specific circumstances, in the case of α thalassemia. Using the SVM formula can be beneficial since this formula can initially be trained on different groups of subjects, such as different genders, age groups, and different cutoffs used in different countries.
After this suggestion is implemented, additional large data studies can be conducted, with subsequent reevaluation and correction of the SVM formula, if necessary.

Author Contributions

Conceptualization: I.L.-R. and A.K.; Methodology: I.L.-R. and A.K.; Software and Support Vector Machine formula: I.L.-R.; Laboratory analysis (blood counts and HPLC): S.P., H.Z. and E.H.; Molecular analysis: D.F.; Statistics and Validation: I.L.-R. and L.L.; Investigation: A.K. and C.L.; Writing—Original Draft Preparation, A.K.; Writing—Review and Editing, I.L.-R., C.L. and A.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki, and approved by the Institutional Review Board of Emek Medical Center—Afula—ISRAEL (protocol code 52109-06, approved on 2006, and registered at the NIH Clinical trials registry—NCT00481221.

Informed Consent Statement

Informed consent was not required since the data was collected without subjects’ identifications.

Data Availability Statement

Data is contained within the article. The whole data presented in this study are available on a local database but the data are not publicly available since in the database the whole data included subjects identifications, identifications that were deleted before performing the analysis presented in the paper.

Acknowledgments

This paper is dedicated to the memory of Boaz Lachover, Idit Lachover Roth’s first husband, who was killed in a cycling “hit and run” accident. Boaz developed the SVM formula, which is the basis of the first paper, “Detection of β thalassemia carriers by red cell parameters obtained from automatic counters using mathematical formulas” [26]. and became the basis of this study, and to Guy Koren, who helped the authors after Boaz passed away.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Higgs, D.R. The Molecular Basis of -Thalassemia. Cold Spring Harb. Perspect. Med. 2013, 3, a011718. [Google Scholar] [CrossRef] [PubMed]
  2. Koren, A. The continuing global challenges of treating patients with beta-thalassemia. Br. J. Haematol. 2023, 201, 183–184. [Google Scholar] [CrossRef]
  3. Koren, A.; Profeta, L.; Zalman, L.; Palmor, H.; Levin, C.; Zamir, R.B.; Shalev, S.; Blondheim, O. Prevention of Β Thalassemia in Northern Israel—A cost-benefit analysis. Mediterr. J. Hematol. Infect. Dis. 2014, 6, e2014012. [Google Scholar] [CrossRef]
  4. Musallam, K.M.; Lombard, L.; Kistler, K.D.; Arregui, M.; Gilroy, K.S.; Chamberlain, C.; Zagadailov, E.; Ruiz, K.; Taher, A.T. Epidemiology of clinically significant forms of alpha- and beta-thalassemia: A global map of evidence and gaps. Am. J. Hematol. 2023, 98, 1436–1451. [Google Scholar] [CrossRef] [PubMed]
  5. Koren, A.; Zalman, L.; Palmor, H.; Zamir, R.B.; Levin, C.; Openheim, A.; Daniel-Spiegel, E.; Shalev, S.; Filon, D. Sickle cell anemia in northern Israel: Screening and prevention. Isr. Med. Assoc. J. 2009, 11, 229–234. [Google Scholar]
  6. Koren, A.; Zalman, L.; Palmor, H.; Ekstein, E.; Schneour, Y.; Schneour, A.; Shalev, S.; Rachmilewitz, E.A.; Filon, D.; Openhaim, A. The prevention programs for beta thalassemia in the Jezreel and Eiron valleys: Results of fifteen years experience. Harefuah 2002, 141, 938–943, 1210. [Google Scholar]
  7. Piel, F.B.; Weatherall, D.J. The α-Thalassemias. N. Engl. J. Med. 2014, 371, 1908–1916. [Google Scholar] [CrossRef] [PubMed]
  8. Goh, L.P.W.; Chong, E.T.J.; Lee, P.-C. Prevalence of Alpha(α)-Thalassemia in Southeast Asia (2010–2020): A Meta-Analysis Involving 83,674 Subjects. Int. J. Environ. Res. Public Health 2020, 17, 7354. [Google Scholar] [CrossRef]
  9. Galanello, R.; Cao, A. Alpha-thalassemia. Genet. Med. 2011, 13, 83–88. [Google Scholar] [CrossRef]
  10. Songdej, D.; Fucharoen, S. Alpha-Thalassemia: Diversity of Clinical Phenotypes and Update on the Treatment. Thalass. Rep. 2022, 12, 157–172. [Google Scholar] [CrossRef]
  11. Dehbozorgian, J.; Moghadam, M.; Daryanoush, S.; Haghpanah, S.; Fard, J.I.; Aramesh, A.; Shahsavani, A.; Karimi, M. Distribution of alpha-thalassemia mutations in Iranian population. Hematology 2015, 20, 359–362. [Google Scholar] [CrossRef]
  12. Keikhaei, B.; Slehi-Fard, P.; Shariati, G.; Khosravi, A. Genetics of Iranian Alpha-Thalassemia Patients: A Comprehensive Original Study. Biochem. Genet. 2018, 56, 506–521. [Google Scholar] [CrossRef]
  13. Liebhaber, S.A.; Kan, Y.W. Differentiation of the MRNA Transcripts Originating from the Al- and A2-Globin Loci in Normals and a-Thalassemics. J. Clin. Inwvest. 1981, 68, 439–446. [Google Scholar] [CrossRef] [PubMed]
  14. Oron, V.; Filon, D.; Oppenheim, A.; Rund, D. Severe thalassaemia intermedia caused by interaction of homozygosity for α-globin gene triplication with heterozygosity for β thalassaemia. Br. J. Haematol. 1994, 86, 377–379. [Google Scholar] [CrossRef] [PubMed]
  15. Amendolia, S.; Cossu, G.; Ganadu, M.; Golosio, B.; Masala, G.; Mura, G. A comparative study of K-Nearest Neighbour, Support Vector Machine and Multi-Layer Perceptron for Thalassemia screening. Chemom. Intell. Lab. Syst. 2003, 69, 13–20. [Google Scholar] [CrossRef]
  16. Bordbar, E.; Taghipour, M.; Zucconi, B.E. Reliability of Different RBC Indices and Formulas in Discriminating between β-Thalassemia Minor and other Microcytic Hypochromic Anemia. Mediterr. J. Hematol. Infect. Dis. 2015, 7, e2015022. [Google Scholar] [CrossRef] [PubMed]
  17. Ehsani, M.; Shahgholi, E.; Rahimineja, M.; Seighali, F.; Rashidi, A. A New Index for Discrimination Between Iron Deficiency Anemia and Beta-Thalassemia Minor: Results in 284 Patients. Pak. J. Biol. Sci. 2009, 12, 473–475. [Google Scholar] [CrossRef] [PubMed]
  18. England, J.; Fraser, P. Discrimination between iron-deficiency and heterozygous-thalassqmia syndromes in differential diagnosis of microcytosis. Lancet 1979, 313, 145–148. [Google Scholar] [CrossRef] [PubMed]
  19. England, J.; Fraser, P. Differentiation of iron deficiency from thalassæmia trait by routine blood-count. Lancet 1973, 301, 449–452. [Google Scholar] [CrossRef]
  20. Rahim, F.; Keikhaei, B. Better Differential Diagnosis of Iron Deficiency Anemia from Beta-Thalassemia Trait. Turk. J. Hematol. 2009, 26, 138–145. [Google Scholar]
  21. Green, R.; King, R. A new red cell discriminant incorporating volume dispersion for differentiating iron deficiency anemia from thalassemia minor. Blood Cells 1989, 15, 481. [Google Scholar] [PubMed]
  22. Getta, H.A.; Yasseen, H.A.; Said, H.M. Hi & Ha, Are New Indices in Differentiation between Iron Deficiency Anemia and Beta-Thalassaemia Trait/A Study in Sulaimani City-Kurdistan/Iraq. J. Dent. Med. Sci. 2015, 14, 67–72. [Google Scholar]
  23. Mentzer, W. Differentiation of iron deficiency from thalassæmia trait. Lancet 1973, 301, 882. [Google Scholar] [CrossRef] [PubMed]
  24. Ricerca, B.M.; Storti, S.; D’Onofrio, G.; Mancini, S.; Vittori, M.; Campisi, S.; Mango, G.; Bizzi, B. Differentiation of iron deficiency from thalassaemia trait: A new approach. Haematologica 1987, 72, 409–413. [Google Scholar]
  25. Artaza, J.R.; Carbia, C.D.; Ceballo, M.F.; Díaz, N.B. Red cell distribution width (RDW): Its use in the characterization of microcytic and hypochromic anemias. Medicina 1999, 59, 17–22. [Google Scholar]
  26. Roth, I.L.; Lachover, B.; Koren, G.; Levin, C.; Zalman, L.; Koren, A. Detection of β thalassemia carriers by red cell parameters obtained from automatic counters using mathematical formulas. Mediterr. J. Hematol. Infect. Dis. 2017, 10, 2018008. [Google Scholar] [CrossRef] [PubMed]
  27. Schriever, H.; Srivastava, P. Differentiation of thalassæmia minor from iron deficiency. Lancet 1973, 302, 154–155. [Google Scholar] [CrossRef] [PubMed]
  28. Shine, I.; Lal, S. A strategy to detect β-thalassæmia minor. Lancet 1977, 309, 692–694. [Google Scholar] [CrossRef] [PubMed]
  29. Sirachainan, N.; Iamsirirak, P.; Charoenkwan, P.; Kadegasem, P.; Wongwerawattanakoon, P.; Sasanakul, W.; Chansatitporn, N.; Chuansumrit, A. New Mathematical Formula for Differentiating Thalassemia Trait and Iron Deficiency Anemia in Thalas-semia Prevalent Area: A Study in Healthy School-Age Children. Southeast Asian J. Trop. Med. Public Health 2014, 45, 174–182. [Google Scholar] [PubMed]
  30. Sirdah, M.; Tarazi, I.; AL Najjar, E.; AL Haddad, R. Evaluation of the diagnostic reliability of different RBC indices and formulas in the differentiation of the β-thalassaemia minor from iron deficiency in Palestinian population. Int. J. Lab. Hematol. 2008, 30, 324–330. [Google Scholar] [CrossRef]
  31. Urrechaga, E.; Hoffmann, J.J. Critical appraisal of discriminant formulas for distinguishing thalassemia from iron deficiency in patients with microcytic anemia. Clin. Chem. Lab. Med. 2017, 55, 1582–1591. [Google Scholar] [CrossRef]
  32. Tang, H.; Yu, R.; Yu, Z.; Xi, H. Comparison of screening indicators for different types of thalassemia carriers in Hunan Province. J. Med. Biochem. 2023, 43, 281–289. [Google Scholar] [CrossRef]
  33. d’Onofrio, G.; Zini, G.; Ricerca, B.M.; Mancini, S.; Mango, G. Automated Measurement of Red Blood Cell Microcytosis and Hy-pochromia in Iron Deficiency and Beta-Thalassemia Trait. Arch. Pathol. Lab. Med. 1992, 116, 84–89. [Google Scholar] [PubMed]
  34. Koren, A.; Levin, C.; Zalman, L.; Palmor, H.; Filon, D.; Chubar, E.; Resnitzky, P.; Bennett, M. Hb TAYBE: Clinical and morphological findings IN 43 patients. Eur. J. Haematol. 2016, 97, 137–144. [Google Scholar] [CrossRef] [PubMed]
  35. Hoffmann, J.J.; Urrechaga, E.; Aguirre, U. Discriminant indices for distinguishing thalassemia and iron deficiency in patients with microcytic anemia: A meta-analysis. Clin. Chem. Lab. Med. 2015, 53, 1883–1894. [Google Scholar] [CrossRef]
  36. Phirom, K.; Charoenkwan, P.; Shoombuatong, W.; Charoenkwan, P.; Sirichotiyakul, S.; Tongsong, T. DeepThal: A Deep Learning-Based Framework for the Large-Scale Prediction of the α+-Thalassemia Trait Using Red Blood Cell Parameters. J. Clin. Med. 2022, 11, 6305. [Google Scholar] [CrossRef] [PubMed]
  37. Hatton, C.; Wilkie, A.; Drysdale, H.; Wood, W.; Vickers, M.; Sharpe, J.; Ayyub, H.; Pretorius, I.; Buckle, V. Higgs Alpha-thalassemia caused by a large (62 kb) deletion upstream of the human alpha globin gene cluster. Blood 1990, 76, 221–227. [Google Scholar] [CrossRef]
  38. Liebhaber, S.A.; Griese, E.U.; Weiss, I.; Cash, F.E.; Ayyub, H.; Higgs, D.R.; Horst, J. Inactivation of human alpha-globin gene expression by a de novo deletion located upstream of the alpha-globin gene cluster. Proc. Natl. Acad. Sci. USA 1990, 87, 9431–9435. [Google Scholar] [CrossRef]
  39. Wilkie, A.O.M.; Lamb, J.; Harris, P.C.; Finney, R.D.; Higgs, D.R. A truncated human chromosome 16 associated with α thalassaemia is stabilized by addition of telomeric repeat (TTAGGG)n. Nature 1990, 346, 868–871. [Google Scholar] [CrossRef]
  40. Ribeiro, D.M.; Sonati, M.F. Regulation of Human Alpha-Globin Gene Expression and Alpha-Thalassemia. Genet. Mol. Res. 2008, 7, 1045–1053. [Google Scholar] [CrossRef]
  41. De Gobbi, M.; Viprakasit, V.; Hughes, J.R.; Fisher, C.; Buckle, V.J.; Ayyub, H.; Gibbons, R.J.; Vernimmen, D.; Yoshinaga, Y.; de Jong, P.; et al. A Regulatory SNP Causes a Human Genetic Disease by Creating a New Transcriptional Promoter. Science 2006, 312, 1215–1217. [Google Scholar] [CrossRef] [PubMed]
  42. Akhavan-Niaki, H.; Youssefi Kamangari, R.; Banihashemi, A.; Kholghi Oskooei, V.; Azizi, M.; Tamaddoni, A.; Sedaghat, S.; Vakili, M.; Mahmoudi Nesheli, H.; Shabani, S. Hematologic Features of Alpha Thalassemia Carriers. Int. J. Mol. Cell Med. 2012, 1, 162–167. [Google Scholar] [PubMed]
  43. Oron-Karni, V.; Filon, D.; Shifrin, Y.; Fried, E.; Pogrebijsky, G.; Oppenheim, A.; Rund, D. Diversity of alpha-globin mutations and clinical presentation of alpha-thalassemia in Israel. Am. J. Hematol. 2000, 65, 196–203. [Google Scholar] [CrossRef] [PubMed]
  44. Shaulov, A.; Filon, D.; Rund, D. Haplotype analysis of α-thalassemia chromosomes reveals heterogeneity and multiple founders in Ashkenazi Jews. Eur. J. Med. Genet. 2016, 59, 555–558. [Google Scholar] [CrossRef]
  45. Kattamis, A.C.; Camaschella, C.; Sivera, P.; Surrey, S.; Fortina, P. Human α-Thalassemia Syndromes: Detection of Molecular Defects. Am. J. Hematol. 1996, 53, 81–91. [Google Scholar] [CrossRef]
  46. Embury, S.H.; A Miller, J.; Dozy, A.M.; Kan, Y.W.; Chan, V.; Todd, D. Two different molecular organizations account for the single alpha-globin gene of the alpha-thalassemia-2 genotype. J. Clin. Investig. 1980, 66, 1319–1325. [Google Scholar] [CrossRef] [PubMed]
  47. Gilad, O.; Steinberg-Shemer, O.; Dgany, O.; Krasnov, T.; Noy-Lotan, S.; Tamary, H.; Yacobovich, J. Alpha-Thalassemia Carrier due to -alpha(3.7) Deletion: Not So Silent. Acta Haematol. 2020, 143, 432–437. [Google Scholar] [CrossRef] [PubMed]
  48. Singer, S.T. Variable Clinical Phenotypes of α-Thalassemia Syndromes. Sci. World J. 2009, 9, 615–625. [Google Scholar] [CrossRef]
  49. Arnon, S.; Tamary, H.; Dgany, O.; Litmanovitz, I.; Regev, R.; Bauer, S.; Dolfin, T.; Yacobovich, J.; Wolach, B.; Jaber, L. Hydrops fetalis associated with homozygosity for hemoglobin Taybe (α 38/39 THR deletion) in newborn triplets. Am. J. Hematol. 2004, 76, 263–266. [Google Scholar] [CrossRef]
  50. Burges, C.J.C. A Tutorial on Support Vector Machines for Pattern Recognition. Data Min. Knowl. Discov. 1998, 2, 121–167. [Google Scholar] [CrossRef]
Figure 1. The Support Vector Machine Algorithm calculated formula [26]. MCV: mean corpuscular volume; MCH: mean corpuscular hemoglobin.
Figure 1. The Support Vector Machine Algorithm calculated formula [26]. MCV: mean corpuscular volume; MCH: mean corpuscular hemoglobin.
Ijms 25 06446 g001
Table 1. Data from non-β thalassemia trait subjects, β thalassemia trait, and suspected α thalassemia trait.
Table 1. Data from non-β thalassemia trait subjects, β thalassemia trait, and suspected α thalassemia trait.
DiagnosisNon β Thalassemia Trait
Mean ± SD (Range)
β Thalassemia Trait
Mean ± SD (Range)
α Thalassemia Trait and/or Suspected α Trait
Mean ± SD (Range)
p-Value between α and β Trait
Number of individuals (%)18,572 (81.3)2936 (12.8)1334 (5.9)
RBC (×109/dL)4.2 ± 0.43 (2.17–7.67)5.42 ± 0.55
(3.21–7.81)
4.87 ± 0.44
(3.48–6.34)
<0.001
Hb (g/dL)11.75 ± 1.06
(9.00–19.2)
10.65 ± 0.95
(9.00–15.4)
11.51 ± 1.07
(9.0–15.6)
<0.001
MCV (fl)85.9 ± 6.84
(34–125.9)
63.14 ± 5.76
(48–91.5)
73.56 ± 4.52
(53.3–91.4)
<0.001
MCH (pg)28.16 ± 2.7
(16.2–40.7)
19.71 ± 1.95
(14–31.3)
23.71 ± 1.92
(16.5–29.9)
<0.001
MCHC (g/dL)32.75 ± 1.76
(12.6–45.7)
31.23 ± 1.73
(17.5–65)
32.17 ± 1.68
(19.2–36.2)
<0.001
RDW (%)14.96 ± 2.02
(10.1–36.4)
16.39 ± 2.71
(12–22.8)
15.16 ± 1.94
(12.1–28)
<0.001
Hb F (%)0.34 ± 0.61
(0–14)
2.12 ± 2.67
(0–38)
0.4 ± 0.29
(0.1–1.9)
<0.001
Hb A2 (%)1.12 ± 1.38
(0–3.4)
5.6 ± 0.8
(3.5–8.8)
2.6 ± 0.26
(2–3.3)
<0.001
SD: standard deviation; RBC: red blood cells count; Hb: hemoglobin; MCV: mean corpuscular volume; MCH: mean corpuscular hemoglobin; MCHC: mean corpuscular hemoglobin concentration; RDW: red density width; Hb F: fetal hemoglobin; Hb A2: hemoglobin A2.
Table 2. Data from α thalassemia trait with proven genetic defect, suspected α thalassemia trait not molecularly analyzed, and suspected α thalassemia trait with normal molecular analysis.
Table 2. Data from α thalassemia trait with proven genetic defect, suspected α thalassemia trait not molecularly analyzed, and suspected α thalassemia trait with normal molecular analysis.
Diagnosisα Thalassemia Trait + Mutation
Mean ± SD (Range)
α Thalassemia Trait Suspected
Mean ± SD (Range)
α Thalassemia Trait Normal Sequence
Mean ± SD (Range)
p-Value (*)
Number of individuals (%)291 (21.8%)962 (72.12%)81 (6.07%)
RBC (×109/dL)4.93 ± 0.46
(3.79–6.34)
4.87 ± 0.43
(3.48–6.27)
4.67 ± 0.36
(3.7–5.69)
<0.001
Hb (g/dL)11.58 ± 0.97
(9.0–14.8)
11.51 ± 1.1
(9.0–15.6)
11.39 ± 0.93
(9.1–13.5)
NS
MCV (fl)72.95 ± 5.17
(53.3–83.9)
73.55 ± 4.25
(57.8–91.4)
75.1 ± 4.07
(62.2–85)
NS
MCH (pg)23.61 ± 2.1
(16.5–27.1)
23.66 ± 1.84
(16.5–29.9)
24.43 ± 1.81
(18.3–28.6)
NS
MCHC (g/dL)32.26 ± 1.72
(19.2–36.10)
32.15 ± 1.33
(25.9–36.2)
32.51 ± 1.38
(29.4–35.5)
NS
RDW (%)14.91 ± 1.8
(12.3–23.4)
15.19 ± 1.87
(12.1–28.0)
15.89 ± 2.35
(12.9–26.5)
0.004
Hb F (%)0.5 ± 0.52
(0.1–7.3)
0.4 ± 0.29
(0.1–1.9)
0.4 ± 0.34
(0–5.1)
NS
Hb A2 (%)2.6 ± 0.29
(1.3–3.6)
2.6 ± 0.26
(2–3.3)
2.6 ± 0.27
(0.8–3.3)
NS
SD: Standard deviation; RBC: red blood cells count; Hb: hemoglobin; MCV: mean corpuscular volume; MCH: mean corpuscular hemoglobin; MCHC: mean corpuscular hemoglobin concentration; RDW: red density width; Hb F: fetal hemoglobin; Hb A2: hemoglobin A2. (*) p-Value compared the three parameters.
Table 3. Analysis of the results of applying published mathematical formulas to our suspected α thalassemia carriers results. The published cutoff for β thalassemia was used.
Table 3. Analysis of the results of applying published mathematical formulas to our suspected α thalassemia carriers results. The published cutoff for β thalassemia was used.
No.Study
(Reference)
FormulaβThal Cut-Offα Thal PPVα Thal NPVα Thal Specificityα Thal SensitivityPercentile 75%Percentile 95% (Lower Limit)
1Srivastava [27]MCH/RBC<3.839.3993.5999.355.825.386.04 (5.97)
2
(1)
England and Fraser,
(2)
England and Fraser (1979) [18,19]
MCV-RBC-(5-Hb)-K*<00.8793.1097.530.370.2973.37 (72.96)
3Mentzer [23]MCV/RBC<1345.7293.9499.0111.5716.4518.24 (18.09)
4Shine and Lal [28]MCV2 × MCH/100<153034.4298.8388.2285.541466.581626.52 (1607.81)
5Ricerca et al. [24]RDW/RBC<3.313.6696.8368.8168.633.384.01 (3.95)
6Green and King [21]MCV2 × RDW/(Hb × 100)<6558.4595.1598.4430.4778.2692.63 (90.12)
7D’Onofrio et al. [33]MCV/MCH>0.96.74001003.183.34 (3.33)
8Romero Artaza et al. [25]RDW × MCV/RBC<22053.6595.8697.441.64251.18290.99 (285.58)
9Sirdah et al. [30]MCV-RBC-(3XHb)<2756.6393.6999.557.1537.1941.73 (41.11)
10Ehsani et al. [17]MCV-(10 × RBC)<1545.4593.8699.110.4229.835.4 (34.8)
11Sirachainan et al. [29]1.5 × Hb–0.05 × MCV<146.0891.9532.6960.3914.5816.01 (15.85)
12Bordbar et al. [16][80-MCV]*[27-MCH]>44.769.1793.7284.6321.4637.8101.32 (93.44)
13[20]Hb × RDW × 100/RBC2 × MCHC<2163.4495.298.7230.8925.1429.3 (28.66)
14Hisham Index [22]MCH × RDW/RBC<6755.8594.9998.428.1380.8794.52 (92.59)
15Hameed Index [22]MCH × Hct × RDW/(RBC × Hb)2<2206.73001004.996.52 (6.35)
16Amendolia et al.—SVM [15]SVM—(RBC, Hb, Hct, MCV) 98.7513.5998.9957.991.651.81 (1.79)
17SVM [26]SVM (MCV and MCH) (Figure 1)<021.5299.9373.6799.33−0.230.29 (0.22)
RBC: red blood cells count; Hb: hemoglobin; MCV: mean corpuscular volume; MCH: mean corpuscular hemoglobin; MCHC: mean corpuscular hemoglobin concentration; RDW: red density width; βThal: β thalassemia; α thal: α thalassemia; PPV: positive predictive value; NPV: negative predictive value; SVM: Support Vector Machine. Figures in bold represent results above 90%.
Table 4. Comparison of red blood cell count characteristics in the two most common deletions.
Table 4. Comparison of red blood cell count characteristics in the two most common deletions.
α Globin Geneticsαα/-α3.7 kb
Mean ± STD
(Range)
-α3.7 kb/-α3.7 kb
Mean ± STD
(Range)
p *αIVS I-1/αα
Mean ± STD
(Range)
αIVS I-1/αIVS I-1
Mean ± STD
(Range)
p **αIVS I-1/
-α3.7 kb
Mean ± STD
(Range)
Number of individuals (%)13424 978 7
RBC (×109/dL)4.8 ± 0.4
(6.16–3.79)
5.3 ± 0.49
(4.37–6.21)
<0.0014.9 ± 0.36
(4.1–6.1)
5.4 ± 0.57
(4.52–5.96)
0.0025.5 ± 0.56
(4.72–6.14)
Hb (g/dL)11.6 ± 0.96
(9.1–14.2)
11.2 ± 1.01
(9.0– 13.3)
NS11.8 ± 0.88
(9.4–14.8)
10.5 ± 1.19
(9.0–11.7)
<0.00111.1 ± 0.66
(10.3–11.8)
MCV (fl)74.4 ± 4.35
(59–83.9)
68.7 ±5.6
(58.7–77.1)
<0.00173.7 ± 4.36
(53.3– 82.2)
64.9 ± 4.66
(59.1–73.6)
<0.00163.3 ± 1.41
(61.8–65.2)
MCH (pg)24.3 ± 1.75
(18–27.1)
21.8 ± 2.11
(17.1–24.6)
<0.00123.9 ± 1.63
(16.5– 26.5)
19.5 ± 2.15
(17.9–24.7)
<0.00120.2 ± 0.96
(19.2–21.9)
MCHC (g/dL)32.6 ± 1.18
(29.7–36.1)
31.7 ± 1.52
(29.2–34.4)
<0.00132.4 ± 1.2
(28.9–35.7)
30.0 ± 2.2
(25.5–33.5)
<0.00128.7 ± 6.66
(19.2–35.4)
RDW (%)14.8 ± 1.65
(12.5–21.7)
15.5 ± 2.56
(13.3–23.4)
NS14.4 ± 1.29
(12.3– 20.1)
16.3 ± 2.35
(12.6–20.5)
0.00118.8 ± 2.64
(16.5–22.6)
Hb F (%)0.5 ± 0.67
(0.1–7.3)
0.4 ± 0.38
(0.2–1.6)
NS0.5 ± 0.35
(0.1–1.8)
0.6 ± 0.26
(0.3–1.1)
NS0.5 ± 0.46
(0.2–1.1)
Hb A2 (%)2.7 ± 0.26
(1.7–3.6)
2.6 ± 0.19
(2.2–2.9)
0.092.7 ± 0.3
(1.6–3.1)
2.3 ± 0.3
(1.7–2.7)
0.0022.8 ± 0.15
(2.5–2.9)
SD: standard deviation; RBC: red blood cells count; Hb: hemoglobin; MCV: mean corpuscular volume; MCH: mean corpuscular hemoglobin; MCHC: mean corpuscular hemoglobin concentration; RDW: red density width; Hb F: fetal hemoglobin; Hb A2: hemoglobin A2. -α3.7: -α3.7 deletion in the α gene; αIVS I-1 (-5 bp) in α2 gene. p (*)—Comparison between αα/-α3.7 kb and -α3.7 kb/-α3.7 kb. p (**)—Comparison between αIVS I-1 (-5 bp)/αα and αIVS I-1(-5 bp)/αIVS I-1 (-5 bp). -α3.7 kb vs. IVS I-1 (-5 bp)—RBC p = 0.002—all the other parameters NS. -α3.7 kb/-α3.7 kb vs. αIVS I-1 (-5 bp)/αIVS I-1 (-5 bp) p 0.014 for MCH and p 0.027 for MCHC. All the other parameters NS. -α3.7 kb/αIVS I-1 (-5 bp) vs. αIVS I-1 (-5 bp)/αIVS I-1 (-5 bp)—Hb A2 p = 0.005. The other indices NS.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Lachover-Roth, I.; Peretz, S.; Zoabi, H.; Harel, E.; Livshits, L.; Filon, D.; Levin, C.; Koren, A. Support Vector Machine-Based Formula for Detecting Suspected α Thalassemia Carriers: A Path toward Universal Screening. Int. J. Mol. Sci. 2024, 25, 6446. https://doi.org/10.3390/ijms25126446

AMA Style

Lachover-Roth I, Peretz S, Zoabi H, Harel E, Livshits L, Filon D, Levin C, Koren A. Support Vector Machine-Based Formula for Detecting Suspected α Thalassemia Carriers: A Path toward Universal Screening. International Journal of Molecular Sciences. 2024; 25(12):6446. https://doi.org/10.3390/ijms25126446

Chicago/Turabian Style

Lachover-Roth, Idit, Sari Peretz, Hiba Zoabi, Eitam Harel, Leonid Livshits, Dvora Filon, Carina Levin, and Ariel Koren. 2024. "Support Vector Machine-Based Formula for Detecting Suspected α Thalassemia Carriers: A Path toward Universal Screening" International Journal of Molecular Sciences 25, no. 12: 6446. https://doi.org/10.3390/ijms25126446

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop