Crucial Role of Foxp3 Gene Expression and Mutation in Systemic Lupus Erythematosus, Inferred from Computational and Experimental Approaches

Birjan, Zahra; Khashei Varnamkhasti, Khalil; Parhoudeh, Sara; Naeimi, Leila; Naeimi, Sirous

doi:10.3390/diagnostics13223442

Open AccessArticle

Crucial Role of Foxp₃ Gene Expression and Mutation in Systemic Lupus Erythematosus, Inferred from Computational and Experimental Approaches

by

Zahra Birjan

^1,†,

Khalil Khashei Varnamkhasti

^2,†,

Sara Parhoudeh

¹,

Leila Naeimi

¹ and

Sirous Naeimi

^1,*

¹

Department of Genetics, College of Science, Kazerun Branch, Islamic Azad University, Kazerun 73, Iran

²

Department of Medical Laboratory Sciences, Faculty of Medicine, Kazerun Branch, Islamic Azad University, Kazerun 73, Iran

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Diagnostics 2023, 13(22), 3442; https://doi.org/10.3390/diagnostics13223442

Submission received: 11 January 2023 / Revised: 24 April 2023 / Accepted: 12 May 2023 / Published: 14 November 2023

(This article belongs to the Special Issue Precision Medicine in Autoimmunity)

Download

Browse Figures

Versions Notes

Abstract

:

The impaired suppressive function of regulatory T cells is well-understood in systemic lupus erythematosus. This is likely due to changes in Foxp₃ expression that are crucial for regulatory T-cell stability and function. There are a few reports on the correlation between the Foxp₃ altered expression level and single-nucleotide polymorphisms within the Foxp₃ locus. Moreover, some studies showed the importance of Foxp₃ expression in the same diseases. Therefore, to explore the possible effects of single-nucleotide polymorphisms, here, we evaluated the association of IVS9+459/rs2280883 (T>C) and −2383/rs3761549 (C>T) Foxp₃ polymorphisms with systemic lupus erythematosus. Moreover, through machine-learning and deep-learning methods, we assessed the connection of the expression level of the gene with the disease. Single-nucleotide polymorphisms of Foxp₃ (IVS9+459/rs2280883 (T>C) and −2383/rs3761549 (C>T)) were, respectively, genotyped using allele-specific PCR and direct sequencing and polymerase chain reaction-restriction fragment length polymorphism, in 199 systemic lupus erythematosus patients and 206 healthy age- and sex-matched controls. The Statistical Package for the Social Sciences version 19 and Fisher’s exact and chi-square tests were used to analyze the data. Moreover, six machine-learning models and two sequential deep-learning models were designed to classify patients from normal people in the E-MTAB-11191 dataset through the expression level of Foxp₃ and its correlated genes. The allele and genotype frequencies of both polymorphisms in question were found to be significantly associated with an increased risk of systemic lupus erythematosus. Furthermore, both of the two single-nucleotide polymorphisms were associated with some systemic-lupus-erythematosus-related risk factors. Three SVM models and the logistic regression model showed an 81% accuracy in classification problems. In addition, the first deep-learning model showed an 83% and 89% accuracy for the training and validation data, respectively, while the second model had an 85% and 79% accuracy for the training and validation datasets. In this study, we are prompted to represent the predisposing loci for systemic lupus erythematosus pathogenesis and strived to provide evidence-based support to the application of machine learning for the identification of systemic lupus erythematosus. It is predicted that the recruiting of machine-learning algorithms with the simultaneous measurement of the applied single nucleotide polymorphisms will increased the diagnostic accuracy of systemic lupus erythematosus, which will be very helpful in providing sufficient predictive value about individual subjects with systemic lupus erythematosus.

Keywords:

systemic lupus erythematosus; polymorphism; Foxp₃; rs2280883; rs3761549; machine learning; deep learning

1. Introduction

To avoid responsiveness against self-antigens and reduce autoimmunity risk, the immune system has evolved immunological tolerance mechanisms which are categorized as central and peripheral tolerance. The primary deletion of autoreactive T or B cells take place by central tolerance, within the primary (thymus and bone marrow) lymphoid organs. Nevertheless, central tolerance is imperfect and self-reactive cells continuously escape into the periphery. Peripheral tolerance is the inactivation key of autoantigen-recognizing T or B cells which appear in the periphery [1,2]. A unique subset of CD₄⁺ T cells, known as regulatory T (Treg) cells, are essential mediators of peripheral tolerance to self-antigens. These specialized lymphocytes, with regulatory functions in restraining immune responses [3,4,5], arise during thymic-derived T-cell maturation and are characterized by the expression of the interleukin-2 receptor alpha (IL-2Rα) chain (CD₂₅), and the forkhead box P₃ (Foxp₃) transcription factor [6]. The master forkhead/winged-helix transcription factor of Foxp₃ controls the regulation, differentiation, and suppressor function of Treg cells. In contrast, function-impaired Treg cells develop systemic autoimmune diseases which have been found to be associated with mutations on the Foxp₃ gene (located on the X chromosome in the Xp11.23 position) [7,8,9,10]. A common one includes systemic lupus erythematosus (SLE) with prevalence rates varying between 3.7/100,000 person-years [11]. SLE is a multisystem, complex, autoimmune disease involving progressive organ damage with the direct contribution of auto-antibodies and self-reactive T cells to its pathologic changes [12,13]. Impaired immune system function in SLE has recently been reported to be associated with single nucleotide polymorphisms (SNPs) in the Foxp₃ gene which can alter its expression level and impair the suppressive function of Tregs [3]. Two promoter (−2383/rs3761549 (C>T)) and intronic (IVS9+459/rs2280883 (T>C)) polymorphisms of Foxp₃ have been reported to be associated with autoimmune disease risk [14].

Furthermore, gene expression analysis advances our understanding about the underlying molecular mechanisms of SLE. Nowadays, machine learning is a developing area that is known as a revolution in science. Machine learning and its more developed field called deep learning could be represent a solution for the big data interpretation challenge, and used to obtain understandable knowledge from massive gene expression data and facilitate the ability to predict changes in SLE disease [15]. There are various machine-learning methods designed to solve classification problems. One of them is called logistic regression, which uses a sigmoid unit to classify each piece of data based on some inputs as features. The performance of a logistic regression is evaluated by a parameter called loss. A more extended logistic regression with various units and several layers is called a neural network, which is the basic unit of a deep-learning model [16]. Therefore, a logistic regression model is also known as the simplest deep-learning model. For the time being, neural networks are suggested as the best way to solve big data classification problems, while machine-learning models are better for datasets with a small size. Recently, some studies have used these models to perform classifications based on gene expression data in biological challenges [17].

The core finding from the present functional study may fill the existing gaps in our understanding about genetic factors predisposing to SLE and provide a promising way to utilize genetic computational methods for the prediction of risk for SLE.

To shed new light on the molecular mechanism underlying the development of SLE, the present study aimed to realize the probable association of IVS9+459/rs2280883 (T>C) and −2383/rs3761549 (C>T) Foxp₃ polymorphisms and also the association of the expression of Foxp₃ with SLE through in vitro and machine-learning methods.

2. Methods

2.1. Experimental Design

For our study, 199 SLE blood samples were collected from patients whose disease had been diagnosed by rheumatologist (based on the proper constellation of clinical (butterfly rash, oral ulcers, single urine: protein/creatinine ratio or 24 h urine protein, >0.5 g, seizures, psychosis, myelitis, and leukopenia) findings and immunological evidence (including ANA level, anti-dsDNA antibodies, and low complement) at Hafez Hospital Lupus Clinic (Shiraz, Iran). Blood samples were also collected from 206 age- and sex-matched healthy subjects (controls) from the organization of the blood transfusion (Shiraz, Iran). All samples were kept in the Autoimmune Diseases Research Center of Shiraz University of Medical Sciences (Shiraz, Iran) until experimental analysis. Subjects with other co-occurrent autoimmune and underlying diseases were excluded. The study protocol was approved by the Ethics Committee of the Islamic Azad University—Kazerun Branch (IR.IAU.KAU.REC.1398.044) and written informed consent was provided to gain consent of research participation.

2.2. DNA Isolation and Quality Control

Genomic DNA was extracted from a total blood sample volume of 200 µL using the DNP™ DNA Extraction kit (DNP Extraction Kit, Sinagen Company, Tehran, Iran) and was stored frozen at −20 °C for later use. NanoDrop ND-2000 (Thermo, Wilmington, NC, USA) was used for DNA concentration and quality assessment.

2.3. Genotyping

Selected polymorphic sites (IVS9+459/rs2280883 (T>C) and −2383/rs3761549 (C>T)) were genotyped by two independent PCR methods.

The −2383/rs3761549 (C>T) polymorphism was amplified by restriction fragment length polymorphism (RFLP) technique. Amplification program, primers, restriction enzyme, and product sizes are shown in Table 1. A total of 10 µL of PCR product was added to 0.5 µL BseNI (BsrI) restriction enzyme, 2.5 µL buffer, and 18µL nuclease-free water. The mixture then incubated for 4 h at 65 °C. Next, 15 µL of each digested PCR product containing a 3 µL loading buffer was loaded into a lane of the 3% agarose gel. The DNA bands were then visualized on the UV transilluminator and images were taken with a gel documentation system (UVITEC, UK). Finally, the genotypes of −2383/rs3761549 (C>T) SNP were determined.

IVS9+459/rs2280883 (T>C) was genotyped through allele-specific PCR (AS-PCR) (amplification program, primers, and product sizes are shown in Table 1) and direct sequencing method. Direct sequencing of PCR products recovered by the GEL/PCR Purification Kit (Favorgen Biotech Corp., Ping-Tung, Taiwan) was performed using Genetic Analyzer 3130 x (Applied Biosystems, Waltham, MA, USA). Sequences were analyzed with the CodonCode Aligner V.5.1.5 software (CodonCode Corporation, Centerville, MA, USA).

2.4. Data Collection and Preprocessing

We searched for microarray expression datasets in the ArrayExpress (https://www.ebi.ac.uk/arrayexpress/) and GEO (https://www.ncbi.nlm.nih.gov/geo) databases on 7 July 2022, in the current study. Various datasets were selected as our first-level candidates; among them, we chose the E-MTAB-11191 dataset from the ArrayExpress database. The selection criteria were based on the number of samples, study design, and the platform in use. The platform in use in the current study was Affymetrix Human Genome U133 Plus 2.0 Array. We first downloaded raw CEL files and then generated the expression matrix through the RMA method in the affy package in the R environment. The package is developed to generate and modify expression matrices from the Affymetrix platform series. The data values were then normalized and scaled into log2 + 1 format. The matrix was then annotated with Ensembl IDs, Gene Symbols, and Entrez IDs. We did not remove any genes through typical methods such as CPM (counts per million) because we had a specific target gene to study.

2.5. Machine-Learning Model Design

First of all, we extracted the expression level of the Foxp₃ gene from the expression matrix. The gene had 3 probe IDs; therefore, we considered all of them in our model. The data was first transformed in a way that columns were considered our features (genes), and rows were our labels (patients and normal). We had 101 samples; among them, 17 were normal, and 84 were patients. The data were first scaled between zero and one (a common method in machine-learning models) by the following formula:

G i - m i n (G)) / (m a x (G) - m i n (G)

in which G represents the expression value of the gene in the patients i, and min and max values of G are the minimum and maximum values of the gene among all patients.

Six machine-learning models were created to classify them based on the expression level of the Foxp₃ gene, including linear regression (LR), support vector machine (SVM) with RBF (SVM_1), linear (SVM_2), and poly (SVM_3) kernels, decision tree (DT), and extra-tree classifier (ETC). To train the models, we shuffled and divided 70% of the data into the training datasets and 30% into the testing datasets. For that purpose, we used the train_test_split function from the sklearn library in Python. Each model was first trained on the training dataset and then evaluated on the test datasets.

2.6. Co-Expression Network

In order to find genes associated to Foxp₃, the co-expression network analysis was performed. The expression matrix of Foxp₃ was extracted. Afterwards, the Pearson correlation test was executed between the gene and all other genes in the main expression matrix. Those genes with a correlation coefficient (CC) > 0.8 and CC < −0.8 were selected.

2.7. Deep-Learning Model

Two deep-learning models were designed. We used both the Keras and the TensorFlow libraries in the Python environment. Keras is a branched library from TensorFlow that is developed for deep-learning usages. It generally supports two types of models, including sequential and functional models. In the current study, both models were sequential models. In the first model, we only considered Foxp₃ probe IDs as our features. The model had 2 hidden layers with 10 and 5 units, respectively. Moreover, there was another output layer with one unit. The activation function for hidden layers was carried out then, and the output layer was sigmoid. Bias for all layers was considered zero at the first epoch, and the weights were random numbers. Adam was considered as the model optimizer, and, because it was a binary classification, we considered binary cross-entropy to calculate our loss. Moreover, the learning rate was set at a 0.0001 value. The model was trained with 2000 epochs, and the validation dataset was considered 0.25 of the total number of train samples. In addition, at the end of the training, the model was evaluated by the test dataset.

For the second model, we considered all genes with a correlation coefficient > 0.8 or <−0.8 with Foxp₃. We had three hidden layers in the second model with 25, 25, and 12 units, respectively. Other parameters were similar to the first model. However, in this model, because of the large number of features, we did not consider the test dataset, and only the validation dataset with 33% of total samples was considered. We utilized this method because, if the number of samples became less than the number of features, the model could not classify very well.

2.8. Software and Statistics

All statistical analysis for the genotyping part was performed in SPSS Statistics 19 software. The significance differences in genotype and allelic frequencies between two groups were verified by the Hardy–Weinberg (HW) equilibrium and chi-square test. Bonferroni corrections were applied to correct for multiple comparisons, and the threshold for statistical significance was set at ≤0.05. In the machine-learning part, all mathematical and statistical calculations were performed in R and Python environments. We applied R version 4.0.1 and R Studio for data preparation and basic statistical tests. We applied Python for deep learning, machine learning, and model evaluation in the Google Colab (https://colab.research.google.com/) environment on 15 July 2022. The runtime was set on TPU, which is developed for better execution of machine-learning projects. Moreover, figures were depicted using Python and the matplotlib library.

3. Results

The basic demographic data of all the SLE patients are summarized in Table 2.

The results showed that the statistical power in our study were: (1) the associations between both Foxp₃ (IVS9+459/rs2280883 (T>C) and −2383/rs3761549 (C>T)) gene polymorphisms and SLE risk, and (2) the 81% accuracy of the three SVM models and the logistic regression model when performing classifications based on gene expression data in biological challenges about genetic factors predisposing to SLE.

The −2383/rs3761549 (C>T) genotype distribution was in accordance with the Hardy–Weinberg equilibrium (control group, X² = 3.2, df = 2, HWE p-value = 0.201; and patient group, X² = 4.7, df = 2, HWE p-value = 0.095). The CC, CT, and TT genotype of the −2383/rs3761549 (C>T) polymorphism is shown in Figure 1. As the genotypic and allelic distribution of Foxp₃ rs3761549 SNP is summarized in Table 3, the CT- and TT-genotype frequencies were significantly higher in the SLE patients than controls. Moreover, our results indicate that the T allele of rs3761549 is a risk allele for SLE development (Table 3). Regarding the association of rs3761549 (C>T) polymorphism with SLE risk factors such as antinuclear antibody (ANA), anti-double-stranded DNA (anti-dsDNA), complement (C3/C4), and white blood cell count (WBC), only a significant relationship was found between CT-genotype carriers and anti-dsDNA (Table 4).

An evaluation of the Hardy–Weinberg equilibrium for the rs2280883 polymorphic loci showed a nonsignificant deviation in both the control and patient population (control group, X² = 1.4, df = 2, HWE p-value = 0.496; and patient group, X² = 1.6, df = 2, HWE p-value = 0.449). Figure 2 demonstrated the CT and TT genotype confirmed by direct PCR sequencing. The relationships between the rs2280883 risk genotypes and alleles, and susceptibility to SLE were analyzed (Table 3). We found a significantly increased risk for SLE associated with the rs2280883 polymorphism CT genotype and C allele. Except for C3, no statistically significant difference was observed between rs2280883 SNP and various SLE risk factors (Table 4). After Bonferroni correction, both SNPs remained significant.

3.1. Foxp₃ Expression Level Might Efficiently Classify People with or without Lupus Erythematosus through Machine-Learning Methods

We applied six machine-learning models to classify normal people from patients. The results are shown in Table 5. Overall, it is evident that all SVM models and logistic regression could indicate similar outcomes with an accuracy of 81%. On the other hand, decision tree and ETC models were 68% and 74% accurate in the classification problem. All the classification models were designed based on only one gene as our feature. None of the models showed a macro average F1-score of more than 45%. However, the weighted average F1-score of all models was in a similar range between 65% and 72%. The four models listed at first showed the highest metrics based on all methods.

3.2. A Deep-Learning Model with More Features: Similar Results to the Model with One Feature

We designed two deep-learning models to assess the performance of neural networks in the classification with such a number of samples. The training history of both models is shown in Figure 3A,B. As the first model had only one feature (Foxp₃), we only considered two layers but a more extensive training time for that (2000 epochs). The model showed an 83% and 89% accuracy for the training and validation datasets, respectively. The loss for both datasets was at the minimum level of its function, and the training was precisely stopped at this point. On the other hand, for the second model, we considered 76 genes that had the highest CC with Foxp₃ (Supplementary Table S1). The expression level of all genes was normalized and scaled between zero and one then. We designed the model with three hidden layers, as the number of features was larger. However, we reduced the number of epochs to achieve better results. The model showed an 85% accuracy for the training set, which is larger than the previous model. However, the validation accuracy could only reach 79%. Therefore, the first model with Foxp₃ as its only feature revealed better results compared to the second model with correlated genes. This fact reveals that, despite the low number of samples available for this deep-learning modelling, Foxp₃ could classify patients and normal people very well.

4. Discussion

It has long been suggested that genetic factors not only enhance the risk for the development of SLE, but also are able to play important roles in the pathogenesis of this disease. However, the exact cause of SLE remains elusive [18] and further experiments are still needed. Thus, in this study, we aimed to prove whether the Foxp₃ expression and mutation are crucial in lupus erythematosus, through computational modelling and experimental approaches. Despite the success of SNP analyses in the context of assessing the association between genetic determinants and complex diseases, disease-risk SNPs are usually neglected. On the other hand, although the SNP discovery holds great promise, SNPs may not be the single mediator for the relation between genetics and disease. Gene expression information can also increase the power of detecting the overall effect of genetics on disease risk [19]. In this paper, we combined the information of SNPs and gene expression to introduce Foxp₃ as a mediator which highlights the clinical significance of our findings.

In the present study, the role of Foxp₃ polymorphisms has attracted attention in the SLE pathogenesis. Our main findings suggest that the T allele of Foxp₃−2383 C>T (rs3761549) could be a risk allele and the CT and TT genotypes were associated with developing SLE. Other findings are in agreement with our result; for example, in a Brazilian population, the rs2232368 polymorphism T allele was found to have an association with endometriosis-related infertility [20]. The association between the “CT” genotype of -2383 C/T (rs3761549) polymorphism and Hashimoto’s thyroiditis and Graves’ disease is also reported in the South Indian population [21]. Moreover, we found a significant relationship between CT-genotype carriers and the positive anti-dsDNA antibody. It suggests that the rs3761549 could be considered as a genetic risk factor for SLE susceptibility. Likewise, observing the association of the CT genotype and C allele of IVS9+459 T>C (rs2280883) polymorphism with an increased risk for SLE in our study implies the effect of rs2280883 SNP’s predisposition to SLE. There is also a report indicating the associations between the rs2280883 SNP and psoriasis in a Han Chinese population [22]. The rs2280883 SNP is also linked to an increased risk of Graves’ disease (Tan et al., 2021). Additionally, the association of the TC genotype of Foxp₃ rs2280883 was found with the risk of connective-tissue-disease-associated ILD [23]. Findings in other studies, together with the Foxp₃-polymorphism-associated genetic effect on the risk of SLE identified in this study, confirm our hypothesis that Foxp₃ polymorphisms might be a proper candidate for use in autoimmune disease screening, including for SLE. However, no evidence for the Foxp₃ gene polymorphism association with Graves’ disease and autoimmune Addison’s disease has been found in the UK population [24]. It is clearly the case that there is inconsistency between different ethnic populations concerning the polymorphisms responsible for disease susceptibility. We suggest that the association between such identified true susceptibility loci and SLE and other autoimmune diseases should be evaluated in every population.

On the other hand, in the current study, the potential of the Foxp₃ gene expression level in patient classification was evaluated. Our present result is similar to the findings of a previous study which indicate machine-learning algorithms could potentially be applied to identify the gene expression features and subjects with higher degrees of disease activity [25]. Moreover, there are reports about the importance of this gene expression activity in SLE in the Egyptian population [26]. The dataset in use in this study was selected among more than 20 candidate datasets. Unfortunately, we could not select any microarray dataset with a larger number of samples because of the type of study and the platform in use. Therefore, we only considered 101 samples: 17 were without the disease and the others were patients. This means that around 17% of our samples were grouped as control, which is acceptable for a small machine-learning project. However, with the small number of samples, the outcomes showed that the Foxp₃ expression levels might classify both groups considerably. A comparison among the deep-learning and machine-learning models shows that the first deep-learning model had the best performance with about 90% accuracy. In this model, we only considered Foxp₃ as our feature. We believe that the better performance of the model is due to the number of samples, as the rules of deep learning say the number of features should be much less than the sample number to obtain the best result. On the other hand, among machine-learning models, logistic regression and all support vector machine models showed the same result with an accuracy of 81%. Here, again, we think that the same results are because of the number of control samples. Definitely, if we increase this number, the results would be changed. Therefore, we cannot decide which one of the machine-learning models could classify better. Considering a larger sample size to evaluate the association between these polymorphisms in question with SLE in future studies will resolve the present study’s limitation. Likewise, further studies on patients in a variety of ethnic populations are still required to increase our knowledge base for this gene. It is advantageous that other genetic association studies evaluate other potential mediators, such as DNA methylation.

5. Conclusions

To date, although the role of Foxp₃ in autoimmunopathies have attracted interest in numerous genetic studies, the present study has been attempted to detect Foxp₃ gene expression features and its related polymorphisms as more plausible genetic risk factors for SLE. The present data provide an approach to considering the Foxp₃ gene as a strong genetic component with high clinical significance for SLE which could potentially be used to identify the subjects with higher disease susceptibility.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/diagnostics13223442/s1.

Author Contributions

Conceptualization, S.N.; methodology, S.N.; formal analysis, S.N., Z.B., K.K.V., S.P. and L.N.; investigation, S.N., Z.B., K.K.V., S.P. and L.N.; resources, S.N., Z.B., K.K.V., S.P. and L.N.; data curation, S.N., Z.B., K.K.V., S.P. and L.N.; writing—original draft preparation, S.N. and K.K.V.; writing—review and editing, S.N., and K.K.V.; visualization, S.N.; supervision, S.N.; funding acquisition, S.N. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki and approved by the Ethics Committee Review Board of the Islamic Azad University’s Kazerun Branch.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Acknowledgments

Special thanks to the Hafez Hospital Lupus Clinic healthcare workers and all other staff.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

Treg: Regulatory T cells, Foxp₃; Forkhead box P₃, IL-2Rα; Interleukin-2 receptor alpha chain, SLE; Systemic lupus erythematosus, SNPs; Single nucleotide polymorphisms, RFLP; Restriction fragment length polymorphism, AS-PCR; Allele-specific PCR, LR; Linear regression, SVM; Support vector machine, DT; Decision tree, ETC; Extra-tree classifier, HWE; Hardy–Weinberg equilibrium, ANA; Antinuclear antibody, anti-dsDNA; Anti-double-stranded DNA, C3/C4; Complement, WBC; White blood cell count.

References

Waldmann, H. Mechanisms of immunological tolerance. Clin. Biochem. 2016, 49, 324–328. [Google Scholar] [CrossRef]
Moorman, C.D.; Sohn, S.J.; Phee, H. Emerging Therapeutics for Immune Tolerance: Tolerogenic Vaccines, T cell Therapy, and IL-2 Therapy. Front. Immunol. 2021, 12, 657768. [Google Scholar]
Attias, M.; Al-Aubodah, T.; Piccirillo, C.A. Mechanisms of human FoxP3⁺ Treg cell development and function in health and disease. Clin. Exp. Immunol. 2019, 197, 36–51. [Google Scholar] [CrossRef]
Gliwiński, M.; Iwaszkiewicz-Grześ, D.; Trzonkowski, P. Cell-Based Therapies with T Regulatory Cells. Biodrugs 2017, 31, 335–347. [Google Scholar] [CrossRef]
Mikami, N.; Kawakami, R.; Sakaguchi, S. New Treg cell-based therapies of autoimmune diseases: Towards antigen-specific immune suppression. Curr. Opin. Immunol. 2020, 67, 36–41. [Google Scholar]
Stadtlober, N.P.; Flauzino, T.; da Rosa Franchi Santos, L.F.; Iriyoda, T.M.V.; Costa, N.T.; Lozovoy, M.A.B.; Dichi, I.; Reiche, E.M.V.; Simão, A.N.C. Haplotypes of FOXP3 genetic variants are associated with susceptibility, autoantibodies, and TGF-β1 in patients with systemic lupus erythematosus. Sci. Rep. 2021, 11, 5406. [Google Scholar]
Deng, B.; Zhang, W.; Zhu, Y.; Li, Y.; Li, D.; Li, B. FOXP3⁺ regulatory T cells and age-related diseases. FEBS J. 2022, 289, 319–335. [Google Scholar]
Grover, P.; Goel, P.N.; Greene, M.I. Regulatory T Cells: Regulation of Identity and Function. Front. Immunol. 2021, 12, 750542. [Google Scholar] [CrossRef]
Ben-Skowronek, I. IPEX Syndrome: Genetics and Treatment Options. Genes 2021, 12, 323. [Google Scholar] [CrossRef]
Ono, M. Control of regulatory T-cell differentiation and function by T-cell receptor signalling and FOXP3 transcription factor complexes. Immunology 2020, 160, 24–37. [Google Scholar] [CrossRef]
Barber, M.R.W.; Drenkard, C.; Falasinnu, T.; Hoi, A.; Mak, A.; Kow, N.Y.; Svenungsson, E.; Peterson, J.; Clarke, A.E.; Ramsey-Goldman, R. Global epidemiology of systemic lupus erythematosus. Nat. Rev. Rheumatol. 2021, 17, 515–532. [Google Scholar] [CrossRef]
Zucchi, D.; Elefante, E.; Calabresi, E.; Signorini, V.; Bortoluzzi, A.; Tani, C. One year in review 2019: Systemic lupus erythematosus. Clin. Exp. Rheumatol. 2019, 37, 715–722. [Google Scholar]
Lin, Y.-C.; Lee, J.-H.; Wu, A.S.; Tsai, C.-Y.; Yu, H.-H.; Wang, L.-C.; Yang, Y.-H.; Chiang, B.-L. Association of single-nucleotide polymorphisms in FOXP3 gene with systemic lupus erythematosus susceptibility: A case-control study. Lupus 2011, 20, 137–143. [Google Scholar]
Tan, G.; Wang, X.; Zheng, G.; Du, J.; Zhou, F.; Liang, Z.; Wei, W.; Yu, H. Meta-analysis reveals significant association between FOXP3 polymorphisms and susceptibility to Graves’ disease. J. Int. Med. Res. 2021, 49, 3000605211004199. [Google Scholar]
Catalina, M.D.; Owen, K.A.; Labonte, A.C.; Grammer, A.C.; Lipsky, P.E. The pathogenesis of systemic lupus erythematosus: Harnessing big data to understand the molecular basis of lupus. J. Autoimmun. 2020, 110, 102359. [Google Scholar] [CrossRef]
Lei, C. Deep Learning Basics. In Deep Learning and Practice with MindSpore; Springer: Singapore, Singapore, 2021; pp. 17–28. [Google Scholar]
Tarca, A.L.; Carey, V.J.; Chen, X.-W.; Romero, R.; Drăghici, S. Machine Learning and Its Applications to Biology. PLoS Comput. Biol. 2007, 3, e116. [Google Scholar] [CrossRef]
Ceccarelli, F.; Perricone, C.; Borgiani, P.; Ciccacci, C.; Rufini, S.; Cipriano, E.; Alessandri, C.; Spinelli, F.R.; Scavalli, A.S.; Novelli, G.; et al. Genetic Factors in Systemic Lupus Erythematosus: Contribution to Disease Phenotype. J. Immunol. Res. 2015, 2015, 745647. [Google Scholar] [CrossRef]
Huang, Y.-T.; VanderWeele, T.J.; Lin, X. Joint analysis of SNP and gene expression data in genetic association studies of complex diseases. Ann. Appl. Stat. 2014, 8, 352–376. [Google Scholar] [CrossRef]
André, G.M.; Barbosa, C.P.; Teles, J.S.; Vilarino, F.L.; Christofolini, D.M.; Bianco, B. Analysis of FOXP3 polymorphisms in infertile women with and without endometriosis. Fertil. Steril. 2011, 95, 2223–2227. [Google Scholar]
Fathima, N.; Narne, P.; Ishaq, M. Association and gene–gene interaction analyses for polymorphic variants in CTLA-4 and FOXP3 genes: Role in susceptibility to autoimmune thyroid disease. Endocrine 2019, 64, 591–604. [Google Scholar] [CrossRef]
Gao, L.; Li, K.; Li, F.; Li, H.; Liu, L.; Wang, L.; Zhang, Z.; Gao, T.; Liu, Y. Polymorphisms in the FOXP3 gene in Han Chinese psoriasis patients. J. Dermatol. Sci. 2010, 57, 51–56. [Google Scholar]
Yao, J.; Zhang, T.; Zhang, L.; Han, K.; Zhang, L. FOXP3 polymorphisms in interstitial lung disease among Chinese Han population: A genetic association study. Clin. Respir. J. 2018, 12, 1182–1190. [Google Scholar]
Owen, C.J.; Eden, J.A.; Jennings, C.E.; Wilson, V.; Cheetham, T.D.; Pearce, S.H. Genetic association studies of the FOXP3 gene in Graves’ disease and autoimmune Addison’s disease in the United Kingdom population. J. Mol. Endocrinol. 2006, 37, 97–104. [Google Scholar]
Kegerreis, B.; Catalina, M.D.; Bachali, P.; Geraci, N.S.; Labonte, A.C.; Zeng, C.; Stearrett, N.; Crandall, K.A.; Lipsky, P.E.; Grammer, A.C. Machine learning approaches to predict lupus disease activity from gene expression data. Sci. Rep. 2019, 9, 9617. [Google Scholar] [CrossRef]
Abbass, A.A.; Mohamed, N.A.; Abdel-Rehim, A.S. Association of FOXP3 regulatory gene expression with systemic lupus erythematosus disease activity among Egyptian patients. Egypt J. Immunol. 2013, 20, 21–28. [Google Scholar]

Figure 1. Agarose gel electrophoresis of rs3761549 polymorphism and its restricted fragments obtained by BseNI (BsrI) digestion.

Figure 2. The electrophoretic and sequencing results of PCR products of rs2280883 polymorphism.

Figure 3. The training history of both models. (A) First model with only one feature and (B) Second model with correlated genes.

Table 1. Amplification program, primers, restriction enzyme, and product sizes used for genotyping of rs3761549 SNP, and amplification program and primers used for genotyping of rs2280883 SNP.

	rs3761549 (Promoter Region)
Type of polymorphism	Single-base C>T
Site of polymorphism	−2383
PCR primers
Forward:	5′-CTGAGACTTTGGGACCGTAGAC-3′
Reverse:	5′-ACACCACGGAGGAAGAGAAGAG-3′
PCR conditions
Denaturation:	94 °C, 5 min
Annealing:	64 °C, 30 s
Extension:	72 °C, 7 min
No. of cycles:	35
Restriction enzyme:	BseNI (BsrI)
Restriction Enzymes Product Size (bp):	CC (183, 128, and 61 bp) CT (311, 183, 128, and 61 bp) TT (311 and 61 bp)
	rs2280883 (Intronic region)
Type of polymorphism	Single-base T>C
Site of polymorphism	IVS9+459
PCR primers
T Allele
Forward:	5′-ACCACCATCCAGGCCAGAG-3′
Reverse:	5′-GTGTGGCGCTAGGATGAAGG-3′
C Allele
Forward:	5′-AATACACCCCCAACTGGGCA-3′
Reverse:	5′-GTGTGGCGCTAGGATGAAGG-3′
PCR conditions
Denaturation:	95 °C, 5 min
Annealing:	58 °C, 1 min
Extension:	72 °C, 3 min
No. of cycles:	30
Product Size (bp):	T (368 bp) C (136 bp)

Table 2. Demographic characteristics of participants in two groups.

Variables	Controls	Patients	p-Value
	N = (206)	N = (199)
Age, years	40.46 ± 10.4	34.59 ± 10.9	0.223
Range	19–61	14–71	-
Sex
Male	15 (7.7%)	16 (8%)	0.134
Female	191 (92.3%)	183 (92%)	0.134

Table 3. Genotype and allele frequency distribution of rs3761549 and rs2280883 polymorphisms in SLE patients and controls.

Gene	SNP	Controls (n = 206)	Patients (n = 199)	OR (95% CI)	Uncorrected p	Corrected p
Foxp₃	rs3761549
	CC	145 (66.1%)	91 (46.7%)	1	<0.001	<0.003
	CT	61 (33.9%)	107 (52.8%)	2.2 (1.4–3.3)
	TT	(0)	1 (0.5%)	-
	C	299 (83.1%)	285 (47.5%)	1
	T	61 (16.9%)	315 (52.5%)	2.2 (1.4–3.3)	<0.001	<0.007
	rs2280883
	TT	117 (56.4%)	77 (45.6%)	1	0.037	0.045
	CT	89 (43.6%)	122 (54.4%)	0.6 (0.4–0.9)
	CC	0 (0)	0 (0)	-
	T	260 (72%)	289 (66%)	1
	C	143 (28%)	89 (34%)	0.5 (0.4–0.7)	<0.001	<0.005

Note: Corrected p-values were calculated by using Bonferroni’s correction.

Table 4. Association of rs3761549 and rs2280883 polymorphisms and the SLE developmental risk factors.

rs3761549	Genotypes (%)		OR (95% CI)	p-Value
rs3761549	CC	CT	OR (95% CI)	p-Value
ANA Negative Positive C3 Normal Decrease Increase C4 Normal Decrease Increase Anti-ds DNA No Yes WBC Normal Decrease	28 63 52 25 7 66 10 9 47 44 85 4	29 76 58 31 7 76 14 6 39 66 95 10	1 1.16 (0.6–1.2) 1 1 (0.5–2.1) 0.8 (0.2–2.7) 1 1.2 (0.5–2.9) 0.5 (0.19–1.7) 1 1.8 (1.03–3.1) 1 2.2 (0.6–7.5)	0.6 0.7 0.8 0.6 0.3 0.04 0.17
rs2280883	Genotypes (%)		OR (95% CI)	p-Value
rs2280883	TT	CT	OR (95% CI)	p-Value
ANA Negative Positive C3 Normal Decrease Increase C4 Normal Decrease Increase Anti-ds DNA No Yes WBC Normal Decrease	33 73 69 24 6 83 9 8 50 56 99 7	25 65 43 31 8 61 14 7 38 52 83 7	1 1.17 (0.6–1.3) 1 2 (1.07–3.9) 2 (0.6–6.9) 1 2 (0.8–5.2) 1.19 (0.3–3.4) 1 1.2 (0.6–2.1) 1 0.7 (0.4–3.2)	0.6 0.02 0.18 0.1 0.7 0.4 0.7

Table 5. Machine-learning results for FOXP3-based model.

	Logistic Regression	SVM_1	SVM_2	SVM_3	Decision Tree	ETC
Accuracy	0.81	0.81	0.81	0.81	0.68	0.74
Macro Avg	0.45	0.45	0.45	0.45	0.40	0.43
Weighted Avg	0.72	0.72	0.72	0.72	0.65	0.69

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Birjan, Z.; Khashei Varnamkhasti, K.; Parhoudeh, S.; Naeimi, L.; Naeimi, S. Crucial Role of Foxp₃ Gene Expression and Mutation in Systemic Lupus Erythematosus, Inferred from Computational and Experimental Approaches. Diagnostics 2023, 13, 3442. https://doi.org/10.3390/diagnostics13223442

AMA Style

Birjan Z, Khashei Varnamkhasti K, Parhoudeh S, Naeimi L, Naeimi S. Crucial Role of Foxp₃ Gene Expression and Mutation in Systemic Lupus Erythematosus, Inferred from Computational and Experimental Approaches. Diagnostics. 2023; 13(22):3442. https://doi.org/10.3390/diagnostics13223442

Chicago/Turabian Style

Birjan, Zahra, Khalil Khashei Varnamkhasti, Sara Parhoudeh, Leila Naeimi, and Sirous Naeimi. 2023. "Crucial Role of Foxp₃ Gene Expression and Mutation in Systemic Lupus Erythematosus, Inferred from Computational and Experimental Approaches" Diagnostics 13, no. 22: 3442. https://doi.org/10.3390/diagnostics13223442

APA Style

Birjan, Z., Khashei Varnamkhasti, K., Parhoudeh, S., Naeimi, L., & Naeimi, S. (2023). Crucial Role of Foxp₃ Gene Expression and Mutation in Systemic Lupus Erythematosus, Inferred from Computational and Experimental Approaches. Diagnostics, 13(22), 3442. https://doi.org/10.3390/diagnostics13223442

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Crucial Role of Foxp₃ Gene Expression and Mutation in Systemic Lupus Erythematosus, Inferred from Computational and Experimental Approaches

Abstract

1. Introduction