Next Article in Journal
Postmortem Protein Degradation as a Tool to Estimate the PMI: A Systematic Review
Previous Article in Journal
Neuromuscular Ultrasound in the Pediatric Population
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Vaginal Microbiome-Based Bacterial Signatures for Predicting the Severity of Cervical Intraepithelial Neoplasia

1
Department of Obstetrics and Gynecology, School of Medicine, Kyungpook National University, Daegu 41404, Korea
2
Department of Obstetrics and Gynecology, Kyungpook National University Chilgok Hospital, Daegu 41404, Korea
3
Clinical Omics Research Center, School of Medicine, Kyungpook National University, Daegu 41940, Korea
4
Department of Applied Biosciences, Kyungpook National University, Daegu 41566, Korea
5
Department of Biomedical Convergence Science & Technology, Kyungpook National University, Daegu 41566, Korea
6
Department of Physiology, School of Medicine, Kyungpook National University, Daegu 41405, Korea
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this study and share the first authorship.
Diagnostics 2020, 10(12), 1013; https://doi.org/10.3390/diagnostics10121013
Submission received: 19 October 2020 / Revised: 15 November 2020 / Accepted: 25 November 2020 / Published: 26 November 2020
(This article belongs to the Section Diagnostic Microbiology and Infectious Disease)

Abstract

:
Although emerging evidence revealed that the gut microbiome served as a tool and as biomarkers for predicting and detecting specific cancer or illness, it is yet unknown if vaginal microbiome-derived bacterial markers can be used as a predictive model to predict the severity of CIN. In this study, we sequenced V3 region of 16S rRNA gene on vaginal swab samples from 66 participants (24 CIN 1−, 42 CIN 2+ patients) and investigated the taxonomic composition. The vaginal microbial diversity was not significantly different between the CIN 1− and CIN 2+ groups. However, we observed Lactobacillus amylovorus dominant type (16.7%), which does not belong to conventional community state type (CST). Moreover, a minimal set of 33 bacterial species was identified to maximally differentiate CIN 2+ from CIN 1− in a random forest model, which can distinguish CIN 2+ from CIN 1− (area under the curve (AUC) = 0.952). Among the 33 bacterial species, Lactobacillus iners was selected as the most impactful predictor in our model. This finding suggests that the random forest model is able to predict the severity of CIN and vaginal microbiome may play a role as biomarker.

1. Introduction

Cervical cancer is common in women worldwide [1], and 95–100% of patients with invasive cervical cancer are infected with human papillomavirus (HPV) [2]. The recognition that HPV is the causative agent of cervical cancer has changed the perception of cervical cancer screening. Compared to primary screening with Pap cytology, the primary HPV test has superior sensitivity for detecting cervical cancer [3]. Although HPV infection is necessarily a cause, it is not determinant of the development of cervical cancer. Most HPV infections are eliminated and only a small fraction of infected women progress to cervical intraepithelial neoplasia (CIN) or cervical cancer [4]. Thus, a persistent infection may not be enough to drive full tumorigenesis. The major risk factors for cervical cancer have been reported to be immune status (immunocompromised), high parity, smoking, combined oral contraceptive use, and other sexually transmitted infections, particularly chlamydia [5,6,7]. More recently, there has been evidence that the vaginal microbiome can either protect or stimulate CIN or cervical cancer progression [8].
Previous studies have demonstrated that the composition of the vaginal microbiome was associated with the acquisition, clearance, or persistence of HPV infection and the severity of CIN [9,10]. With its proven biomarker potential, high diversity microbiome has been frequently shown to have correlation with HPV status and different severity levels of CIN, suggesting that it is a potential predictor for the progression of disease related to HPV infection. Indeed, the presence of Atopobium vaginae, Gardnerella vaginalis, and Lactobacillus iners together with low levels of Lactobacillus crispatus was considered as the most hazardous combination for the development of CIN [11]. Moreover, a recent study showed that Bacteroides fragilis, Lactobacillus delbrueckii, and Streptococcus agalactiae have an indirect association with CIN status [12].
A previous study applied a machine learning technique to classify microbial communities according to their bacterial vaginosis characteristics [13]. Furthermore, several attempts have been made to predict diseases, such as nonalcoholic fatty liver, hypertension, and liver cirrhosis, using gut microbiome-based machine learning [14,15,16]. However, machine learning for predicting the severity of CIN based on microbiomes has not yet been reported. Therefore, in this study, we investigated the vaginal microbiome of a well-characterized cohort of participants with biopsy-proven patients in this study. We aimed to develop a panel of vaginal microbiome-derived biomarkers of the severity of CIN using a machine learning-based random forest classification model.

2. Materials and Methods

2.1. Study Population and Sample Collection

The ethical approval for the present study was acquired by the Institutional Review Board of Kyungpook National University Chilgok Hospital (KNUMC 2015-10-033) (16-11-2015), Republic of Korea. All participants gave written informed consent and experiments performed according to the Declaration of Helsinki. Vaginal smears were collected using pap brush-lines (Bion, Korea) from healthy women and patients with CIN after obtaining approval from our hospital’s Institutional Review Board, and written informed consent from the subjects. HC (Healthy Control) to CIN 1 was defined as CIN 1− and CIN 2+ to cervical cancer was defined as CIN 2+. The brush was placed into the posterior vaginal fornix and rotated 360 degrees clockwise. The samples were transported in DNase, RNase, and pyrogenic-free tubes and immediately stored at −80 °C.

2.2. HPV-Assay and HPV Genotyping

HPV genotyping was performed with cervico-vaginal swab specimens using the Anyplex II HPV 28 assay kit (Seegene, Korea). The Anyplex II HPV 28 assay was carried out following the manufacturer’s instructions. Briefly, the five μL of DNA was used in each of 2 20-μL reactions with primer set A or B. In the assay, HPV-specific dual priming oligonucleotides were used for multiplex (real-time) PCR. A total of 28 HPV types were tested to simultaneously detect 18 high-risk types (HPV 16, 18, 26, 31, 33, 35, 39, 45, 51, 52, 56, 58, 59, 66, 68, 69, 73, and 82) and eight low-risk types (HPV 6, 11, 40, 42, 44, 53, 54, and 70).

2.3. DNA Extraction and Ion Torrent Sequencing

Bacterial genomic DNA was extracted using the QIAamp PowerFecal Pro DNA Kit (QIAGEN, Germany) according to the manufacturer’s instructions. Bacterial DNA concentration was evaluated using a Qubit® 3.0 Fluorometer (Invitrogen, Waltham, MA, USA) and the isolated DNA was immediately stored at −70 °C until further processing. For the 16S rRNA gene sequencing, the DNA in each sample was amplified with the primers targeting the V3 hypervariable regions of the bacterial 16S ribosomal RNA gene and PCR was performed as described in a previous study [17]. The amplicons were purified using a QIAquick gel extraction kit (QIAGEN, Germany). The purified libraries were pooled in equimolar concentrations and sequenced using the Ion Torrent PGM for 1250 flows with the Ion PGM™ Hi Q Sequencing Kit (Thermo Fisher, Waltham, MA, USA) according to the manufacturer’s instructions.

2.4. Bioinformatics Analysis

The amplicon sequencing reads were assessed by FastQC and Trimmomatic [18] used for the pre-processing to acquire the clean reads from raw FASTQ files. We then removed Chimeric sequences, picked operational taxonomic unit (OTU), and performed taxonomic assignment using Quantitative Insights Into Microbial Ecology (QIIME) v1.9.1 [19] and Microbiome Helper package [20]. In order to remove chimeric sequences in our sequences, the script chimera_filter.pl was used based on VSEARCH v1.11.1 [21]. For the OTU level identification, we aligned the processed sequencing reads into OTUs with 97% identity cut-off value with closed-reference and Greengenes database version 13.8. All samples inside of the final OTU table were normalized to equal depths (7231 reads) for the further analysis. To increase bacterial resolution at the species, we further built a custom bacterial identification database that is suitable for the V3 region 16S rRNA gene. Our database is based on the vaginal-microbiota specific database constructed by Fettweis et al. [22], and This database was published in 2012, originally constructed for the taxonomic assignment of hypervariable V1–V3 regions of 16S rRNA gene to show accurate classification at the species level. Several conflicts regarding taxonomic identification between our database and greengenes annotations happened; we then carried out BLAST searches for every OTU with hits at the NCBI nucleotide database and curated manually our database, excluding uncultured microorganisms. Lastly, all OTUs assigned to more than one species followed rules described by Lennard et al. [23].

2.5. Data and Statistical Analysis

Differences between subsets were evaluated using Student’s t-test, and the differences between proportions were compared using the Chi-square test or Fisher’s exact test. Data analysis was performed on a rarefied OTU table, and the relative and actual abundance were used for all statistical analyses. General analyses of the datasets including principle component analysis (PCA) and non-metric multi-dimensional scaling (NMDS) were conducted in R [24] with tidyr and ggplot2 packages. Phylogenies were manipulated as described by Youngblut et al. [25] and an ANOVA test was applied to test the statistical significance of alpha diversity estimates among groups at different CSTs (Shannon Index and Observed OTUs). The Wilcoxon test was applied for measuring differences in bacterial abundance. A Random Forest (RF) model was constructed to model the severity of CIN based on microbial signatures using the randomForest package in R [26,27]. The abundance of all the bacterial species identified in the CIN 1−and CIN 2+ samples were used as features for building this classifier. The sample sets (24 CIN 1− samples, and 42 CIN 2+ samples) were randomly split into 2 sets (training and test sets). Optimal parameters of the random forest classifier model were obtained through grid search on a five-run 10-fold cross validation procedure using a caret package. The first model was constructed using all the bacterial species variables. Initially, the top 40 discriminating features were obtained based on their importance according to the mean decrease Gini score (indicators of purity by classification splits of given variables). The obtained microbial features were gradually added into the model in order to create the final model. The area under the curve (AUC) of the receiver operating characteristics (ROC) curve was computed to select the best model and it was measured on the test set. To acquire the best threshold in our prediction model, we further analyzed cut-off value, specificity, and sensitivity with bootstrapping method (2000 times).

3. Results

3.1. Participants’ Characteristics

Sixty-six women in total were included in this study. CIN 1− accounted for 24 women and CIN 2+ for 42 women. The basic characteristics of the study subjects are presented in Table 1. Mean age of the participants in the CIN 2+ group was significantly lower than in the CIN 1− group (42.7 ± 13.2 years vs. 49.2 ± 7.3 years, p = 0.0313). However, menopausal status was similar between the two groups (37.5% vs. 26.2%, p = 0.3399). HPV infections were detected in seven patients (29.2%) of the CIN 1− group and in 41 patients (97.6%) of the CIN 2+ group (p < 0.0001). Moreover, HPV 16/18 was more frequent in the CIN 2+ group than in the CIN 1− group (8.3% vs. 52.4%, p = 0.0004).

3.2. Vaginal Microbiome

Using bacterial amplicon sequencing data, our samples were classified into different bacterial community state types (CSTs) according to the previous description of the vaginal microbiome [28] (Figure 1). The most common CST type was CST 3 (L. iners, 36.4%), followed by CST 4 (Diverse group, 47%) (Table 2). Based on the previous classification, CST 4 is grouped into CST 4-A and CST 4-B types. CST 4-A is generally characterized by genera such as Anaerococcus, Corynebacterium, Finegoldia, and Streptococcus. CST 4-B, on the other hand, is dominated by Gardnerella, Parvimonas, Prevotella, and Sneathia with low proportions of Lactobacillus [29]. In our study, out of all participants, samples from eleven participants (16.7%) were dominated by Lactobacillus amylovorus, which we proposed as CST* type. Notably, mapping phylum, order, and genus-level relative abundance onto the HPV infection did not demonstrate any clustering of microbiome composition. Vaginal microbiome composition according to the CSTs was evaluated in every sample (CST 3, CST 4-A, CST 4-B, and CST*). Vaginal communities dominated by aerobic bacteria (CST 4-A) had the highest Shannon index and observed OTUs, while in samples where anaerobic bacteria was dominant (CST 4-B), it showed the lowest Shannon index and observed species (Figure 2A). CST classification based on the bacterial species of this cohort showed that the observed clustering trend was significant (NMDS stress = 0.175, p < 0.001) (Figure 2B).

3.3. Differences between the Vaginal Microbiomes of CIN 1− and CIN 2+

Alpha diversity indices (Shannon index and observed OTUs), which account for the biodiversity of vaginal microbiomes, did not significantly differ according to CIN severity. Principle component analysis (PCA) showed that the full microbial community structure of CIN 1− was not significantly different (p = 0.443) from that of CIN 2+ (Figure S1). These results suggest that the full microbial community structures of CIN 1− and CIN 2+ were not distinguishable. Non culture-based analyses of the vaginal microbiome showed different taxonomic compositions according to severity of CIN (Figure 3A). There was no statistical difference between CIN 1- and CIN 2+ at the phylum level and. However, at the species level, Unclassified Prevotella, Corynebacterium coyleae, and Streptococcus canis were the most significantly different (p < 0.05) organisms between CIN 1− and CIN 2+. Most importantly, the relative abundance of the five Streptococcus species viz., S. agalactiae, S. anginosus, S. canis, S. vestibularis, and S. massilensis, and L. iners, was higher in CIN 2+ than in CIN 1−. Lactobacillus johnsonii was higher in the CIN 1− group compared to the CIN 2+ group (Figure 3B).

3.4. Vaginal Microbiome-Derived Signature Can Predict the Severity of CIN

A Random forest (RF) model was constructed to determine which bacterial species can be potential biomarkers for predicting the severity of CIN. Receiver operating characteristics (ROC) analysis was applied to confirm maximum area under curve (AUC) in order to select specific species as biomarkers and thereby build an optimal model. The highest accuracy was found when 33 bacterial species were selected as optimal marker sets in the random forest model with the lowest number of species (Figure 4A). The optimal 33 features from vaginal samples were selected according to a random forest classifier model to determine whether the model can predict severity of CIN (Figure 4B). The top 33 mean decrease Gini scores of the bacterial species are shown in Figure 4A. The diagnostic value of this model based on those 33 discriminating species was measured according to the AUC and our final model achieved an AUC value of 0.952 (95% CI 0.82–1.00, sensitivity = 1.00, specificity = 0.857, Figure 4C). Among the 33 most impactful bacterial species, L. iners was selected as the most discriminating predictor in our model. Also, L. Johnsonii, S. agalactiae, S. anginosus, and S. canis were selected as impactful predictors. Overall, these data imply that the selected 33 microbial features suggest further evidence for utilizing vaginal microbiomes as biomarkers and highlight their potential usefulness for detecting the severity of CIN.

4. Discussion

Machine learning techniques has been widely used to assess the relationship between microbiome and disease status [30]. However, the potential role and diagnostic accuracy of vaginal microbiome-derived bacterial signatures for predicting CIN severity remains rare. Thus, in this study, we explored several vaginal microbiome-derived bacterial signatures and develop a prediction model using a random forest for predicting the severity of CIN. Vaginal microbiome has previously been classified into six CST levels according to the hierarchical clustering of bacterial species [31], Although our results did not show CST 1, 2, or 5 CST levels. Demographic characterization of each cohort and technical sequencing approaches for identifying bacterial species might have caused this difference [32]. Primers targeting the V4 region of the 16S rRNA gene were used previously [33]. In the present study, on the other hand, we used primer pairs targeting the V3 region, which might contribute to the low abundance of bacterial species such as L. crispatus, L. gasseri, and L. jensenii. Notably, our results showed that L. amylovorus was found to be one of the dominant types instead. Similar to the present study, MacIntyre et al. [34] reported that L. amylovorus was detected exclusively in Asian women compared to white and black women. Therefore, our finding suggest that since Asian women are more likely to have L. amylovorus, it is worth considering L. amylovorus as belonging to a new CST classification, CST*.
Most HPV infections are a spontaneous regression due to local immune response. Additional factors might influence the progression of CIN to cervical cancer or its regression. The cervical microbial environment is composed of immune-related cells and its specific microbiota may contribute to the progression of CIN. In the previous studies, prediction of CIN progression was mostly based on HPV infection [35]. However, in this study, we classified the severity of CIN based on bacterial species and we developed a RF model to predict the severity of CIN. Therefore, the preliminary results of this study demonstrated that RF-based ROC analysis can predict the severity of CIN. In addition to this, we described the diagnostic test accuracy of vaginal microbiome-based biomarkers to predict the severity of CIN. Most of the 33 bacterial species used for building each model in this study were directly or indirectly linked to vaginal health [36,37,38,39,40]. Our study showed that the RF model had 33 relevant microbial signatures with differential abundance between CIN 1− and CIN 2+. Furthermore, this model had considerably high accuracy for predicting the severity of CIN (AUC = 0.952) (Figure 4C).
According to our model, L. iners was the most impactful predictor in the model (Figure 4). Interestingly, a previous study reported that L. iners was a predominant part of the microbial community under the presence of cervical cancer and precancerous lesions in women [41]. Furthermore, the cervical mucosal community dominated by L. iners was associated with a higher grade of CIN in women infected with HPV [10]. Additionally, S. agalactiae and S. anginosus showed high abundance, although non-significant, in CIN 2+ compared to CIN 1−. S. agalactiae, also known as Group B Streptococcus, is a facultative Gram-positive organism and an important pathogen in aerobic vaginitis. It rarely causes infections in healthy adults; however, it may occasionally cause morbidity in older women, pregnant women, or in patients with underlying predisposing conditions [42]. There are limited studies on the association between S. agalactiae and CIN. Only one study showed that S. agalactiae was associated with HPV infection and CIN 2+ lesion [12]. Moreover, we observed the highest abundance of S. anginosus in the genus Streptococcus among patients with CIN2+. Recently, Tao et al. [43] revealed that S. anginosus caused the lysis of vaginal epithelial cells. Furthermore, several studies showed the association of certain diseases with S. anginosus bacteremia [44,45]. Furthermore, Sasaki et al. [46,47] also reported that the presence of S. anginosus was associated with cancers of the upper digestive tracts Thus, our findings contribute important information about the relationships between L. iners, S. agalactiae, and S. anginosus, and the severity of CIN.
Progression of CIN can be detected and predicted with HPV genotyping, proteomics, DNA methylation, or according to the microbial community. The former 3 have been actively studied [35,48,49], but the latter has not been reported yet. HPV testing is used for primary screening of cervical cancer. However, since most HPV infections are harmless, additional triage testing is necessary to properly identify HPV-positive women with a cervical precancerous or cancerous lesion. CIN 2+ has a high probability (30%) of becoming an invasive cancer and detection of CIN 2+ in primary screening is crucial for preventing cervical cancer. Our model showed high accuracy in predicting the severity of CIN. The model would provide a huge benefit for the early diagnosis of women with CIN 2+ and it could be used as a powerful decision-making tool. In this context, it is very plausible that the biomarkers identified in this study could potentially predict the severity of CIN. Predicting the severity of CIN based on the identified biomarkers confers new insights for the prevention of cervical cancer. Hence, we suggest further investigation to quantitatively and qualitatively characterize the identified individual bacterial species and the bacterial community in future studies. The RF method is a machine learning algorithm with the capacity to identify an optimal set of variables with high discriminative power from many dependent or independent variables. The independent signatures can be viewed as a minimized representation of the microbial community for manipulating the dysbiotic microbiome. A few studies used RF machine learning to predict the status of various diseases. Beck et al. demonstrated that machine learning techniques are able to classify microbial communities according to their bacterial vaginosis characteristics with 90 to 95% accuracy [50]. Recent studies also have used the vaginal microbiome as a tool to detect the stage of endometriosis with RF-based machine-learning classification analysis [51]. To the best of our knowledge, the current study is the first study that reports a vaginal microbiome-based RF model for predicting the severity of CIN. Our study built a RF model consisting of 33 relevant bacterial species with differential abundance between CIN 1− and CIN 2+.
In summary, key strength of this current study includes a prospective prediction model, which successfully predicted women at risk of CIN 2+ with a high AUC score. Moreover, we identified 33 bacterial species that are differently abundant and present in both CIN 1− and CIN 2+ from a well-characterized cohort based on the vaginal microbiome. However, we further acknowledge the weakness of our study: (1) The sample size was small and this cohort does not represent the entire Korean population, which have under- or overestimated our prediction model and our study was based on 16S rRNA gene sequencing. Hence, further studies are warranted to increase sample size to validate our prediction model and perform whole metagenome sequencing to fine-tune the prediction model. Therefore, the generalization of this model should be cautious. (2) Our findings only provide preliminary potential of an association between vaginal microbiota and severity of CIN. Therefore, future clinical studies are needed to assess how these bacterial species play a role in vaginal health, affecting the severity of CIN in terms of causality. (3) Future studies should consider performing immunological assay to unravel how hosts interact with its microbiota. (4) We did not evaluate the association between ages and health status. The age difference may be a confounding factor in terms of disease progression. Further studies are essential to evaluate how ages influence the severity of CIN.

5. Conclusions

In conclusion, our study suggests the potential associations between vaginal bacterial species and CIN development. Furthermore, utilizing random forest classifiers to predict the severity of CIN can aid patients in the prevention of cervical cancer. This finding may provide an initial step toward exploring the vaginal microbiome for decision-making purposes during CIN development.

Supplementary Materials

Supplementary materials can be found at https://www.mdpi.com/2075-4418/10/12/1013/s1, Figure S1: Biplot of Principal component analysis (PCA) of the vaginal microbiome according to CIN severity. The PCA plot showed the variation according to severity of CIN. The red circle represents CIN 1−, and the blue triangle represents CI N2+. The arrow indicates the direction of strength of each clinical feature to the overall correlation.

Author Contributions

Conceptualization, G.O.C., J.-H.S., and G.-U.K.; methodology, Y.H.L., G.-U.K., H.Q.P., S.Y.J., M.-S.K., S.A., D.-R.J., and Y.-J.P.; data curation, Y.H.L., G.-U.K., and G.O.C.; writing—original draft preparation, Y.H.L., G.-U.K., and G.O.C.; writing—review and editing, Y.H.L., G.-U.K., G.O.C., S.B.T., and J.-H.S.; visualization, Y.H.L., G.-U.K., and G.O.C.; supervision, G.O.C., J.-H.S., and H.S.H.; project administration, G.O.C.; funding acquisition, G.O.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by a Biomedical Research Institute Grant, Kyungpook National University Hospital (2019).

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

CINCervical intraepithelial neoplasia
RFRandom Forest
ROCReceiver operating characteristic
AUCArea under curve
CSTCommunity state type
HPVHuman papillomavirus

References

  1. Arbyn, M.; Castellsagué, X.; de Sanjosé, S.; Bruni, L.; Saraiya, M.; Bray, F.; Ferlay, J. Worldwide burden of cervical cancer in 2008. Ann. Oncol. 2011, 22, 2675–2686. [Google Scholar] [CrossRef] [PubMed]
  2. Okuma, K.; Yamashita, H.; Yokoyama, T.; Nakagawa, K.; Kawana, K. Undetected human papillomavirus DNA and uterine cervical carcinoma. Strahlentherapie und Onkologie 2016, 192, 55–62. [Google Scholar] [CrossRef]
  3. Ronco, G.; Dillner, J.; Elfström, K.M.; Tunesi, S.; Snijders, P.J.; Arbyn, M.; Kitchener, H.; Segnan, N.; Gilham, C.; Giorgi-Rossi, P. Efficacy of HPV-based screening for prevention of invasive cervical cancer: Follow-up of four European randomised controlled trials. Lancet 2014, 383, 524–532. [Google Scholar] [CrossRef]
  4. Doorbar, J.; Egawa, N.; Griffin, H.; Kranjec, C.; Murakami, I. Human papillomavirus molecular biology and disease association. Rev. Med. Virol. 2015, 25, 2–23. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Westrich, J.A.; Warren, C.J.; Pyeon, D. Evasion of host immune defenses by human papillomavirus. Virus Res. 2017, 231, 21–33. [Google Scholar] [CrossRef] [PubMed]
  6. Hellberg, D. Sex steroids and cervical cancer. Anticancer Res. 2012, 32, 3045–3054. [Google Scholar] [PubMed]
  7. Zhu, H.; Shen, Z.; Luo, H.; Zhang, W.; Zhu, X. Chlamydia trachomatis infection-associated risk of cervical cancer: A meta-analysis. Medicine 2016, 95, e3077. [Google Scholar] [CrossRef] [PubMed]
  8. Kyrgiou, M.; Mitra, A.; Moscicki, A.-B. Does the vaginal microbiota play a role in the development of cervical cancer? Transl. Res. 2017, 179, 168–182. [Google Scholar] [CrossRef] [Green Version]
  9. King, C.C.; Jamieson, D.J.; Wiener, J.; Cu-Uvin, S.; Klein, R.S.; Rompalo, A.M.; Shah, K.V.; Sobel, J.D. Bacterial vaginosis and the natural history of human papillomavirus. Infect. Dis. Obstetr. Gynecol. 2011, 2011, 319460. [Google Scholar]
  10. Piyathilake, C.J.; Ollberding, N.J.; Kumar, R.; Macaluso, M.; Alvarez, R.D.; Morrow, C.D. Cervical microbiota associated with higher grade cervical intraepithelial neoplasia in women infected with high-risk human papillomaviruses. Cancer Prev. Res. 2016, 9, 357–366. [Google Scholar] [CrossRef] [Green Version]
  11. Oh, H.; Kim, B.-S.; Seo, S.-S.; Kong, J.-S.; Lee, J.-K.; Park, S.-Y.; Hong, K.-M.; Kim, H.-K.; Kim, M. The association of uterine cervical microbiota with an increased risk for cervical intraepithelial neoplasia in Korea. Clin. Microbiol. Infect. 2015, 21, 674.e671–674.e679. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  12. Zhang, C.; Liu, Y.; Gao, W.; Pan, Y.; Gao, Y.; Shen, J.; Xiong, H. The direct and indirect association of cervical microbiota with the risk of cervical intraepithelial neoplasia. Cancer Med. 2018, 7, 2172–2179. [Google Scholar] [CrossRef] [PubMed]
  13. Beck, D.; Foster, J.A. Machine learning techniques accurately classify microbial communities by bacterial vaginosis characteristics. PLoS ONE 2014, 9, e87830. [Google Scholar] [CrossRef] [Green Version]
  14. Li, J.; Zhao, F.; Wang, Y.; Chen, J.; Tao, J.; Tian, G.; Wu, S.; Liu, W.; Cui, Q.; Geng, B. Gut microbiota dysbiosis contributes to the development of hypertension. Microbiome 2017, 5, 1–19. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. Armour, C.R.; Nayfach, S.; Pollard, K.S.; Sharpton, T.J. A metagenomic meta-analysis reveals functional signatures of health and disease in the human gut microbiome. MSystems 2019, 4, e00332-18. [Google Scholar] [CrossRef] [Green Version]
  16. Loomba, R.; Seguritan, V.; Li, W.; Long, T.; Klitgord, N.; Bhatt, A.; Dulai, P.S.; Caussy, C.; Bettencourt, R.; Highlander, S.K. Gut microbiome-based metagenomic signature for non-invasive detection of advanced fibrosis in human nonalcoholic fatty liver disease. Cell Metab. 2017, 25, 1054–1062.e1055. [Google Scholar] [CrossRef]
  17. De Seta, F.; Campisciano, G.; Zanotta, N.; Ricci, G.; Comar, M. The vaginal community state types microbiome-immune network as key factor for bacterial vaginosis and aerobic vaginitis. Front. Microbiol. 2019, 10, 2451. [Google Scholar] [CrossRef]
  18. Bolger, A.M.; Lohse, M.; Usadel, B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics 2014, 30, 2114–2120. [Google Scholar] [CrossRef] [Green Version]
  19. Caporaso, J.G.; Lauber, C.L.; Walters, W.A.; Berg-Lyons, D.; Lozupone, C.A.; Turnbaugh, P.J.; Fierer, N.; Knight, R. Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample. Proc. Natl. Acad. Sci. USA 2011, 108, 4516–4522. [Google Scholar] [CrossRef] [Green Version]
  20. Comeau, A.M.; Douglas, G.M.; Langille, M.G. Microbiome helper: A custom and streamlined workflow for microbiome research. mSystems 2017, 2, e00127-16. [Google Scholar] [CrossRef] [Green Version]
  21. Rognes, T.; Flouri, T.; Nichols, B.; Quince, C.; Mahé, F. VSEARCH: A versatile open source tool for metagenomics. PeerJ 2016, 4, e2584. [Google Scholar] [CrossRef] [PubMed]
  22. Fettweis, J.M.; Serrano, M.G.; Sheth, N.U.; Mayer, C.M.; Glascock, A.L.; Brooks, J.P.; Jefferson, K.K.; Buck, G.A.; Consortium, V.M. Species-level classification of the vaginal microbiome. BMC Genom. 2012, 13, S17. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Lennard, K.; Dabee, S.; Barnabas, S.L.; Havyarimana, E.; Blakney, A.; Jaumdally, S.Z.; Botha, G.; Mkhize, N.N.; Bekker, L.-G.; Lewis, D.A. Microbial composition predicts genital tract inflammation and persistent bacterial vaginosis in South African adolescent females. Infect. Immunity 2018, 86, e00410-17. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. R Foundation for Statistical Computing. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2016. [Google Scholar]
  25. Youngblut, N.D.; Reischer, G.H.; Walters, W.; Schuster, N.; Walzer, C.; Stalder, G.; Ley, R.E.; Farnleitner, A.H. Host diet and evolutionary history explain different aspects of gut microbiome diversity among vertebrate clades. Nat. Commun. 2019, 10, 1–15. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
  27. Liaw, A.; Wiener, M. Classification and regression by randomForest. R News 2002, 2, 18–22. [Google Scholar]
  28. Ravel, J.; Gajer, P.; Abdo, Z.; Schneider, G.M.; Koenig, S.S.; McCulle, S.L.; Karlebach, S.; Gorle, R.; Russell, J.; Tacket, C.O. Vaginal microbiome of reproductive-age women. Proc. Natl. Acad. Sci. USA 2011, 108, 4680–4687. [Google Scholar] [CrossRef] [Green Version]
  29. Gajer, P.; Brotman, R.; Bai, G.; Sakamoto, J.; Schütte, U.; Zhong, X.; Koenig, S.; Fu, L.; Ma, Z.; Zhou, X. Temporal dynamics of the human vaginal microbiota. Sci. Transl. Med. 2012, 4, 132ra52. [Google Scholar] [CrossRef] [Green Version]
  30. Liu, Y.; Geng, R.; Liu, L.; Jin, X.; Yan, W.; Zhao, F.; Wang, S.; Guo, X.; Ghimire, G.; Wei, Y. Gut microbiota-based algorithms in the prediction of metachronous adenoma in colorectal cancer patients following surgery. Front. Microbiol. 2020, 11, 1106. [Google Scholar] [CrossRef]
  31. Lennard, K.; Dabee, S.; Barnabas, S.L.; Havyarimana, E.; Blakney, A.; Jaumdally, S.Z.; Botha, G.; Mkhize, N.N.; Bekker, L.-G.; Lewis, D.A. Vaginal microbiota varies by geographical location in South African women. arXiv 2019, arXiv:1905.11946. [Google Scholar]
  32. Schloss, P.D.; Jenior, M.L.; Koumpouras, C.C.; Westcott, S.L.; Highlander, S.K. Sequencing 16S rRNA gene fragments using the PacBio SMRT DNA sequencing system. PeerJ 2016, 4, e1869. [Google Scholar] [CrossRef] [Green Version]
  33. Zhang, H.; Lu, J.; Lu, Y.; Cai, Q.; Liu, H.; Xu, C. Cervical microbiome is altered in cervical intraepithelial neoplasia after loop electrosurgical excision procedure in china. Sci. Rep. 2018, 8, 1–8. [Google Scholar] [CrossRef] [PubMed]
  34. MacIntyre, D.A.; Chandiramani, M.; Lee, Y.S.; Kindinger, L.; Smith, A.; Angelopoulos, N.; Lehne, B.; Arulkumaran, S.; Brown, R.; Teoh, T.G. The vaginal microbiome during pregnancy and the postpartum period in a European population. Sci. Rep. 2015, 5, 8988. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Matsumoto, K.; Oki, A.; Furuta, R.; Maeda, H.; Yasugi, T.; Takatsuka, N.; Mitsuhashi, A.; Fujii, T.; Hirai, Y.; Iwasaka, T. Predicting the progression of cervical precursor lesions by human papillomavirus genotyping: A prospective cohort study. Int. J. Cancer 2011, 128, 2898–2910. [Google Scholar] [CrossRef] [PubMed]
  36. Gil, N.F.; Martinez, R.C.; Gomes, B.C.; Nomizo, A.; De Martinis, E.C. Vaginal lactobacilli as potential probiotics against Candida spp. Braz. J. Microbiol. 2010, 41, 6–14. [Google Scholar] [CrossRef] [Green Version]
  37. Marrazzo, J.M.; Thomas, K.K.; Fiedler, T.L.; Ringwood, K.; Fredricks, D.N. Relationship of specific vaginal bacteria and bacterial vaginosis treatment failure in women who have sex with women. Ann. Intern. Med. 2008, 149, 20–28. [Google Scholar] [CrossRef] [Green Version]
  38. Verhelst, R.; Verstraelen, H.; Claeys, G.; Verschraegen, G.; Delanghe, J.; Van Simaey, L.; De Ganck, C.; Temmerman, M.; Vaneechoutte, M. Cloning of 16S rRNA genes amplified from normal and disturbed vaginal microflora suggests a strong association between Atopobium vaginae, Gardnerella vaginalis and bacterial vaginosis. BMC Microbiol. 2004, 4, 16. [Google Scholar] [CrossRef] [Green Version]
  39. Boris, S.; Suárez, J.E.; Vázquez, F.; Barbés, C. Adherence of human vaginal lactobacilli to vaginal epithelial cells and interaction with uropathogens. Infect. Immunity 1998, 66, 1985–1989. [Google Scholar] [CrossRef] [Green Version]
  40. Vásquez, A.; Jakobsson, T.; Ahrné, S.; Forsum, U.; Molin, G. Vaginal Lactobacillus flora of healthy Swedish women. J. Clin. Microbiol. 2002, 40, 2746–2749. [Google Scholar] [CrossRef] [Green Version]
  41. Yang, X.; Da, M.; Zhang, W.; Qi, Q.; Zhang, C.; Han, S. Role of Lactobacillus in cervical cancer. Cancer Manag. Res. 2018, 10, 1219. [Google Scholar] [CrossRef] [Green Version]
  42. Kaambo, E.; Africa, C.; Chambuso, R.; Passmore, J.-A.S. Vaginal microbiomes associated with aerobic vaginitis and bacterial vaginosis. Front. Public Health 2018, 6, 78. [Google Scholar] [CrossRef] [PubMed]
  43. Tao, Z.; Zhang, L.; Zhang, Q.; Lv, T.; Chen, R.; Wang, L.; Huang, Z.; Hu, L.; Liao, Q. The Pathogenesis Of Streptococcus anginosus In Aerobic Vaginitis. Infect. Drug Resist. 2019, 12, 3745. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  44. Masood, U.; Sharma, A.; Lowe, D.; Khan, R.; Manocha, D. Colorectal cancer associated with streptococcus anginosus bacteremia and liver abscesses. Case Rep. Gastroenterol. 2016, 10, 769–774. [Google Scholar] [CrossRef] [PubMed]
  45. Hui, M. Streptococcus anginosus bacteremia: Sutton’s law. J. Clin. Microbiol. 2005, 43, 6217. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  46. Sasaki, H.; Ishizuka, T.; Muto, M.; Nezu, M.; Nakanishi, Y.; Inagaki, Y.; Watanabe, H.; Watanabe, H.; Terada, M. Presence of Streptococcus anginosus DNA in esophageal cancer, dysplasia of esophagus, and gastric cancer. Cancer Res. 1998, 58, 2991–2995. [Google Scholar] [PubMed]
  47. Sasaki, M.; Yamaura, C.; Ohara-Nemoto, Y.; Tajika, S.; Kodama, Y.; Ohya, T.; Harada, R.; Kimura, S. Streptococcus anginosus infection in oral cancer and its infection route. Oral Dis. 2005, 11, 151–156. [Google Scholar] [CrossRef]
  48. Kelly, H.; Benavente, Y.; Pavon, M.A.; De Sanjose, S.; Mayaud, P.; Lorincz, A.T. Performance of DNA methylation assays for detection of high-grade cervical intraepithelial neoplasia (CIN2+): A systematic review and meta-analysis. Br. J. Cancer 2019, 121, 954–965. [Google Scholar] [CrossRef]
  49. Uleberg, K.-E.; Øvestad, I.T.; Munk, A.C.; Brede, C.; Diermen, B.v.; Gudlaugsson, E.; Janssen, E.A.; Hjelle, A.; Baak, J. Prediction of spontaneous regression of cervical intraepithelial neoplasia lesions grades 2 and 3 by proteomic analysis. Int. J. Proteomics 2014, 2014, 129064. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  50. Beck, D.; Foster, J.A. Machine learning classifiers provide insight into the relationship between microbial communities and bacterial vaginosis. BioData Min. 2015, 8, 23. [Google Scholar] [CrossRef] [Green Version]
  51. Perrotta, A.R.; Borrelli, G.M.; Martins, C.O.; Kallas, E.G.; Sanabani, S.S.; Griffith, L.G.; Alm, E.J.; Abrao, M.S. The Vaginal Microbiome as a Tool to Predict rASRM Stage of Disease in Endometriosis: A Pilot Study. Reproductive Sciences 2020, 27, 1064–1073. [Google Scholar] [CrossRef]
Figure 1. Phylogenetic trees of the vaginal microbiome network by metadata. From outside to inside, the stacked bar chart represents normalized relative abundances of community state type (CST)-related bacterial species; the heatmap represents each clinical stage; the bar charts were enriched in each HPV status, and the trees were grouped according to the phylum level.
Figure 1. Phylogenetic trees of the vaginal microbiome network by metadata. From outside to inside, the stacked bar chart represents normalized relative abundances of community state type (CST)-related bacterial species; the heatmap represents each clinical stage; the bar charts were enriched in each HPV status, and the trees were grouped according to the phylum level.
Diagnostics 10 01013 g001
Figure 2. Vaginal microbiome diversity in each CST. (A) Alpha diversity including Shannon index and observed operational taxonomic units (OTUs) at the species level. (B) Non-metric multidimensional scaling (NMDS) ordination plots based on Bray–Curtis. The microbial communities of each CST type were significantly (Adonis, p < 0.001) different from each other. *, p < 0.05; **, p < 0.01; ***, p < 0.001.
Figure 2. Vaginal microbiome diversity in each CST. (A) Alpha diversity including Shannon index and observed operational taxonomic units (OTUs) at the species level. (B) Non-metric multidimensional scaling (NMDS) ordination plots based on Bray–Curtis. The microbial communities of each CST type were significantly (Adonis, p < 0.001) different from each other. *, p < 0.05; **, p < 0.01; ***, p < 0.001.
Diagnostics 10 01013 g002
Figure 3. Bacterial differences between CIN stages. The circular phylogenetic tree at the species level is the same as shown in Figure 1. (A) The outer circle represents the enriched bacterial species in each CIN and (B) the bar charts show mean relative abundance of the most significant species in each CIN stage.
Figure 3. Bacterial differences between CIN stages. The circular phylogenetic tree at the species level is the same as shown in Figure 1. (A) The outer circle represents the enriched bacterial species in each CIN and (B) the bar charts show mean relative abundance of the most significant species in each CIN stage.
Diagnostics 10 01013 g003
Figure 4. Random forest classifier model using vaginal microbiome-derived bacterial signatures to predict the severity of CIN. (A) The 33 most important bacterial species based on their mean decrease Gini scores of the optimal random forest model. (B) The 33 bacterial species selected by optimizing the area under the curve (AUC) of the receiver operating characteristics (ROC) curve. (C) The AUC of vaginal microbiome-based classification. Random forest classifiers were used to predict CIN 1− and CIN 2+ based on species-level vaginal microbiome composition. The green area of the curve represents the 95% confidence interval (CI) shape.
Figure 4. Random forest classifier model using vaginal microbiome-derived bacterial signatures to predict the severity of CIN. (A) The 33 most important bacterial species based on their mean decrease Gini scores of the optimal random forest model. (B) The 33 bacterial species selected by optimizing the area under the curve (AUC) of the receiver operating characteristics (ROC) curve. (C) The AUC of vaginal microbiome-based classification. Random forest classifiers were used to predict CIN 1− and CIN 2+ based on species-level vaginal microbiome composition. The green area of the curve represents the 95% confidence interval (CI) shape.
Diagnostics 10 01013 g004
Table 1. Participants’ characteristics.
Table 1. Participants’ characteristics.
VariablesTotal (n = 66)CIN 1− (n = 24)CIN 2+ (n = 42)p Value
Age (years)45.1 ± 11.749.2 ± 7.342.7 ± 13.20.0313
Menopause (n, %)20 (30.3)9 (37.5)11 (26.2)0.3399
Marriage (n, %)58 (87.9)22 (91.7)36 (85.7)0.4794
Parity (n)1.7 ± 1.01.8 ± 0.91.6 ± 1.10.4169
Smoker (n, %)4 (6.1)1 (4.8)3 (12.0)0.6139
Contraceptive use (n, %)16 (24.6)7 (29.2)9 (22.0)0.5178
Human papillomavirus (HPV) positive (n, %)48 (72.7)7 (29.2)41 (97.6)<0.0001
HPV16/18 positive (n, %)24 (36.4)2 (8.3)22 (52.4)0.0004
Table 2. CSTs according to the severity of cervical intraepithelial neoplasia (CIN).
Table 2. CSTs according to the severity of cervical intraepithelial neoplasia (CIN).
CST TypeTotal (n = 66)CIN 1− (n = 24)CIN 2+ (n = 42)p Value
CST3 (n, %)24 (36.4)9 (37.5)15 (35.7)0.8855
CST 4-A (n, %)14 (21.2)4 (16.7)10 (23.8)0.4980
CST 4-B (n, %)17 (25.8)8 (33.3)9 (21.4)0.2911
CST* (n, %)11 (16.7)3 (12.5)8 (19.0)0.4956
CIN = Cervical intraepithelial neoplasia; CST = Community state type; CST3: Primarily composed of Lactobacillus iners; CST 4-A; aerobic bacteria such as Streptococcus dominated; CST 4-B: Gardnerella dominated; CST*: Lactobacillus amylovorus dominant type.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Lee, Y.H.; Kang, G.-U.; Jeon, S.Y.; Tagele, S.B.; Pham, H.Q.; Kim, M.-S.; Ahmad, S.; Jung, D.-R.; Park, Y.-J.; Han, H.S.; et al. Vaginal Microbiome-Based Bacterial Signatures for Predicting the Severity of Cervical Intraepithelial Neoplasia. Diagnostics 2020, 10, 1013. https://doi.org/10.3390/diagnostics10121013

AMA Style

Lee YH, Kang G-U, Jeon SY, Tagele SB, Pham HQ, Kim M-S, Ahmad S, Jung D-R, Park Y-J, Han HS, et al. Vaginal Microbiome-Based Bacterial Signatures for Predicting the Severity of Cervical Intraepithelial Neoplasia. Diagnostics. 2020; 10(12):1013. https://doi.org/10.3390/diagnostics10121013

Chicago/Turabian Style

Lee, Yoon Hee, Gi-Ung Kang, Se Young Jeon, Setu Bazie Tagele, Huy Quang Pham, Min-Sueng Kim, Sajjad Ahmad, Da-Ryung Jung, Yeong-Jun Park, Hyung Soo Han, and et al. 2020. "Vaginal Microbiome-Based Bacterial Signatures for Predicting the Severity of Cervical Intraepithelial Neoplasia" Diagnostics 10, no. 12: 1013. https://doi.org/10.3390/diagnostics10121013

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop