Quantifying In-Host Quasispecies Evolution
Abstract
:1. Introduction
2. Results
2.1. Simulated Pairs of Quasispecies
2.1.1. Correlations
- and : At high values only high values of may occur, on the other hand at very low values almost all values are possible. This is consistent with the definition of both indices.
- and : are highly correlated and seemingly do not convey significant additional information.
- and : At low values, takes high values, but at high values, spans the highest range of values. The lower values correspond to high values of .
- and : At high values, takes the lower values, but at low values, spans a high range of values.
2.1.2. Illustration of Selected Pairs
2.2. Simulated Treatment
2.3. A Clinical Case
3. Discussion
4. Materials and Methods
4.1. Data
4.1.1. Simulation of Paired Quasispecies
- Distribution pattern: 20,000 random occurrences of a geometric distribution, with parameter , are generated, simulating 20,000 reads of over 35 haplotypes. The frequencies of this distribution are used as pattern distribution on which to apply random selection criteria of frequencies.
- Select frequencies for quasispecies A: From the above pattern distribution, 12 frequencies are randomly selected to represent the composition of quasispecies A.
- Select frequencies for quasispecies B: From a new pattern distribution generated with the same parameters as above, randomly select 12 frequencies to represent the composition of quasispecies B.
- Confront both simulated quasispecies: The two quasispecies are composed together of 20 haplotypes, some common to both quasispecies, some unique to either one. Assign randomly the 12 frequencies of quasispecies A among the 20, and do the same with the 12 frequencies of quasispecies B. Remove from the 20 any haplotype not populated (0 reads in both quasispecies).
4.1.2. Simulation of a Viral Treatment Follow-Up
- The dominant haplotype, initially at a frequency of 99.9% evolving at a pace of a constant uniformly distributed between 0.85 and 1.05, at each evolution step.
- A minoritary haplotype initially at (0.1/39)%, and evolving at a pace of a constant uniformly distributed between 0.95 and 1.25, at each evolution step.
- The remaining 38 haplotypes, initially at (0.1/39)%, and evolving at a pace of a constant uniformly distribution between 0.8 and 2.5. Only a random number of these, between 2 and 10, are submitted to evolution at each step. The remaining are left as they were.
4.1.3. A Clinical HEV Case
4.2. Methods
4.2.1. Similarity between Distributions
- Commons: As the fraction of reads belonging to haplotypes populated in both quasispecies.
- Overlap: As the sum of the minimum proportion of common haplotypes.
- Yue–Clayton: This index takes fuller account of all proportion information, considering the proportions of both common and unique haplotypes. [11]
4.2.2. Genetic Distance between Quasispecies
4.2.3. Quasispecies Fitness Partition (QFP)
- Master: the fraction of molecules belonging to the most frequent haplotype; that is, the one present at the highest percentage ().
- Emerging: the fraction of molecules present at a frequency greater then and less than the master percentage, belonging to haplotypes that are potentially able to compete with the predominant one and possibly replace it ().
- Low fitness: the fraction of molecules present at frequencies from to , belonging to haplotypes that have a low probability of progressing to higher frequencies ().
- Very low fitness: the fraction of molecules present at frequencies below , belonging to haplotypes with very low fitness and to defective genomes. The likely fate of these molecules individually is degradation, but the fraction is continuously fed with new very low fitness genomes produced by replication errors or by host editing activities ().
4.3. Software and Statistics
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
HNP | Hill Numbers Profile |
MDS | Multidimensional scaling |
NGS | Next Generation Sequencing |
QFF | Quasispecies fitness fractions |
QFP | Quasispecies fitness partition |
RHL | Rare haplotype Load |
References
- Domingo, E.; Parrish, C.R.; Holland, J.J. (Eds.) Origin and Evolution of Viruses, 2nd ed.; Academic Press, Elsevier: London, UK, 2008. [Google Scholar]
- Nei, M. Molecular Evolutionary Genetics, 1st ed.; Columbia University Press: New York, NY, USA, 1987; pp. 276–279. [Google Scholar]
- Gregori, J.; Perales, C.; Rodriguez-Frias, F.; Esteban, J.I.; Quer, J.; Domingo, E. Viral Quasispecies Complexity Measures. Virology 2016, 493, 227–237. [Google Scholar] [CrossRef] [PubMed]
- Cha, S.-H. Comprehensive Survey on Distance/Similarity Measures between Probability Density Functions. Int. J. Math. Model. Methods Appl. Sci. 2007, 1, 300–307. [Google Scholar]
- Gregori, J.; Colomer-Castell, S.; Campos, C.; Ibañez-Lligoña, M.; Garcia-Cehic, D.; Rando-Segura, A.; Adombie, C.M.; Pintó, R.; Guix, S.; Bosch, A.; et al. Quasispecies fitness partition to characterize the molecular status of a viral population. Negative effect of early ribavirin discontinuation in a chronically infected HEV patient. Int. J. Mol. Sci. 2022, 23, 14654. [Google Scholar] [CrossRef] [PubMed]
- Gallego, I.; Gregori, J.; Soria, M.E.; García-Crespo, C.; García-Álvarez, M.; Gómez-González, A.; Valiergue, R.; Gómez, J.; Esteban, J.I.; Quer, J.; et al. Resistance of high fitness hepatitis C virus to lethal mutagenesis. Virology 2018, 523, 100–109. [Google Scholar] [CrossRef] [PubMed]
- Domingo, E. Virus as Populations: Composition, Complexity, Dynamics, and Biological. In Virus as Populations, 1st ed.; Academic Press, Elsevier: London, UK, 2016; pp. 1–412. [Google Scholar]
- Galaxy Community Hub. Available online: https://galaxyproject.org/ (accessed on 7 January 2023).
- Soria, M.E.; Gregori, J.; Chen, Q.; García-Cehic, D.; Llorens, M.; de Ávila, A.I.; Beach, N.M.; Domingo, E.; Rodríguez-Frías, F.; Buti, M.; et al. Pipeline for specific subtype amplification and drug resistance detection in hepatitis C virus. BMC Infect. Dis. 2018, 18, 446. [Google Scholar] [CrossRef] [PubMed]
- Eliseev, A.; Gibson, K.M.; Avdeyev, P.; Novik, D.; Bendall, M.L.; Pérez-Losada, M.; Alexeev, N.; Crandall, K.A. Evaluation of haplotype callers for next-generation sequencing of viruses. Infect. Genet. Evol. 2020, 82, 104277. [Google Scholar] [CrossRef] [PubMed]
- Yue, J.C.; Clayton, M.K. A Similarity Measure Based on Species Proportions. Commun. Stat. Theory Methods 2005, 34, 2123–2131. [Google Scholar] [CrossRef] [Green Version]
- R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2020; Available online: https://www.R-project.org/ (accessed on 7 January 2023).
- Paradis, E.; Schliep, K. ape 5.0: An environment for modern phylogenetics and evolutionary analyses in R. Bioinformatics 2019, 854, 526–528. [Google Scholar] [CrossRef] [PubMed]
- Wickham, H. Welcome to Master the Tidyverse. J. Open Source Softw. 2019, 4, 1686. [Google Scholar] [CrossRef] [Green Version]
- Wickham, H. ggplot2: Elegant Graphics for Data Analysis; Springer: New York, NY, USA, 2016; Available online: https://ggplot2.tidyverse.org (accessed on 7 January 2023).
A | 2000 | 1400 | 50.06 | 38.36 |
B | 600 | 800 | 15.02 | 21.92 |
C | 400 | 310 | 10.01 | 8.49 |
D | 100 | 0 | 2.5 | 0 |
E | 0 | 70 | 0 | 1.92 |
F | 0 | 30 | 0 | 0.82 |
G | 15 | 0 | 0.38 | 0 |
H | 600 | 750 | 15.02 | 20.55 |
I | 200 | 220 | 5.01 | 6.03 |
J | 50 | 0 | 1.25 | 0 |
K | 0 | 70 | 0 | 1.92 |
L | 30 | 0 | 0.75 | 0 |
Min. | 0.00070 | 0.01075 | 0.00000 |
1stQ | 0.04237 | 0.46139 | 0.00994 |
Median | 0.10081 | 0.61189 | 0.03628 |
Mean | 0.14544 | 0.60412 | 0.09800 |
3rdQ | 0.20982 | 0.76187 | 0.12192 |
Max. | 0.84055 | 1.00000 | 0.97111 |
Over 0.50 | 245 | 6944 | 336 |
Over 0.75 | 10 | 2698 | 62 |
Over 0.90 | 0 | 692 | 12 |
Min. | 0.0034 | 0.7500 | 0.0005 |
1stQ | 0.0886 | 0.7923 | 0.0336 |
Median | 0.1891 | 0.8395 | 0.1049 |
Mean | 0.2289 | 0.8501 | 0.1863 |
3rdQ | 0.3395 | 0.9022 | 0.2737 |
Max. | 0.8405 | 1.0000 | 0.9711 |
1.0000 | 0.4616 | 0.4256 | −0.3442 | |
0.4616 | 1.0000 | 0.9372 | −0.7961 | |
0.4256 | 0.9372 | 1.0000 | −0.8011 | |
−0.3442 | −0.7961 | −0.8011 | 1.0000 |
Suppl. Figure | |||||||
---|---|---|---|---|---|---|---|
4213 | 16 | 8 | 0.9420 | 0.7477 | 0.8900 | 0.0180 | S6 |
7426 | 15 | 9 | 0.9899 | 0.7232 | 0.8310 | 0.0711 | S7 |
774 | 17 | 7 | 0.8611 | 0.0137 | 0.0071 | 0.7719 | S8 |
5463 | 18 | 6 | 0.8521 | 0.0093 | 0.0039 | 0.6835 | S9 |
3053 | 17 | 7 | 0.5741 | 0.2250 | 0.1789 | 0.4961 | S10 |
5955 | 17 | 7 | 0.6312 | 0.1149 | 0.0417 | 0.5403 | S11 |
1159 | 18 | 6 | 0.7983 | 0.5691 | 0.6454 | 0.1483 | S12 |
2528 | 18 | 6 | 0.7946 | 0.6503 | 0.8052 | 0.0743 | S13 |
345 | 16 | 8 | 0.7308 | 0.4823 | 0.6668 | 0.1199 | S14 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Gregori, J.; Ibañez-Lligoña, M.; Quer, J. Quantifying In-Host Quasispecies Evolution. Int. J. Mol. Sci. 2023, 24, 1301. https://doi.org/10.3390/ijms24021301
Gregori J, Ibañez-Lligoña M, Quer J. Quantifying In-Host Quasispecies Evolution. International Journal of Molecular Sciences. 2023; 24(2):1301. https://doi.org/10.3390/ijms24021301
Chicago/Turabian StyleGregori, Josep, Marta Ibañez-Lligoña, and Josep Quer. 2023. "Quantifying In-Host Quasispecies Evolution" International Journal of Molecular Sciences 24, no. 2: 1301. https://doi.org/10.3390/ijms24021301
APA StyleGregori, J., Ibañez-Lligoña, M., & Quer, J. (2023). Quantifying In-Host Quasispecies Evolution. International Journal of Molecular Sciences, 24(2), 1301. https://doi.org/10.3390/ijms24021301