Next Article in Journal
Pasteurized Akkermansia muciniphila Ameliorates Preeclampsia in Mice by Enhancing Gut Barrier Integrity, Improving Endothelial Function, and Modulating Gut Metabolic Dysregulation
Previous Article in Journal
The Killer Saccharomyces cerevisiae Toxin: From Origin to Biomedical Research
Previous Article in Special Issue
Genomic Characterization of Bacillus sp. THPS1: A Hot Spring-Derived Species with Functional Features and Biotechnological Potential
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

Unlocking the Potential of Metagenomics with the PacBio High-Fidelity Sequencing Technology

by
Yanhua Han
1,2,†,
Jinling He
1,2,†,
Minghui Li
1,2,
Yunjuan Peng
3,
Hui Jiang
1,2,
Jiangchao Zhao
3,
Ying Li
1,2 and
Feilong Deng
1,2,*
1
Guangdong Provincial Key Laboratory of Animal Molecular Design and Precise Breeding, College of Life Science and Engineering, Foshan University, Foshan 528225, China
2
School of Life Science and Engineering, Foshan University, Foshan 528225, China
3
College of Animal Science, South China Agricultural University, Guangzhou 510642, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Microorganisms 2024, 12(12), 2482; https://doi.org/10.3390/microorganisms12122482
Submission received: 4 November 2024 / Revised: 28 November 2024 / Accepted: 29 November 2024 / Published: 2 December 2024

Abstract

:
Traditional methods for studying microbial communities have been limited due to difficulties in culturing and sequencing all microbial species. Recent advances in third-generation sequencing technologies, particularly PacBio’s high-fidelity (HiFi) sequencing, have significantly advanced metagenomics by providing accurate long-read sequences. This review explores the role of HiFi sequencing in overcoming the limitations of previous sequencing methods, including high error rates and fragmented assemblies. We discuss the benefits and applications of HiFi sequencing across various environments, such as the human gut and soil, which provides broader context for further exploration. Key studies are discussed to highlight HiFi sequencing’s ability to recover complete and coherent microbial genomes from complex microbiomes, showcasing its superior accuracy and continuity compared to other sequencing technologies. Additionally, we explore the potential applications of HiFi sequencing in quantitative microbial analysis, as well as the detection of single nucleotide variations (SNVs) and structural variations (SVs). PacBio HiFi sequencing is establishing a new benchmark in metagenomics, with the potential to significantly enhance our understanding of microbial ecology and drive forward advancements in both environmental and clinical applications.

1. Introduction

Traditional microbial analysis methods primarily rely on culturing techniques. However, they fail to capture the full diversity of microbial communities, a limitation often referred to as the “great plate count anomaly”. This phenomenon highlights that the vast majority of microbial species remain uncultured and uncharacterized [1]. In recent years, advancements in molecular techniques, such as high-throughput sequencing and metagenomics, have begun to bridge this gap by enabling researchers to analyze microbial diversity directly from environmental samples without the need for cultivation [2]. Studies have demonstrated that employing a combination of culture-dependent and culture-independent methods can yield a more comprehensive understanding of microbial communities [3], as exemplified in the analysis of sediments from geothermal springs [4]. The metagenomic method has provided unprecedented insights into microbial diversity, ecological interactions, and metabolic functions across a wide range of environments [5,6].
To date, metagenomic studies relying primarily on second-generation short-read sequencing (SRS) technologies have been limited by high error rates and difficulties in accurately reconstructing complex microbial communities. As a result, researchers have struggled with issues related to assembly quality and the recovery of metagenome-assembled genomes (MAGs), which are crucial for studying microbial ecology and function. Recent advancement in sequencing technologies, particularly the emergence of long-read sequencing (LRS), have begun to address these challenges [7,8]. LRS offers the ability to generate longer contiguous sequences, which significantly enhances the assembly of complex metagenomes. This shift has allowed for better characterization of microbial diversity and improved taxonomic classification, ultimately leading to a more comprehensive understanding of microbial communities in various environments [9,10,11]. For example, hybrid sequencing approaches that combine short-read and long-read technologies have shown promise in optimizing the strengths of each method [7,12]. As the field progresses, the integration of these advanced sequencing technologies will ultimately bring about changes in metagenomic research, enabling more accurate and detailed investigations into the complexities of microbial life to become a reality.
This review examines the transformative impact of third-generation sequencing technologies on metagenomics, including Oxford Nanopore Technologies (ONT) and PacBio. We highlight their application across diverse environments, such as the human gut, soil, and other complex ecosystems, demonstrating their enhanced ability to recover microbial genomes, detect structural and single nucleotide variations, and improve quantitative microbial analysis. By showcasing key studies and discussing potential future applications, this review emphasizes the critical role of long reads sequencing in advancing microbial ecology research and its implications for both environmental and clinical studies.

2. The Role and Applications of Metagenomics

Metagenomics, which involves sequencing genetic material directly from environmental samples without culturing, has revolutionized our understanding of microbial diversity, function, and ecology. The field’s growth has been driven by advances in molecular biology, particularly 16S rRNA gene sequencing and next-generation sequencing (NGS). Initially applied to soil and water, metagenomics has expanded to extreme environments and clinical studies, such as the Human Microbiome Project, which highlighted its potential in health research [13]. It has led to the discovery of novel microorganisms, including bacteria, viruses, archaea, and other previously unknown organisms, expanding traditional microbial taxonomy and the tree of life. By bypassing traditional methods, it has uncovered the vast majority of unculturable microorganisms and revealed new metabolic pathways and bioactive compounds [14], as well as the microbial “dark matter” present in diverse environments across the globe [15,16,17] (Figure 1). Pavlopoulos et al. [18] revealed an immense global diversity of previously uncharacterized proteins in global metagenomes by generating reference-free protein families, identifying over 106,000 novel protein clusters with no similarity to known sequences, thereby doubling the number of known protein families and highlighting vastly untapped functional and structural diversity within microbial “dark matter”. Yan et al. [19] established a comprehensive global rumen virome database (RVD), identifying 397,180 viral operational taxonomic units (vOTUs) from 975 rumen metagenomes, revealing the previously unexplored viral “dark matter” of the rumen.
Metagenomic sequencing technology has significantly improved methods for assembling microbial genomes, particularly for uncultured microorganisms [20]. These advancements enable researchers to extract and reconstruct microbial genomes from complex communities, providing deeper insights into their diversity and ecological functions. Parks et al. [21] reconstructed 7903 bacterial and archaeal genomes from over 1500 metagenomes, expanding phylogenetic diversity by over 30%, including the first representatives of 17 bacterial and three archaeal candidate phyla. Nishimura et al. reconstructed 52,325 genomes from 2057 marine metagenomes across diverse environments, revealing 6256 novel species and expanding the phylogenetic diversity of marine prokaryotes by 34.2%. The human gut microbiota has been extensively studied through large-scale cultivation over the years; however, many uncultured microorganisms remain [22,23]. Almeida et al. [24] identifies 1952 uncultured candidate bacterial species by reconstructing 92,143 MAGs from 11,850 human gut microbiomes, expanding the phylogenetic diversity of the human gut microbiota by 281%. For fields with limited foundational research, such as gut microbiota of nonmodel organisms, metagenomics provides a rapid and cost-effective approach to constructing reference genomes of microbes. Moreover, MAGs are highly useful for exploring microbial gene functions in specific environments and identifying candidate microorganisms with targeted functions. For example, Yang et al. [25] used metagenome assembly to mine candidate microbes with strong fiber-degrading capabilities from the gut of the Lantang pig (a Chinese indigenous breed), identifying several gut microorganisms with potential fiber-degrading abilities. Simultaneously, metagenomic sequencing has improved and guided traditional culturing methods, offering a more comprehensive understanding of the complete microbial diversity within a sample [26].
Metagenomic sequencing technology provides significantly higher resolution for quantifying microbial abundance and gene expression compared to 16S rRNA sequencing, making it a critical tool for advancing research on microbial functions and their associations with diseases [27,28]. This enhanced resolution enables a more detailed investigation of microbial communities, revealing not only their taxonomic composition but also their metabolic capabilities and interactions within various ecosystems, such as the human gut, aquatic, and anaerobic environments [29]. For instance, metagenomic studies have illuminated the complex symbiotic relationships between humans and their microbiota, highlighting the beneficial roles of certain microbes in promoting health, as well as the involvement of pathogens in disease [30,31], and have linked host genetics to gut microbiome composition in animals [32]. Additionally, this approach facilitates the discovery of novel genes and metabolic pathways that may be undetectable through traditional culturing methods, further expanding our understanding of microbial diversity and functional potential [29] (Figure 2). Li et al. [33] employed a HiSeq-PacBio hybrid metagenomic method to study the metabolic potential of microorganisms in mangrove wetland sediments comprehensively, leading to the discovery and characterization of the metabolic traits of a new bacterial phylum.
With advancements in sequencing technologies and bioinformatics, metagenomics has significantly improved strain-level resolution, enhancing the accuracy of source tracking and advancing the study of horizontal gene transfer (HGT), particularly in the spread of antibiotic resistance genes (ARGs) [34,35]. Peng et al. analyzed the colonization and expression of probiotic strains in the gut and rumen microbiomes of various hosts using metagenomic and meta-transcriptomic data, revealing that probiotic expression frequently exceeds genomic abundance, while also highlighting strain-level differences in both abundance and expression [6]. High-resolution strain analysis enables researchers to more precisely identify contaminants in clinical and environmental samples, revealing the critical role of plasmids in the horizontal transfer of resistance genes [36]. In studies of wastewater treatment plants, although gene exchange between different bacterial populations is limited, certain resistance genes exhibit a high transfer potential, impacting microbial community structure and posing potential health risks [37]. Katariina et al. [38] used a metagenomic approach at the species and strain level to elucidate the origins of ARGs in infants, demonstrating that infants inherit the effects of maternal antibiotic usage through gene transmission from mother to infant. Recently, numerous bioinformatic tools, such as Strainy [39], inStrain [40], and Strainphlan4 [41], have been developed to enhance the exploration of metagenomic sequencing technologies with strain-level resolution.

3. Current Challenges in Metagenomics

Despite its potential, metagenomics faces several challenges that need to be addressed, with the complexity of metagenomic samples being a major obstacle. Homologous genes from different species or genera, arising through evolutionary processes or frequent HGT [42], pose challenges to current sequencing technologies and bioinformatic analysis methods [43]. Furthermore, metagenomic studies are often hindered by issues related to sequence assembly and binning, particularly when dealing with highly diverse microbial populations. Assembly methods, such as De Bruijn graph-based assemblers like MegaHit [44] and metaSPAdes [45], have improved the ability to reconstruct genomes from mixed microbial communities, but challenges remain due to uneven genome coverage and the presence of homologous regions among different species.
In addition, HGT, a common phenomenon in microbial communities, exacerbates the difficulties in distinguishing between closely related species and genera. The exchange of genetic material across species boundaries can blur the taxonomic lines, complicating both sequence assembly and functional annotation [46]. As a result, the interpretation of metagenomic data often requires advanced bioinformatics tools that can account for these evolutionary processes; yet, even the most sophisticated algorithms may struggle to capture the full scope of microbial diversity and function in complex environments. Due to the prevalence of homologous genes and HGT, a major challenge in metagenomic sequencing and analysis is the inability of SRS technologies to accurately resolve identical or highly similar genomic regions [47]. Third-generation sequencing technologies, such as PacBio RS and Nanopore’s ONT, which are based on single-molecule sequencing, hold significant potential to conquer these limitations by providing longer read lengths and improved resolution of complex genomic regions [48,49].
In addition, the bioinformatics analyses of metagenomic data are computationally intensive, requiring specialized expertise and efficient algorithms for denoising, assembly, annotation, and functional prediction. Accurate species and gene identification also relies on a deep understanding of microbial diversity and ecological functions. Additionally, despite reduced sequencing costs, expenses remain a barrier for large-scale studies, particularly those requiring deep sequencing or extensive sampling, often exceeding available research funding and limiting study scope.

4. Advantages of Long-Read Sequencing

LRS is an advanced genomic sequencing technology capable of generating reads spanning thousands of base pairs, offering significant advantages over SRS in detecting certain types of genetic variation [50]. While the general workflow of LRS shares similarities with NGS platforms, it stands out due to the length of reads it generates and the sequencing mechanisms it employs. The workflow begins with the extraction and purification of DNA, followed by library construction, where long DNA fragments are ligated with adapters to form a sequencing library [51]. Unlike SRS, LRS allows for sequencing without extensive fragmentation of the long strands, enabling the generation of long contiguous sequences. The sequencing process differs depending on the platform; for example, Pacific Biosciences uses real-time synthesis detection, while ONT leverages electrical current changes to detect individual bases passing through nanopores [52,53].
Recent advancements in LRS technology have greatly enhanced its read lengths. PacBio’s HiFi sequencing, for instance, produces long reads up to 10–25 kb [54], making it ideal for resolving complex genomic regions, especially for detecting structural variants and regions that are difficult to map (Figure 3). ONT can generate ultra-long reads exceeding 2 Mb [55], offering real-time sequencing capabilities that are particularly advantageous for rapid genomic analysis. Initially, LRS faced challenges with lower accuracy compared to SRS; however, both PacBio and ONT have made substantial improvements in sequencing accuracy. For instance, the PacBio sequencing platform, through its HiFi sequencing approach, has improved the single-base accuracy of circular consensus sequencing (CCS) reads to 99.5% [54,56]. Recent reports indicate that the Nanopore ONT sequencing platform, using the latest R10.4.1 chemistry and updated flow cells, can reduce the single-base error rate to as low as 1% [57]. LRS has become essential in genomic research, enabling highly contiguous genome assemblies, particularly for complex or repetitive genomes [54,58], and excelling in the detection of structural variations [59].
In the context of microbial and metagenomic studies, LRS largely provides advantages in resolving extremely complex microbial communities, benefiting from the length of contiguous sequencing reads. It facilitates the assembly of complete microbial genomes from environmental samples, providing more accurate species-level, even strain-level, resolution. This is particularly valuable for studying unculturable organisms and complex microbiomes in environmental settings. ONT sequencing methods have been confirmed to provide more continuity and integrity contigs while being used for MAGs assembly from complex microbial environments. Caitlin et al. [60] assembled 1083 high-quality MAGs, including 57 closed circular genomes, from the wastewater industry by ONT long reads. Liu et al. [61] sequenced the complex activated sludge microbiome and successfully reconstructed 275 MAGs with median completeness of approximately 90%. Similarity, Huang et al. [62] utilized ONT-based metagenomic data to assemble bacterial genomes from the giant panda gut, achieving higher assembly quality and obtaining 40 completely closed bacterial genomes. However, researchers have raised concerns about the high sequencing error rate (~10% [48]) associated with ONT, which could introduce errors into the MAGs and significantly impact downstream analyses. Moss et al. [63] assembled 20 complete MAGs (cMAGs) from Oxford Nanopore LRS of 13 human fecal samples, but with low nucleotide accuracy. Alternative approaches include correcting ONT-assembled contigs using Illumina reads or assembling contigs with Illumina reads and subsequently utilizing ONT long reads to extend and enhance the contigs [64,65,66]. The incorporation of high-quality Illumina reads is expected to substantially enhance the accuracy of contig assembly compared to methods relying solely on ONT long reads. Additionally, the supplementary sequencing depth provided by Illumina reads can improve the assembly of microbial genomes present in lower abundance. This approach also offers cost savings by reducing the required sequencing depth of long reads. However, these hybrid strategies cannot entirely mitigate the impact of the high error rate inherent to ONT sequencing. Moss’s study suggests that ONT-based MAG assembly results in decreased nucleotide accuracy compared to methods based on Illumina reads [63]. It is important to note that the existing studies are based on earlier versions of the ONT platform/chemistry, which had lower sequencing accuracy (an average accuracy of 90%) [67]. The 99% accuracy claimed by Nanopore for its newer version has not yet been widely adopted in research studies [57].

5. Advantages of Pacbio HiFi Sequencing

PacBio HiFi sequencing is an advanced third-generation sequencing technology designed to deliver highly accurate, long-read genomic data. PacBio HiFi sequencing operates based on SMRT sequencing, where individual DNA molecules are sequenced in real time. The process begins with sample preparation, where DNA is extracted and the sequencing library is constructed, typically involving DNA fragmentation and ligation of sequencing adapters. During the sequencing process, each DNA molecule is captured in a ZMW, and DNA polymerase is introduced to initiate synthesis [68]. As nucleotides are incorporated, real-time monitoring captures each addition at the single-molecule level. This single-molecule approach minimizes sample cross-contamination and eliminates amplification bias common in other sequencing technologies.
PacBio HiFi sequencing offers distinct technical advantages over other sequencing technologies, primarily due to its ability to generate long reads, typically ranging from 10 to 25 kb [54]. This read length enables comprehensive coverage of large structural variants and challenging genomic regions, including repetitive elements and “dark” regions that are difficult to resolve with SRS [54,69]. Furthermore, HiFi sequencing achieves exceptionally high accuracy, exceeding 99.5%, comparable to short-read and Sanger sequencing, effectively minimizing sequencing errors [54,56,70]. The technology’s versatility is reflected in its broad application across multiple domains, including genome assembly, variant detection, epigenetic profiling, and haplotype phasing [54,69,71]. Additionally, the increased read length and reduced error rate of the HiFi sequencing method significantly lower the coverage depth required for whole-genome sequencing, thereby reducing both sample input and computational load. Despite these advantages, the high cost of PacBio HiFi sequencing continues to limit its widespread use in large-scale metagenomic studies.

6. Applications of HiFi Sequencing in Metagenomics

In recent years, metagenomic sequencing technology has significantly advanced the study of complex microbial communities. Through metagenomic assembly, researchers have been able to reconstruct the genomes of gut microbiota from a variety of hosts, including humans [72,73], mice [74,75], cattle [76,77], sheep [78], chickens [79], pigs [80], and giant pandas [62,65]. Studying microbial communities in these diverse hosts is crucial for understanding how microbiomes contribute to host health, nutrient metabolism, and disease resistance.
Studies using second-generation sequencing technology have limitations in handling repetitive fragments, which are common in complex microbial environments. This issue is particularly problematic when dealing with DNA segments that are repeated within a single organism or shared among different organisms, resulting in challenges in assembling MAGs and potential distortions in microbial quantification [81]. These limitations can lead to misidentification of species, incomplete metabolic reconstructions, and inaccurate representation of microbial community structure.
Pacbio SMRT sequencing uses ZMW technology to synthesize DNA strands, emitting a fluorescent signal for each nucleotide incorporated. This method produces long reads with an initial error rate of approximately 13% [82]. PacBio has refined its initial sequencing methodology by implementing CCS technology, which entails iteratively sequencing a single DNA molecule on a circular template to generate multiple read sequences [83]. Combining multiple reads creates a HiFi read, improving sequencing accuracy. HiFi sequencing technology has the capability to achieve a 99.9% accuracy rate with read lengths ranging from 10 to 20 kb or longer [84]. This characteristic serves to mitigate a limitation present in both Illumina and ONT/PacBio LRS platforms when applied to the field of metagenomics. In previous studies, researchers have focused on accurately recovering microbial genomes to uncover key microorganisms within complex environments. PacBio’s HiFi sequencing technology has significantly advanced the recovery of MAGs due to its long read lengths and high accuracy. For instance, Richy et al. [49] utilized HiFi technology to sequence and assemble MAGs from deadwood microbiomes, discovering nitrogen-fixing bacteria. Similarly, Jiang et al. [85] used PacBio HiFi sequencing in the anaerobic digestion of food waste to construct 60 nonredundant microbial genomes, including 39 complete circular genomes, revealing novel microbial compositions and enhancing genome assembly quality compared to traditional short-read methods. Tao et al. [86] utilized HiFi long reads and Illumina short reads to assemble MAGs and viral genomes from Tibetan Saline Lake Sediment, demonstrating a notable improvement in the retrieval of medium- to high-quality MAGs and viral genomes through the implementation of a hybrid assembly approach. Not limited to environmental samples, HiFi technology has also facilitated the assembly of numerous high-quality MAGs in human and animal gut microbiome research. Kim et al. [87] employed PacBio HiFi sequencing to assemble high-quality MAGs from human fecal samples. They successfully obtained 102 cMAGs with nucleotide accuracy comparable to Illumina sequencing. These cMAGs included genomes from novel orders and encompassed difficult-to-culture regions such as genomic islands and rRNAs, highlighting the robustness of HiFi sequencing in resolving complex genomic regions. To address the limitations of incomplete and lower quality MAGs, Zhang et al. [88] used Pacbio HiFi technology to sequence and assemble 337 microbial genomes at the species level and 461 at the strain level, with over half being circular. They also discovered potential new taxonomy at various levels, showing that HiFi sequencing enhances metagenome assemblies and reveals novel genomes and genes.
Several studies have compared the differences between HiFi metagenomic assembly and other sequencing technologies. Eisenhofer et al. [7] compared SRS, HiFi LRS, and hybrid strategies for genome-resolved metagenomics. Their results demonstrated that HiFi LRS produced superior assembly statistics, including the highest N50, the lowest number of contigs, and a greater number of high-quality MAGs. In contrast, SRS resulted in the highest number of refined bins. Zhang et al. [88] demonstrated that genomes assembled using the HiFi method presented significant enhancements in terms of continuity, completeness, and contamination metrics when compared to MAGs derived from SRS. Additionally, the HiFi approach enabled the identification of 384 previously unknown strains and 89 novel species that were not detected in short-read metagenome studies. Furthermore, the gene catalog generated from HiFi assemblies exhibited a markedly higher level of structural completeness (>99%) in comparison to short-read assemblies (40–60%), with approximately one-third of the genes being novel. Feng et al. [89] discovered that HiFi-based MAG assembly may have worse assembly outcome for species with higher abundance due to strain diversity, leading to fragmented contigs. This emphasizes the importance of accurately assembling genomes of diverse and abundant species, with alternative solutions proposed by the authors.
There is no doubt that PacBio HiFi metagenomic sequencing is an effective method for large-scale recovery of high-accuracy microbial reference genomes. However, some issues still need to be addressed. Moss’s study suggests that ONT-based MAG assembly results in decreased nucleotide accuracy compared to methods based on Illumina reads [63]. To date, no studies have directly compared the nucleotide accuracy of MAGs assembled using HiFi long reads. Additionally, the cost of HiFi sequencing remains relatively high compared to other sequencing methods, which can limit its widespread adoption, especially for large-scale studies.

7. Other Applications of HiFi Sequencing in Metagenomics

7.1. Quantitative Microbial Analysis with HiFi Sequencing

When quantifying microbial composition in samples, either relatively or absolutely, 16S rRNA sequencing is prone to PCR bias, and second-generation metagenomic methods struggle with interference from homologous sequences, making accurate quantification difficult. However, HiFi LRS holds significant promise for quantitative microbial analysis due to its high efficiency in taxonomic classification. Portik et al. [90] found that long-read datasets produced significantly better results than short-read datasets. Furthermore, shorter long reads (<2 kb) led to reduced precision and inaccurate abundance estimates. Their results indicated that while long reads did not perfectly recover the abundance of mock communities, they provided an important initial reference for the application of HiFi long reads in estimating the abundance of complex environmental microbiomes.

7.2. Detection of Genetic Variations with HiFi Sequencing

Detecting SNVs and SVs in complex microbial environments poses significant challenges. Chen et al. [50] utilized ONT technology to identify SVs in the human gut microbiome. Given the superior accuracy of HiFi sequencing, we propose that employing HiFi technology to detect microbial SVs and even single nucleotide variations in complex environmental samples is a promising application direction. Fedarko et al. [91] utilized deep HiFi sequencing with long reads to identify rare mutations within a single MAG. The results indicated that assembling MAGs from metagenomic LRS is an efficient method for detecting infrequent genetic variations.

7.3. Integration of Emerging Technologies with HiFi Sequencing

Integrating other emerging technologies with HiFi sequencing could enhance its value in metagenomics research. For instance, Bickhart et al. [92] combined Hi-C and HiFi sequencing technologies to assemble 428 high-quality MAGs from human fecal samples. This approach effectively addressed the challenges posed by closely related strains in complex microbial communities, facilitating more accurate genome assembly.

8. Future Applications of PacBio HiFi Sequencing Technology in Metagenomics

PacBio HiFi sequencing technology has significantly advanced the field of metagenomics by providing HiFi, long-read sequences that enhance the recovery and assembly of MAGs from complex microbial communities. This technology offers several advantages, including improved genome continuity, completeness, and the ability to uncover novel microbial strains and species. Studies have demonstrated the effectiveness of HiFi sequencing in various environments, including deadwood microbiomes [49], anaerobic digestion of food waste [85], and human gut microbiomes [87]. It is clear that PacBio HiFi sequencing technology has already made significant contributions to MAGs in metagenomics studies. Moving forward, several promising areas where this technology is likely to play an increasingly important role include the following.
A key application of PacBio HiFi sequencing lies in generating high-quality reference genomes directly from environmental samples, bypassing the need for cultivation. Recent exploratory studies have demonstrated the advantages of HiFi sequencing in metagenome assembly, particularly at the species level, where its ability to resolve fine taxonomic distinctions stands out [87]. However, assembling genomes at the strain level may still require further advancements in bioinformatics approaches, specifically in distinguishing subtle genetic differences between strains. Future developments in algorithms designed to address strain-level resolution could unlock even greater potential for this application.
The long-read capabilities and nonamplification-based sequencing strategy of PacBio HiFi technology offer substantial advantages in calculating the relative abundance of microbes in metagenomic studies. Its ability to generate ultra-long reads provides a superior framework for analyzing the structure of microbial communities. However, current applications remain limited, largely due to biases introduced during the library construction process, where DNA fragmentation and size selection are still necessary. Understanding and mitigating these biases represent an important area of future research, and significant progress in this field could enable more accurate and reliable microbial community analyses.
Another significant area for future exploration involves the host association of microbial functional genes, such as ARGs or virulence factors. At present, the primary methods used to identify the microbial hosts of these genes rely on correlation analyses or MAGs assembly, both of which are prone to false positives. The high accuracy and long reads generated by PacBio HiFi sequencing can significantly improve species-level annotations and offer direct evidence for linking functional genes to their microbial hosts. By annotating HiFi reads at the species level, researchers can more confidently trace the origins of functional genes, reducing false positives and providing direct evidence of gene–host associations.
However, challenges remain, such as the high cost of HiFi sequencing and the need for substantial computational resources. Future research should focus on developing cost-effective methods, improving computational tools, and enhancing error correction algorithms to fully realize the potential of HiFi sequencing in metagenomics. While PacBio HiFi sequencing presents a promising avenue for metagenomic research, addressing these challenges will be crucial for its widespread adoption and for unlocking the full potential of microbial genomics in various environmental and clinical applications.

9. Conclusions

Culturomics has made significant advancements, but metagenomics remains a vital tool in microbiology. Advanced sequencing technologies, especially PacBio HiFi, have greatly improved our ability to generate high-quality bacterial reference genomes, enabling deeper exploration of microbial diversity and function. This review highlights the unique advantages of PacBio HiFi, including its role in producing high-quality MAGs, detecting SVs, and integrating with emerging technologies to enhance metagenomic research. Despite challenges like high cost and computational demand, ongoing advancements in sequencing and bioinformatics will expand metagenomics applications. The integration of HiFi with technologies like Hi-C and ONT offers promising solutions to current limitations. Further improvements in these methodologies will enhance our understanding of microbial communities and their potential for biotechnological and therapeutic applications, advancing in both fundamental and applied microbiology.

Author Contributions

Y.H.: Writing—original draft. J.H.: Writing—review and editing. M.L.: Visualization. Y.P.: Writing—review and editing. H.J.: Writing—review and editing. Y.L.: Conceptualization, writing—review and editing. J.Z.: Conceptualization, writing—review and editing. F.D.: Conceptualization, visualization, writing—review and editing. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by National Key Research and Development Program of China (SQ2023YFE0102739), the Youth project of Guangdong Foshan joint fund of the Guangdong Natural Science Foundation (2022A1515110819), and National Natural Science Foundation of China (no. 32170430).

Data Availability Statement

No new data was created for this study.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Junkins, E.N.; Stevenson, B.S. Using Plate-Wash PCR and High-Throughput Sequencing to Measure Cultivated Diversity for Natural Product Discovery Efforts. Front. Microbiol. 2021, 12, 675798. [Google Scholar] [CrossRef] [PubMed]
  2. Lind, A.L.; Pollard, K.S. Accurate and sensitive detection of microbial eukaryotes from whole metagenome shotgun sequencing. Microbiome 2021, 9, 58. [Google Scholar] [CrossRef] [PubMed]
  3. You, H.S.; Lee, S.H.; Lee, Y.J.; Lee, H.; Kang, S.S.; Hyun, S.H. Next-Generation Sequencing Results Vary Between Cultured and Uncultured Microbes. Curr. Microbiol. Int. J. 2022, 79, 187. [Google Scholar] [CrossRef]
  4. Yasir, M.; Qureshi, A.K.; Azhar, E.I. 16S amplicon sequencing of microbial communities in enriched and non-enriched sediments of non-volcanic hot spring with temperature gradients. PeerJ 2021, 9, e10995. [Google Scholar] [CrossRef] [PubMed]
  5. Bittleston, L.S. Connecting microbial community assembly and function. Curr. Opin. Microbiol. 2024, 80, 102512. [Google Scholar] [CrossRef] [PubMed]
  6. Peng, Y.; Chen, R.; Zhang, Z.; Jin, R.; Xie, T.; Liu, X.; Chai, J.; Howe, S.; Zhao, J.; Li, Y.; et al. Metagenomic and Meta-Transcriptomic Analysis Reveal the Colonization and Expression Profile of Probiotic Strains in Humans and Animals. Fermentation 2023, 9, 417. [Google Scholar] [CrossRef]
  7. Eisenhofer, R.; Nesme, J.; Santos-Bay, L.; Koziol, A.; Sørensen, S.J.; Alberdi, A.; Aizpurua, O. A comparison of short-read, HiFi long-read, and hybrid strategies for genome-resolved metagenomics. Microbiol. Spectr. 2024, 12, e03590-23. [Google Scholar] [CrossRef]
  8. Gupta, P.; O’Neill, H.; Wolvetang, E.J.; Chatterjee, A.; Gupta, I. Advances in single-cell long-read sequencing technologies. NAR Genom. Bioinform. 2024, 6, lqae047. [Google Scholar] [CrossRef]
  9. Jia, H.; Tan, S.; Zhang, Y.E. Chasing Sequencing Perfection: Marching Toward Higher Accuracy and Lower Costs. Genom. Proteom. Bioinform. 2024, 22, qzae024. [Google Scholar] [CrossRef]
  10. Agustinho, D.P.; Fu, Y.; Menon, V.K.; Metcalf, G.A.; Treangen, T.J.; Sedlazeck, F.J. Unveiling microbial diversity: Harnessing long-read sequencing technology. Nat. Methods 2024, 21, 954–966. [Google Scholar] [CrossRef]
  11. Chai, J.; Zhuang, Y.; Cui, K.; Bi, Y.; Zhang, N. Metagenomics reveals the temporal dynamics of the rumen resistome and microbiome in goat kids. Microbiome 2024, 12, 14. [Google Scholar] [CrossRef] [PubMed]
  12. Cook, R.; Brown, N.; Rihtman, B.; Michniewski, S.; Redgwell, T.; Clokie, M.; Stekel, D.J.; Chen, Y.; Scanlan, D.J.; Hobman, J.L.; et al. The long and short of it: Benchmarking viromics using Illumina, Nanopore and PacBio sequencing technologies. Microb. Genom. 2024, 10, 001198. [Google Scholar] [CrossRef] [PubMed]
  13. Turnbaugh, P.J.; Ley, R.E.; Hamady, M.; Fraser-Liggett, C.M.; Knight, R.; Gordon, J.I. The human microbiome project. Nature 2007, 449, 804–810. [Google Scholar] [CrossRef] [PubMed]
  14. Ngara, T.R.; Zhang, H. Recent advances in function-based metagenomic screening. Genom. Proteom. Bioinform. 2018, 16, 405–415. [Google Scholar] [CrossRef] [PubMed]
  15. Jiao, J.-Y.; Liu, L.; Hua, Z.-S.; Fang, B.-Z.; Zhou, E.-M.; Salam, N.; Hedlund, B.P.; Li, W.-J. Microbial dark matter coming to light: Challenges and opportunities. Natl. Sci. Rev. 2021, 8, nwaa280. [Google Scholar] [CrossRef] [PubMed]
  16. Osburn, E.D.; McBride, S.G.; Strickland, M.S. Microbial dark matter could add uncertainties to metagenomic trait estimations. Nat. Microbiol. 2024, 9, 1427–1430. [Google Scholar] [CrossRef] [PubMed]
  17. Li, S.; Lian, W.-H.; Han, J.-R.; Ali, M.; Lin, Z.-L.; Liu, Y.-H.; Li, L.; Zhang, D.-Y.; Jiang, X.-Z.; Li, W.-J.; et al. Capturing the microbial dark matter in desert soils using culturomics-based metagenomics and high-resolution analysis. Npj Biofilms Microbiomes 2023, 9, 67. [Google Scholar] [CrossRef]
  18. Pavlopoulos, G.A.; Baltoumas, F.A.; Liu, S.; Selvitopi, O.; Camargo, A.P.; Nayfach, S.; Azad, A.; Roux, S.; Call, L.; Ivanova, N.N.; et al. Unraveling the functional dark matter through global metagenomics. Nature 2023, 622, 594–602. [Google Scholar] [CrossRef]
  19. Yan, M.; Pratama, A.A.; Somasundaram, S.; Li, Z.; Jiang, Y.; Sullivan, M.B.; Yu, Z. Interrogating the viral dark matter of the rumen ecosystem with a global virome database. Nat. Commun. 2023, 14, 5254. [Google Scholar] [CrossRef]
  20. Yang, B.; Yang, J.; Chen, R.; Chai, J.; Wei, X.; Zhao, J.; Zhao, Y.; Deng, F.; Li, Y. Metagenome-Assembled Genomes of Pig Fecal Samples in Nine European Countries: Insights into Antibiotic Resistance Genes and Viruses. Microorganisms 2024, 12, 2409. [Google Scholar] [CrossRef]
  21. Parks, D.H.; Rinke, C.; Chuvochina, M.; Chaumeil, P.-A.; Woodcroft, B.J.; Evans, P.N.; Hugenholtz, P.; Tyson, G.W. Recovery of nearly 8,000 metagenome-assembled genomes substantially expands the tree of life. Nat. Microbiol. 2017, 2, 1533–1542. [Google Scholar] [CrossRef] [PubMed]
  22. Zou, Y.; Xue, W.; Luo, G.; Deng, Z.; Qin, P.; Guo, R.; Sun, H.; Xia, Y.; Liang, S.; Dai, Y.; et al. 1520 reference genomes from cultivated human gut bacteria enable functional microbiome analyses. Nat. Biotechnol. 2019, 37, 179–185. [Google Scholar] [CrossRef] [PubMed]
  23. Forster, S.C.; Kumar, N.; Anonye, B.O.; Almeida, A.; Viciani, E.; Stares, M.D.; Dunn, M.; Mkandawire, T.T.; Zhu, A.; Shao, Y.; et al. A human gut bacterial genome and culture collection for improved metagenomic analyses. Nat. Biotechnol. 2019, 37, 186–192. [Google Scholar] [CrossRef] [PubMed]
  24. Almeida, A.; Mitchell, A.L.; Boland, M.; Forster, S.C.; Gloor, G.B.; Tarkowska, A.; Lawley, T.D.; Finn, R.D. A new genomic blueprint of the human gut microbiota. Nature 2019, 568, 499–504. [Google Scholar] [CrossRef]
  25. Yang, J.; Fan, Y.; Jin, R.; Peng, Y.; Chai, J.; Wei, X.; Zhao, Y.; Deng, F.; Zhao, J.; Li, Y. Exploring the Intestinal Microbial Community of Lantang Pigs through Metagenome-Assembled Genomes and Carbohydrate Degradation Genes. Fermentation 2024, 10, 207. [Google Scholar] [CrossRef]
  26. Liu, S.; Moon, C.D.; Zheng, N.; Huws, S.; Zhao, S.; Wang, J. Opportunities and challenges of using metagenomic data to bring uncultured microbes into cultivation. Microbiome 2022, 10, 76. [Google Scholar] [CrossRef] [PubMed]
  27. Franzosa, E.A.; McIver, L.J.; Rahnavard, G.; Thompson, L.R.; Schirmer, M.; Weingart, G.; Lipson, K.S.; Knight, R.; Caporaso, J.G.; Segata, N.; et al. Species-level functional profiling of metagenomes and metatranscriptomes. Nat. Methods 2018, 15, 962–968. [Google Scholar] [CrossRef]
  28. Blanco-Míguez, A.; Beghini, F.; Cumbo, F.; McIver, L.J.; Thompson, K.N.; Zolfo, M.; Manghi, P.; Dubois, L.; Huang, K.D.; Thomas, A.M.; et al. Extending and improving metagenomic taxonomic profiling with uncharacterized species using MetaPhlAn 4. Nat. Biotechnol. 2023, 41, 1633–1644. [Google Scholar] [CrossRef]
  29. Taş, N.; de Jong, A.E.; Li, Y.; Trubl, G.; Xue, Y.; Dove, N.C. Metagenomic tools in microbial ecology research. Curr. Opin. Biotechnol. 2021, 67, 184–191. [Google Scholar] [CrossRef]
  30. Wang, X.; Fang, Y.; Liang, W.; Wong, C.C.; Qin, H.; Gao, Y.; Liang, M.; Song, L.; Zhang, Y.; Fan, M.; et al. Fusobacterium nucleatum facilitates anti-PD-1 therapy in microsatellite stable colorectal cancer. Cancer Cell 2024, 42, 1729–1746. [Google Scholar] [CrossRef]
  31. Gu, Z.; Pei, W.; Shen, Y.; Wang, L.; Zhu, J.; Zhang, Y.; Fan, S.; Wu, Q.; Li, L.; Zhang, Z. Akkermansia muciniphila and its outer protein Amuc_1100 regulates tryptophan metabolism in colitis. Food Funct. 2021, 12, 10184–10195. [Google Scholar] [CrossRef] [PubMed]
  32. Yang, H.; Wu, J.; Huang, X.; Zhou, Y.; Zhang, Y.; Liu, M.; Liu, Q.; Ke, S.; He, M.; Fu, H.; et al. ABO genotype alters the gut microbiota by regulating GalNAc levels in pigs. Nature 2022, 606, 358–367. [Google Scholar] [CrossRef] [PubMed]
  33. Zhang, Z.-F.; Liu, L.-R.; Pan, Y.-P.; Pan, J.; Li, M. Long-read assembled metagenomic approaches improve our understanding on metabolic potentials of microbial community in mangrove sediments. Microbiome 2023, 11, 188. [Google Scholar] [CrossRef] [PubMed]
  34. Lou, Y.C.; Hoff, J.; Olm, M.R.; West-Roberts, J.; Diamond, S.; Firek, B.A.; Morowitz, M.J.; Banfield, J.F. Using strain-resolved analysis to identify contamination in metagenomics data. Microbiome 2023, 11, 36. [Google Scholar] [CrossRef] [PubMed]
  35. McLellan, S.L.; Eren, A.M. Discovering new indicators of fecal pollution. Trends Microbiol. 2014, 22, 697–706. [Google Scholar] [CrossRef] [PubMed]
  36. Quan, J.; Hu, H.; Zhang, H.; Meng, Y.; Liao, W.; Zhou, J.; Han, X.; Shi, Q.; Zhao, D.; Wang, Q.; et al. Investigating Possible Interspecies Communication of Plasmids Associated with Transfer of Third-Generation Cephalosporin, Quinolone, and Colistin Resistance Between Simultaneously Isolated Escherichia Coli and Klebsiella Pneumoniae. Microbiol. Spectr. 2023, 11, e0355422. [Google Scholar] [CrossRef]
  37. Ma, J.; Sun, H.; Li, B.; Wu, B.; Zhang, X.; Ye, L. Horizontal transfer potential of antibiotic resistance genes in wastewater treatment plants unraveled by microfluidic-based mini-metagenomics. J. Hazard. Mater. 2024, 465, 133493. [Google Scholar] [CrossRef]
  38. Pärnänen, K.; Karkman, A.; Hultman, J.; Lyra, C.; Bengtsson-Palme, J.; Larsson, D.J.; Rautava, S.; Isolauri, E.; Salminen, S.; Kumar, H. Maternal gut and breast milk microbiota affect infant gut antibiotic resistome and mobile genetic elements. Nat. Commun. 2018, 9, 3891. [Google Scholar] [CrossRef]
  39. Kazantseva, E.; Donmez, A.; Frolova, M.; Pop, M.; Kolmogorov, M. Strainy: Phasing and assembly of strain haplotypes from long-read metagenome sequencing. Nat. Methods 2024, 21, 2034–2043. [Google Scholar] [CrossRef]
  40. Olm, M.R.; Crits-Christoph, A.; Bouma-Gregson, K.; Firek, B.A.; Morowitz, M.J.; Banfield, J.F. inStrain profiles population microdiversity from metagenomic data and sensitively detects shared microbial strains. Nat. Biotechnol. 2021, 39, 727–736. [Google Scholar] [CrossRef]
  41. Truong, D.T.; Tett, A.; Pasolli, E.; Huttenhower, C.; Segata, N. Microbial strain-level population structure and genetic diversity from metagenomes. Genome Res. 2017, 27, 626–638. [Google Scholar] [CrossRef]
  42. Méheust, R.; Watson, A.K.; Lapointe, F.J.; Papke, R.T.; Lopez, P.; Bapteste, E. Hundreds of novel composite genes and chimeric genes with bacterial origins contributed to haloarchaeal evolution. Genome Biol. 2018, 19, 75. [Google Scholar] [CrossRef] [PubMed]
  43. Bharti, R.; Grimm, D.G. Current challenges and best-practice protocols for microbiome analysis. Brief. Bioinform. 2021, 22, 178–193. [Google Scholar] [CrossRef] [PubMed]
  44. Li, D.; Liu, C.-M.; Luo, R.; Sadakane, K.; Lam, T.-W. MEGAHIT: An ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. Bioinformatics 2015, 31, 1674–1676. [Google Scholar] [CrossRef] [PubMed]
  45. Nurk, S.; Meleshko, D.; Korobeynikov, A.; Pevzner, P.A. metaSPAdes: A new versatile metagenomic assembler. Genome Res. 2017, 27, 824–834. [Google Scholar] [CrossRef] [PubMed]
  46. Brito, I.L. Examining horizontal gene transfer in microbial communities. Nat. Rev. Microbiol. 2021, 19, 442–453. [Google Scholar] [CrossRef]
  47. Ayling, M.; Clark, M.D.; Leggett, R.M. New approaches for metagenome assembly with short reads. Brief. Bioinform. 2020, 21, 584–594. [Google Scholar] [CrossRef] [PubMed]
  48. Nicholls, S.M.; Quick, J.C.; Tang, S.; Loman, N.J. Ultra-deep, long-read nanopore sequencing of mock microbial community standards. Gigascience 2019, 8, giz043. [Google Scholar] [CrossRef]
  49. Richy, E.; Dobbler, P.T.; Tláskal, V.; López-Mondéjar, R.; Baldrian, P.; Kyselková, M. Pacbio HiFi Sequencing Sheds Light on Key Bacteria Contributing to Deadwood Decomposition Processes. 2024. Available online: https://www.researchsquare.com/article/rs-4181686/v1 (accessed on 4 April 2024).
  50. Chen, L.; Zhao, N.; Cao, J.; Liu, X.; Xu, J.; Ma, Y.; Yu, Y.; Zhang, X.; Zhang, W.; Guan, X.; et al. Short-and long-read metagenomics expand individualized structural variations in gut microbiomes. Nat. Commun. 2022, 13, 3175. [Google Scholar] [CrossRef]
  51. Branton, D.; Deamer, D.W.; Marziali, A.; Bayley, H.; Benner, S.A.; Butler, T.; Di Ventra, M.; Garaj, S.; Hibbs, A.; Huang, X.; et al. The potential and challenges of nanopore sequencing. Nat. Biotechnol. 2008, 26, 1146–1153. [Google Scholar] [CrossRef]
  52. McCarthy, A. Third generation DNA sequencing: Pacific biosciences’ single molecule real time technology. Chem. Biol. 2010, 17, 675–676. [Google Scholar] [CrossRef] [PubMed]
  53. Wang, Y.; Zhao, Y.; Bollas, A.; Wang, Y.; Au, K.F. Nanopore sequencing technology, bioinformatics and applications. Nat. Biotechnol. 2021, 39, 1348–1365. [Google Scholar] [CrossRef] [PubMed]
  54. Hon, T.; Mars, K.; Young, G.; Tsai, Y.-C.; Karalius, J.W.; Landolin, J.M.; Maurer, N.; Kudrna, D.; Hardigan, M.A.; Steiner, C.C.; et al. Highly accurate long-read HiFi sequencing data for five complex genomes. Sci. Data 2020, 7, 399. [Google Scholar] [CrossRef] [PubMed]
  55. Payne, A.; Holmes, N.; Rakyan, V.; Loose, M. BulkVis: A graphical viewer for Oxford nanopore bulk FAST5 files. Bioinformatics 2019, 35, 2193–2198. [Google Scholar] [CrossRef]
  56. Wenger, A.M.; Peluso, P.; Rowell, W.J.; Chang, P.-C.; Hall, R.J.; Concepcion, G.T.; Ebler, J.; Fungtammasan, A.; Kolesnikov, A.; Olson, N.D.; et al. Accurate circular consensus long-read sequencing improves variant detection and assembly of a human genome. Nat. Biotechnol. 2019, 37, 1155–1162. [Google Scholar] [CrossRef]
  57. Zidane, N.; Rodrigues, C.; Bouchez, V.; Rethoret-Pasty, M.; Passet, V.; Brisse, S.; Crestani, C. Accurate genotyping of three major respiratory bacterial pathogens with ONT R10. 4.1 long-read sequencing. bioRxiv 2024. [Google Scholar] [CrossRef]
  58. Kolmogorov, M.; Yuan, J.; Lin, Y.; Pevzner, P.A. Assembly of long, error-prone reads using repeat graphs. Nat. Biotechnol. 2019, 37, 540–546. [Google Scholar] [CrossRef]
  59. Zhang, Z.; Zhang, J.; Kang, L.; Qiu, X.; Xu, S.; Xu, J.; Guo, Y.; Niu, Z.; Niu, B.; Bi, A.; et al. Structural variation discovery in wheat using PacBio high-fidelity sequencing. Plant J. 2023, 120, 687–698. [Google Scholar] [CrossRef]
  60. Singleton, C.M.; Petriglieri, F.; Kristensen, J.M.; Kirkegaard, R.H.; Michaelsen, T.Y.; Andersen, M.H.; Kondrotaite, Z.; Karst, S.M.; Dueholm, M.S.; Nielsen, P.H.; et al. Connecting structure to function with the recovery of over 1000 high-quality metagenome-assembled genomes from activated sludge using long-read sequencing. Nat. Commun. 2021, 12, 2009. [Google Scholar] [CrossRef]
  61. Liu, L.; Yang, Y.; Deng, Y.; Zhang, T. Nanopore long-read-only metagenomics enables complete and high-quality genome reconstruction from mock and complex metagenomes. Microbiome 2022, 10, 209. [Google Scholar] [CrossRef]
  62. Huang, G.; Shi, W.; Wang, L.; Qu, Q.; Zuo, Z.; Wang, J.; Zhao, F.; Wei, F. PandaGUT provides new insights into bacterial diversity, function, and resistome landscapes with implications for conservation. Microbiome 2023, 11, 221. [Google Scholar] [CrossRef] [PubMed]
  63. Moss, E.L.; Maghini, D.G.; Bhatt, A.S. Complete, closed bacterial genomes from microbiomes using nanopore sequencing. Nat. Biotechnol. 2020, 38, 701–707. [Google Scholar] [CrossRef] [PubMed]
  64. Deng, F.; Han, Y.; Huang, Y.; Li, D.; Chai, J.; Deng, L.; Wei, M.; Wu, K.; Zhao, H.; Yang, G.; et al. A comprehensive analysis of antibiotic resistance genes in the giant panda gut. Imeta 2024, 3, e171. [Google Scholar] [CrossRef] [PubMed]
  65. Deng, F.; Wang, C.; Li, D.; Peng, Y.; Deng, L.; Zhao, Y.; Zhang, Z.; Wei, M.; Wu, K.; Zhao, J.; et al. The unique gut microbiome of giant pandas involved in protein metabolism contributes to the host’s dietary adaption to bamboo. Microbiome 2023, 11, 180. [Google Scholar] [CrossRef] [PubMed]
  66. Xia, Y.; Li, X.; Wu, Z.; Nie, C.; Cheng, Z.; Sun, Y.; Liu, L.; Zhang, T. Strategies and tools in illumina and nanopore-integrated metagenomic analysis of microbiome data. iMeta 2023, 2, e72. [Google Scholar] [CrossRef]
  67. Shendure, J.; Balasubramanian, S.; Church, G.M.; Gilbert, W.; Rogers, J.; Schloss, J.A.; Waterston, R.H. DNA sequencing at 40: Past, present and future. Nature 2017, 550, 345–353. [Google Scholar] [CrossRef]
  68. Rhoads, A.; Au, K.F. PacBio sequencing and its applications. Genom. Proteom. Bioinform. 2015, 13, 278–289. [Google Scholar] [CrossRef]
  69. Chen, Y.; Wu, B.; Ding, Y.; Niu, L.; Bai, X.; Lin, Z.; Xiao, C.-L. High accuracy methylation identification tools on single molecular level for PacBio HiFi data. bioRxiv 2024. [Google Scholar] [CrossRef]
  70. van Dijk, E.L.; Naquin, D.; Gorrichon, K.; Jaszczyszyn, Y.; Ouazahrou, R.; Thermes, C.; Hernandez, C. Genomics in the long-read sequencing era. Trends Genet. 2023, 39, 649–671. [Google Scholar] [CrossRef]
  71. Kucuk, E.; van der Sanden, B.P.; O’Gorman, L.; Kwint, M.; Derks, R.; Wenger, A.M.; Lambert, C.; Chakraborty, S.; Baybayan, P.; Rowell, W.J.; et al. Comprehensive de novo mutation discovery with HiFi long-read sequencing. Genome Med. 2023, 15, 34. [Google Scholar] [CrossRef]
  72. Nayfach, S.; Shi, Z.J.; Seshadri, R.; Pollard, K.S.; Kyrpides, N.C. New insights from uncultivated genomes of the global human gut microbiome. Nature 2019, 568, 505–510. [Google Scholar] [CrossRef] [PubMed]
  73. Arikawa, K.; Ide, K.; Kogawa, M.; Saeki, T.; Yoda, T.; Endoh, T.; Matsuhashi, A.; Takeyama, H.; Hosokawa, M. Recovery of strain-resolved genomes from human microbiome through an integration framework of single-cell genomics and metagenomics. Microbiome 2021, 9, 202. [Google Scholar] [CrossRef] [PubMed]
  74. Lesker, T.R.; Durairaj, A.C.; Gálvez, E.J.; Lagkouvardos, I.; Baines, J.F.; Clavel, T.; Sczyrba, A.; McHardy, A.C.; Strowig, T. An integrated metagenome catalog reveals new insights into the murine gut microbiome. Cell Rep. 2020, 30, 2909–2922.e6. [Google Scholar] [CrossRef]
  75. Zhu, J.; Ren, H.; Zhong, H.; Li, X.; Zou, Y.; Han, M.; Li, M.; Madsen, L.; Kristiansen, K.; Xiao, L. An expanded gene catalog of mouse gut metagenomes. MSphere 2021, 6, 10–1128. [Google Scholar] [CrossRef] [PubMed]
  76. Tong, F.; Wang, T.; Gao, N.L.; Liu, Z.; Cui, K.; Duan, Y.; Wu, S.; Luo, Y.; Li, Z.; Yang, C.; et al. The microbiome of the buffalo digestive tract. Nat. Commun. 2022, 13, 823. [Google Scholar] [CrossRef]
  77. Xie, F.; Jin, W.; Si, H.; Yuan, Y.; Tao, Y.; Liu, J.; Wang, X.; Yang, C.; Li, Q.; Yan, X.; et al. An integrated gene catalog and over 10,000 metagenome-assembled genomes from the gastrointestinal microbiome of ruminants. Microbiome 2021, 9, 137. [Google Scholar] [CrossRef] [PubMed]
  78. Zhang, K.; He, C.; Wang, L.; Suo, L.; Guo, M.; Guo, J.; Zhang, T.; Xu, Y.; Lei, Y.; Liu, G.; et al. Compendium of 5810 genomes of sheep and goat gut microbiomes provides new insights into the glycan and mucin utilization. Microbiome 2024, 12, 104. [Google Scholar] [CrossRef]
  79. Wang, Y.; Qu, M.; Bi, Y.; Liu, W.J.; Ma, S.; Wan, B.; Hu, Y.; Zhu, B.; Zhang, G.; Gao, G.F. The multi-kingdom microbiome catalog of the chicken gastrointestinal tract. Biosaf. Health 2024, 6, 101–115. [Google Scholar] [CrossRef]
  80. Chen, C.; Zhou, Y.; Fu, H.; Xiong, X.; Fang, S.; Jiang, H.; Wu, J.; Yang, H.; Gao, J.; Huang, L. Expanded catalog of microbial genes and metagenome-assembled genomes from the pig gut microbiome. Nat. Commun. 2021, 12, 1106. [Google Scholar] [CrossRef]
  81. Olson, N.D.; Treangen, T.J.; Hill, C.M.; Cepeda-Espinoza, V.; Ghurye, J.; Koren, S.; Pop, M. Metagenomic assembly through the lens of validation: Recent advances in assessing and improving the quality of genomes assembled from metagenomes. Brief. Bioinform. 2019, 20, 1140–1150. [Google Scholar] [CrossRef]
  82. Van Dijk, E.L.; Jaszczyszyn, Y.; Naquin, D.; Thermes, C. The third revolution in sequencing technology. Trends Genet. 2018, 34, 666–681. [Google Scholar] [CrossRef] [PubMed]
  83. Pourmohammadi, R.; Abouei, J.; Anpalagan, A. Error analysis of the PacBio sequencing CCS reads. Int. J. Biostat. 2023, 19, 439–453. [Google Scholar] [CrossRef] [PubMed]
  84. Tedersoo, L.; Albertsen, M.; Anslan, S.; Callahan, B. Perspectives and benefits of high-throughput long-read sequencing in microbial ecology. Appl. Environ. Microbiol. 2021, 87, e00626-21. [Google Scholar] [CrossRef] [PubMed]
  85. Jiang, F.; Li, Q.; Wang, S.; Shen, T.; Wang, H.; Wang, A.; Xu, D.; Yuan, L.; Lei, L.; Chen, R.; et al. Recovery of metagenome-assembled microbial genomes from a full-scale biogas plant of food waste by pacific biosciences high-fidelity sequencing. Front. Microbiol. 2023, 13, 1095497. [Google Scholar] [CrossRef]
  86. Tao, Y.; Xun, F.; Zhao, C.; Mao, Z.; Li, B.; Xing, P.; Wu, Q.L. Improved assembly of metagenome-assembled genomes and viruses in Tibetan saline lake sediment by HiFi metagenomic sequencing. Microbiol. Spectr. 2023, 11, e03328-22. [Google Scholar] [CrossRef]
  87. Kim, C.Y.; Ma, J.; Lee, I. HiFi metagenomic sequencing enables assembly of accurate and complete genomes from human gut microbiota. Nat. Commun. 2022, 13, 6367. [Google Scholar] [CrossRef]
  88. Zhang, Y.; Jiang, F.; Yang, B.; Wang, S.; Wang, H.; Wang, A.; Xu, D.; Fan, W. Improved microbial genomes and gene catalog of the chicken gut from metagenomic sequencing of high-fidelity long reads. GigaScience 2022, 11, giac116. [Google Scholar] [CrossRef]
  89. Feng, X.; Li, H. Evaluating and improving the representation of bacterial contents in long-read metagenome assemblies. Genome Biol. 2024, 25, 92. [Google Scholar] [CrossRef]
  90. Portik, D.M.; Brown, C.T.; Pierce-Ward, N.T. Evaluation of taxonomic classification and profiling methods for long-read shotgun metagenomic sequencing datasets. BMC Bioinform. 2022, 23, 541. [Google Scholar] [CrossRef]
  91. Fedarko, M.W.; Kolmogorov, M.; Pevzner, P.A. Analyzing rare mutations in metagenomes assembled using long and accurate reads. Genome Res. 2022, 32, 2119–2133. [Google Scholar] [CrossRef] [PubMed]
  92. Bickhart, D.M.; Kolmogorov, M.; Tseng, E.; Portik, D.M.; Korobeynikov, A.; Tolstoganov, I.; Uritskiy, G.; Liachko, I.; Sullivan, S.T.; Shin, S.B.; et al. Generating lineage-resolved, complete metagenome-assembled genomes from complex microbial communities. Nat. Biotechnol. 2022, 40, 711–719. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Potential application of shotgun metagenomics.
Figure 1. Potential application of shotgun metagenomics.
Microorganisms 12 02482 g001
Figure 2. A comparison of metagenomic and culturomic approaches in microbial research.
Figure 2. A comparison of metagenomic and culturomic approaches in microbial research.
Microorganisms 12 02482 g002
Figure 3. Schematic diagram of the advantages of long reads.
Figure 3. Schematic diagram of the advantages of long reads.
Microorganisms 12 02482 g003
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Han, Y.; He, J.; Li, M.; Peng, Y.; Jiang, H.; Zhao, J.; Li, Y.; Deng, F. Unlocking the Potential of Metagenomics with the PacBio High-Fidelity Sequencing Technology. Microorganisms 2024, 12, 2482. https://doi.org/10.3390/microorganisms12122482

AMA Style

Han Y, He J, Li M, Peng Y, Jiang H, Zhao J, Li Y, Deng F. Unlocking the Potential of Metagenomics with the PacBio High-Fidelity Sequencing Technology. Microorganisms. 2024; 12(12):2482. https://doi.org/10.3390/microorganisms12122482

Chicago/Turabian Style

Han, Yanhua, Jinling He, Minghui Li, Yunjuan Peng, Hui Jiang, Jiangchao Zhao, Ying Li, and Feilong Deng. 2024. "Unlocking the Potential of Metagenomics with the PacBio High-Fidelity Sequencing Technology" Microorganisms 12, no. 12: 2482. https://doi.org/10.3390/microorganisms12122482

APA Style

Han, Y., He, J., Li, M., Peng, Y., Jiang, H., Zhao, J., Li, Y., & Deng, F. (2024). Unlocking the Potential of Metagenomics with the PacBio High-Fidelity Sequencing Technology. Microorganisms, 12(12), 2482. https://doi.org/10.3390/microorganisms12122482

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop