Next Article in Journal
The Key Enzymes of Carbon Metabolism and the Glutathione Antioxidant System Protect Yarrowia lipolytica Yeast Against pH-Induced Stress
Previous Article in Journal
Four New Species of Deconica (Strophariaceae, Agaricales) from Subtropical Regions of China
Previous Article in Special Issue
Genome Sequencing of Lentinula edodes Revealed a Genomic Variant Block Associated with a Thermo-Tolerant Trait in Fruit Body Formation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Comparative Genomics Reveals Species-Specific Genes and Symbiotic Adaptations in Tricholoma matsutake

1
Department of Plant Medicine, Sunchon National University, Suncheon 57922, Republic of Korea
2
Forest Microbiology and Application Division, Forest Bioresources Department, National Institute of Forest Science, Suwon 16631, Republic of Korea
3
Department of Multimedia Engineering, Sunchon National University, Suncheon 57922, Republic of Korea
4
Department of Agricultural Life Science, Sunchon National University, Suncheon 57922, Republic of Korea
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
J. Fungi 2024, 10(11), 746; https://doi.org/10.3390/jof10110746
Submission received: 2 September 2024 / Revised: 3 October 2024 / Accepted: 26 October 2024 / Published: 29 October 2024

Abstract

:
Tricholoma matsutake, a highly valued ectomycorrhizal fungus, requires a symbiotic relationship with pine trees for growth, complicating its cultivation. This study presents a comprehensive comparative genomic analysis of Tricholoma species, with a focus on T. matsutake. Genomic data from 19 assemblies representing 13 species were analyzed to identify genus-, species-, and strain-specific genes, revealing significant evolutionary adaptations. Notably, T. matsutake exhibits a higher proportion of repetitive elements compared to other species, with retrotransposons like LTR Gypsy dominating its genome. Phylogenomic analyses showed that T. matsutake forms a monophyletic group closely related to T. bakamatsutake. Gene family expansion and contraction analyses highlighted the unique evolutionary pressures on T. matsutake, particularly the loss of tryptophan-related metabolic pathways and the gain of genes related to iron ion homeostasis, which may be crucial for its adaptation to nutrient-limited environments. Additionally, the reduction in secreted proteins and carbohydrate-active enzymes reflects the host-dependent lifestyle of T. matsutake and related species. These findings enhance our understanding of the genetic and evolutionary mechanisms underlying the complex symbiotic relationships of T. matsutake, offering potential avenues for optimizing its cultivation and commercial value.

1. Introduction

Species in the genus Tricholoma are mushroom fungi belonging to the family Tricholomataceae and the order Agaricales in the phylum Basidiomycota. They are found on every continent across the globe, including Africa, America, Asia, Europe, and Oceania [1,2,3,4]. All Tricholoma species are known to be ectomycorrhizal (ECM) fungi, forming symbiotic relationships with various broad-leaved or coniferous trees [5]. A recent phylogenetic study based on 50 multi-loci indicated that the genus Tricholoma is divided into at least four subgenera and fourteen sections [6]. Furthermore, with the advancement of next-generation sequencing (NGS) technology, the genomes of numerous Tricholoma species have been sequenced and reported (Table 1). However, comprehensive comparative genomic analysis within this genus has not yet been conducted.
Among the species in the genus Tricholoma, T. matsutake, which has the highest commercial value, necessitates a symbiotic relationship with its host (mainly Pinus, but also Abies, Tsuga, Picea, Quercus, and Castanopsis) for growth, yet its slow growth rate significantly complicates cultivation [7,8]. To address these challenges, various ecological, physiological, and genetic studies have been conducted. Research at the genomic level is particularly important, as comparing the genome of T. matsutake with those of closely related species can reveal more specific genetic differences. This comparative genomics approach is instrumental in identifying species-specific genes that have emerged from the co-evolution process between T. matsutake and its hosts. For instance, specific genes crucial for interactions with Pinus can be identified through gene variations that have adapted during the co-evolution process.
According to the latest telomere-to-telomere assembly, T. matsutake strain Sample A has been found to possess 13 chromosomes [9]. However, the number of annotated proteins varied significantly for different studies, ranging from 15,305 to 23,170 [9,10,11,12,13]. This discrepancy is due to different annotation pipelines and differences in gene structural annotation before and after repeat annotation. In this study, a comprehensive comparative analysis of the T. matsutake genome was performed using the same annotation method, identifying genus-, species-, and strain-specific genes within the Tricholoma genus. Comparative genomic studies with closely related species provide crucial insights for optimizing the growth conditions and improving the cultivation success rate of T. matsutake. Ongoing research based on these findings can maximize the commercial value of T. matsutake and ensure a stable supply.

2. Materials and Methods

2.1. Dataset Acquisition and Quality Assessment

To conduct a comparative genomic analysis of species within the genus Tricholoma, a search for “Tricholomataceae (Taxonomy ID: 5351)” was performed on NCBI, resulting in the download of 19 assemblies from 13 species (Cystodermella granulosa, Flagelloscypha sp., Macrocystidia cucumis, Rugosomyces carneus, Tricholoma bakamatsutake, Tricholoma flavovirens, Tricholoma furcatifolium, Tricholoma hemisulphureum, T. matsutake, Tricholoma saponaceum, Tricholoma terreum, and two Tricholoma species) (Table 1). The assembly quality was analyzed using BUSCO v5.3.2 and Quast v5.2.0 [14,15].

2.2. Structural Annotation of Repetitive Elements, Genes, and RNAs

For a comprehensive comparative genomic analysis, protein-coding genes, ribosomal RNAs (rRNAs), and transfer RNAs (tRNAs) were predicted following the de novo repeat annotation. The repetitive sequences were predicted using the RepeatModeler v2.0.5 and RepeatMasker v4.1.6 pipeline incorporated with LTR_retreiver v2.9.9, RepeatScout v1.0.6, and RECON v1.08 (-LTRStruct mode) [16,17,18,19,20]. The structural annotation of genes was performed with the repeat-masked genomes using BRAKER v3.0.7 based on the 330,798 Tricholomatineae (Taxonomy ID: 2982303) proteins retrieved from the NR database of NCBI and the fungal ab initio method [21]. Augustus v3.5.0 and GeneMark v3.68 were included for BRAKER3 annotation [22,23]. The rRNAs were predicted using Barrnap v0.9 [24], the tRNAs were predicted using tRNAscan-SE v2.0.12 [25], and the eukaryote model was used for both programs.

2.3. Synteny Analysis for T. matsutake Genomes

The genome mapping of T. matsutake strains to the reference strain Sample A was performed using a reference-based scaffolding tool RagTag v2.1.0 [26]. The synteny among the T. matsutake strains and between T. bakamatsutake was analyzed using the nucleotide sequences comparison tool nucmer in MUMmer v4.0.0rc1 package [27].

2.4. Functional Annotation of the Genes and Enrichment Tests

The functional annotation of genes was performed using DIAMOND alignments against the NR database, InterProScan, and Gene Ontology (GO), and the gene function enrichment analyses were performed by Fisher’s exact test incorporated in OmicsBox v3.1.11. An adjusted p-value of <0.05 was used for the cutoff for the functional enrichment test, but a p-value of <0.05 was used instead when no enriched GO IDs were found. Unless otherwise specified, the functional enrichment analysis used T. matsutake NIFoS 2001 as the reference. Additionally, KofamScan v1.3.0 was used for KEGG annotations [28].
SignalP v6.0 with eukaryote mode, dbCAN3 v4.1.4, and EffectorP v3.0 default options were used for the prediction of secreted proteins, carbohydrate-active enzymes (CAZymes), and putative effector proteins, respectively [29,30,31]. For the secondary metabolite biosynthesis gene clusters (SMBGCs), antiSMASH v7.0.0 fungal version with all features and relaxed strictness was implemented [32].

2.5. Phylogenomics, Orthology, and Gene Family Expansion/Contraction Analyses

The annotated proteins from the 19 strains were subjected to OrthoFinder v2.5.4 to reveal their orthology and phylogenomics relationship [33]. For gene family expansion and contraction analysis, the output from ortholog analysis and the species tree was subject to CAFE v5.1.0 [34]. The additive species tree was calibrated to an ultrametric tree using treePL v1.0 with a divergence time of 32 MYA between T. matsutake and T. flavovirens as reference [35]. The divergence time was derived from TimeTree 5 [36].

2.6. Idenfication of a Trytophan Metabolism Pathway Gene by PCR Amplification

T. matsutake and T. bakamatsutake strains (as shown in Supplementary Table S1) were cultured on potato dextrose agar (PDA) medium at 25 °C for 2 months. For DNA extraction, 100 mg of mycelium was frozen in liquid nitrogen and ground to a fine powder using a TissuLyser II (Qiagen, Hilden, Germany). Genomic DNA was then extracted from lyophilized mycelia using the Allprep Fungal DNA/RNA/Protein kit (Qiagen) following the manufacturer’s protocol. Primers were designed using Primer 3 web version 4.1.0 (primer3.ut.ee) and synthesized by Macrogen Co. Ltd. (Seoul, Republic of Korea). PCR amplification of the 3-hydroxyanthranilate 3,4-dioxygenase (3HAO) gene was performed using specific primers (Supplementary Figure S1 and Supplementary Table S2) and AccuPower Taq PCR PreMix (Bionner, Daejeon, Republic of Korea). The PCR conditions were as follows: initial denaturation at 95°C for 3 min, followed by 35 cycles of denaturation at 95 °C for 30 s, annealing at 59 °C for 30 s, and extension at 72 °C for 90 s, with a final extension at 72 °C for 5 min. PCR products were analyzed on a 1.0% agarose gel in 1% TAE buffer, stained with EcoDye DNA Staining Solution (SolGent, Daejeon, Republic of Korea), and run at 100 V for 20 min.

3. Results

3.1. Genome Quality and Metadata of the Available Agaricales Strains

T. matsutake strains were isolated from China, Korea, and Japan, while T. bakamatsutake strains were isolated from China and Japan (Table 1). Additionally, Tricholoma MG strains were isolated from China for mushroom genome analysis study [13]. Among the Tricholoma species, only the T. hemisulphureum strain was isolated outside Asia, specifically from the United Kingdom [37]. The species assembled at the chromosome level include T. matsutake sample A (13 chromosomes), T. bakamatsutake Sf-Tf05 (14 chromosomes) [38], and T. hemisulphureum (12 chromosomes) (Table 1). According to BUSCO analysis, essential genes at the Agaricales level were well preserved across all strains of T. matsutake and T. bakamatsutake, as well as T. flavovirens and T. hemisulphureum, regardless of the number of scaffolds (Supplementary Figure S2). Although the BUSCO scores for Tricholoma outgroup species at the Agaricales level were relatively insufficient, all were used to predict Tricholoma genus-specific genes.
Table 1. The genome statistics of Tricholoma and outgroup strains used in this study.
Table 1. The genome statistics of Tricholoma and outgroup strains used in this study.
SpeciesStrainIsolated
Location
No. of
Scaffolds
Assembly
Size (bp)
N50
(bp)
L50GC
(%)
Repeat
Size (bp)
Repeat
Content (%)
Reference
(GenBank Accession)
Tricholoma matsutakeNIFoS 2001Korea80162,632,6065,103,8591045.50116,305,33271.51Kang et al., 2023 [12]
(GCA_026213095.1)
Tricholoma matsutakeKMCC04578Korea5255189,030,25693,40658945.2599,821,68669.22Min et al., 2020 [10]
(GCA_002939025.2)
Tricholoma matsutakeMG52China58,276155,036,8749294397145.35108,421,24871.23Li et al., 2018 [13]
(GCA_003314635.1)
Tricholoma matsutakeNBRC 30605Japan88,884131,718,9252909952645.4688,187,24467.19Unpublished
(GCA_001652835.1)
Tricholoma matsutake945NA2545175,758,422320,87516745.40104,910,41169.01Miyauchi et al., 2020 [11]
(GCA_014904895.1)
Tricholoma matsutakeSample AJapan13161,040,72112,376,747645.50115,538,23871.74Kurokochi et al., 2023 [9]
(GCA_026075535.2)
Tricholoma bakamatsutakeSF-Tf05Japan14142,190,77712,427,164643.92108,654,71276.41Ichida et al., 2023 [38]
(GCA_029959215.1)
Tricholoma bakamatsutakeMG51China51,308140,676,1259944326043.93106,664,01577.08Li et al., 2018 [13]
(GCA_003313665.1)
Tricholoma flavovirensMG32China49,305119,315,1217637361145.0080,893,95568.90Li et al., 2018 [13]
(GCA_003313805.1)
Tricholoma sinoportentosum *MG77China53,256118,334,1155062605147.0965,740,83957.47Li et al., 2018 [13]
(GCA_003314665.1)
Tricholoma saponaceumMG146China14,50055,708,85711,929122847.0010,707,82519.35Li et al., 2018 [13]
(GCA_003313625.1)
Tricholoma albobrunneum **MG99China18,07259,536,7329454154046.7616,205,71727.82Li et al., 2018 [13]
(GCA_003521275.1)
Tricholoma hemisulphureumKDTOL00252UK26128,976,82610,683,071547.1388,317,38168.48Unpublished
(GCA_964034925.1)
Tricholoma terreumMG45China25,40484,437,8279877225947.8738,467,97046.28Li et al., 2018 [13]
(GCA_003316345.1)
Praearthromyces corneri ***D33Malaysia17,09449,805,31910,87087647.4411,299,28722.73Unpublished
(GCA_018854895.1)
Rugosomyces carneusD47Netherlands39,73358,963,7003469257647.0215,608,58326.69Unpublished
(GCA_018221805.1)
Cystodermella granulosaDBG-29008USA23,41935,367,9942089402445.654,136,58311.71Kraisitudomsook et al., 2024 [37]
(GCA_036937035.1)
Flagelloscypha sp.PMI 526NA39772,728,8451,132,4801846.849,103,48012.52Unpublished
(GCA_021399455.1)
Macrocystidia cucumisKM177596UK204,50846,746,85915,51159948.764,791,42710.33Unpublished
(GCA_001179725.1)
* Originally Tricholoma sp. MG77; ** Tricholoma sp. MG99; and *** Tricholoma furcatifolium.

3.2. Evolutionary Relationship of the Agaricales Strains

The evolutionary relationships of Agaricales species were established using 1056 ortholog clusters shared among 19 strains. The species tree indicates that the genus Tricholoma, excluding T. furcatifolium, is monophyletic (Figure 1A). Previous DNA analyses suggested that T. furcatifolium is not associated with the Tricholoma genus and proposed its classification into a new genus, Praearthromyces [39]. Additionally, a BLAST search of the TEF1A sequence of T. furcatifolium, presumed to be Praearthromyces, in the NR database showed 100% coverage and identity with Praearthromyces corneri. This indicates that T. furcatifolium in the NCBI genome dataset is P. corneri (Supplementary Figure S3). Tricholoma sp. MG77 appears closely related to T. flavovirens; however, the relatively long branch length and differences in GC content suggest species-level divergence. Genome-based ITS analysis revealed that the MG77 strain is identified as T. sinoportentosum, which belongs to the same section Tricholoma as T. flavovirens, while the MG99 strain is identified as T. albobrunneum, belonging to the section Genuina (Supplementary Figure S4). Henceforth, the newly identified name will be used in the manuscript.

3.3. Evolutionary Relationship of the Strains in Section Matsutake

T. matsutake strains formed a monophyletic group, appearing as a sister clade to T. bakamatsutake strains. Among the T. matsutake strains, two Korean strains, NIFoS 2001 and KMCC04578, formed a clade, with the Chinese strain MG2 as their outgroup and the Japanese strain NBRC 30605 and sample A each appearing as additional outgroups (Figure 1B). Although the origin of strain 945 is unclear [11], phylogenomic analysis suggests a high likelihood that it was collected from Japan. In addition, the genomes of T. matsutake strains were mapped to Sample A with 161 Mbp size chromosome-level assembly as reference (Figure 1B). The genomes of two Korean strains and strain 945 were almost fully mapped to the reference, but the strains KMCC04578 and 945 had additional genome sequences. The genomes of Chinese strain MG52 and Japanese strain NBRC 30605 were not fully mapped to the reference, but they had their own unique genome sequences. Overall, the syntenic relationship was preserved within T. matsutake strains and was partially conserved with T. bakamatsutake, which belongs to the same section Matsutake (Supplementary Figure S5). Additionally, the synteny was not conserved with Tricholoma species outside this section.

3.4. Genome Statics of the Selected Agaricales Strains

The genome size of Agaricales species shows a broad range from 35.3 Mbp in C. granulosa to 189.0 Mbp in T. matsutake KMCC04578 (Figure 1A, Table 1). Within the Tricholoma genus, all species except T. terreum, T. albobrunneum (MG99), and T. saponaceum possess genomes larger than 100 Mbp. Notably, species within the Matsutake section, including T. matsutake and T. bakamatsutake, have larger genomes than other Tricholoma species. Compared to the outgroup, the relatively large genome sizes of Tricholoma species can be attributed to an increased content of repetitive DNA.
The GC content of Agaricales varies across species (Figure 1A, Table 1). Nevertheless, strains within the same species exhibit nearly identical GC content. T. matsutake strains isolated from different regions have a GC content ranging from 45.3% to 45.5%, while T. bakamatsutake strains show a GC content of 43.9%. Regarding tRNA, excluding M. cucumis, which had poor assembly, the number of tRNA genes ranged from a minimum of 284 to a maximum of 577, and rRNA genes ranged from 0 to a maximum of 45 (Table 2). The variability in the number of tRNA and rRNA genes is attributed to these genes being recognized as known repeats [40], which can cause repetitive sequences to be inadequately represented depending on the sequencing methods used.
After repeat masking and re-annotation, the number of protein-coding genes in Tricholoma species ranged from 12,460 in T. hemisulphureum to 17,939 in T. sinoportentosum (MG77) (Figure 1A, Table 2). Previously, T. matsutake strains showed a wide range in gene counts; however, on average, they were found to have 13,809 genes coding for 16,221 proteins. Of these, only 65.6% have associated Gene Ontology terms, with the remaining genes presumed to encode proteins of unknown function.

3.5. Repeat Content within the Selected Strains

Analysis of the repeat content in selected species revealed that, except for T. saponaceum and T. albobrunneum (MG99), Tricholoma species possess more repeat elements compared to outgroup species (Figure 2A). The average repeat content in Tricholoma species was approximately 60%. Excluding the unclassified repeats, the most abundant repeat elements in Tricholoma genomes are retrotransposons, including LTRs and LINEs, with LTR Gypsy being the most prevalent and constituting a high proportion. DNA transposons are evenly distributed among Tricholoma species, except for the two species with smaller repeat content mentioned above, while LINEs appear increased only in T. matsutake strains. Additionally, T. bakamatsutake has a higher proportion of rolling circle elements compared to other Tricholoma species.
The distribution of repeat sequences within genomes according to the Kimura substitution level indicates the relative age of repeat element generation [41]. Flagelloscypha sp. has lower repeat content compared to Tricholoma species, but a high proportion of its repeats were generated long ago, indicating a relatively stable genomic environment (Figure 2B). In contrast, Tricholoma species show high proportions of repeat sequences at low Kimura substitution levels, suggesting a recent surge in repeat element activity in each species.

3.6. The Pan and Core Orthogroups of Selected Strains

The ortholog analysis of 325,167 proteins in 19 strains revealed 34,538 orthogroups, including 22,708 unassigned proteins (singletons) (Table 3). Among them, 11,788 Tricholoma genus-specific orthogroups were identified, with 10,027 orthogroups sharing with the outgroup (Figure 3A). The number of Tricholoma genus-specific orthogroups without singletons was 6359 and was divided into six sections. The division identified 137 Tricholoma core orthogroups, and their putative functions were fungal-type cell wall, pentose phosphate shunt, gamma-aminobutyric acid catabolic process, glutamate decarboxylation to succinate, and thiamine triphosphatase activity (Figure 3B). These functions are ubiquitous in fungi, and gamma-aminobutyric acid and succinate have been reported to be produced by T. matsutake [42].
The number of Matsutake section-specific orthogroups was 2803, and 120 orthogroups shared between T. bakamatsutake and T. matsutake were identified (Figure 3A). The proteins of the shared orthogroup are primarily active in the extracellular region and are involved in functions such as flavin adenine dinucleotide binding, pyranose dehydrogenase activity, chitin-binding and the catabolic process, choline dehydrogenase activity, glycine betaine biosynthesis, and the sphingoid catabolic process (Figure 3B). Molecules such as chitin, sphingoid, choline, and betaine related to the functions are major fungal components found in T. matsutake extracts [42].
Lastly, the 2245 species-specific orthogroups of T. matsutake were divided among six strains, yielding region- and strain-specific orthogroups and 195 T. matsutake species-specific core orthogroups (Figure 3A). The major functions of the core orthogroups included the negative regulation of intracellular signal transduction and TORC1 signaling, the D-galactonate catabolic process, glycogen (starch) synthase activity, the GATOR1 complex, the pyridoxine biosynthetic process, and quercetin 3-O-glucosyltransferase activity (Figure 3B). These functions highlight specialized metabolic and regulatory strategies of T. matsutake that are essential for its ecological niche and symbiotic lifestyle.

3.7. The Gene Family Expansion and Contraction in Tricholoma Evolution

The CAFE analysis revealed the numbers of expanded and contracted gene families in the evolution of Tricholoma and outgroup strains (Figure 4A). During the divergence of the Tricholoma genus, 457 gene families expanded and 1265 gene families contracted. At the divergence of the Matsutake section, 383 gene families expanded and 1056 contracted, while, at the divergence of T. matsutake, 592 gene families expanded and 239 contracted.
To understand the functional characteristics of the Tricholoma genome, the functions of gene families that expanded and contracted at the divergence points of the Tricholoma genus, the Matsutake section, and T. matsutake were analyzed (Figure 4B). For the expanded gene families, T. matsutake NIFoS 2001 was used as a reference to identify enriched functions. Gene families commonly expanded at all divergence points were associated with DNA metabolism and repair as well as medium-chain fatty acid metabolism, while each divergence point exhibited unique functions specific to that time (Figure 4B). In contrast, for the contracted gene families, the functions of gene families completely lost in the Tricholoma genus, Matsutake section, and T. matsutake were identified using P. corneri, T. flavovirens, and T. bakamatsutake SF-Tf05 as outgroups, respectively. No common functions were found among the contracted gene families (Figure 4A).
The functions of gene families expanded in the Tricholoma genus were primarily associated with amide, peptide, glycerol, and water transport, as well as the nuclear envelope and RNA polymerase. The functions of completely deleted gene families were related to the catabolic processes of plant components such as cellulose, pectin, and glycerolipids, along with various enzyme activities. In the Matsutake section, the functions of expanded and lost gene families were diverse. The main functions of the expanded gene families were related to transmembrane transport, hydrolase activity, and fungal-type vacuoles, while the lost gene families were associated with various degradative enzymes such as oxygenase, peptidase, and glucosidase, as well as metabolic processes. In T. matsutake, gene families related to iron ion binding and homeostasis, small RNA regulation, and the cell cycle were expanded. To investigate the genes related to iron binding, a major portion of the expanded gene families in T. matsutake, proteins involved in the four main iron uptake mechanisms in fungi (Fet4p, Fre1p, Fet3p, Ftr1p, Rbt5, and Sit1p) were identified through a BLAST search (E-value < 0.05) in both T. matsutake and T. bakamatsutake (Figure 5) [43]. No homologs of Fet4p, responsible for direct ferrous iron permeation, were found at an E-value below 0.05 (though they were detected at an E-value above 1). This corresponds to the gene families that are contracted in T. matsutake (Figure 4B). Homologs of Fre1p and Fet3p, associated with iron ion binding, were more abundant in T. matsutake compared to T. bakamatsutake. However, iron permease Ftr1p, heme receptor Rbt5, and siderophore uptake transporter Sit1p were more frequently identified in T. bakamatsutake.
On the other hand, the lost gene families were related to the quinolinate biosynthetic process, anthranilate metabolic process, and, consequently, the biosynthesis of NAD, all of which are associated with tryptophan metabolism (Figure 4B). KEGG analysis of the tryptophan metabolism pathway revealed that the pathway from tryptophan to quinolinate and NAD+ was preserved in all T. bakamatsutake strains, while, in T. matsutake strains, the absence of 3-hydroxyanthranilate 3,4-dioxygenase (3HAO; EC:1.13.11.6) resulted in an incomplete pathway (Figure 6, Supplementary Figure S6). Two homologous copies of 3HAO were found in T. bakamatsutake, while none were present in T. matsutake. However, BLAST results for 3HAO in T. bakamatsutake show the existence of a homolog with low conservation in T. matsutake (Figure 6). Additional PCR amplification of the 3HAO gene from T. bakamatsutake revealed that the gene is present in T. bakamatsutake strains but absent in T. matsutake strains (Supplementary Figure S7). Although tryptophan has been found in T. matsutake extracts [44], the absence of 3HAO suggests that T. matsutake may depend on its host to supply intermediates, particularly in the steps leading to 3-hydroxyanthranilate and quinolinate.

3.8. CAZyme, Secreted Protein, Effector, and Secondary Metabolite Gene Cluster Content in Selected Strains

The analysis of secreted proteins, CAZymes, and putative effectors in Tricholoma species revealed that, consistent with the gene family loss observed at the divergence of the Tricholoma genus, Tricholoma species had fewer predicted proteins compared to the outgroup (Figure 7). While the outgroup species possessed more than 1000 secreted proteins, Tricholoma species had approximately 500 to fewer than 1000 proteins. Notably, T. bakamatsutake had the smallest number of secreted proteins, ranging from about 532 to 555. The total number of CAZymes and secreted CAZymes in Tricholoma species ranged from 634 to 801 and 143 to 190, respectively, which was significantly lower than in the outgroup. The number of secreted effectors in Tricholoma species also ranged from 90 to 229, which was markedly lower than in the outgroup.
In the case of SMBGCs, most selected species here were found to possess one or more type 1 PKS and NI-siderophore clusters. The fungal-RiPP-like cluster was absent in the Matsutake section and presented only in the outgroup, while the indole cluster was found exclusively in the Flagelloscypha sp. and M. cucumis clade. On the other hand, NRPS-like and terpene clusters were widely present within Tricholoma and outgroup species. Notably, the (+)-δ-cadinol sesquiterpene synthase, which is found in many conifers and other fungi, was detected in all strains except for the T. matsutake Korean strains, Sample A, T. bakamatsutake SF-Tf05, and C. granulosa [45,46]. This may serve as evidence of horizontal gene transfer with hosts, and the diversity of SMBGCs across Tricholoma species suggests their potential to produce various secondary metabolites.

4. Discussion

The comparative genomic analysis of Tricholoma species, mainly focusing on T. matsutake, reveals critical insights into the evolutionary mechanisms shaping the genomic architecture and functional specialization of these fungi. The size of non-repetitive DNA, excluding repeats, in Tricholoma genomes averaged around 48 Mbp, similar to the average genome size of outgroup species and other Basidiomycetes [47]. The similarity in the size of non-repetitive DNA between Tricholoma and outgroup species suggests that the expansion of Tricholoma genomes is attributed to the proliferation of repetitive sequences rather than an increase in non-repetitive DNA. Especially the relatively large genome sizes in species like T. matsutake and T. bakamatsutake can be primarily attributed to the expansion of repetitive elements, particularly LTR Gypsy retrotransposons (Figure 2A). Previous studies have indicated that merely LTR elements constitute the most significant portion of the T. matsutake genome [10,11]. The recent activity of these elements, indicated by their low Kimura substitution levels, points to ongoing genomic evolution. In addition, the increased content of LINEs in T. matsutake strains and the higher proportion of rolling circle elements in T. bakamatsutake highlight specific evolutionary adaptions in these species (Figure 2B). For instance, the symbiotic relationship between T. matsutake and its hosts, which thrive in specific forest ecosystems [48], may have driven the selection of these genetic changes, promoting resilience to external stressors like soil nutrient depletion. These ongoing genomic changes not only reflect evolutionary adaptation but also may enhance the functional specialization of these fungi in their unique ecological niches.
The phylogenomic analyses revealed that the genus Tricholoma forms a monophyletic group, supporting previous taxonomic classifications (Figure 1A) [39]. The divergence of T. matsutake from its close relatives, such as T. bakamatsutake, is particularly intriguing as it suggests that specific evolutionary pressures have shaped the genome of T. matsutake differently from other members of the genus. The presence of region-specific clades within T. matsutake strains underscores the impact of geographic isolation on genetic divergence (Figure 1B). These findings provide a foundation for further exploration into how environmental factors and host interactions drive speciation and genetic diversity within this genus.
In this study, GO annotation was used to explore the functional enrichment of genes in T. matsutake and related species. Despite only 65.6% of the proteins in T. matsutake being assigned GO terms, significant functions were identified in core orthogroups and orthogroups that underwent expansion or contraction through enrichment tests (Figure 3 and Figure 4). Although the functions of most proteins within these orthogroups remain unknown, the proteins that have been assigned GO terms reflect the biological characteristics of T. matsutake and its related species at the genomic level. However, the analysis is limited by the incomplete annotation of proteins, as many remain functionally uncharacterized, which may restrict a comprehensive understanding of the full biological roles of these orthogroups.
The identification of core orthogroups within the section Matsutake, particularly those involved in extracellular functions and metabolic processes, reflects the specialized lifestyle of these fungi (Figure 3B). Functions such as chitin-binding, terpene synthesis, and sphingoid catabolic processes are crucial for interacting with their environment, breaking down organic matter, and synthesizing bioactive compounds. These processes support nutrient acquisition and cellular metabolism, enabling the fungi to adapt to various environmental conditions and maintain their symbiotic relationship with host plants. Additionally, the involvement of mitochondrial-nucleus signaling pathways and protein tagging mechanisms indicates the importance of cellular regulation and stress adaptation in these fungi’s survival and ecological roles.
The identification of T. matsutake-specific orthogroups related to TORC1 signaling, the D-galactonate catabolic process, starch synthase activity, the GATOR1 complex, the pyridoxine biosynthetic process, and quercetin 3-O-glucosyltransferase activity provides valuable insights into how this fungus may have adapted to modulate its growth and metabolism in response to the nutrient-limited environments that it typically inhabits (Figure 3B). These pathways suggest that T. matsutake has evolved specialized mechanisms to optimize nutrient acquisition and utilization under challenging ecological conditions. Quercetin, which has been identified in T. matsutake extracts [42], alongside essential vitamins such as thiamin, nicotinic acid, and pyridoxine, as well as starch, has been shown to enhance mycelial growth and promote amylase production, a critical factor for fruit body formation [7,49,50,51]. Additionally, TORC1 is a crucial regulator of cell growth and development in fungi, controlling nutrient absorption, metabolism, protein synthesis and degradation, morphology, and cytotoxicity, and the GATOR complex serves as an inhibitor of TORC1 [52]. The negative regulation of TORC1 likely influences the slow growth of Matsutake mushrooms. This negative regulation by the GATOR complex could lead to reduced TORC1 activity, thereby slowing down cellular processes and overall growth rates. This modulation allows the fungi to adapt to nutrient-limited environments, which is essential for their survival and symbiotic relationship with their host plants.
The gene family expansion and contraction analysis provides deeper insights into the adaptive strategies employed by T. matsutake throughout its evolution. The expansion of gene families related to iron ion homeostasis and small RNA regulation suggests that the fungus has developed sophisticated mechanisms to manage nutrient availability and regulate gene expression, which are essential for survival in its specific ecological niche (Figure 4B). Several studies have reported that iron ions play a significant role in promoting T. matsutake growth, with evidence showing that the soil that it inhibits is generally nutrient-poor but rich in iron ions [7,53,54,55]. In these environments, the ability to efficiently acquire, store, and regulate iron offers a significant competitive advantage. Enhanced iron uptake mechanisms, potentially through more efficient transporters, likely enable T. matsutake to outcompete other organisms for this essential nutrient (Figure 5). Moreover, the expansion of iron-related gene families may provide precise control over iron-dependent cellular processes, optimizing growth and metabolism in iron-rich environments.
The gene loss observed in T. matsutake related to tryptophan metabolism, particularly the absence of 3HAO, suggests a significant evolutionary shift toward a symbiotic lifestyle (Figure 4B and Figure 6). This gene loss may result in a metabolic dependency on its host or environment for intermediates like quinolinate and NAD+, classifying T. matsutake as an auxotroph for these compounds. The contraction of these gene families may reflect an evolutionary trade-off, where the fungus has streamlined its metabolism to conserve energy by outsourcing the biosynthesis of certain compounds to its host, enhancing its ecological fitness within a stable symbiotic relationship. However, as these findings are based solely on comparative genome analysis, experimental validation is necessary to confirm the functional implications of the gene loss and metabolic dependency in T. matsutake. Future studies should focus on investigating whether the fungus indeed acquires tryptophan intermediates from its host and how this metabolic reliance impacts the dynamics of its symbiotic relationship.
Compared to outgroup species, the reduction in the number of secreted proteins and CAZymes in T. matsutake and related species indicates their adaptation to a host-dependent lifestyle (Figure 7). The limited diversity of CAZymes suggests that these fungi have streamlined their enzymatic toolkit to suit the specific needs of their symbiotic relationship, possibly focusing on the breakdown of particular substrates provided by their host plants. These results reflect the host-dependent lifestyle of Tricholoma species, a phenomenon also observed in other ectomycorrhizae [56]. The analysis of SMBGCs reveals the potential for T. matsutake to produce unique bioactive compounds, which may play roles in host interaction, defense mechanisms, or ecological competition (Figure 7). The absence of certain SMBGCs, like the fungal-RiPP-like cluster in the section Matsutake, could indicate a divergence in secondary metabolism pathways that corresponds to the distinct ecological roles that these fungi play.
The comparative genomics analysis of Tricholoma species, with a focus on T. matsutake, reveals significant evolutionary and functional adaptations within this fungal group. Key findings include a higher proportion of repeat elements, special functions identified in species-specific orthogroups, the expansion and contraction of gene families, and a reduction in secreted proteins and CAZymes compared to outgroup species, reflecting their host-dependent lifestyle. These insights underscore the specialized ecological and metabolic traits of Tricholoma species, driven by their unique evolutionary pressures and symbiotic relationships.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/jof10110746/s1. References [57,58] are cited in the Supplementary Materials.

Author Contributions

J.H.K.: software, formal analysis, investigation, resources, data curation, writing—original draft, visualization. E.-K.B.: validation, formal analysis, investigation, resources, data curation, writing—original draft, funding acquisition. Y.H.: software, formal analysis, data curation, visualization. B.C.: software, data curation. M.-J.K.: validation, resources. E.-J.P.: validation, resources, writing—review and editing, supervision, funding acquisition. K.-T.K.: conceptualization, methodology, software, validation, formal analysis, investigation, writing—original draft, visualization, supervision, project administration. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported by a National Institute of Forest Science grant (Forest Science Research project number FG0603-2021-01-2024) to E.-K.B.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The genome sequences, repeat and gene annotation, coding and protein sequences, and functional annotation data were deposited to FigShare and available at https://doi.org/10.6084/m9.figshare.26869750.v1.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Reschke, K.; Popa, F.; Yang, Z.L.; Kost, G. Diversity and taxonomy of Tricholoma species from Yunnan, China, and notes on species from Europe and North America. Mycologia 2018, 110, 1081–1109. [Google Scholar] [CrossRef] [PubMed]
  2. Benazza-Bouregba, M.; Savoie, J.-M.; Fortas, Z.; Billette, C. A new record of Tricholoma caligatum (Tricholomataceae) from North Africa with a discussion of related species. Phytotaxa 2016, 282, 119–128. [Google Scholar] [CrossRef]
  3. Cooper, J.A.; Park, D. The fungal genus Tricholomopsis (Agaricales) in New Zealand, including Tricholomopsis scabra sp. nov. Phytotaxa 2016, 288, 69–76. [Google Scholar] [CrossRef]
  4. Singer, R. Monographs of south American Basidiomycetes, especially those of the east slope of the Andes and Brazil. IX. Tricholoma in Brazil and Argentina. Darwiniana 1966, 14, 19–35. [Google Scholar]
  5. Sánchez-García, M.; Matheny, P.B. Is the switch to an ectomycorrhizal state an evolutionary key innovation in mushroom-forming fungi? A case study in the Tricholomatineae (Agaricales). Evolution 2017, 71, 51–65. [Google Scholar] [CrossRef] [PubMed]
  6. Ding, X.X.; Xu, X.; Cui, Y.Y.; Kost, G.; Wang, P.M.; Yang, Z.L. A fifty-locus phylogenetic analysis provides deep insights into the phylogeny of Tricholoma (Tricholomataceae, Agaricales). Persoonia 2023, 50, 1–26. [Google Scholar] [CrossRef] [PubMed]
  7. Yamanaka, T.; Yamada, A.; Furukawa, H. Advances in the cultivation of the highly-prized ectomycorrhizal mushroom Tricholoma matsutake. Mycoscience 2020, 61, 49–57. [Google Scholar] [CrossRef]
  8. Aoki, W.; Bergius, N.; Kozlan, S.; Fukuzawa, F.; Okuda, H.; Murata, H.; Ishida, T.A.; Vaario, L.M.; Kobayashi, H.; Kalmiş, E.; et al. New findings on the fungal species Tricholoma matsutake from Ukraine, and revision of its taxonomy and biogeography based on multilocus phylogenetic analyses. Mycoscience 2022, 63, 197–214. [Google Scholar] [CrossRef]
  9. Kurokochi, H.; Tajima, N.; Sato, M.P.; Yoshitake, K.; Asakawa, S.; Isobe, S.; Shirasawa, K. Telomere-to-telomere genome assembly of matsutake (Tricholoma matsutake). DNA Res. 2023, 30, dsad006. [Google Scholar] [CrossRef]
  10. Min, B.; Yoon, H.; Park, J.; Oh, Y.L.; Kong, W.S.; Kim, J.G.; Choi, I.G. Unusual genome expansion and transcription suppression in ectomycorrhizal Tricholoma matsutake by insertions of transposable elements. PLoS ONE 2020, 15, e0227923. [Google Scholar] [CrossRef]
  11. Miyauchi, S.; Kiss, E.; Kuo, A.; Drula, E.; Kohler, A.; Sánchez-García, M.; Morin, E.; Andreopoulos, B.; Barry, K.W.; Bonito, G. Large-scale genome sequencing of mycorrhizal fungi provides insights into the early evolution of symbiotic traits. Nat. Commun. 2020, 11, 5125. [Google Scholar] [CrossRef] [PubMed]
  12. Kang, M.J.; Bae, E.K.; Park, E.J.; Ka, K.H.; Son, M.R.; Kim, K.T.; Lee, J.W. Draft genome sequence for the symbiotic pine mushroom Tricholoma matsutake. Microbiol. Resour. Announc. 2023, 12, e0127122. [Google Scholar] [CrossRef] [PubMed]
  13. Li, H.; Wu, S.; Ma, X.; Chen, W.; Zhang, J.; Duan, S.; Gao, Y.; Kui, L.; Huang, W.; Wu, P.; et al. The Genome Sequences of 90 Mushrooms. Sci. Rep. 2018, 8, 9982. [Google Scholar] [CrossRef] [PubMed]
  14. Simão, F.A.; Waterhouse, R.M.; Ioannidis, P.; Kriventseva, E.V.; Zdobnov, E.M. BUSCO: Assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 2015, 31, 3210–3212. [Google Scholar] [CrossRef]
  15. Gurevich, A.; Saveliev, V.; Vyahhi, N.; Tesler, G. QUAST: Quality assessment tool for genome assemblies. Bioinformatics 2013, 29, 1072–1075. [Google Scholar] [CrossRef] [PubMed]
  16. Flynn, J.M.; Hubley, R.; Goubert, C.; Rosen, J.; Clark, A.G.; Feschotte, C.; Smit, A.F. RepeatModeler2 for automated genomic discovery of transposable element families. Proc. Natl. Acad. Sci. USA 2020, 117, 9451–9457. [Google Scholar] [CrossRef] [PubMed]
  17. Tarailo-Graovac, M.; Chen, N. Using RepeatMasker to identify repetitive elements in genomic sequences. Curr. Protoc. Bioinform. 2009, 25, 4.10.11–14.10.14. [Google Scholar] [CrossRef]
  18. Ou, S.; Jiang, N. LTR_retriever: A highly accurate and sensitive program for identification of long terminal repeat retrotransposons. Plant Physiol. 2018, 176, 1410–1422. [Google Scholar] [CrossRef]
  19. Price, A.L.; Jones, N.C.; Pevzner, P.A. De novo identification of repeat families in large genomes. Bioinformatics 2005, 21, i351–i358. [Google Scholar] [CrossRef]
  20. Bao, Z.; Eddy, S.R. Automated de novo identification of repeat sequence families in sequenced genomes. Genome Res. 2002, 12, 1269–1276. [Google Scholar] [CrossRef]
  21. Gabriel, L.; Brůna, T.; Hoff, K.J.; Ebel, M.; Lomsadze, A.; Borodovsky, M.; Stanke, M. BRAKER3: Fully automated genome annotation using RNA-seq and protein evidence with GeneMark-ETP, AUGUSTUS, and TSEBRA. Genome Res. 2024, 34, 1–9. [Google Scholar] [CrossRef] [PubMed]
  22. Stanke, M.; Diekhans, M.; Baertsch, R.; Haussler, D. Using native and syntenically mapped cDNA alignments to improve de novo gene finding. Bioinformatics 2008, 24, 637–644. [Google Scholar] [CrossRef] [PubMed]
  23. Brůna, T.; Lomsadze, A.; Borodovsky, M. GeneMark-EP+: Eukaryotic gene prediction with self-training in the space of genes and proteins. NAR Genom. Bioinform. 2020, 2, lqaa026. [Google Scholar] [CrossRef] [PubMed]
  24. Seemann, T. Barrnap 0.9: Rapid Ribosomal RNA Prediction. 2013. Available online: https://github.com/tseemann/barrnap (accessed on 31 May 2024).
  25. Chan, P.P.; Lin, B.Y.; Mak, A.J.; Lowe, T.M. tRNAscan-SE 2.0: Improved detection and functional classification of transfer RNA genes. Nucleic Acids Res. 2021, 49, 9077–9096. [Google Scholar] [CrossRef] [PubMed]
  26. Alonge, M.; Lebeigle, L.; Kirsche, M.; Jenike, K.; Ou, S.; Aganezov, S.; Wang, X.; Lippman, Z.B.; Schatz, M.C.; Soyk, S. Automated assembly scaffolding using RagTag elevates a new tomato system for high-throughput genome editing. Genome Biol. 2022, 23, 258. [Google Scholar] [CrossRef]
  27. Marcais, G.; Delcher, A.L.; Phillippy, A.M.; Coston, R.; Salzberg, S.L.; Zimin, A. MUMmer4: A fast and versatile genome alignment system. PLoS Comput. Biol. 2018, 14, e1005944. [Google Scholar] [CrossRef] [PubMed]
  28. Aramaki, T.; Blanc-Mathieu, R.; Endo, H.; Ohkubo, K.; Kanehisa, M.; Goto, S.; Ogata, H. KofamKOALA: KEGG Ortholog assignment based on profile HMM and adaptive score threshold. Bioinformatics 2020, 36, 2251–2252. [Google Scholar] [CrossRef]
  29. Teufel, F.; Almagro Armenteros, J.J.; Johansen, A.R.; Gíslason, M.H.; Pihl, S.I.; Tsirigos, K.D.; Winther, O.; Brunak, S.; von Heijne, G.; Nielsen, H. SignalP 6.0 predicts all five types of signal peptides using protein language models. Nat. Biotechnol. 2022, 40, 1023–1025. [Google Scholar] [CrossRef]
  30. Zheng, J.; Ge, Q.; Yan, Y.; Zhang, X.; Huang, L.; Yin, Y. dbCAN3: Automated carbohydrate-active enzyme and substrate annotation. Nucleic Acids Res. 2023, 51, W115–W121. [Google Scholar] [CrossRef]
  31. Sperschneider, J.; Dodds, P.N. EffectorP 3.0: Prediction of apoplastic and cytoplasmic effectors in fungi and oomycetes. Mol. Plant Microbe Interact. 2022, 35, 146–156. [Google Scholar] [CrossRef]
  32. Blin, K.; Shaw, S.; Augustijn, H.E.; Reitz, Z.L.; Biermann, F.; Alanjary, M.; Fetter, A.; Terlouw, B.R.; Metcalf, W.W.; Helfrich, E.J. antiSMASH 7.0: New and improved predictions for detection, regulation, chemical structures and visualisation. Nucleic Acids Res. 2023, 51, W46–W50. [Google Scholar] [CrossRef] [PubMed]
  33. Emms, D.M.; Kelly, S. OrthoFinder: Phylogenetic orthology inference for comparative genomics. Genome Biol. 2019, 20, 238. [Google Scholar] [CrossRef] [PubMed]
  34. Mendes, F.K.; Vanderpool, D.; Fulton, B.; Hahn, M.W. CAFE 5 models variation in evolutionary rates among gene families. Bioinformatics 2020, 36, 5516–5518. [Google Scholar] [CrossRef] [PubMed]
  35. Smith, S.A.; O’Meara, B.C. treePL: Divergence time estimation using penalized likelihood for large phylogenies. Bioinformatics 2012, 28, 2689–2690. [Google Scholar] [CrossRef] [PubMed]
  36. Kumar, S.; Suleski, M.; Craig, J.M.; Kasprowicz, A.E.; Sanderford, M.; Li, M.; Stecher, G.; Hedges, S.B. TimeTree 5: An expanded resource for species divergence times. Mol. Biol. Evol. 2022, 39, msac174. [Google Scholar] [CrossRef]
  37. Kraisitudomsook, N.; Ahrendt, S.; Riley, R.; LaButti, K.; Lipzen, A.; Daum, C.; Barry, K.; Grigoriev, I.V.; Rama, T.; Martin, F.; et al. On the origin of bird’s nest fungi: Phylogenomic analyses of fungi in the Nidulariaceae (Agaricales, Basidiomycota). Mol. Phylogenet. Evol. 2024, 193, 108010. [Google Scholar] [CrossRef]
  38. Ichida, H.; Murata, H.; Hatakeyama, S.; Yamada, A.; Ohta, A. Near-complete de novo assembly of Tricholoma bakamatsutake chromosomes revealed the structural divergence and differentiation of Tricholoma genomes. G3 Genes Genomes Genet. 2023, 13, jkad198. [Google Scholar] [CrossRef]
  39. van de Peppel, L.J.; Aime, M.C.; Læssøe, T.; Pedersen, O.S.; Coimbra, V.R.; Kuyper, T.W.; Stubbe, D.; Aanen, D.K.; Baroni, T.J. Four new genera and six new species of lyophylloid agarics (Agaricales, Basidiomycota) from three different continents. Mycol. Prog. 2022, 21, 85. [Google Scholar] [CrossRef]
  40. Frenkel, F.E.; Chaley, M.B.; Korotkov, E.V.; Skryabin, K.G. Evolution of tRNA-like sequences and genome variability. Gene 2004, 335, 57–71. [Google Scholar] [CrossRef]
  41. Kimura, M. A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. J. Mol. Evol. 1980, 16, 111–120. [Google Scholar] [CrossRef]
  42. Hu, L.; Weng, J.; Wang, Z.; Huang, C.; Zhang, L. Effect and mechanism of Tricholoma matsutake extract combined with bakuchiol and ergothioneine on UVB-induced skin aging. J. Cosmet. Dermatol. 2024. [Google Scholar] [CrossRef] [PubMed]
  43. Rajapitamahuni, S.; Kang, B.R.; Lee, T.K. Exploring the roles of arbuscular mycorrhizal fungi in plant–iron homeostasis. Agriculture 2023, 13, 1918. [Google Scholar] [CrossRef]
  44. Ying, X.; Ma, J.; Liang, Q.; Wang, Y.; Bai, G.; Luo, G. Identification and analysis of the constituents in an aqueous extract of Tricholoma matsutake by HPLC coupled with diode array detection/electrospray ionization mass spectrometry. J. Food Sci. 2013, 78, C1173–C1182. [Google Scholar] [CrossRef] [PubMed]
  45. Krauze-Baranowska, M.; Mardarowicz, M.; Wiwart, M.; Pobłocka, L.; Dynowska, M. Antifungal activity of the essential oils from some species of the genus Pinus. Z. Naturforsch. C 2002, 57, 478–482. [Google Scholar] [CrossRef] [PubMed]
  46. Ringel, M.; Dimos, N.; Himpich, S.; Haack, M.; Huber, C.; Eisenreich, W.; Schenk, G.; Loll, B.; Brück, T. Biotechnological potential and initial characterization of two novel sesquiterpene synthases from Basidiomycota Coniophora puteana for heterologous production of δ-cadinol. Microb. Cell Fact. 2022, 21, 64. [Google Scholar] [CrossRef]
  47. Mohanta, T.K.; Bae, H. The diversity of fungal genome. Biol. Proced. Online 2015, 17, 1–9. [Google Scholar] [CrossRef] [PubMed]
  48. Vaario, L.M.; Yang, X.; Yamada, A. Biogeography of the Japanese gourmet fungus, Tricholoma matsutake: A review of the distribution and functional ecology of Matsutake. In Biogeography of Mycorrhizal Symbiosis; Springer: Cham, Switzerland, 2017; pp. 319–344. [Google Scholar]
  49. Lee, C.-Y.; Hong, O.-P.; Jung, M.-J.; Han, Y.-H. Effect of carbon sources and vitamins on mycelial growth of Tricholoma matsutake DGUM 26001. Kor. J. Mycol. 1997, 25, 226–232. [Google Scholar]
  50. Kusuda, M.; Ueda, M.; Konishi, Y.; Yamanaka, K.; Terashita, T.; Miyatake, K. Effects of carbohydrate substrate on the vegetative mycelial growth of an ectomycorrhizal mushroom, Tricholoma matsutake, isolated from Quercus. Mycoscience 2007, 48, 358–364. [Google Scholar] [CrossRef]
  51. Kusuda, M.; Nagai, M.; Hur, T.C.; Terashita, T.; Ueda, M. Purification and some properties of α-amylase from an ectomycorrhizal fungus, Tricholoma matsutake. Mycoscience 2003, 44, 311–317. [Google Scholar] [CrossRef]
  52. Wang, Y.; Zheng, X.; Li, G.; Wang, X. TORC1 signaling in Fungi: From yeasts to filamentous Fungi. Microorganisms 2023, 11, 218. [Google Scholar] [CrossRef]
  53. Chung, D.-Y.; Lee, K.-S.; Lee, J.-S.; Youn, Y.-N. Characteristics of a forest soil on pine mushroom habitat located in Ponghwa, Kyungbuk and Gansung, Kangwon. 1. Physical and chemical properties of O horizon and surface soil. Korean J. Soil Sci. Fert. 2008, 41, 206–213. [Google Scholar]
  54. Li, T.; Zhang, J.; Shen, T.; Shi, Y.D.; Yang, S.B.; Zhang, T.; Li, J.Q.; Wang, Y.Z.; Liu, H.G. Mineral element content in prized matsutake mushroom (Tricholoma matsutake) collected in China. Chem. Pap. 2013, 67, 672–676. [Google Scholar] [CrossRef]
  55. Li, Q.; Li, S.H.; Huang, W.L.; Liu, C.Y.; Xiong, C.; Li, X.L.; Zheng, L.Y. Mineral constituents of a prized edible mushroom (Tricholoma matsutake) and soils beneath the fruiting bodies from the production areas across China. J. Mt. Sci. 2016, 13, 2046–2052. [Google Scholar] [CrossRef]
  56. Looney, B.; Miyauchi, S.; Morin, E.; Drula, E.; Courty, P.E.; Kohler, A.; Kuo, A.; LaButti, K.; Pangilinan, J.; Lipzen, A. Evolutionary transition to the ectomycorrhizal habit in the genomes of a hyperdiverse lineage of mushroom-forming fungi. New Phytol. 2022, 233, 2294–2309. [Google Scholar] [CrossRef] [PubMed]
  57. Jeon, S.-M.; Ka, K.-H. Korean Tricholoma matsutake strains that promote mycorrhization and growth of pinus densiflora seedling. Kor. J. Mycol. 2001, 44, 155–165. [Google Scholar] [CrossRef]
  58. Madeira, F.; Madhusoodanan, N.; Lee, J.; Eusebi, A.; Niewielska, A.; Tivey, A.; Lopez, R.; Butcher, S. The EMBL-EBI Job Dispatcher sequence analysis tools framework in 2024. Nucleic Acids Res. 2024, 52, W521–W525. [Google Scholar] [CrossRef]
Figure 1. Phylogenomic tree of Agaricales strains along with genome size, GC content, and number of annotated genes. (A) The species tree was constructed using concatenated orthologous protein sequences conserved among the selected species/strains, employing the maximum likelihood method. The sections of the Tricholoma genus are based on findings from a previous study [6]. T. sinoportentosum *, T. albobrunneum **, and P. corneri *** are originally Tricholoma sp. MG77, MG99, and T. furcatifolium, respectively. (B) An expanded tree focusing on T. matsutake strains is presented, showing bootstrap values and the proportion of genomes that align with the reference strain Sample A. The bar graph depicts the size of matched (black) and unmatched (gray) sequences.
Figure 1. Phylogenomic tree of Agaricales strains along with genome size, GC content, and number of annotated genes. (A) The species tree was constructed using concatenated orthologous protein sequences conserved among the selected species/strains, employing the maximum likelihood method. The sections of the Tricholoma genus are based on findings from a previous study [6]. T. sinoportentosum *, T. albobrunneum **, and P. corneri *** are originally Tricholoma sp. MG77, MG99, and T. furcatifolium, respectively. (B) An expanded tree focusing on T. matsutake strains is presented, showing bootstrap values and the proportion of genomes that align with the reference strain Sample A. The bar graph depicts the size of matched (black) and unmatched (gray) sequences.
Jof 10 00746 g001
Figure 2. Repeat content in Tricholoma and outgroup strains and their evolution. (A) The repeat content of Tricholoma and outgroup species/strains is shown, categorized by the number of repeats (left), the total occupied size (middle), and the occupied proportion in the genomes (right). (B) The asterisks in (A) indicate the strains that are further analyzed for Kimura substitution levels. The non-repeatitive DNA is depicted in black, while different types of repeats are color-coded.
Figure 2. Repeat content in Tricholoma and outgroup strains and their evolution. (A) The repeat content of Tricholoma and outgroup species/strains is shown, categorized by the number of repeats (left), the total occupied size (middle), and the occupied proportion in the genomes (right). (B) The asterisks in (A) indicate the strains that are further analyzed for Kimura substitution levels. The non-repeatitive DNA is depicted in black, while different types of repeats are color-coded.
Jof 10 00746 g002
Figure 3. Pan/core genome of Tricholoma and outrgroup strains and functional analysis of core orthogroups. (A) Venn diagrams illustrate the numbers of orthogroups that are shared among or unique to the evolutionary clades of Tricholoma and outgroup strains. Numbers in parentheses indicate orthogroups excluding singletons. Asterisks denote the number of unique core orthogroups in the genus Tricholoma *, section Matsutake **, and T. matsutake species *** clades. (B) The bar graphs shows the enriched functions within each unique core orthogroup compared to the total proteome (Fisher’s exact test with p-value < 0.05). Black and gray bars represent the percentage of proteins in the core orthogroup and the total proteome, respectively.
Figure 3. Pan/core genome of Tricholoma and outrgroup strains and functional analysis of core orthogroups. (A) Venn diagrams illustrate the numbers of orthogroups that are shared among or unique to the evolutionary clades of Tricholoma and outgroup strains. Numbers in parentheses indicate orthogroups excluding singletons. Asterisks denote the number of unique core orthogroups in the genus Tricholoma *, section Matsutake **, and T. matsutake species *** clades. (B) The bar graphs shows the enriched functions within each unique core orthogroup compared to the total proteome (Fisher’s exact test with p-value < 0.05). Black and gray bars represent the percentage of proteins in the core orthogroup and the total proteome, respectively.
Jof 10 00746 g003
Figure 4. Gene family expansion and contraction in Tricholoma and outgroup strains and functional analysis at key evolutionary divergence points. (A) The phylogenomic tree depicts the dynamics of gene family expansion and contraction across various evolutionary lineages within selected Agaricales. Numbers in green indicate expanded gene families, while numbers in red indicate contracted gene families. The common ancestor of the genus Tricholoma is marked as *, the common ancestor of section Matsutake is marked as **, and the common ancestor of T. matsutake species is marked as ***. Additionally, a functional analysis of the expanded and lost gene families in these groups was conducted. (B) Bar graphs present the functions of gene families that have expanded (green bars) or are completely lost (red bars) during the evolutionary divergence of the Tricholoma genus (left), the section Matsutake (middle), and the species T. matsutake (right). Each bar represents a specific gene function, with the length of the bar corresponding to the relative abundance of that function in the expanded or completely lost gene families compared to the total proteins (gray).
Figure 4. Gene family expansion and contraction in Tricholoma and outgroup strains and functional analysis at key evolutionary divergence points. (A) The phylogenomic tree depicts the dynamics of gene family expansion and contraction across various evolutionary lineages within selected Agaricales. Numbers in green indicate expanded gene families, while numbers in red indicate contracted gene families. The common ancestor of the genus Tricholoma is marked as *, the common ancestor of section Matsutake is marked as **, and the common ancestor of T. matsutake species is marked as ***. Additionally, a functional analysis of the expanded and lost gene families in these groups was conducted. (B) Bar graphs present the functions of gene families that have expanded (green bars) or are completely lost (red bars) during the evolutionary divergence of the Tricholoma genus (left), the section Matsutake (middle), and the species T. matsutake (right). Each bar represents a specific gene function, with the length of the bar corresponding to the relative abundance of that function in the expanded or completely lost gene families compared to the total proteins (gray).
Jof 10 00746 g004
Figure 5. The number of protein homologs involved in the four major iron uptake mechanisms in fungi in T. matsutake and T. bakamatsutake. Rajapitamahuni et al. investigated the proteins involved in fungal iron uptake mechanisms [43], and the bar graphs display the number of homologs for each protein identified in T. matsutake (Tmat, black bars) and T. bakamatsutake (Tbak, gray bars) through a BLAST search. E-value cutoffs are shown below the protein names.
Figure 5. The number of protein homologs involved in the four major iron uptake mechanisms in fungi in T. matsutake and T. bakamatsutake. Rajapitamahuni et al. investigated the proteins involved in fungal iron uptake mechanisms [43], and the bar graphs display the number of homologs for each protein identified in T. matsutake (Tmat, black bars) and T. bakamatsutake (Tbak, gray bars) through a BLAST search. E-value cutoffs are shown below the protein names.
Jof 10 00746 g005
Figure 6. Tryptophan to nicotinate D-ribonucleotide pathway in T. matsutake and T. bakamatsutake. The pathway from tryptophan to nicotinate D-ribonucleotide in T. matsutake and T. bakamatsutake is depicted based on the tryptophan, nicotinate, and nicotinamide metabolism pathway from KEGG (Supplementary Figure S6). Each metabolic step is annotated with its corresponding EC number, and genes present in T. matsutake NIFoS 2001 (Tmat) and T. bakamatsutake Sample A (Tbak) are highlighted in green and red, respectively. Next to each EC number, ortholog sequence alignments between Tmat and Tbak are shown, along with their sequence conservation. EC1.13.11.52 has four paralogs in both Tmat and Tbak, while EC1.13.11.6 has only two paralogs in Tbak, with none in Tmat. The most similar sequence in Tmat was identified through a BLAST search and aligned.
Figure 6. Tryptophan to nicotinate D-ribonucleotide pathway in T. matsutake and T. bakamatsutake. The pathway from tryptophan to nicotinate D-ribonucleotide in T. matsutake and T. bakamatsutake is depicted based on the tryptophan, nicotinate, and nicotinamide metabolism pathway from KEGG (Supplementary Figure S6). Each metabolic step is annotated with its corresponding EC number, and genes present in T. matsutake NIFoS 2001 (Tmat) and T. bakamatsutake Sample A (Tbak) are highlighted in green and red, respectively. Next to each EC number, ortholog sequence alignments between Tmat and Tbak are shown, along with their sequence conservation. EC1.13.11.52 has four paralogs in both Tmat and Tbak, while EC1.13.11.6 has only two paralogs in Tbak, with none in Tmat. The most similar sequence in Tmat was identified through a BLAST search and aligned.
Jof 10 00746 g006
Figure 7. Carbohydrate-active enzymes, secreted proteins, and secondary metabolite biosynthetic gene clusters of the Agaricales strains. The bar graphs provide a comparative analysis of various functional categories across different Agaricales strains. The first panel shows the number of secreted proteins identified in each strain. The second and third panels display the number of CAZymes and secreted CAZymes, categorized into six classes: glycoside hydrolases (GHs), glycosyl transferases (GTs), polysaccharide lyases (PLs), carbohydrate esterases (CEs), auxiliary activities (AAs), and carbohydrate-binding modules (CBMs). The fourth panel shows the distribution of secreted effectors, classified based on their predicted localization: apoplast, cytoplasm, or both (Apo/Cyto). The last panel presents the diversity of SMBGCs across the species, including type I polyketide synthases (T1PKSs), non-ribosomal peptide synthetases (NRPSs), NRPS-like, terpene synthases, NI-siderophores, fungal-RiPP-like clusters, and indole clusters.
Figure 7. Carbohydrate-active enzymes, secreted proteins, and secondary metabolite biosynthetic gene clusters of the Agaricales strains. The bar graphs provide a comparative analysis of various functional categories across different Agaricales strains. The first panel shows the number of secreted proteins identified in each strain. The second and third panels display the number of CAZymes and secreted CAZymes, categorized into six classes: glycoside hydrolases (GHs), glycosyl transferases (GTs), polysaccharide lyases (PLs), carbohydrate esterases (CEs), auxiliary activities (AAs), and carbohydrate-binding modules (CBMs). The fourth panel shows the distribution of secreted effectors, classified based on their predicted localization: apoplast, cytoplasm, or both (Apo/Cyto). The last panel presents the diversity of SMBGCs across the species, including type I polyketide synthases (T1PKSs), non-ribosomal peptide synthetases (NRPSs), NRPS-like, terpene synthases, NI-siderophores, fungal-RiPP-like clusters, and indole clusters.
Jof 10 00746 g007
Table 2. The annotation result of Tricholoma and outgroup strains used in this study.
Table 2. The annotation result of Tricholoma and outgroup strains used in this study.
SpeciesStraintRNAsrRNAsProtein-Coding
Genes
ProteinsBLAST
(%)
GO
(%)
With EC no.
(%)
InterPro
(%)
Tricholoma matsutakeNIFoS 20012772514,65717,25716,410 (95.1)11,086 (64.2)3717 (21.5)10,890 (63.1)
Tricholoma matsutakeKMCC04578275411,44913,36412,862 (96.2)9318 (69.7)3444 (25.8)9230 (69.1)
Tricholoma matsutakeMG523001813,99216,43715,636 (95.1)10,739 (65.3)3774 (23.0)10,491 (63.8)
Tricholoma matsutakeNBRC 30605304913,93116,35615,559 (95.1)10,780 (65.9)3893 (23.8)10,325 (63.1)
Tricholoma matsutake945281314,44217,04816,292 (95.6)11,138 (65.3)3818 (22.4)11,028 (64.7)
Tricholoma matsutakeSample A2751114,38516,86316,003 (94.9)10,820 (64.2)3639 (21.6)10,774 (63.9)
Tricholoma bakamatsutakeSF-Tf0528731969711,07110,646 (96.2)8464 (76.5)3145 (28.4)8486 (76.7)
Tricholoma bakamatsutakeMG5128345993111,63511,254 (96.7)8842 (76.0)3343 (28.7)8758 (75.3)
Tricholoma flavovirensMG32304911,97213,64312,747 (93.4)9523 (69.8)3424 (25.1)9481 (69.5)
Tricholoma sinoportentosumMG775771317,93920,38218,748 (92.0)13,799 (67.7)5037 (24.7)12,998 (63.8)
Tricholoma saponaceumMG146616116,77719,19117,956 (93.6)13,533 (70.5)4827 (25.2)13,564 (70.7)
Tricholoma albobrunneumMG995162215,59717,58716,261 (92.5)12,478 (71.0)4542 (25.8)12,271 (69.8)
Tricholoma hemisulphureumKDTOL00252305112,46014,10313,017 (92.3)9819 (69.6)3523 (25.0)10,089 (71.5)
Tricholoma terreumMG454242015,81517,96216,569 (92.2)12,503 (69.6)4286 (23.9)11,972 (66.7)
Praearthromyces corneriD33524015,84618,80218,339 (97.5)12,342 (65.6)4662 (24.8)12,961 (78.9)
Rugosomyces carneusD47340016,54918,64016,597 (89.0)13,393 (71.9)5428 (29.1)13,092 (70.2)
Cystodermella granulosaDBG-29008248617,65120,23618,717 (92.5)15,841 (78.3)6974 (34.5)15,095 (74.6)
Flagelloscypha sp.PMI 5264101426,59931,41031,146 (99.2)18,012 (57.3)4205 (13.4)22,038 (70.2)
Macrocystidia cucumisKM177596- *2011,80613,18012,344 (93.7)10,079 (76.5)3441 (26.1)10,732 (81.4)
* The genome is too fragmented to allow for reliable tRNA prediction.
Table 3. The orthogroup analysis result.
Table 3. The orthogroup analysis result.
Total number of species/strains19
Total number of proteins325,167
Total number of orthogroups34,538
Number of orthogroups with homologous proteins22,708
Number of orthogroups with single protein (unassigned proteins)11,830
Number of orthogroups with all species present4131
Number of single-copy orthogroups204
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Kim, J.H.; Bae, E.-K.; Hue, Y.; Choi, B.; Kang, M.-J.; Park, E.-J.; Kim, K.-T. Comparative Genomics Reveals Species-Specific Genes and Symbiotic Adaptations in Tricholoma matsutake. J. Fungi 2024, 10, 746. https://doi.org/10.3390/jof10110746

AMA Style

Kim JH, Bae E-K, Hue Y, Choi B, Kang M-J, Park E-J, Kim K-T. Comparative Genomics Reveals Species-Specific Genes and Symbiotic Adaptations in Tricholoma matsutake. Journal of Fungi. 2024; 10(11):746. https://doi.org/10.3390/jof10110746

Chicago/Turabian Style

Kim, Jea Hyeoung, Eun-Kyung Bae, Yoeguang Hue, Byungheon Choi, Min-Jeong Kang, Eung-Jun Park, and Ki-Tae Kim. 2024. "Comparative Genomics Reveals Species-Specific Genes and Symbiotic Adaptations in Tricholoma matsutake" Journal of Fungi 10, no. 11: 746. https://doi.org/10.3390/jof10110746

APA Style

Kim, J. H., Bae, E. -K., Hue, Y., Choi, B., Kang, M. -J., Park, E. -J., & Kim, K. -T. (2024). Comparative Genomics Reveals Species-Specific Genes and Symbiotic Adaptations in Tricholoma matsutake. Journal of Fungi, 10(11), 746. https://doi.org/10.3390/jof10110746

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop