Next Article in Journal
Driving Chromatin Organisation through N6-methyladenosine Modification of RNA: What Do We Know and What Lies Ahead?
Previous Article in Journal
FDDM1 and FDDM2, Two SGS3-like Proteins, Function as a Complex to Affect DNA Methylation in Arabidopsis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genomic Analyses of the Fungus Paraconiothyrium sp. Isolated from the Chinese White Wax Scale Insect Reveals Its Symbiotic Character

1
Institute of Highland Forest Science, Chinese Academy of Forestry, Kunming 650224, China
2
Key Laboratory of Breeding and Utilization of Resource Insects of National Forestry and Grassland Administration, Kunming 650224, China
3
College of Agriculture and Life Sciences, Kunming University, Kunming 650214, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Genes 2022, 13(2), 338; https://doi.org/10.3390/genes13020338
Submission received: 19 January 2022 / Revised: 9 February 2022 / Accepted: 10 February 2022 / Published: 12 February 2022
(This article belongs to the Section Microbial Genetics and Genomics)

Abstract

:
The Chinese white wax scale, Ericerus pela, is an insect native to China. It harbors a variety of microbes. The Paraconiothyrium fungus was isolated from E. pela and genome sequenced in this study. A fungal cytotoxicity assay was performed on the Aedes albopictus cell line C6/36. The assembled Paraconiothyrium sp. genome was 39.55 Mb and consisted of 14,174 genes. The coding sequences accounted for 50.75% of the entire genome. Functional pathway analyses showed that Paraconiothyrium sp. possesses complete pathways for the biosynthesis of 20 amino acids, 10 of which E. pela lacks. It also had complementary genes in the vitamin B groups synthesis pathways. Secondary metabolism prediction showed many gene clusters that produce polyketide. Additionally, a large number of genes associated with ‘reduced virulence’ in the genome were annotated with the Pathogen–Host Interaction database. A total of 651 genes encoding carbohydrate-active enzymes were predicted to be mostly involved in plant polysaccharide degradation. Pan-specific genomic analyses showed that genes unique to Paraconiothyrium sp. were enriched in the pathways related to amino acid metabolism and secondary metabolism. GO annotation analysis yielded similar results. The top COG categories were ‘carbohydrate transport and metabolism’, ‘lipid transport and metabolism’, and ‘secondary metabolite biosynthesis, transport and catabolism’. Phylogenetic analyses based on gene family and pan genes showed that Paraconiothyrium sp is clustered together with species from the Didymosphaeriaceae family. A multi-locus sequence analysis showed that it converged with the same branch as P. brasiliense and they formed one group with fungi from the Paraconiothyrium genus. To validate the in vitro toxicity of Paraconiothyrium sp., a cytotoxicity assay was performed. The results showed that medium-cultured Paraconiothyrium sp. had no harmful effect on cell viability. No toxins were secreted by the fungus during growth. Our results imply that Paraconiothyrium sp. may establish a symbiotic relationship with the host to supply complementary nutrition to E. pela.

1. Introduction

The Chinese white wax scale insect (Ericerus pela) is well known for its wax production. White wax secreted by males has high economic value and is widely used in machinery, food, medicine, and other fields [1,2,3]. The white wax is produced by the second-instar male larvae. The males live from about May to August. The lifespan of females is about one year and they produce eggs in the summer. Males and females of E. pela parasitize the branches of the Chinese ash tree (Fraxinus chinensis) and glossy privet (Ligustrum lucidum) for almost their entire lifespan and remain immobile due to the degeneration of their appendages [4,5,6]. E. pela display a varied relationship with microorganisms as a result of their sedentary lifestyle [7]. They may also inherit a variety of microorganisms from their ground-dwelling ancestors [8].
In previous studies, we measured the diversity of microorganisms in E. pela and found that they house a variety of microbes; we identified 20 phyla from 128 bacterial families and 4 phyla from 48 fungal families [7]. These microorganisms may be transmitted vertically or obtained from the diet or the environment. The bacteria are mainly concentrated in three families: Comamonadaceae, Streptococcaceae, and Rickettsiaceae. The fungi are less abundant and diverse than the bacteria and are mainly concentrated in the Hypocreales, Pleosporales, and Capnodiales orders Studies on scale insect symbiotic microorganisms have primarily focused on bacterial symbionts [9,10]. However, it is unclear whether these fungi play a role in the long parasitic lifespan of E. pela.
With the development of high-through sequencing technology, genomic sequencing has provided a robust tool to explore the genetic aspects of the relationships between fungi and insects [11,12,13]. To date, 3413 fungi genomes have been published in GenBank. More than 500 fungi of the three orders mentioned above have been genome sequenced. These data allow for the reliable taxonomic classification of the fungi that E. pela host and will also facilitate comparative genomic analyses to reveal the evolution and gene functions of these fungi. Genome sequences also prove information on the complementariness of genes and pathways related to amino acid biosynthesis with the host. Because plant sap supplies unbalanced nutrition to insects, piercing–sucking insects must acquire essential nutrients through nutrition partners [14,15]. In addition, the genome sequences of a large number of insect symbionts have revealed that all of the symbionts have smaller genomes and a higher AT content than free-living relative species, as well as fast evolutionary rates according to their coding genes [16,17].
To understand the genetic information related to the function of the fungi, we cultured fungi from E. pela using homogenized eggs in this study. The isolated fungus was genome sequenced, and its toxicity to cells was analyzed. These studies will provide an important foundation for studying the genome character and understanding the relationship between E. pela and fungi.

2. Materials and Methods

2.1. Fungal Culture and Genomic DNA Isolation

E. pela eggs were collected and quickly washed with 75% alcohol. The sterilized eggs were homogenized in a 1.5 mL centrifuge tube (Axygen, San Francisco, CA, USA). The homogenate was diluted with sterile water and 50 µL of the diluted homogenate was inoculated onto PDA medium. The medium was incubated at a constant temperature of 30 °C. The culture status was periodically observed; when mycelium growth was observed, the marginal mycelium was promptly transferred to new PDA medium for purification. This process was repeated several times until a pure culture strain was obtained. The strain was used for sequencing.
Genomic DNA was isolated with the BGI Customized Magnetic Plant Genomic DNA Kit (Tiangen Biotech, Beijing, China) according to the manufacturer’s recommended protocol. The DNA concentration was determined using a Qubit fluorometer (Thermo Fisher Scientific, Waltham, MA, USA) and DNA quality was measured using a Nanodrop 2000 spectrophotometer (Thermo Fisher Scientific, Carlsbad, CA, USA). Then, DNA integrity was evaluated using 0.5% agarose gel electrophoresis.
Universal ITS primers [18] were used to amplify the DNA fragments and the amplification results were sequenced and aligned with the ITS sequences in NCBI using BLASTN.

2.2. Library Construction and Sequencing

In this study, whole-genome sequencing was performed based on NGS (BGISEQ platform) and SMRT (PacBio Sequel system).
For BGISEQ sequencing, genomic DNA was fragmented using a g-TUBE device (Covaris, MA, USA). Fragments of 300–400 bp were selected with magnetic beads. After DNA purification, dsDNA end repair, 3′ adenylation, and adapter ligation, the templates were amplified by PCR. Then, the amplification products were again selected with magnetic beads. With a splint oligo sequence and ligase, single-stranded circular DNA (ssCir-DNA) was denatured and circularized. With digestion of the linear DNA and qualification using Agilent 2100, a final library containing an insert with a paired-end sequencing length of 150 bp was formed.
The DNBSEQ library was sequenced with the BGISEQ platform through rolling circle amplification to transform ssCir-DNA into DNA nanoballs (DNBs)—nanospheres containing more than 300 copies. Through high-density DNA nanochip technology, the obtained DNBs were added to the holes of the chip mesh with high-density DNA nanochip technology and sequenced by combined probe anchoring polymerization (CPA).
Approximately one microgram of DNA from the strain was used for library construction. The PacBio library was constructed using the SMRTbell Express Template Preparation Kit 2.0 (Pacific Biosciences, Menlo Park, CA, USA). First, DNA samples were sheared using a g-TUBE device and 10–15 Kb fragments were selected using the BluePippin size selection system (Sage Science, Beverly, MA, USA). After the single-stranded overhang was removed, DNA damage and DNA ends were repaired, A-tailing was conducted, and inserts were ligated to adapters (blunt hairpins) to form the SMRTbell library. Ligation products were purified using AMPure PB beads and then pooled. Selection was conducted and inserts less than 10 kb in size were discarded using the BluePippin size selection system (Sage Science, Beverly, MA, USA). Finally, the library was detected with a Qubit DNA HS Assay Kit, and the insert was checked with an Agilent HS DNA Kit (Agilent Technologies, Santa Clara, CA, USA). Sequencing of the SMRTbell library was performed using the Sequel (PacBio) Sequencing Kit 2.0.1 The SMRT bell corresponds to 1 million ZMWs. With the Sequel Binding Kit 3.0 (Pacific Biosciences, Menlo Park, CA, USA), ZMWs loaded with one template and one DNA sample were prepared for sequencing. Sequence information was analyzed through fluorescent signals linked to dNTPs. Subreads obtained from sequencing also contained redundancy, which was removed in the subsequent procedure.

2.3. Genome Assembly

Data generated from BGISEQ were filtered to obtain clean data. Reads with a certain proportion of low-quality bases and Ns were filtered, and contaminating duplicates were removed.
For PacBio sequencing, after removal of the adapters several subreads were generated from the same polymerase reads in one ZMW, and subreads less than 1000 bp were trimmed. Then, these subreads were transformed into a circular consensus sequence with enforced consistency (Table S1).
Before genome assembly, the genome size was estimated by K-mer analysis based on BGISEQ data. In the study, we assigned K a value of 15 (Table S2).
Genome assembly and long-read polishing were performed after data polishing. A Falcon (v0.3.0) was used for de novo assembly based on PacBio long reads. Due to the high error rate, preassembly errors still needed to be corrected. First, corrected subreads were obtained through Pbdagcon (https://github.com/PacificBiosciences/pbdagcon, accessed on 20 March 2020) and Falcon (v0.3.0) after assembly with several software programs, such as Celera (v8.3) and Falcon, and the best assembly result remained. Single-base errors were corrected using GATK (v3.8), and the last gaps were filled with pbjelly2 (v15.8.24) after reads with a long insert size were assembled to transform contigs into scaffolds using SSPACE_Basic (v2.0). To further improve assembly accuracy, polishing steps were executed. Initial polishing, which is available for only PacBio long reads, was carried out. Then, the high-accuracy PacBio-corrected assembly was obtained with the help of BGISEQ short reads.
With rudimentary sequencing and subsequent fragment assembly, it was feasible to carry out genome analysis under general conditions. Genome size, gene number, genome characteristics, and assembly statistics, including contig/scaffold (in both number and length), N50, the GC content, and the gap number, were also preserved.

2.4. Repetitive Sequences

The assembly results were compared with the transposon sequence database using the de novo method. A database of assembly sequences was established using RepeatMasker software (v4-0-6) and the de novo method [19]. A transposon model was built using RepeatModeler (v2.0.1) based on the database. Transposon prediction was executed using RepeatMasker software (v4-0-6) with the help of the established model. Tandem repeats were predicted using Tandem Repeat Finder (TRF) [20].

2.5. Noncoding RNA

The prediction of ncRNA genes (tRNA, rRNA, sRNA, snRNA, and miRNA) was conducted using several software programs. rRNA was predicted ab initio using RNAmmer [21]. The tRNA region and the secondary structure of the tRNA were predicted by tRNAscan-SE (v1.3.1) [22]. sRNA was obtained by aligning sequences with the database using Rfam (v9.1) [23,24].

2.6. Gene Prediction

To determine gene location, homology prediction was conducted by comparing the genome sequence with a protein set from several other reference species using GeneWise (v2.20) [25]. With the help of SNAP (v2010-07-28) [26] and GeneMarkES (v4.21) [27], the gene structure and general statistics of the genome regarding functional elements containing introns, exons and CDSs were identified. Specifically, Augustus (v3.2.1) was used for the prediction of protein-coding genes [28].

2.7. Gene Annotation

After gene prediction, gene annotation was performed by aligning the predicted protein sequence with Swiss-Prot protein data using BLASTP to assign each gene an annotation of the best match. Predicted proteins were annotated by searching against the NR database. KEGG [29,30,31] annotation was performed through sequence alignment to identify the pathways in which genes might be involved. GO (v07012019) [32] and COG (v11102014) [33] were also used for gene functional annotation in this study.
Fungal databases are essential, especially for identifying the functions of specific genes in the target fungus. Genes involved in the interaction of pathogen and host were analyzed using PHI (v4.6) [34,35]. In addition, CAZy (v201906) [36] was used to identify genes encoding CAZymes, which can damage the cell wall of the host. Additionally, genes that might be related to the transportation of toxic secondary metabolites were searched for in other universal gene annotation databases.
The prediction of secondary metabolites was performed using the online software antiSMASH [37].

2.8. Functional Pathway Analysis of E. pela and Paraconiothyrium sp.

We considered plant sap to be deficient in essential amino acids and vitamins for E. pela and, combined with E. pela genome annotation, aligned the annotated genes of both with the KEGG database-related pathways and constructed metabolic complementary pathway maps.

2.9. Core/Pan-Gene Analysis

To explore the functional similarities and differences between the identified genes, an analysis of core/pan-genes among all samples was carried out. Molecular evidence was also obtained to explain the underlying cause for the phenotypic variation. Gene cluster analysis of protein-coding genes from all samples was performed using CD-HIT (v4.6.6) [38], generating final gene clusters that were regarded as pan-gene clusters. The pan-gene clusters were divided into core genes, specific genes, and dispensable genes. The core genes were contained in every strain, and most were essential for the growth of the strain, such as genes related to metabolism and the production of energy. Certain genes, however, were present in only a specific strain to show their unique characteristics or carry out specific metabolic activities. The rest of the genes were dispensable genes. The related species are Bimuria novae-zelandiae (GenBank: GCA_010015655.1), Didymosphaeria enalia (GenBank: GCA_010094045.1), Karstenula rhodostoma (GenBank: GCA_010093485.1), Laburnicola sp. JP-R-44 (ascomycetes) (GenBank: GCA_009805535.1), Lentithecium fluviatile (Genbank: GCA_010405425.1), Paraphaeosphaeria sporulosa (GenBank: GCA_001642045.1), and Paraphaeosphaeria minitans (GenBank: GCA_009707825.1).

2.10. Phylogenetic Tree Was Constructed

A gene family was constructed based on the genes of related species and the target strain, after which gene family identification was carried out.
The genome sequences of the related species downloaded from NCBI were used for reference to analyze gene families. The species used were the same as those mentioned in Section 2.9, but L. fluviatile was an outgroup. The identification of gene families was performed by aligning the protein sequences using BLAST, eliminating redundancy using solar, TreeFam [39] gene family clustering treatment of the alignment results using hcluster_sg, converting the protein alignment results into multiple sequence amino acids in the CDS area, and aligning multiple sequences with clustered gene families using Muscle. A phylogenetic tree was constructed based on a module of core/pan genes, gene families, and resequencing data. In this study, with a gene family module, a phylogenetic tree was constructed based on multiple sequence alignment results by adopting the NJ method with TreeBeST (v1.9.2).
In order to verify the results, the three conserved sequences of ITS, LSU, and Tub of 32 species (Table S3) were selected on GenBank for multi-locus sequence analysis with Paraconiothyrium sp. [3,18,40]. Multiple sequence alignment was performed for each conserved gene using Muscle (v5.1). Sequence Matrix was used to concatenate the three conserved genes of each species, and the concatenation order was ITS–LSU–Tub. A phylogenetic tree was constructed using the maximum composite likelihood model of neighbor-joining method with a bootstrap value of 1000 using MEGA11.

2.11. Synteny Analysis

To detect the evolution of homologous genomes, we performed synteny analysis of Paraconiothyrium sp. and the other six species mentioned in Section 2.9 except for Lentithecium fluviatile in pairs at the nucleotide and protein levels. The genetic order of the relatives was used as a standard for analysis. Then, the upper and following axes of the linear synteny graph were constructed after the length of both sequences was reduced by the same proportion. According to BLAST, each pair of nucleic acid sequences in the two alignments was marked in the coordinate diagram based on its position information after size reduction at the same proportion. Then, the amino acid levels were compared using the following method. Paraconiothyrium sp. was aligned with the relative as a database and the best hit value of each protein as selected. Then, Paraconiothyrium sp. was used as a database and other species were aligned with it. The best hits from the two alignments were selected for synteny analysis.

2.12. Cytotoxicity Assay

To validate whether Paraconiothyrium sp. could secrete toxic materials, the cytotoxicity of secondary metabolites secreted from Paraconiothyrium sp. were evaluated in the Aedes albopictus cell line C6/36. The fungus was cultured using liquid PDA medium. After replacement of the medium in the test cell with new RPMI 1640 medium (MeilunBio, Dalian, China), 100 μL of the inoculum was inoculated into 96-well plates that had been preincubated for 24 h in an incubator (37 °C). Then, each dilute Paraconiothyrium sp. medium was added to each well. Seven gradients were set up; for each, 0, 1, 2, 4, 6, 8, and 10 µL of liquid PDA medium were added. After 24 h of incubation, 10 μL of CCK-8 solution (Proteintech, Wuhan, China) was added to each well. The samples were incubated until the absorbance at 450 nm was approximately 1.0, as measured with a microplate reader (Thermo Fisher Scientific, Waltham, MA, USA). Homogenates of the strains were also assayed for cytotoxicity. DMSO (Sangon, Shanghai, China) was used as a positive control. The volume gradients of DMSO were 0, 1, 2, 4, 6, 8, and 10 µL in each well. The survival rate was calculated using the following formula:
Survival   Rate ( % ) = OD ( Negative   control   group ) OD ( Experimental   group ) OD ( Negative   control   group ) OD ( Blank   control   group ) × 100 %

3. Results

3.1. Fungal Culture

The collected E. pela was homogenized and cultured. Two pure fungi were isolated. Internal transcribed spacer (ITS) analyses showed that one fungus exhibits 99.42% similarity with the ITS sequences of Paraconiothyrium brasiliense. This fungus was genome sequenced in this study. The morphology of the grown fungus is shown in Figure 1, which exhibits the obvious characteristics of Ascomycetes.

3.2. Genome Sequencing and Assembly

The Paraconiothyrium sp. genome size was estimated to be approximately 41.79 Mb using K-mer analysis (Figure S1). The Paraconiothyrium sp. genome was sequenced using a combination of the BGISEQ and PacBio approaches, and the sequence depths were 86× and 236×, respectively. The assembled Paraconiothyrium sp. genome was 39.55 Mb with a scaffold N50 of 4.92 Mb (Figure 2, Table 1 and Table S4). The overall GC content was 51.36% (Table 1).

3.3. Genome Components

The Paraconiothyrium sp. genome consists of 14,174 genes with an average length of approximately 1.5 kb (Table 1). The CDS accounted for 50.75% of the entire genome with an average length of nearly 1.4 kb, and each gene contained approximately 2.75 exons and 1.75 introns.
Noncoding RNA in Paraconiothyrium sp. consisted of tRNA, rRNA, sRNA, snRNA, and miRNA, with copy numbers of 114, 32, 18, 32, and 29, respectively. The average lengths of tRNA, rRNA, sRNA, snRNA, and miRNA were 99.1 bp, 704 bp, 97 bp, 119 bp, and 51 bp, respectively. The total lengths of tRNA, rRNA, sRNA, snRNA, and miRNA were approximately 11.3 kb, 22.5 kb, 1.8 kb, 3.8 kb, and 1.5 kb, respectively (Table S5).
The repetitive sequences were approximately 305, 308, 1124, and 174 kb based on the Repbase database, the ProteinMask database, de novo prediction, and the TRF database, respectively, accounting for 0.77%, 0.78%, 2.84%, and 0.44% of the entire genome, respectively (Table S6). After redundant sequences in the above four databases were removed, 139 kb of repetitive sequences were obtained, accounting for 3.52% of the entire genome. Transposons account for approximately 3.18% of the genome and include DNA (0.16%), LINE (0.06%), LTR (2.46%), SINE (0.01%), and unknown types (0.56%) (Table S7).

3.4. Genomic Annotation

GO annotation returned 7238 proteins, accounting for 51.06% of the total proteins (Table 2, Figure S2). These genes were assigned different GO terms. In the biological process category, genes involved in ‘metabolic process’, ‘cellular process’, ‘localization’, ‘biological regulation’, and ‘regulation of biological process’ accounted for the majority. Among the molecular function category, genes were mainly associated with ‘catalytic activity’, ‘binding’, ‘transporter activity’, ‘transcription regulator activity’, and ‘structural molecule activity’.
A total of 1490 classified genes were obtained in 22 clusters of orthologous groups (COG) functional categories (Table 2, Figure S3). Among these categories, except ‘general function prediction only’, the ‘carbohydrate transport and metabolism’ cluster represented the largest group, followed by the ‘lipid transport and metabolism’, ‘amino acid transport and metabolism’, and ‘energy production and conversion’ clusters.
Among all the predicted genes, 4655 (32.84%) genes were mappable through the KEGG pathway database and found to be distributed in 148 metabolic pathways (Table 2, Figure S4). At the third level, these pathways were mainly classified into several categories related to metabolism, such as ‘metabolic pathways’, ‘biosynthesis of secondary metabolites’, ‘microbial metabolism in diverse environments’, ‘biosynthesis of antibiotics’, ‘biosynthesis of amino acids’, and ‘carbon metabolism’.
Secondary metabolism prediction showed many gene clusters that produced polyketide (Table S8).

3.5. Analyses of Functional Pathway Related to Nutrition Contribution of Paraconiothyrium sp.

Functional pathway analysis showed that E. pela and Paraconiothyrium sp. were complementary at least in the amino acid synthesis and vitamin synthesis pathways. Genomic annotation indicated that E. pela lacked the ability to de novo synthesize 10 essential amino acids (valine, leucine, isoleucine, lysine, arginine, methionine, histidine, phenylalanine, tyrosine, and tryptophan). However, Paraconiothyrium sp. has the genes necessary for the synthesis of these essential amino acids (Figure 3). In the vitamin B synthesis pathways, a complementary form of the other is presented. Except for vitamin B6 and riboflavin, which can be synthesized from scratch by Paraconiothyrium sp., the complete pathway exists for the other kinds of B vitamins only if the genes of both are complementary (Figure S5).

3.6. Identification of Pathogenic Factors

In our analysis, 1276 genes with high homology were identified in the PHI database, accounting for 9% of the total number of genes in the Paraconiothyrium sp. genome, and they covered 65 fungal species (Table 2). In addition, 540 genes and 511 genes were involved in ’unaffected pathogenicity’ and ’reduced virulence’, respectively. Only 82 genes concern the survival state of the pathogen—‘lethal factor’. In particular, the number of genes strongly associated with ’pathogenicity’ and ‘increased virulence (hypervirulence)’ was only 28 (Figure 4, Table S9).
A total of 595 carbohydrate-active enzyme (CAZyme)-coding gene homologs were identified in the Paraconiothyrium sp. genome (Table 2); among these homologs were 261 glycoside hydrolases (GHs), 124 carbohydrate-binding modules (CBMs), 64 glycosyl transferases (GTs), 45 carbohydrate esterases (CEs), 30 polysaccharide lyases (PLs), and 120 enzymes with auxiliary activities (AAs) (Table S10). GHs and AAs were the two most abundant annotated CAZyme genes. Classification of the GH family revealed that the majority of the GHs are members of the GH16 and GH28 families (Table S11). Classification of the AA family showed up to 31 members of AA3, including cellobiose dehydrogenase, glucose 1-oxidase, aryl alcohol oxidase, alcohol oxidase, and pyranose oxidase (Table S11).

3.7. Comparative Genome Analyses

Analysis of MUMmer outputs revealed that the genomes of K. rhodostoma (Pleosporales: Didymosphaeriaceae) and Paraconiothyrium sp. share 1453 syntenic blocks at the nucleotide level, accounting for approximately 584 kb of the sequence (Figure 5, Table S11). In contrast, Paraconiothyrium sp. shares fewer regions of synteny with others (Figure S6, Table S12). This result revealed that Paraconiothyrium sp. is more closely related to K. rhodostoma than to the other species mentioned in Section 2.9. Similar conclusions were obtained at the protein level (Figure S6, Table S13).
The pan-genome from eight species from Pleosporales contains 44,480 genes (Table S14). The core genome consists of 3027 genes (Figure 6). Except for P. minitans, the percentage of unique genes to the number of coding genes was the smallest for Paraconiothyrium sp. Additionally, the proportion of dispensable genes was large among the eight species. Paraconiothyrium sp. possesses 2592 specific genes, most of which are concentrated in metabolism-related processes. The top three COG categories were ‘carbohydrate transport and metabolism’, ‘lipid transport and metabolism’, and ‘secondary metabolite biosynthesis, transport, and catabolism’ (Figure 7). Meanwhile, the GO annotation of unique genes revealed genes enriched in the metabolic pathways of secondary products, such as the austinol metabolic process, dehydroaustinol metabolic process, and toxin metabolic process (Table S15).
The phylogenetic tree was constructed using L. fluviatile (Pleosporales: Lentitheciaceae) as an outgroup and eight species based on the pan-genome core genes or single-copy ortholog genes, respectively. The topology of the two trees is the same, but the divergence of bases differs slightly. The clusters composed of K. rhodostoma, P. minitans, and P. sporulosa are sister groups to Paraconiothyrium sp. (Figure S7). The species of these branches belongs to the Didymosphaeriaceae family.
Further multi-locus sequence analysis (Figure 8) showed that Paraconiothyrium sp. and Paraconiothyrium brasiliense converged upon the same branch. They clustered together with all the species from the Paraconiothyrium genus and formed one group in the phylogenetic tree.

3.8. Cytotoxicity Assay

To estimate the toxicity of materials secreted by Paraconiothyrium sp, the cytotoxicity assay was performed. The results of a cytotoxicity assay showed slight inhibition of the C6/36 cell line by the PDA liquid medium containing secretions. The cell viability was over 90% with different doses of medium. Cell survival was the lowest when 6 µL of medium was added, at approximately 91.3%. We used DMSO, which is highly cytotoxic, as a positive control, and the results showed that the more DMSO was added, the lower the cell survival rate was (Figure 9).

4. Discussion

Paraconiothyrium is widely distributed and has a variety of host habitats [41]. It has been found living within insects. It was reported that P. brasiliense was isolated from Acrida cinerea [42]. Another fungus from the Paraconiothyrium genus, P. hawaiiense, was isolated from the scale insect Diaspidiotus sp. [43]. We also found ITS sequences of Paraconiothyrium in the microorganism diversity analyses in our previous study [7]. They exist in insects, suggesting that they may play some role in the insect host. We isolated Paraconiothyrium sp. From E. pela and sequenced the genome in this study. Phylogenetic trees based on gene families showed that Paraconiothyrium sp. is evolutionally close to K. rhodostoma, P. minitans, and P. sporulosa.
The genome assembly is approximately 39.55 Mb and smaller than the genomes of the relatives of Didymosphaeriaceae. For example, B. novae-zelandiae, the species with the largest genome in the phylogenetic analysis, has a genome size of 78.18 Mb, and the Paraconiothyrium sp. genome is reduced in size by nearly half. However, the genomes of obligate symbiotic bacteria range from 0.5–2 Mb. In contrast, the genomes of related free-living bacteria are approximately five times larger (usually 4–10 Mb) [44].
The P. sporulosa genome has a GC content of 53.3%, higher than that of the majority of the fungi used for phylogeny. The GC content of the Paraconiothyrium sp. genome was 51.36%, similar to that of the P. sporulosa genome. However, the GC content of the obligate symbiotic bacterial genome was much lower than that of its free-living relatives [45,46,47,48]. This suggests that Paraconiothyrium sp. represents a transition stage between endogenous fungi and obligate symbionts and that the symbiotic relationship between Paraconiothyrium sp. and E. pela is relatively casual.
By synteny analysis, we found some sequence fragments that are lacking in the Paraconiothyrium sp. genome. E. pela is a scale insect. According to Gullan, scale insects may have long ago inherited mutual relationships with a variety of microorganisms from their ancestors due to their lifestyles [11]. After establishing a symbiotic relationship with the host, the symbiont becomes host-dependent due to the loss of genes involved in some biological processes [49,50]. Although they lose many genes, symbionts retain certain genes required for nutrient synthesis that are somewhat complementary to the missing parts of the host [51,52]. Additionally, genome annotation of Paraconiothyrium sp. showed retention of an intact essential amino acid synthesis pathway and that Paraconiothyrium sp. is also capable of producing nonessential amino acids, which should compensate for the host in this regard (Figure 3). In addition, E. pela lacks the complete vitamin B synthesis pathway, as shown by the gene functional annotation of the previously reported genome sequence [2]. However, Paraconiothyrium sp. contains synthetic pathways for vitamin B group members. Paraconiothyrium sp. and E. pela may collaborate in the production of most of the vitamin B group members (Figure S5). The above shows that Paraconiothyrium sp. plays a significant role in providing nutritional metabolic assistance to the host.
Fungi of the Paraconiothyrium genus have often been considered pathogens to plants [41,53]. They can infect the leaves of plants and cause leaf spots. They also cause human cutaneous phaeohyphomycosis [54,55]. However, they have been found living inside scale insects without causing disease, implying their adaptation to an insect host [41,42]. PHI annotation of the Paraconiothyrium sp. genome showed genes associated with reduced virulence and unaffected pathogenicity to be common in Paraconiothyrium sp. To verify this prediction, we performed a cytotoxicity assay. The Paraconiothyrium sp. liquid medium containing secretions had no significant inhibitory effect against the C6/36 cell line. In contrast, DMOS was substantially toxic to the cells. The presence of a symbiotic relationship between Paraconiothyrium sp. and E. pela can be inferred from this finding.
Evidence suggests that certain symbionts can assist the host in penetrating plant cell walls [47,48,56]. A large number of GH genes, such as GH16, GH28, GH43, and GH47, have also been found in the Paraconiothyrium sp. genome, and their main role involves the degradation of plant cell wall hemicellulose and pectin, suggesting that Paraconiothyrium sp. can assist E. pela in piercing plant tissues.
Lignocellulose-degrading fungi usually contain genes encoding lytic polysaccharide monooxygenases (LPMOs) [57], which are classified in the AA family in the CAZyme database. These enzymes are mainly involved in the depolymerization of lignin [58]. Thirty-one AA3 and 37 AA9 family genes were found in the Paraconiothyrium sp. genome. Members of the AA3 family assist other AA family enzymes through hydrogen peroxide or hydroquinone or assists glycoside hydrolases in the degradation of lignocellulose. LPMOs, which belong to the AA9 family, are involved in the degradation of various hemicelluloses from cellulose and lignocellulose, such as xyloglucan, xylan, and glucomannan [58]. A large number of AA3 and AA9 genes work together to degrade lignin by oxidation [58]. Experiments have shown that Paraconiothyrium sp. is highly abundant in the host epidermis and digestive tract. Both families are hypothesized to assist in digestion or in the invasion of plants by the insect through epidermal contact. These AAs facilitate the establishment of a symbiotic relationship between E. pela and Paraconiothyrium sp.

Supplementary Materials

The following supporting material can be downloaded at: https://www.mdpi.com/article/10.3390/genes13020338/s1, Figure S1. 15-mer analysis. X-axis is depth, and Y-axis is proportion. Theoretically, 15-mer distributions should follow a Poisson distribution. In fact, heterozygotes cause the possible appearance of other peaks at 1/2 of the main peak, and duplication causes the possible appearance of duplicate peaks near integer multiples of the main peak. Figure S2. Functional categories of the annotated genes, broadly separated into ‘biological process’, ‘cellular component’ and ‘molecular function’ based on Gene Ontology. X-axis indicates 45 functional GO categories. Blue boxes represent biological processes, yellow boxes represent cell composition, and orange boxes represent molecular functions. Y-axis indicates number of genes in a category. Figure S3. Cluster of orthologous groups (COG) classification of putative proteins. Y-axis indicates 22 functional COG categories. X-axis indicates number of genes in a category. Figure S4. KEGG classification of the genes. A total of 4655 genes were assigned to 383 KEGG pathways. X-axis indicates number of genes in a pathway. Y-axis indicates 46 second level pathways. Figure S5. Pathways of vitamin B synthesis involving E. pela and Paraconiothyrium sp. Red boxes and lines indicate genes and pathways unique to Paraconiothyrium sp. Green boxes and lines indicate genes and pathways unique to E. pela. Gray dashed lines and boxes indicate that neither gene or pathway is present. Figure S6. Synteny analysis between Paraconiothyrium sp. and the other species at the nucleic acid and amino acid level. Yellow box stands for forward chain and blue box stands for reverse chain within the upper and following sequence region. In the box of sequence, the yellow region stands for the nucleic acid sequence in the forward chain of this genome sequence and the blue region stands for the nucleic acid sequence in the reverse chain of this genome sequence. In the middle region of two sequences, the yellow line stands for forward alignment and the blue line stands for reverse complementary alignment. Column A is based on nucleotide level, column B is based on amino acid level. The species analyzed from top to bottom with Paraconiothyrium sp. are Bimuria novae-zelandiae, Didymosphaeria enalia, Laburnicola sp. JP-R-44, Paraphaeosphaeria sporulosa and Paraphaeosphaeria minitans. Figure S7. Phylogenetic tree in the eight species determined by the maximum likelihood method. L. fluviatile is the outgroup. (a) based on the pan-genome; (b) based on single-copy ortholog genes. The scale bar corresponds to 0.06 nucleotide substitutions per two sites; Table S1. PacBio reads statistics. Table S2. The statistics of BGISEQ based on next generation sequencing. Table S3. Three conserved sequences used in phylogenetic analyses. Table S4. Genome assembly of Paraconiothyrium sp. Table S5. Statistics of noncoding RNA in Paraconiothyrium sp. genome. Table S6. Repeat statistic in Paraconiothyrium sp. genome. Table S7. Transposons statistic in Paraconiothyrium sp. genome. Table S8. Predictions of secondary metabolite using antiSMASH. Table S9. Identification of pathogenicity genes through querying the Paraconiothyrium sp. genome with the PHI database. Table S10. Results of classification and annotation of carbohydrate enzymes. CBM, carbohydrate-binding module; CE, carbohydrate esterases; GH, glycoside hydrolases; GT, glycosyl transferase; PL, polysaccharide lyase; AA, auxiliary activity. Table S11. Statistics of all classifications in CAZy database. Table S12. Statistics for synteny at the nucleic acid level. Scaffold represents the 10 scoffolds assembled by Paraconiothyrium sp. Refs Length represents the length of each scoffold. Map number and map length represent the genome of the species corresponding to one of the Paraconiothyrium sp. scaffold gene and length, respectively. Table S13.Statistics for synteny at the amino acid level. Aligned indicates the number of genes of Paraconiothyrium sp. on the corresponding species match. Target percent (%) represents the number of genes of the corresponding species match as a percentage of Paraconiothyrium sp. Query percent (%) represents the number of genes of the corresponding substance match as a percentage of the total number of own genes. Table S14. The statistics of pangenome.

Author Contributions

Conceptualization, P.Y.; methodology, P.Y.; formal analysis, Z.-Y.F.; investigation, Z.-Y.F., J.-Q.A., H.-P.Z. and W.L.; resources, P.Y.; data curation, Z.-Y.F.; writing—original draft, Z.-Y.F., J.-Q.A. and P.Y.; writing—review & editing, Z.-Y.F., J.-Q.A., H.-P.Z., W.L. and P.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Key Program of Fundamental Research Funds for the Chinese Academy of Forestry (grant number CAFYBB2017ZB005), the Applied Basic Research Foundation of Yunnan Province (grant number 202101AS070042), the Young- and Middle-Aged Academic and Technical Leaders Reserve Talent Project of Yunnan Province (grant number 202105AC160031), and the National Natural Science Foundation of China (grant number 31572337).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data is available from GenBank repository with accession number JAJUBI000000000, and BioProject number PRJNA791143.

Acknowledgments

We thank Jin-Yang Zhang of the Faculty of Life Science and Technology, Kunming University of Science and Technology for providing Aedes albopictus cell line C6/36.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Zhang, C.; Tang, X.; Cheng, J. The utilization and industrialization of insect resources in China. Èntomol. Res. 2008, 38, S38–S47. [Google Scholar] [CrossRef]
  2. Yang, P.; Yu, S.; Hao, J.; Liu, W.; Zhao, Z.; Zhu, Z.; Sun, T.; Wang, X.; Song, Q. Genome sequence of the Chinese white wax scale insect Ericerus pela: The first draft genome for the Coccidae family of scale insects. GigaScience 2019, 8, 8. [Google Scholar] [CrossRef] [PubMed]
  3. Yu, S.-H.; Yang, P.; Sun, T.; Qi, Q.; Wang, X.-Q.; Xu, D.-L.; Chen, X.-M. Identification and evaluation of reference genes in the Chinese white wax scale insect Ericerus pela. SpringerPlus 2016, 5, 791. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. Yang, P.; Zhu, J.-Y.; Gong, Z.-J.; Xu, D.-L.; Chen, X.-M.; Liu, W.-W.; Lin, X.-D.; Li, Y.-F. Transcriptome Analysis of the Chinese White Wax Scale Ericerus pela with Focus on Genes Involved in Wax Biosynthesis. PLoS ONE 2012, 7, e35719. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Liu, W.-W.; Yang, P.; Chen, X.-M.; Xu, D.-L.; Hu, Y.-H. Cloning and Expression Analysis of Four Heat Shock Protein Genes in Ericerus pela (Homoptera: Coccidae). J. Insect Sci. 2014, 14, 14. [Google Scholar] [CrossRef] [PubMed]
  6. Yang, P.; Chen, X.-M. Protein profiles of chinese white wax scale, ericerus pela, at the male pupal stage by high-throughput proteomics. Arch. Insect Biochem. Physiol. 2014, 87, 214–233. [Google Scholar] [CrossRef] [PubMed]
  7. Sun, T.; Wang, X.-Q.; Zhao, Z.-L.; Yu, S.-H.; Yang, P.; Chen, X.-M. A Lethal Fungus Infects the Chinese White Wax Scale Insect and Causes Dramatic Changes in the Host Microbiota. Sci. Rep. 2018, 8, 5324. [Google Scholar] [CrossRef] [PubMed]
  8. Gullan, P.J.; Kosztarab, M. Adaptations in scale insects. Annu. Rev. Èntomol. 1997, 42, 23–50. [Google Scholar] [CrossRef]
  9. Rosenblueth, M.; Martínez-Romero, J.; Ramírez-Puebla, S.T.; Vera-Ponce de León, A.; Rosas-Pérez, T.; Bustamante-Brito, R.; Rincón-Rosales, R.; Martínez-Romero, E. Endosymbiotic microorganisms of scale insects. TIP. Rev. Espec. Cienc. Químico-Biológicas 2018, 21, 53–69. [Google Scholar] [CrossRef]
  10. Szklarzewicz, T.; Michalik, A.; Michalik, K. The Diversity of Symbiotic Systems in Scale Insects. In Symbiosis: Cellular, Molecular, Medical and Evolutionary Aspects; Kloc, M., Ed.; Springer: Berlin/Heidelberg, Germany, 2020; pp. 469–495. [Google Scholar]
  11. Wang, C.; Wang, S. Insect Pathogenic Fungi: Genomics, Molecular Interactions, and Genetic Improvements. Annu. Rev. Èntomol. 2017, 62, 73–90. [Google Scholar] [CrossRef]
  12. Fu, N.; Wang, M.; Wang, L.; Luo, Y.; Ren, L. Genome Sequencing and Analysis of the Fungal Symbiont of Sirex noctilio, Amylostereum areolatum: Revealing the Biology of Fungus-Insect Mutualism. mSphere 2020, 5, 15. [Google Scholar] [CrossRef] [PubMed]
  13. Brown, B.P.; Wernegreen, J.J. Genomic erosion and extensive horizontal gene transfer in gut-associated Acetobacteraceae. BMC Genom. 2019, 20, 472. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Hansen, A.K.; Moran, N.A. The impact of microbial symbionts on host plant utilization by herbivorous insects. Mol. Ecol. 2013, 23, 1473–1496. [Google Scholar] [CrossRef]
  15. Douglas, A.E. Phloem-sap feeding by animals: Problems and solutions. J. Exp. Bot. 2006, 57, 747–754. [Google Scholar] [CrossRef] [PubMed]
  16. Hongoh, Y.; Sasaki, T.; Ishikawa, H. Cloning, sequence analysis and expression in Escherichia coli of the gene encoding a uricase from the yeast-like symbiont of the brown planthopper, Nilaparvata lugens. Insect Biochem. Mol. Biol. 2000, 30, 173–182. [Google Scholar] [CrossRef]
  17. Martin, E.; Boccazzi, I.V.; De Marco, L.; Bongiorno, G.; Montagna, M.; Sacchi, L.; Mensah, P.; Ricci, I.; Gradoni, L.; Bandi, C.; et al. The mycobiota of the sand flyPhlebotomus perniciosus: Involvement of yeast symbionts in uric acid metabolism. Environ. Microbiol. 2018, 20, 1064–1077. [Google Scholar] [CrossRef] [PubMed]
  18. Sung, G.-H.; Hywel-Jones, N.L.; Sung, J.-M.; Luangsa-Ard, J.J.; Shrestha, B.; Spatafora, J.W. Phylogenetic classification of Cordyceps and the clavicipitaceous fungi. Stud. Mycol. 2007, 57, 5–59. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  19. Bao, W.; Kojima, K.K.; Kohany, O. Repbase Update, a database of repetitive elements in eukaryotic genomes. Mob. DNA 2015, 6, 11. [Google Scholar] [CrossRef] [Green Version]
  20. Benson, G. Tandem repeats finder: A program to analyze DNA sequences. Nucleic Acids Res. 1999, 27, 573–580. [Google Scholar] [CrossRef] [Green Version]
  21. Lagesen, K.; Hallin, P.; Rødland, E.A.; Staerfeldt, H.-H.; Rognes, T.; Ussery, D.W. RNAmmer: Consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 2007, 35, 3100–3108. [Google Scholar] [CrossRef]
  22. Lowe, T.M.; Eddy, S. tRNAscan-SE: A Program for Improved Detection of Transfer RNA Genes in Genomic Sequence. Nucleic Acids Res. 1997, 25, 955–964. [Google Scholar] [CrossRef] [PubMed]
  23. Griffiths-Jones, S.; Bateman, A.; Marshall, M.; Khanna, A.; Eddy, S. Rfam: An RNA family database. Nucleic Acids Res. 2003, 31, 439–441. [Google Scholar] [CrossRef] [Green Version]
  24. Griffiths-Jones, S. Rfam: Annotating non-coding RNAs in complete genomes. Nucleic Acids Res. 2004, 33, D121–D124. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  25. Madeira, F.; Park, Y.M.; Lee, J.; Buso, N.; Gur, T.; Madhusoodanan, N.; Basutkar, P.; Tivey, A.R.N.; Potter, S.C.; Finn, R.D.; et al. The EMBL-EBI search and sequence analysis tools APIs in 2019. Nucleic Acids Res. 2019, 47, W636–W641. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Johnson, A.D.; Handsaker, R.E.; Pulit, S.L.; Nizzari, M.M.; O’Donnell, C.J.; de Bakker, P.I.W. SNAP: A web-based tool for identification and annotation of proxy SNPs using HapMap. Bioinformatics 2008, 24, 2938–2939. [Google Scholar] [CrossRef] [Green Version]
  27. Borodovsky, M.; Lomsadze, A. Gene Identification in Prokaryotic Genomes, Phages, Metagenomes, and EST Sequences with GeneMarkS Suite. Curr. Protoc. Bioinform. 2011, 35, 4.5.1–4.5.17. [Google Scholar]
  28. Keller, O.; Kollmar, M.; Stanke, M.; Waack, S. A novel hybrid gene prediction method employing protein multiple sequence alignments. Bioinformatics 2011, 27, 757–763. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  29. Kanehisa, M. From genomics to chemical genomics: New developments in KEGG. Nucleic Acids Res. 2006, 34, D354–D357. [Google Scholar] [CrossRef]
  30. Kanehisa, M.; Sato, Y.; Kawashima, M.; Furumichi, M.; Tanabe, M. KEGG as a reference resource for gene and protein annotation. Nucleic Acids Res. 2016, 44, D457–D462. [Google Scholar] [CrossRef] [Green Version]
  31. Kanehisa, M.; Goto, S. KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res. 2000, 28, 27–30. [Google Scholar] [CrossRef] [PubMed]
  32. Ashburner, M.; Ball, C.A.; Blake, J.A.; Botstein, D.; Butler, H.; Cherry, J.M.; Davis, A.P.; Dolinski, K.; Dwight, S.S.; Eppig, J.T.; et al. Gene ontology: Tool for the unification of biology. The Gene Ontology Consortium. Nat. Genet. 2000, 25, 25–29. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  33. Tatusov, R.L.; Galperin, M.Y.; Natale, D.A.; Koonin, E.V. The COG database: A tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Res. 2000, 28, 33–36. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Winnenburg, R. PHI-base: A new database for pathogen host interactions. Nucleic Acids Res. 2006, 34, D459–D464. [Google Scholar] [CrossRef]
  35. Urban, M.; Cuzick, A.; Rutherford, K.; Irvine, A.; Pedro, H.; Pant, R.; Sadanadan, V.; Khamari, L.; Billal, S.; Mohanty, S.; et al. PHI-base: A new interface and further additions for the multi-species pathogen–host interactions database. Nucleic Acids Res. 2017, 45, D604–D610. [Google Scholar] [CrossRef] [PubMed]
  36. Lombard, V.; Golaconda Ramulu, H.; Drula, E.; Coutinho, P.M.; Henrissat, B. The carbohydrate-active enzymes database (CAZy) in 2013. Nucleic Acids Res. 2014, 42, D490–D495. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  37. Medema, M.H.; Blin, K.; Cimermancic, P.; De Jager, V.; Zakrzewski, P.; Fischbach, M.A.; Weber, T.; Takano, E.; Breitling, R. antiSMASH: Rapid identification, annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genome sequences. Nucleic Acids Res. 2011, 39, W339–W346. [Google Scholar] [CrossRef]
  38. Li, W.; Godzik, A. Cd-hit: A fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 2006, 22, 1658–1659. [Google Scholar] [CrossRef] [Green Version]
  39. Ruan, J.; Li, H.; Chen, Z.; Coghlan, A.; Coin, L.; Guo, Y.; Hériché, J.-K.; Hu, Y.; Kristiansen, K.; Li, R.; et al. TreeFam: 2008 Update. Nucleic Acids Res. 2007, 36, D735–D740. [Google Scholar] [CrossRef]
  40. Heled, J.; Drummond, A.J. Bayesian inference of population size history from multiple loci. BMC Evol. Biol. 2008, 8, 289. [Google Scholar] [CrossRef] [Green Version]
  41. Wang, J.; Shao, S.; Liu, C.; Song, Z.; Liu, S.; Wu, S. The genus Paraconiothyrium: Species concepts, biological functions, and secondary metabolites. Crit. Rev. Microbiol. 2021, 47, 781–810. [Google Scholar] [CrossRef]
  42. Liu, C.-X.; Wang, L.; Chen, J.-F.; Guo, Z.-Y.; Tu, X.; Deng, Z.-S.; Zou, K. Paraconfuranones A-H, eight new furanone analogs from the insect-associated fungus Paraconiothyrium brasiliense MZ-1. Org. Magn. Reson. 2015, 53, 317–322. [Google Scholar] [CrossRef] [PubMed]
  43. Ren, F.; Chen, S.; Zhang, Y.; Zhu, S.; Xiao, J.; Liu, X.; Su, R.; Che, Y. Hawaiienols A–D, Highly Oxygenated p-Terphenyls from an Insect-Associated Fungus, Paraconiothyrium hawaiiense. J. Nat. Prod. 2018, 81, 1752–1759. [Google Scholar] [CrossRef] [PubMed]
  44. Boscaro, V.; Kolisko, M.; Felletti, M.; Vannini, C.; Lynn, D.H.; Keeling, P.J. Parallel genome reduction in symbionts descended from closely related free-living bacteria. Nat. Ecol. Evol. 2017, 1, 1160–1167. [Google Scholar] [CrossRef]
  45. Lo, W.-S.; Huang, Y.-Y.; Kuo, C.-H. Winding paths to simplicity: Genome evolution in facultative insect symbionts. FEMS Microbiol. Rev. 2016, 40, 855–874. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  46. Rispe, C.; Moran, N.A. Accumulation of Deleterious Mutations in Endosymbionts: Muller’s Ratchet with Two Levels of Selection. Am. Nat. 2000, 156, 425–441. [Google Scholar] [CrossRef] [PubMed]
  47. Salem, H.; Kirsch, R.; Pauchet, Y.; Berasategui, A.; Fukumori, K.; Moriyama, M.; Cripps, M.; Windsor, D.; Fukatsu, T.; Gerardo, N.M. Symbiont Digestive Range Reflects Host Plant Breadth in Herbivorous Beetles. Curr. Biol. 2020, 30, 2875–2886.e4. [Google Scholar] [CrossRef] [PubMed]
  48. Degnan, P.H.; Ochman, H.; Moran, N.A. Sequence Conservation and Functional Constraint on Intergenic Spacers in Reduced Genomes of the Obligate Symbiont Buchnera. PLoS Genet. 2011, 7, e1002252. [Google Scholar] [CrossRef] [Green Version]
  49. Tamas, I.; Klasson, L.; Canback, B.; Naslund, A.K.; Eriksson, A.S.; Wernegreen, J.J.; Sandstrom, J.P.; Moran, N.A.; Andersson, S.G.E. 50 Million Years of Genomic Stasis in Endosymbiotic Bacteria. Science 2002, 296, 2376–2379. [Google Scholar] [CrossRef] [Green Version]
  50. McCutcheon, J.P.; von Dohlen, C.D. An Interdependent Metabolic Patchwork in the Nested Symbiosis of Mealybugs. Curr. Biol. 2011, 21, 1366–1372. [Google Scholar] [CrossRef] [Green Version]
  51. Douglas, A.E. The microbial dimension in insect nutritional ecology. Funct. Ecol. 2009, 23, 38–47. [Google Scholar] [CrossRef]
  52. Ligoxigakis, E.K.; Papaioannou, I.A.; Markakis, E.A.; Typas, M.A. First Report of Leaf Spot of Phoenix theophrasti Caused by Paraconiothyrium variabile in Greece. Plant Dis. 2013, 97, 1250. [Google Scholar] [CrossRef]
  53. Hamed, A.; Hughes, O.B.; Palavecino, E.L.; Jakharia, N. Cutaneous infection caused by paraconiothyrium cyclothyrioides in a renal transplant recipient. Transpl. Infect. Dis. 2021, e13624. [Google Scholar] [CrossRef]
  54. Gordon, R.A.; Sutton, D.A.; Thompson, E.H.; Shrikanth, V.; Verkley, G.J.M.; Stielow, J.B.; Mays, R.; Oleske, D.; Morrison, L.K.; Lapolla, W.J.; et al. Cutaneous Phaeohyphomycosis Caused by Paraconiothyrium cyclothyrioides. J. Clin. Microbiol. 2012, 50, 3795–3798. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  55. Calderón-Cortés, N.; Quesada, M.; Watanabe, H.; Cano-Camacho, H.; Oyama, K. Endogenous Plant Cell Wall Digestion: A Key Mechanism in Insect Evolution. Annu. Rev. Ecol. Evol. Syst. 2012, 43, 45–71. [Google Scholar] [CrossRef]
  56. Kracher, D.; Scheiblbrandner, S.; Felice, A.K.G.; Breslmayr, E.; Preims, M.; Ludwicka, K.; Haltrich, D.; Eijsink, V.G.H.; Ludwig, R. Extracellular electron transfer systems fuel cellulose oxidative degradation. Science 2016, 352, 1098–1101. [Google Scholar] [CrossRef] [PubMed]
  57. Frommhagen, M.; Mutte, S.K.; Westphal, A.H.; Koetsier, M.J.; Hinz, S.W.A.; Visser, J.; Vincken, J.-P.; Weijers, D.; Van Berkel, W.J.H.; Gruppen, H.; et al. Boosting LPMO-driven lignocellulose degradation by polyphenol oxidase-activated lignin building blocks. Biotechnol. Biofuels 2017, 10, 121. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  58. Garajova, S.; Mathieu, Y.; Beccia, M.R.; Granier, C.B.; Biaso, F.; Fanuel, M.; Ropartz, D.; Guigliarelli, B.; Record, E.; Rogniaux, H.; et al. Single-domain flavoenzymes trigger lytic polysaccharide monooxygenases for oxidative degradation of cellulose. Sci. Rep. 2016, 6, 28276. [Google Scholar] [CrossRef] [PubMed]
Figure 1. The morphological characters of symbiotic fungus in Ericerus pela. Scale bars = 1 cm.
Figure 1. The morphological characters of symbiotic fungus in Ericerus pela. Scale bars = 1 cm.
Genes 13 00338 g001
Figure 2. Circular representation of the genomic features of Paraconiothyrium sp. From the outer to the inner tracks: Genome Size, Gene density (gene number in 50,000 bp nonoverlapping windows), ncRNA (ncRNA number in 100,000 bp nonoverlapping windows), Repeat coverage (in 50,000 bp nonoverlapping windows), GC rate (in 20,000 bp nonoverlapping windows), GC skew (in 20,000 bp nonoverlapping windows).
Figure 2. Circular representation of the genomic features of Paraconiothyrium sp. From the outer to the inner tracks: Genome Size, Gene density (gene number in 50,000 bp nonoverlapping windows), ncRNA (ncRNA number in 100,000 bp nonoverlapping windows), Repeat coverage (in 50,000 bp nonoverlapping windows), GC rate (in 20,000 bp nonoverlapping windows), GC skew (in 20,000 bp nonoverlapping windows).
Genes 13 00338 g002
Figure 3. Amino acid synthesis pathways in E. pela and Paraconiothyrium sp. The green and blue areas represent E. pela and Paraconiothyrium sp., respectively. The blue and red oval boxes indicate nonessential amino acids and essential amino acid, respectively. Yellow and black boxes represent enzymes whose genes are encoded by E. pela and Paraconiothyrium sp., respectively. The name of the gene in the box is the EC number or the name of the enzyme corresponding to the KEGG annotation.
Figure 3. Amino acid synthesis pathways in E. pela and Paraconiothyrium sp. The green and blue areas represent E. pela and Paraconiothyrium sp., respectively. The blue and red oval boxes indicate nonessential amino acids and essential amino acid, respectively. Yellow and black boxes represent enzymes whose genes are encoded by E. pela and Paraconiothyrium sp., respectively. The name of the gene in the box is the EC number or the name of the enzyme corresponding to the KEGG annotation.
Genes 13 00338 g003
Figure 4. Statistics of virulence-associated genes based on PHI annotation.
Figure 4. Statistics of virulence-associated genes based on PHI annotation.
Genes 13 00338 g004
Figure 5. Synteny analysis of Paraconiothyrium sp. and K. rhodostoma at the nucleic acid and amino acid levels. (a) Synteny analysis at the nucleic acid level; (b) Synteny analysis at the amino acid level. The yellow box represents the forward chain and the blue box represents the reverse chain within the upper and following sequence regions. In the boxed sequence, the yellow region represents the nucleic acid or amino acid sequence in the forward chain of the genome sequence, and the blue region represents the nucleic acid or amino acid sequence in the reverse chain of the genome sequence. In the region between two sequences, the yellow line represents forward alignment and the blue line represents reverse complementary alignment.
Figure 5. Synteny analysis of Paraconiothyrium sp. and K. rhodostoma at the nucleic acid and amino acid levels. (a) Synteny analysis at the nucleic acid level; (b) Synteny analysis at the amino acid level. The yellow box represents the forward chain and the blue box represents the reverse chain within the upper and following sequence regions. In the boxed sequence, the yellow region represents the nucleic acid or amino acid sequence in the forward chain of the genome sequence, and the blue region represents the nucleic acid or amino acid sequence in the reverse chain of the genome sequence. In the region between two sequences, the yellow line represents forward alignment and the blue line represents reverse complementary alignment.
Genes 13 00338 g005
Figure 6. Flower plot based on pan-genomes of the eight species. Each ellipse represents one strain and the number in the ellipse refers to the cluster number. Each cluster has more than 50 percent gene identity and less than 0.3 length diversity.
Figure 6. Flower plot based on pan-genomes of the eight species. Each ellipse represents one strain and the number in the ellipse refers to the cluster number. Each cluster has more than 50 percent gene identity and less than 0.3 length diversity.
Genes 13 00338 g006
Figure 7. Comparison of COG-based specific genes in eight species. C represents “Energy production and conversion”; E represents “Amino acid transport and metabolism”; F represents “Nucleotide transport and metabolism”; G represents “Carbohydrate transport and metabolism”; H represents “Coenzyme transport and metabolism”; I represents “Lipid transport and metabolism”; J represents “Translation, ribosomal structure and biogenesis”; L represents “Replication, recombination and repair”; M represents “Cell wall/membrane/envelope biogenesis”; N represents “Cell motility”; O represents “Posttranslational modification, protein turnover, chaperones”; P represents “Inorganic ion transport and metabolism”; Q represents “Secondary metabolite biosynthesis, transport, and catabolism”; R represents “General function prediction only”; S represents “Function unknown”; T represents “Signal transduction mechanisms”; U represents “Intracellular trafficking, secretion, and vesicular transport”; V represents “Defense mechanisms”; and X represents “Mobilome: prophages, transposons”.
Figure 7. Comparison of COG-based specific genes in eight species. C represents “Energy production and conversion”; E represents “Amino acid transport and metabolism”; F represents “Nucleotide transport and metabolism”; G represents “Carbohydrate transport and metabolism”; H represents “Coenzyme transport and metabolism”; I represents “Lipid transport and metabolism”; J represents “Translation, ribosomal structure and biogenesis”; L represents “Replication, recombination and repair”; M represents “Cell wall/membrane/envelope biogenesis”; N represents “Cell motility”; O represents “Posttranslational modification, protein turnover, chaperones”; P represents “Inorganic ion transport and metabolism”; Q represents “Secondary metabolite biosynthesis, transport, and catabolism”; R represents “General function prediction only”; S represents “Function unknown”; T represents “Signal transduction mechanisms”; U represents “Intracellular trafficking, secretion, and vesicular transport”; V represents “Defense mechanisms”; and X represents “Mobilome: prophages, transposons”.
Genes 13 00338 g007
Figure 8. Phylogenetic tree of Paraconiothyrium sp. and 32 other species based on the neighbor-joining method. Bolds are the species involved in the phylogenetic analysis. The number below the branch indicates bootstrap values obtained from 1000 replications. Red numbers on the branch nodes indicate bootstrap values. Only bootstrap values equal to or greater than 50 are listed on the branches. The scale bar indicates the genetic distances.
Figure 8. Phylogenetic tree of Paraconiothyrium sp. and 32 other species based on the neighbor-joining method. Bolds are the species involved in the phylogenetic analysis. The number below the branch indicates bootstrap values obtained from 1000 replications. Red numbers on the branch nodes indicate bootstrap values. Only bootstrap values equal to or greater than 50 are listed on the branches. The scale bar indicates the genetic distances.
Genes 13 00338 g008
Figure 9. Cell survival rate with different additions. Blue indicates that the additive is PDA liquid medium containing Paraconiothyrium sp. metabolites. Red indicates that the additive is DMSO, which was used as a positive control. X-axis indicates the different volumes added, from left to right, 0, 1, 2, 4, 6, 8, and 10 µL. Y-axis indicates the cell survival rate, where the survival rate without the addition is 100%.
Figure 9. Cell survival rate with different additions. Blue indicates that the additive is PDA liquid medium containing Paraconiothyrium sp. metabolites. Red indicates that the additive is DMSO, which was used as a positive control. X-axis indicates the different volumes added, from left to right, 0, 1, 2, 4, 6, 8, and 10 µL. Y-axis indicates the cell survival rate, where the survival rate without the addition is 100%.
Genes 13 00338 g009
Table 1. Main assembly features of Paraconiothyrium sp.
Table 1. Main assembly features of Paraconiothyrium sp.
Assembly Features and Genome FeaturesValue
Total genome size (Mb)39.55
Max Length (bp)6,769,356
N50 (bp)4,916,146
N90 (bp)2,311,653
GC (%)51.36
Number of genes14,174
Number of exons38,925
Number of CDS14,174
Number of introns24,751
Gene length/genome length (%)4.43
Exons length/genome length (%)55.61
CDS length/genome length (%)50.75
Intron length/genome length (%)4.86
Average gene length (bp)1552
Average exon length (bp)515.7
Average CDS length (bp)1416
Average intron length (bp)77.7
tRNA114
Table 2. Statistics of annotation of different databases.
Table 2. Statistics of annotation of different databases.
DatabaseNumberPercent
AnnotatedP450243617.18%
CAZY5954.19%
KINASE1410.99%
SWISSPROT353124.91%
NOG924865.24%
COG149010.51%
CARD20.01%
NR12,41187.56%
DBCAN7825.51%
TCDB6204.37%
IPR10,22072.10%
PHI12769%
KEGG465532.84%
GO723851.06%
PHOSPHATASE350.24%
KOG220115.52%
Unannotated-13429.47%
Total-12,83290.53%
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Fu, Z.-Y.; An, J.-Q.; Liu, W.; Zhang, H.-P.; Yang, P. Genomic Analyses of the Fungus Paraconiothyrium sp. Isolated from the Chinese White Wax Scale Insect Reveals Its Symbiotic Character. Genes 2022, 13, 338. https://doi.org/10.3390/genes13020338

AMA Style

Fu Z-Y, An J-Q, Liu W, Zhang H-P, Yang P. Genomic Analyses of the Fungus Paraconiothyrium sp. Isolated from the Chinese White Wax Scale Insect Reveals Its Symbiotic Character. Genes. 2022; 13(2):338. https://doi.org/10.3390/genes13020338

Chicago/Turabian Style

Fu, Zuo-Yi, Jia-Qi An, Wei Liu, Hong-Ping Zhang, and Pu Yang. 2022. "Genomic Analyses of the Fungus Paraconiothyrium sp. Isolated from the Chinese White Wax Scale Insect Reveals Its Symbiotic Character" Genes 13, no. 2: 338. https://doi.org/10.3390/genes13020338

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop