Next Article in Journal
Improved Script Identification Algorithm Using Unicode-Based Regular Expression Matching Strategy
Previous Article in Journal
Terrestrial Carbon Storage Estimation in Guangdong Province (2000–2021)
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Data Descriptor

Linking Fungal Genomics to Thermal Growth Limits: A Dataset of 730 Sequenced Species

School of Physics & Astronomy, Cardiff University, 4 The Parade, Cardiff CF24 3AA, UK
Data 2025, 10(4), 42; https://doi.org/10.3390/data10040042
Submission received: 10 February 2025 / Revised: 18 March 2025 / Accepted: 21 March 2025 / Published: 25 March 2025

Abstract

:
The response of fungal species to changes in temperature is of theoretical and practical importance in a world of changing temperatures, ecologies and populations. Genomic sequencing to identify fungal species and their potential metabolic capabilities is well established, but linking this to growth temperature conditions has been limited. To that end, I describe a dataset that brings together the maximum and minimum temperature growth limits for 730 species of Fungi and Oomycetes for which genome sequences are available, together with supporting proteome and taxonomic data and literature references. The set will provide an entry for studies into how genomic structure and sequence can be used to predict the potential for growth at low or high temperatures, and hence the potential industrial use or pathogenic liability of existing or new fungal species.
Dataset: Available in Supplementary Information to this paper.
Dataset License: This dataset is made available under Creative Commons license CC-BY-NC-SA

1. Summary

This paper describes a compilation of information on fungal species for which genome sequences and growth temperatures are known. The data were collected as part of a project to explore the genomic and proteomic parameters that relate to the maximum temperature at which eukaryotic life can flourish.
Organisms are known that can grow at 120 °C [1,2,3], but no eukaryote is known that can grow above 65 °C [4] (although many can withstand higher temperatures for a short period, or as dormant forms). In particular, fungi show a range of optimal growth temperatures from below 15 °C to 60 °C, and some can grow below 0 °C [5,6,7,8]. Exploring what sets these limits is of fundamental interest to understanding the limits of life on Earth, and may inform whether complex life could evolve on other worlds with substantially different surface temperatures from Earth [9]. It is also of substantial practical importance. Enzymes from thermophilic organisms are widely used in industrial processes [10,11,12,13,14]. Of medical relevance, fungi are particularly difficult infectious agents to treat because they share substantial biochemical similarity to mammals, unlike bacteria or protozoa [15], and fungal infections cause over 1.5 million deaths a year [13]. However the majority of fungi cannot grow above around 32 °C, and so cannot colonize mammalian hosts [13,16]. If any organism, including fungi but also other classes, can adapt to higher temperatures, then they become potential opportunistic pathogens [13,17]. This has already been observed with Candida albicans, a commensal organism that historically has been identified as only growing in human body niches that maintain a temperature well below 37 °C, such as the vaginal passage, where it causes thrush [18], but increasingly has been suggested to be adapting to the higher-temperature bloodstream and causing systemic infection [19], and for Cryptococcus gattii, which occupies an ecological niche in rotting trees but in recent decades has become an increasingly frequent pathogen in humans [20]. With global climate change resulting in some entire ecosystems shifting to higher maximum temperatures, there is concern that the adaptation of previously benign fungal species to new temperature regimes could result in ‘new’ infectious species [21,22,23,24].
Therefore, there are several reasons to want to understand the causes of the temperature limits of fungal growth, and how temperature limits might change under selection for growth at higher temperatures. The dataset reported here was conceived as a basis for the exploration of whether genome structure and function are related to the maximum and minimum growth temperatures of fungi, so that the potential for a species to evolve a higher (or lower) growth temperature limit could be inferred without the need for long-term in vitro evolution experiments [25]. While a number of such studies have been performed (e.g., [26,27,28,29,30]), they all relate to specific groups of fungi or to very limited comparisons between related species. Extensive databases of genome sequences are available (referenced below in Section 2), but there is no comprehensive listing of species for which growth temperature limits have been reported in the literature.
I therefore sought to identify all the fungal species for which growth temperature limits and genome sequences are known. To that end, an extensive literature search identified 699 fungal species for which both maximum and minimum growth temperatures and genome sequences are available. This substantially extends the previous compilation of fungal cardinal temperatures. I have also included 31 Oomycetes in the dataset as an outgroup. While morphologically similar to fungi, Oomycetes are taxonomically distant from the opisthokonts (fungi, animals and related taxa), and so probe whether patterns seen might be fungi specific.
This paper describes this dataset in the hope that other researchers can bring their own analytical methods to bear on the question of what limits eukaryotic growth temperature, for basic and applied goals.
This is a preliminary version of this dataset. In particular, it assumes that the growth temperature limits of a fungal species are characteristic of that species, as they are of mammals and birds. However, different isolates of the same fungus can show different maximum growth temperatures (e.g., [13,31,32,33]), and maximum growth temperature can depend on growth conditions [13,34] and whether bimorphic species’ growth as a yeast or hyphal form is considered [13]. I have taken the highest recorded maximum temperature (and the lowest minimum temperature) here as representing the limits for any species, but a future dataset could probe the growth temperatures of each isolate under a range of conditions and compare them to the genome sequence of that isolate.

2. Data Description

The data comprise genome and growth temperature information on 699 fungal and 31 Oomycete species, of which 505 have translated proteomes available. The data are available as an Excel spreadsheet, with columns as described in Table 1. The spreadsheet file has five sheets: a ‘Read Me’, one sheet containing the data as listed in Table 1, one sheet implementing the growth model as described in Section 3.4, one sheet with the references for Table 1 and one for the compiled and processed Togashi dataset [35,36]. This last sheet is included for ‘future proofing’ for future genome project outputs. A ‘Read Me’ summarizes this information, and provides detail on how to run the temperature estimate model.
The downloaded proteome and genome sequence files are not provided here, as they are the property of other institutions. However, they can easily be downloaded from the websites listed in Table 2 using the species names given.
Data are only for species for which there is a literature citation for the maximum and the minimum growth temperature and extensive genome sequence data. I note that data on the rate of growth of fungal species at different temperatures probably exist for many more species than I have been able to include here. Many papers report that they grew a fungal species at many temperatures, but only report the temperature for optimal growth, not what the growth rate was at other temperatures. Future work could attempt to contact the many hundreds of workers reporting data in this way to extend the dataset. For other species, no growth data were traceable at all. For example, of the 59 species analyzed from [37], 41 had no traceable growth temperature data. Future experimental programmes might rectify this gap.
Species are listed by their names as listed in nmdc.cn/fungalnames. This is not to claim that this source of nomenclature is authoritative, but is simply used to remove duplications in the dataset, as fungi can be known by several names, either because the classification of the species has changed with greater knowledge or because, for historical reasons, the same organism in different parts of its life cycle has acquired different names. Alternative names are listed in the dataset for search purposes.
All species have genome sequence data available, and the size, fraction of the bases in the genome that are Guanine or Cytosine (GC content) and (where annotated by the genome sequence depositor or by NCBI) number of coding sequences are described. For species where the complete proteome is available for download, the source of that proteome is listed. Summary proteome characteristics computed from these files are provided.
The maximum and minimum temperatures have been identified either from the literature search or from the Togashi database [35], as updated and digitized by [36]. Togashi entries are listed separately, and, if temperature estimates from both the literature and Togashi are present, the minimum and maximum temperatures are shown in the summary column.
A summary of higher-level taxonomic classification is also provided.
The sources of the data are summarized in Figure 1. Note that the ’Uniprot’ set and ‘Literature set’ (species identified through searching Uniprot fungal species and those identified through the literature search) are mutually exclusive, as any species identified by the literature search was then not searched as part of the Uniprot search (see Methods below for more details).
The taxonomic distribution of species is summarized in Figure 2. The study included a small number of Oomycetes, which have a morphology and lifestyle similar to some fungi but are actually quite distantly related (fungi are more closely related to animals than to Oomycetes).
The relative numbers of species in the different classification groups shown in Figure 2 roughly match the known number of genera of fungi identified from large-scale genomic comparisons [38].
Some summary statistics on the dataset, broken down by classification category, are provided in Appendix A.

3. Methods

Data were collected through three routes: a literature search, a Uniprot search and a Togashi dataset analysis. Sources for the data are listed in Table 2.

3.1. Literature Search

The primary source of growth temperature data was a literature search. The literature was searched for papers through a search of Google Scholar using the keyword combinations in Table 3. The term ‘cardinal temperature’ is used in the literature for the range of temperatures over which a phenomenon is observed. In the study reported here, the phenomenon being observed at different temperatures was the rate of growth of mycelia. All hits identified by Google Scholar search where the full text was available to the author (around 95% of the references were accessible) were then searched for any data relevant to the rate of growth of the organism reported in the paper.
For a broad search for any data on any fungal species for which growth temperature was known, the ‘Literature first’ search strategy was adopted. For a narrower search for growth temperatures for fungi whose proteome was in the UniProt reference collection or for which genome data were available, the ‘Uniprot first’ and ‘Genome’ strategies were adopted. The same search strategy of reading accessible papers for temperature information was used in these searches.
Papers identified by any of these searches were then searched for relevant data, and, regardless of whether they contained such data, were searched for references to other papers on fungal growth, and the citations to that paper as identified in Google Scholar were also searched for potential data. This process was repeated until no new papers could be found.

3.2. Genome Data Search

For every fungal species identified through searching for temperature data, the NCBI Genome data resource and the JGI Mycocosm portal were searched for genome sequence information. If no entries were found in those resources, or if the data were incomplete (for example, lacking GC content or coding region data), the literature was again searched for genome sequence data on that species. The concern of this project was not for a complete, gap-free, chromosomal-level sequence assembly, but for a sufficiently complete set of genome sequence data to be listed in the NCBI or JGI as a genome sequence, or to be reported as a draft sequence in the literature.

3.3. Undermind Search

The manual search of the literature was complemented with a search using the AI search tool Undermind. Undermind is an AI-enabled literature search engine that takes a search query, uses a language model to recognize key concepts in the search query and in an initial literature search, and then uses the concepts educed from that analysis to perform repeated searches, adapting its search strategy as the repeated searches are conducted. In this, it claims to mimic how a human researcher would search the literature. Within the study reported here, the search phrase used at the start was “Synthesizing a dataset of fungal species by integrating genome data (from any source, with preference for genomes annotated with coding regions numbers) with experimentally determined minimum and maximum growth temperatures, prioritizing completeness across fungal diversity and actively seeking extremophilic species”. This identified a few additional species for which both temperature and genome data were available.

3.4. Uniprot Search

As one goal of the overall project was to compare the proteomes of species with different temperature tolerances, I downloaded all the reference proteomes from the Uniprot database and identified fungi in this set, as these all have extensive genome sequence information listed by NCBI in their ‘genomes’ datasets or in JGI in the Mycocosm portal. I then searched the literature for growth temperature data using the more relaxed ‘Uniprot Search’ criteria shown in Table 3.

3.5. Maximum and Minimum Temperature Estimate

The way that growth is reported varies hugely in different studies. For those that show a detailed curve of growth rate vs. temperature, the maximum growth temperature is easily identified, and the minimum is either clear or is the temperature at which growth is <0.05 times the peak growth rate. A number of papers stated the maximum and minimum growth temperatures but without providing supporting data, and these values were transcribed directly into the database. However, a number of papers reported growth at a number of temperatures which included a temperature above the maximum growth temperature (at which no growth happens) but not one approaching the minimum. For these papers, the parameters in Equation (1) were optimized to fit the observed growth data.
G = M a x i m u m 0 , Q 10 · k · T · e A T f
where G is the growth rate (in arbitrary units), Q10, k, A and f are arbitrary constants to be optimized and T is the temperature in degrees Celsius. The predictions from Equation (1) are scaled so that the maximum growth rate is the same as the maximum growth rate as reported in the literature report, so the two can be compared. The equation assumes that growth rates below zero are meaningless, so a minimum rate of 0 is imposed on the calculation.
The formula is derived empirically, derived from two parts. The first set of square brackets is from the observation that the growth rate of a wide range of fungi for which detailed growth vs. temperature curves are available is linearly proportional to temperature (not, as might be expected, exponentially proportional) for temperatures well below their maximum growth temperature. The second term in square brackets is derived from the exponential decrease in growth rate at greater-then-optimal temperatures, caused by increased protein denaturation, metabolite reaction and other processes that limit cellular viability.
Note that this is not meant to be a ‘model’ of fungal growth. Rather, it is a numerical match for empirical observations about the shapes of the growth rate vs. temperature curves for those species where such curves were reported.
The methods are summarized in Table 4, together with the code used in the dataset, to show which method was used.

3.6. Togashi Analysis

Kogo Togashi compiled a set of individual observations of the growth temperatures of fungi and oomycetes that are pathogens in plants in 1949 [35] from a manual reading of the primary literature. Ref. [36] has digitized and extended this dataset, and their digitized version was further analyzed for this study. The set of species reported in [36] was searched for species for which a maximum temperature and a minimum growth temperature were recorded. All species for which a maximum and a minimum were recorded were then searched in NCBI and JGI for genome information, and those with genome information were added to the dataset reported here.
Where growth temperatures were available from both the literature and the Togashi dataset, a ‘Combined Data’ temperature was calculated as the lower of the two minimum growth temperature and the higher of the two maximum growth temperatures. The reason for this is that if one study does not find growth at a specific temperature but another does, then that species has been demonstrated to grow at that temperature under at least one set of circumstances.

3.7. Naming and De-Duplicatiom

Because the same fungus can appear in the literature under several names, the name by which a species was reported in the literature was searched in the Fungal Names database, and the name listed as ‘Current name’ in the Fungal Names database was extracted. This name was used as a unique identifier for the species to ensure a unique species per row.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/data10040042/s1.

Funding

This research received no external funding.

Data Availability Statement

All data developed in this study are available for download, as above.

Acknowledgments

I am grateful to the referees for this paper, whose constructive comments improved its clarity significantly.

Conflicts of Interest

The author declares no conflicts of interest.

Appendix A

In this Appendix I provide some summary statistics for the dataset, to illustrate the breadth and limitations of the data.
Figure A1. Frequency distribution of genome sizes in the database. X axis: genome size in 2 megabase increments. Y axis: number of entries in the database with that genome size. The major categories of taxonomy as described in Figure 2 of the main text are colored separately. Note that the yeasts (Saccharomycotine) have much smaller genomes than other fungi, but they are not uniquely small.
Figure A1. Frequency distribution of genome sizes in the database. X axis: genome size in 2 megabase increments. Y axis: number of entries in the database with that genome size. The major categories of taxonomy as described in Figure 2 of the main text are colored separately. Note that the yeasts (Saccharomycotine) have much smaller genomes than other fungi, but they are not uniquely small.
Data 10 00042 g0a1
Figure A2. Frequency distribution of proteome sizes in the database. X axis: genome size in 500 amino acid increments. Y axis: number of entries in the database with that genome size. The major categories of taxonomy as described in Figure 2 of the main text are coloured separately. Note that the yeasts (Saccharomycotine) have much smaller proteomes than other fungi, in line with their smaller genomes, but not uniquely small.
Figure A2. Frequency distribution of proteome sizes in the database. X axis: genome size in 500 amino acid increments. Y axis: number of entries in the database with that genome size. The major categories of taxonomy as described in Figure 2 of the main text are coloured separately. Note that the yeasts (Saccharomycotine) have much smaller proteomes than other fungi, in line with their smaller genomes, but not uniquely small.
Data 10 00042 g0a2
Figure A3. Frequency distribution of genome G+C content in the database. G+C content as a percentage of bases in the genome. Y axis: number of entries in the database with that genome size. The major categories of taxonomy as described in Figure 2 of the main text are colored separately.
Figure A3. Frequency distribution of genome G+C content in the database. G+C content as a percentage of bases in the genome. Y axis: number of entries in the database with that genome size. The major categories of taxonomy as described in Figure 2 of the main text are colored separately.
Data 10 00042 g0a3
Figure A4. Plot of genome size vs. temperature limits for the species in the dataset. Y axis: genome size in megabases, plotted on a log scale. Y axis: (A) minimum growth temperature. (B) maximum growth temperature. There is a weak tendency for the largest genomes to belong to mesophilic fungi (i.e., species minimum growth temperatures between 0 °C and 20 °C and maximum growth temperatures between 25 °C and 40 °C), but the linear correlation coefficients (R2 = 0.016 for maximum temperature and R2 = 0.001 for minimum temperature) are not significant.
Figure A4. Plot of genome size vs. temperature limits for the species in the dataset. Y axis: genome size in megabases, plotted on a log scale. Y axis: (A) minimum growth temperature. (B) maximum growth temperature. There is a weak tendency for the largest genomes to belong to mesophilic fungi (i.e., species minimum growth temperatures between 0 °C and 20 °C and maximum growth temperatures between 25 °C and 40 °C), but the linear correlation coefficients (R2 = 0.016 for maximum temperature and R2 = 0.001 for minimum temperature) are not significant.
Data 10 00042 g0a4

References

  1. Slesarev, A.I.; Mezhevaya, K.V.; Makarova, K.S.; Polushin, N.N.; Shcherbinina, O.V.; Shakhova, V.V.; Belova, G.I.; Aravind, L.; Natale, D.A.; Rogozin, I.B.; et al. The complete genome of hyperthermophile Methanopyrus kandleri AV19 and monophyly of archaeal methanogens. Proc. Nat. Acad. Sci. USA 2002, 99, 4644–4649. [Google Scholar] [CrossRef] [PubMed]
  2. Takai, K.; Nakamura, K.; Toki, T.; Tsunogai, U.; Miyazaki, M.; Miyazaki, J.; Hirayama, H.; Nakagawa, S.; Nunoura, T.; Horikoshi, K. Cell proliferation at 122 °C and isotopically heavy CH4 production by a hyperthermophilic methanogen under high-pressure cultivation. Proc. Nat. Acad. Sci. USA 2008, 105, 10949–10954. [Google Scholar]
  3. Holden, J. Some like it hot: Understanding the limits of life using hyperthermophilic microbes. In Proceedings of the ABSTRACT 37th COSPAR Scientific Assembly, Montreal, QC, Canada, 13–20 July 2008; p. 1259. [Google Scholar]
  4. Clarke, A. The thermal limits to life on Earth. Int. J. Astrobiol. 2014, 13, 141–154. [Google Scholar] [CrossRef]
  5. de Oliveira, T.B.; Gomes, E.; Rodrigues, A. Thermophilic fungi in the new age of fungal taxonomy. Extremophiles 2015, 19, 31–37. [Google Scholar] [CrossRef] [PubMed]
  6. Tendulkar, S.; Hattiholi, A.; Chavadar, M.; Dodamani, S. Psychrophiles: A journey of hope. J. Biosci. 2021, 46, 64. [Google Scholar] [CrossRef]
  7. Gostinčar, C.; Grube, M.; De Hoog, S.; Zalar, P.; Gunde-Cimerman, N. Extremotolerance in fungi: Evolution on the edge. FEMS Microbiol. Ecol. 2009, 71, 2–11. [Google Scholar] [CrossRef]
  8. Saccomori, F.; Wigmann, É.F.; Bernardi, A.O.; de Jesús Alcano-González, M.; Copetti, M.V. Influence of storage temperature on growth of Penicillium polonicum and Penicillium glabrum and potential for deterioration of frozen chicken nuggets. Int. J. Food Microbiol. 2015, 200, 1–4. [Google Scholar]
  9. Bains, W.; Schulze-Makuch, D. The cosmic zoo: The (near) inevitability of the evolution of complex, macroscopic life. Life 2016, 6, 25. [Google Scholar] [CrossRef]
  10. Maheshwari, R.; Bharadwaj, G.; Bhat, M.K. Thermophilic fungi: Their physiology and enzymes. Microbiol. Mol. Biol. Rev. 2000, 64, 461–488. [Google Scholar]
  11. Morgenstern, I.; Powlowski, J.; Ishmael, N.; Darmond, C.; Marqueteau, S.; Moisan, M.-C.; Quenneville, G.; Tsang, A. A molecular phylogeny of thermophilic fungi. Fungal Biol. 2012, 116, 489–502. [Google Scholar] [CrossRef]
  12. Unsworth, L.D.; van der Oost, J.; Koutsopoulos, S. Hyperthermophilic enzymes − stability, activity and implementation strategies for high temperature applications. FEBS J. 2007, 274, 4044–4056. [Google Scholar] [CrossRef]
  13. Vieille, C.; Zeikus, G.J. Hyperthermophilic Enzymes: Sources, Uses, and Molecular Mechanisms for Thermostability. Microbiol. Mol. Biol. Rev. 2001, 65, 1–43. [Google Scholar] [CrossRef] [PubMed]
  14. Abo Nouh, F.A.; Gezaf, S.A.; Abo Nahas, H.H.; Abo Nahas, Y.H.; Vargas-De-La-Cruz, C.; Solorzano Acosta, R.A.; Landa-Acuña, D.; Luis-Alaya, B.; Abdel-Azeem, A.M. Bioprospecting for biomolecules from different fungal communities: An introduction. In Industrially Important Fungi for Sustainable Development: Volume 2: Bioprospecting for Biomolecules; Springer: Berlin/Heidelberg, Germany, 2021; pp. 1–71. [Google Scholar]
  15. Mota Fernandes, C.; Dasilva, D.; Haranahalli, K.; McCarthy, J.B.; Mallamo, J.; Ojima, I.; Del Poeta, M. The Future of Antifungal Drug Therapy: Novel Compounds and Targets. Antimicrob. Agents Chemother. 2021, 65, 10-1128. [Google Scholar] [CrossRef] [PubMed]
  16. Casadevall, A. Fungi and the rise of mammals. PLoS Pathog. 2012, 8, e1002808. [Google Scholar]
  17. Robert, V.; Cardinali, G.; Casadevall, A. Distribution and impact of yeast thermal tolerance permissive for mammalian infection. BMC Biol. 2015, 13, 18. [Google Scholar] [CrossRef]
  18. Truss, C.O. The role of Candida albicans in human illness. J. Orthomol. Psychiatry 1981, 10, 228–238. [Google Scholar]
  19. Concia, E.; Azzini, A.M.; Conti, M. Epidemiology Incidence and Risk Factors for Invasive Candidiasis in High-Risk Patients. Drugs 2009, 69, 5–14. [Google Scholar] [CrossRef]
  20. Chen, S.C.-A.; Meyer, W.; Sorrell, T.C. Cryptococcus gattii infections. Clin. Microbiol. Rev. 2014, 27, 980–1024. [Google Scholar]
  21. Van Rhijn, N.; Bromley, M. The consequences of our changing environment on life threatening and debilitating fungal diseases in humans. J. Fungi 2021, 7, 367. [Google Scholar] [CrossRef]
  22. Nnadi, N.E.; Carter, D.A. Climate change and the emergence of fungal pathogens. PLoS Pathog. 2021, 17, e1009503. [Google Scholar] [CrossRef]
  23. Seidel, D.; Wurster, S.; Jenks, J.D.; Sati, H.; Gangneux, J.-P.; Egger, M.; Alastruey-Izquierdo, A.; Ford, N.P.; Chowdhary, A.; Sprute, R. Impact of climate change and natural disasters on fungal infections. Lancet Microbe 2024, 5, e594–e605. [Google Scholar] [CrossRef]
  24. Frías-De-León, M.G.; Brunner-Mendoza, C.; del Rocío Reyes-Montes, M.; Duarte-Escalante, E. The Impact of Climate Change on Fungal Diseases; Springer: Berlin/Heidelberg, Germany, 2022. [Google Scholar]
  25. Dallinger, W.H. The president’s address. J. R. Microsc. Soc. 1887, 7, 185–199. [Google Scholar] [CrossRef]
  26. Hu, E.-Z.; Lan, X.-R.; Liu, Z.-L.; Gao, J.; Niu, D.-K. A positive correlation between GC content and growth temperature in prokaryotes. BMC Genom. 2022, 23, 110. [Google Scholar] [CrossRef] [PubMed]
  27. Sauer, D.B.; Wang, D.-N. Predicting the optimal growth temperatures of prokaryotes using only genome derived features. Bioinformatics 2019, 35, 3224–3231. [Google Scholar] [CrossRef]
  28. Liu, H.; Zheng, G.; Chen, Z.; Ding, X.; Wu, J.; Zhang, H.; Jia, S. Psychrophilic Yeasts: Insights into Their Adaptability to Extremely Cold Environments. Genes 2023, 14, 158. [Google Scholar] [CrossRef]
  29. van Noort, V.; Bradatsch, B.; Arumugam, M.; Amlacher, S.; Bange, G.; Creevey, C.; Falk, S.; Mende, D.R.; Sinning, I.; Hurt, E.; et al. Consistent mutational paths predict eukaryotic thermostability. BMC Evol. Biol. 2013, 13, 7. [Google Scholar] [CrossRef] [PubMed]
  30. Baeza, M.; Sepulveda, D.; Cifuentes, V.; Alcaíno, J. Codon usage bias in yeasts and its correlation with gene expression, growth temperature, and protein structure. Front. Microbiol. 2024, 15, 1414422. [Google Scholar] [CrossRef]
  31. Colhoun, J. Effects of environmental factors on plant diseases. Annu. Rev. Phytopathol. 1973, 11, 343–364. [Google Scholar] [CrossRef]
  32. Ellison, C.E.; Hall, C.; Kowbel, D.; Welch, J.; Brem, R.B.; Glass, N.; Taylor, J.W. Population genomics and local adaptation in wild isolates of a model microbial eukaryote. Proc. Natl. Acad. Sci. USA 2011, 108, 2831–2836. [Google Scholar] [CrossRef]
  33. Zimmermann, G. The entomopathogenic fungi Isaria farinosa (formerly Paecilomyces farinosus) and the Isaria fumosorosea species complex (formerly Paecilomyces fumosoroseus): Biology, ecology and use in biological control. Biocontrol Sci. Technol. 2008, 18, 865–901. [Google Scholar] [CrossRef]
  34. Solieri, L.; Gullo, M.; Giudici, P. Traditional balsamic vinegar: A microbiological overview. In Handbook of Plant-Based Fermented Food and Beverage Technology, 2nd ed.; CRC Press, Taylor and Francis Group: New York, NY, USA, 2012; pp. 609–630. [Google Scholar]
  35. Togashi, K. Biological Characters of Plant Pathogens: Temperature Relations; Meibundo: Ishikawa, Japan, 1949. [Google Scholar]
  36. Bebber, D.; Chaloner, T.; Gurr, S. Fungal and Oomycete cardinal temperatures (the Togashi dataset). Dryad 2020. [Google Scholar] [CrossRef]
  37. Li, H.; Wu, S.; Ma, X.; Chen, W.; Zhang, J.; Duan, S.; Gao, Y.; Kui, L.; Huang, W.; Wu, P.; et al. The Genome Sequences of 90 Mushrooms. Sci. Rep. 2018, 8, 9982. [Google Scholar] [CrossRef]
  38. Li, Y.; Steenwyk, J.L.; Chang, Y.; Wang, Y.; James, T.Y.; Stajich, J.E.; Spatafora, J.W.; Groenewald, M.; Dunn, C.W.; Hittinger, C.T.; et al. A genome-scale phylogeny of the kingdom Fungi. Curr. Biol. 2021, 31, 1653–1665.e1655. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Sources of data for entries in the dataset. Shown are the number of entries identified by the literature search (“Literature”), those not identified by the literature search but that were identified by searching for Uniprot species names (“Uniprot”), those identified from the Togashi dataset (“Togashi”) and those identified by an AI-assisted search using Undermind (“Undermind”).
Figure 1. Sources of data for entries in the dataset. Shown are the number of entries identified by the literature search (“Literature”), those not identified by the literature search but that were identified by searching for Uniprot species names (“Uniprot”), those identified from the Togashi dataset (“Togashi”) and those identified by an AI-assisted search using Undermind (“Undermind”).
Data 10 00042 g001
Figure 2. Taxonomic distribution of the species included in the dataset. The numbers under each taxonomic class show the number of species in that class in the database. Note that the majority of the fungi are in the Sacchaomycotina (yeasts), Pezizomycotina and Agaricomyces orders, reflecting both larger number of species and greater industrial interest in these groups of organisms.
Figure 2. Taxonomic distribution of the species included in the dataset. The numbers under each taxonomic class show the number of species in that class in the database. Note that the majority of the fungi are in the Sacchaomycotina (yeasts), Pezizomycotina and Agaricomyces orders, reflecting both larger number of species and greater industrial interest in these groups of organisms.
Data 10 00042 g002
Table 1. Data columns in the dataset.
Table 1. Data columns in the dataset.
GroupColumnDescriptionValues
SPECIESName from Fungal NamesUniform name from Fungal Names (https://nmdc.cn/fungalnames/) Text
Source nameName of species as described in the source materialText
Other name(s)Other names by which this species is referred to in the relevant literature, if anyText
GENOMESGenome lengthReported length of the genome, in megabasesReal number
Protein numberReported number of proteins in the genome annotationInteger
GC contentG+C percentage of the genomePercentage number
ReferenceLiterature reference for additional genome data, if not included in NCBI or Mycocosm databasesText
PROTEOMESProteome file sourceSource of the full proteome file, if one is available: Uniprot reference dataset, and, if not available there, the NCBI genome resource, and, if not available there, the JGI Mycocosm databaseText
number of proteinsNumber of proteins in the proteome fileInteger
total number of amino acidsTotal number of amino acids in the proteins in the proteome fileInteger
TEMPERATURE RANGEMinMinimum growth temperature as reported in the literature, as °CReal number
MaxMaximum growth temperature as reported in the literature, as °CReal number
BasisBasis for estimating minimum and maximum temperature (see Section 3.5)Text
ReferenceReference for source of minimum and maximum growth temperatureText
TOGASHI TEMPERATURE RANGEMinMinimum growth temperature as reported in the Togashi database, as °CReal number
MaxMaximum growth temperature as reported in the Togashi database, as °CReal number
Number of entries in Togashi dbNumber of entries in the Togashi database from which minimum and maximum growth temperatures were educedInteger
COMBINED TEMPERATURESMINMinimum of the estimates of minimum growth temperature, as °CReal number
MAXMaximum of the estimates of maximum growth temperature, as °CReal number
TAXONOMYComplete taxonomyComplete taxonomic description as listed in NBCI Taxonomy databaseText
CladeTop-level cladeText
SubkingdomSubkingdom (for fungi)Text
DivisionDivision (for fungi)Text
OrderOrder where specified (for fungi)Text
Table 2. Sources of data used.
Table 2. Sources of data used.
SourceRelevant DataURLAccess Dates
Google ScholarResearch-specific search engine https://google.scholar.comVaried dates in 2023 and 2024
NCBI Genome databaseData on genome sequence and coding capacity, proteomeshttps://www.ncbi.nlm.nih.gov/datasets/
(used to be https://www.ncbi.nlm.nih.gov/genome/)
Varied dates in 2023 and 2024
Uniprot proteome databaseReference proteome datahttps://ftp.uniprot.org/pub/databases/uniprot/current_release/knowledgebase/reference_proteomes/Eukaryota/20 June 2024
JGI Mycocosm Fungal genome statistics, proteomeshttps://mycocosm.jgi.doe.gov/mycocosm/homeVaried dates in 2023 and 2024
Fungal namesFungal names and synonymshttps://nmdc.cn/fungalnames/Varied dates in 2023 and 2024
UndermindAI-enabled literature search enginehttps://www.undermind.ai/27 December 2024
Togashi databaseDetailed database on growth temperatures of fungihttps://datadryad.org/stash/dataset/doi:10.5061/dryad.tqjq2bvw628 August 2024
Table 3. Search terms used in Google Scholar.
Table 3. Search terms used in Google Scholar.
Search TypeRelevant Data
Literature first(“fungi” OR “fungal”) AND “mycelial growth” AND (“maximum temperature” OR “minimum temperature” OR “cardinal temperature”)
(“fungi” OR “fungal”) AND (“maximum growth temperature” OR “minimum growth temperature”)
Uniprot first[fungal name] AND (“growth temperature” OR “cardinal temperature”)
Genome[fungal name] AND “genome sequence”
Table 4. Maximum and minimum temperature identification method.
Table 4. Maximum and minimum temperature identification method.
Search TypeRelevant Data
GraphRead from a graph of growth vs. temperature provided in the paper
StatedMaximum and minimum growth temperature stated in a paper, with or without accompanying graphical data
ModelTemperature limits derived from Equation (1), as described above
TogashiIdentified from the Togashi dataset, as described in Section 3.5
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Bains, W. Linking Fungal Genomics to Thermal Growth Limits: A Dataset of 730 Sequenced Species. Data 2025, 10, 42. https://doi.org/10.3390/data10040042

AMA Style

Bains W. Linking Fungal Genomics to Thermal Growth Limits: A Dataset of 730 Sequenced Species. Data. 2025; 10(4):42. https://doi.org/10.3390/data10040042

Chicago/Turabian Style

Bains, William. 2025. "Linking Fungal Genomics to Thermal Growth Limits: A Dataset of 730 Sequenced Species" Data 10, no. 4: 42. https://doi.org/10.3390/data10040042

APA Style

Bains, W. (2025). Linking Fungal Genomics to Thermal Growth Limits: A Dataset of 730 Sequenced Species. Data, 10(4), 42. https://doi.org/10.3390/data10040042

Article Metrics

Back to TopTop