Next Article in Journal
MultimodalGasData: Multimodal Dataset for Gas Detection and Classification
Previous Article in Journal
Grapevine Plant Image Dataset for Pruning
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Data Descriptor

Tropical Wood Species Recognition: A Dataset of Macroscopic Images

by
Daniel Alejandro Cano Saenz
1,
Carlos Felipe Ordoñez Urbano
1,
Holman Raul Gaitan Mesa
2 and
Rubiel Vargas-Cañas
1,*
1
Physics Department, Universidad del Cauca, Popayan 190002, Colombia
2
Environmental Management and Forest Governance, Regional Autonomous Corporation of Cauca, Popayan 190002, Colombia
*
Author to whom correspondence should be addressed.
Data 2022, 7(8), 111; https://doi.org/10.3390/data7080111
Submission received: 30 June 2022 / Revised: 24 July 2022 / Accepted: 30 July 2022 / Published: 11 August 2022

Abstract

:
Forests are of incalculable value due to the ecosystem services they provide to humanity such as carbon storage, climate regulation and participation in the hydrological cycle. The threat to forests grows as the population increases and the activities that are carried out in it, such as: cattle rearing, illegal trafficking, deforestation and harvesting. Moreover, the environmental authorities do not have sufficient capacity to exercise strict control over wood production due to the vast variety of timber species within the countries, the lack of tools to verify timber species in the supply chain and the limited available and labelled digital data of the forest species. This paper presents a set of digital macroscopic images of eleven tropical forest species, which can be used as support at checkpoints, to carry out studies and research based on macroscopic analysis of cross-sectional images of tree species such as: dendrology, forestry, as well as algorithms of artificial intelligence. Images were acquired in wood warehouses with a digital magnifying glass following a protocol used by the Colombian Ministry of Environment, as well as the USA Forest Services and the International Association of Wood Anatomists. The dataset contains more than 8000 images with resolution of 640 × 480 pixels which includes 3.9 microns per pixel, and an area of (2.5 × 1.9) square millimeters where the anatomical features are exposed. The dataset presents great usability for academics and researchers in the forestry sector, wood anatomists and personnel who work with computational models, without neglecting forest surveillance institutions such as regional autonomous corporations and the Ministry of the Environment.
Dataset License: Creative Commons Attribution 4.0 International.

1. Summary

Forests have incalculable value due to the ecosystem services that they provide to humanity which transcend the tangible elements that can be extracted from them. Among those services are: carbon storage, climate regulation and participation in the hydrological cycle [1]. Globally, the threat to forests grows as the population increases because larger areas are required for agriculture, land conversion for urbanization and timber demand for paper and lumber. Such wood demand results in higher activity of wood extraction and commercialization, both legal and illegal [2,3]. In Colombia, more than half of its surface is covered by natural forests (51.35%) and of this area, 50% is located in zones where forest extraction activities are carried out [4,5]. According to the Ministry of the Environment [6] and ONF Andina [7] in their report “Use and legality of wood in Colombia”, they estimate that 47% of the wood consumed within the country comes from unknown or illegal sources; such a critical aspect generates a negative environmental and socioeconomic impact.
There is a wide variety of native and planted wood species of great interest to the scientific forestry community, and to governments seeking to regulate and protect their exploitation. Timber species differ slightly around the world and show variability in their visible features due to environmental conditions and other factors. In order to carry out studies on dendrology, forestry, as well as for algorithms on artificial intelligence, research is primarily based on macroscopic analysis of cross-sectional images of tree species [8,9], but despite the high biodiversity in the region, there is a limited availability of labeled digital data of forest species, on which studies of algorithms can be carried out to apply recognition and identification [10,11,12]. In addition, in several countries there is a lack of tools to help verify timber species [13,14,15,16] during any step of the supply chain; and the xylotheques that have large taxonomic samples of species are only physical, not digital which makes field identification difficult. The same is also true for universities that carry out different forestry studies and do not have these wood enclosures.
Therefore, this paper provides a set of digital macroscopic images of eleven tropical forest species existing in warehouses and describes the collecting process carried out to set up the digital repository. Images are in RGB format and from timber cross sections with a resolution of 640 × 480 pixels. Image acquisition was performed with a digital magnifying glass which was set at 3.9 microns per pixel, and an area of 2.5 × 1.9 square millimeters where the anatomical features are exposed for macroscopic analysis. The sample preparation process included the cross-section, which is a standard protocol used by the Colombian ministry of environment, as well as internationally practiced by USA forest services and the International Association of Wood Anatomists (IAWA) [17]; moreover, to improve and regularize color contrast, a soft application of water over the cross section was performed. The generated dataset presents great usability for academics in the forestry sector [18], for teaching and research, for wood anatomists and for personnel who work with computational models [19,20], without neglecting the forest surveillance institutions such as regional autonomous corporations and the environmental authorities.

2. Data Description

Out of all the possible existing timber species or those that can be accessed, eleven (11) forest species were selected, which are the most regularly present in the commercialization processes in the region and come from the Amazonian and Pacific regions of Colombia. These species have been rated as being at high risk for illegal timber trade, based on information from the Cauca Regional Autonomous Corporation and the Ministry of the Environment, which have personnel working in the monitoring process of illegal wood trafficking in those areas. Therefore, the dataset is composed of eleven folders, each one named using the scientific names of each forest species (Table 1), and within each folder, there is a ZIP file, named with the Colombian common name, containing macroscopic digital raw RGB images, which are in jpg format. Every picture name has the date of acquisition, and another zip file has also been added with raw images to complement the dataset.
For completeness, the description of the data is composed of three key names, which are useful to recognize the species within the region (Colombia) and around the world: the scientific name, the common Colombian name/and the Global Trade name (Table 1). The description also includes the family, wood type and the number of images in each folder. The specialists mainly use the scientific name and trade names that can be found in timber databases such as InsideWood [21], International Tropical Timber Organization (ITTO) [22] and others; however, the trade name has several variations within the countries.
This dataset shows different types of RGB images with varied textures that expose important features of each species such as fibers, pores, vessels, and parenchyma (Figure 1), which are essential in the identification and diagnosis of timber species. Pictures are from different parts of the log and the database is being populated with pictures of the same tree species coming from different locations, i.e., Amazon rain forest, Pacific rain forest, among others. Moreover, as shown in the examples, the captured sections have random orientations to add variability and make them more usable in producing automatic learning models. In addition, there are two planted species (Figure 1j,k) that do not exhibit the characteristics of native species.

Parameters of the Data Collection

  • Image Resolution: 640 × 480 pixels, based on the CMOS sensor size;
  • Magnification: 3.9 µm/pixel (microns per pixel);
  • Area of interest: 2.5 mm × 1.9 mm;
  • Lighting: artificial provided by the capturing device.
The quality of each image was rated on a scale of one to five according to the number of visible features, the focus of the photograph and the cutting technique quality. Some of the defects that are taken as poor quality criteria in the scale ranking are capturing boundaries or borders in the wood, images with a lack of visible features or focusing problems (Figure 2). Finally, images were ordered by date of capture and their names were generated automatically by the acquisition software, as shown below:
format 1: Day MM DD HH-MM-SS or YYMMDD_HHMM.
format 2: IPC_YYYY-MM-DD-HH-MM-SS.
Figure 2. Some defects found in the images. (a) Dialyanthera gracilipes, (b) Handroanthus chrysanthus, (c) Campnosperma panamensis, (d) Fraxinus uhdei. Cutting problems, boundaries, diffuse, lack of features.
Figure 2. Some defects found in the images. (a) Dialyanthera gracilipes, (b) Handroanthus chrysanthus, (c) Campnosperma panamensis, (d) Fraxinus uhdei. Cutting problems, boundaries, diffuse, lack of features.
Data 07 00111 g002

3. Methods

Image acquisition followed an adaptation of the standard protocol used by the Colombian Ministry of Environment, as well as that internationally practiced by USA forest services and the International Association of Wood Anatomists (IAWA) [17], which was redesigned to the conditions under which local authorities inspect wood in situ.

3.1. Place of Collection

The selection criteria were implemented by the experts of the environmental division of the state public agency, based on species more commercialized or rated as high risk for illegal extraction in the country.
Samples and images were acquired in warehouses, where timber from the Pacific and Colombian Amazon region is aggregated, commercialized and distributed. Samples were taken by an expert from the environmental and forest governance department of the regional autonomous corporation of Cauca, which facilitated access to warehouses. In addition, a set of validation images of the same species was collected in the xylotheque of the University of Cauca to increase variability in the dataset.

3.2. Sample Preparation

For practical purposes, the sample preparation process consisted of cutting the transverse plane of the timber blocks with a small blade and moistening the sample with water to increase contrast and reduce the variation of physical properties such as color due to the moisture content of the wood, which affects multiple physical properties of timber [23]. The cross-section of the wood was selected as the study surface since its cut exposed most of the anatomical characteristics relevant for its discrimination by macroscopic analysis [24].

3.3. Image Adquisition and Labeling

Macroscopic image acquisition followed a capturing protocol that ensures the display of most of the anatomical characteristics of the timber and, hence, their later usability. Here, area, camera and resolution are some of the most important aspects to safeguard image quality [25]. In this sense, macroscopic images were acquired using a digital magnifying glass with artificial lighting and a fixed magnification of 3.9 microns per pixel which covers an area of 2.5 mm × 1.9 mm, a scale suitable for the observation of pores, fibers and parenchyma (Figure 3).
The acquisition was performed using two software applications: MScopes for mobile applications, and AMcap for USB cameras for PC. Image filenames were created automatically by the software. During collection, species were labeled for species and dates in the warehouses. The quality of each image was also rated on a scale of one to five according to the number of visible features, depending on the focus of the photography and the cutting technique. Debugging based on these criteria allowed the construction of a working dataset with macroscopic images of the cross sections. Then, the dataset was separated into folders by species.

4. User Notes

The database contains hundreds of images of both native and planted forest species which allows macroscopic analyses as well as studies of anatomical characteristics of species (Figure 4). It serves for comparative analyses for species variation and identification and as a support for the development of computerized wood recognition systems based on artificial vision and machine learning.
Users can select the set of images of species that are of interest, including various defects present that may or may not help their specific purposes.
Beneficiaries of the data are wood anatomists interested in analyzing variation inter and intraspecies; forest authorities involved in controlling extraction, identification and the trafficking of timber in the laboratory and the field, up to groups, genera and families; and professionals working in artificial intelligence and machine learning, where the data can serve as training sets and for the testing of computational techniques [26].
Data might be used to conduct dendrological research, studies on environmental aspects and climate variability, and their effects on the anatomical structure of trees. The data may become the beginning of a worldwide digital xylotheque (dataset) with hundreds of samples to access easily, where researchers can broaden taxonomic knowledge. The data may also serve as an input to measure the performance of classification algorithms to build specialized forestry software.

Author Contributions

Conceptualization, R.V.-C., C.F.O.U. and H.R.G.M.; methodology, D.A.C.S. and C.F.O.U.; validation, D.A.C.S.; formal analysis, R.V.-C. and C.F.O.U.; investigation, D.A.C.S., C.F.O.U., R.V.-C. and H.R.G.M.; resources, H.R.G.M.; data curation, D.A.C.S.; writing—original draft preparation, R.V.-C. and C.F.O.U.; writing—review and editing, R.V.-C. and C.F.O.U. All authors have read and agreed to the published version of the manuscript.

Funding

This research no received funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data regarding images and annotations can be accessed at repository name: Tropical Forest Species (DOI 10.17632/yzzcbyvgmh.3); direct URL to data: http://dx.doi.org/10.17632/yzzcbyvgmh.3 or http://www.unicauca.edu.co/laboratorios-fisica/maderas_cauca/ (accessed on 15 May 2022).

Acknowledgments

Authors express their gratitude to the managers of the wood warehouses, who allowed the sample collection, the personal assistance of the Regional Autonomous Corporation of Cauca, and the University of Cauca. The data collection was achieved with the effort of the authors and without funding.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Cristiano, P.M.; Campanello, P.I.; Bucci, S.J.; Rodriguez, S.A.; Lezcano, O.A.; Scholz, F.G.; Goldstein, G. Evapotranspiration of subtropical forests and tree plantations: A comparative analysis at different temporal and spatial scales. Agric. For. Meteorol. 2015, 203, 96–106. [Google Scholar] [CrossRef]
  2. World Wild Life, Forest. 2019. Available online: https://www.worldwildlife.org/initiatives/forests (accessed on 10 February 2022).
  3. Koch, G.; Haag, V.; Heinz, I.; Richter, H.-G.; Schmitt, U. Control of internationally traded timber-the role of macroscopic and microscopic wood identification against illegal logging. J. Forensic. Res. 2015, 6, 2–4. [Google Scholar] [CrossRef]
  4. WWF-Colombia-Programa Subregional Amazonas Norte & Chocó Darién; Maderas de Colombia. 2013; pp. 3–8. ISBN 978-958-8353-53-1. Available online: www.wwf.org.co/?213040/Maderas-de-Colombia (accessed on 10 June 2022).
  5. López Camacho, R.; Pulido Rodríguez, E.N.; González Martínez, R.O.; Nieto Vargas, J.E.; Vásquez, M.Y. Especies comercializadas en el Territorio, Guía para su identificación, Corporación Autónoma Regional de Cundinamarca-CAR; Editorial Universidad Distrital Francisco José de Caldas: Bogota, Colombia, 2014; pp. 15–17. ISBN 978-958-8897-1-10. [Google Scholar]
  6. MinAmbiente, CARDER, WWF, FedeMaderas. Pacto Intersectorial por la Madera legal en Colombia. 2009. Available online: https://www.ica.gov.co/areas/agricola/servicios/pacto-interseccional-de-madera/pactomadera/pacto_intersectorial_maderalegal.aspx (accessed on 20 February 2022).
  7. MinAmbiente, ONF Andina. Uso y Legalidad de la Madera en Colombia-Análisis Parcial; 2015; pp. 154–196; ISBN: 978-958-8901-09-1. Available online: https://www.minambiente.gov.co/wp-content/uploads/2021/10/Uso-y-Legalidad-de-la-Madera.pdf (accessed on 10 June 2022).
  8. Ravindran, P.; Costa, A.; Soares, R.; Wiedenhoeft, A.C. Classification of CITES-listed and other neotropical Meliaceae wood images using convolutional neural networks. Plant Methods 2018, 14, 25. [Google Scholar] [CrossRef] [PubMed]
  9. Ravindran, P.; Thompson, B.J.; Soares, R.K.; Wiedenhoeft, A.C. The XyloTron: Flexible, Open-Source, Image-Based Macroscopic Field Identification of Wood Products. Front. Plant Sci. 2020, 11, 1015. [Google Scholar] [CrossRef] [PubMed]
  10. Figueroa-Mata, G.; Mata-Montero, E.; Valverde-Otárola, J.C.; Arias-Aguilar, D.; Zamora-Villalobos, N. Using Deep Learning to Identify Costa Rican Native Tree Species From Wood Cut Images. Front. Plant Sci. 2022, 13, 789227. [Google Scholar] [CrossRef] [PubMed]
  11. Duarte, P.; Borges, C.; Ferreira, C.; Cruz, T.; de Souza, W.; Mori, F. Anatomical Identification of tropical woods traded in lavras, brazil. J. Trop. For. Sci. 2021, 33, 95–104. [Google Scholar] [CrossRef]
  12. Arévalo, R.E.; Pulido, E.N.; Solórzano, J.F.; Soares, R.; Ruffinatto, F.; Ravindran, P.; Wiedenhoeft, A.C. Imaged based identification of Colombian timbers using the xylotron: A proof of concept international partnership. Colomb. For. 2021, 24, 5–16. [Google Scholar] [CrossRef]
  13. Wiedenhoeft, A.C. The xylophone: Toward democratizing access to high-quality macroscopic imaging for wood and other substrates. IAWA J. 2020, 41, 699–719. [Google Scholar] [CrossRef]
  14. Tang, X.J.; Tay, Y.H.; Siam, N.A.; Lim, S.C. MyWood-ID: Automated Macroscopic Wood Identification System using Smartphone and macro-lens. In Proceedings of the 2018 International Conference on Computational Intelligence and Intelligent Systems, New York, NY, USA, 17–19 November 2018; pp. 37–43. [Google Scholar] [CrossRef]
  15. Tay, Y.H. XYLORIX: An AI-as-a-Service Platform for Wood Identification. In Proceedings of the IAWA-IUFRO International Symposium for Updating Wood Identification, Beijing, China, 20–22 May 2019. [Google Scholar] [CrossRef]
  16. Ravindran, P.; Owens, F.C.; Wade, A.C.; Vega, P.; Montenegro, R.; Shmulsky, R.; Wiedenhoeft, A.C. Field-Deployable Computer Vision Wood Identification of Peruvian Timbers. Front. Plant Sci. 2021, 12, 647515. [Google Scholar] [CrossRef] [PubMed]
  17. Wiedenhoeft, A.C. Identification of Central American Woods; Center for Wood Anatomy Research, USDA Forest Service: Madison, WI, USA, 2011; pp. 23–26. ISBN 978-1-892529-58-9.
  18. Wiedenhoeft, A.C.; Simeone, J.; Smith, A.; Parker-Forney, M.; Soares, R.; Fishman, A. Fraud and misrepresentation in retail forest products exceeds U.S. forensic wood science capacity. PLoS ONE 2019, 14, e0219917. [Google Scholar] [CrossRef]
  19. Figueroa, G.; Mata, E.; Valverde, J.C.; Arias, D. Using deep convolutional networks for species identification of xylotheque samples. In Proceedings of the 2018 IEEE International Work Conference on Bioinspired Intelligence (IWOBI), San Carlos, Costa Rica, 18–20 July 2018; pp. 1–9. [Google Scholar] [CrossRef]
  20. De Geus, A.R.; Silva, S.F.D.; Gontijo, A.B.; Silva, F.O.; Batista, M.A.; Souza, J.R. An analysis of timber sections and deep learning for wood species classification. Multimed. Tools Appl. 2020, 79, 34513–34529. [Google Scholar] [CrossRef]
  21. InsideWood Working Group (IWG). Available online: https://insidewood.lib.ncsu.edu/ (accessed on 13 July 2022).
  22. International Tropical Timber Organization (ITTO). Available online: http://www.tropicaltimber.info/ and https://www.itto.int/ (accessed on 13 July 2022).
  23. Simpson, W.; Ten Wolde, A. Physical Properties and Moisture Relations of Wood. In Chapter 3 Wood Handbook—Wood as an Engineering Material; Report FPL–GTR-113; US Department of Agriculture, Forest Service, Forest Products Laboratory: Madison, WI, USA, 1999. [Google Scholar]
  24. Jones, P.D. Basic Guide to Identification of Hardwoods and Softwoods Using Anatomical Characteristics; FAO of the United Nations: Mississippi State University Extension Service: Jackson, MI, USA, 2010. [Google Scholar]
  25. Laboratório Visão Robótica e Imagem. Available online: https://web.inf.ufpr.br/vri/databases/forest-species-database-macroscopic/ (accessed on 20 June 2022).
  26. Oliveira, L.S.; Nisgoski, S.; Britto, A.S. Forest species recognition using macroscopic images. Mach. Vis. Appl. 2014, 25, 1019–1031. [Google Scholar] [CrossRef]
Figure 1. Macroscopic images of timber species. (a) Campnosperma panamensis, (b) Cedrela odorata, (c) Cedrelinga cateniformis, (d) Cordia alliodora. (e) Dialyanthera gracilipes, (f) Eucalyptus globulus, (g) Handroanthus chrysanthus, (h) Humiriastrum procerum, (i) Fraxinus uhdei, (j) Cupresus lusitanica, (k) Pinus patula.
Figure 1. Macroscopic images of timber species. (a) Campnosperma panamensis, (b) Cedrela odorata, (c) Cedrelinga cateniformis, (d) Cordia alliodora. (e) Dialyanthera gracilipes, (f) Eucalyptus globulus, (g) Handroanthus chrysanthus, (h) Humiriastrum procerum, (i) Fraxinus uhdei, (j) Cupresus lusitanica, (k) Pinus patula.
Data 07 00111 g001
Figure 3. Cutting and image acquisition procedure.
Figure 3. Cutting and image acquisition procedure.
Data 07 00111 g003
Figure 4. Timber anatomical feature identification.
Figure 4. Timber anatomical feature identification.
Data 07 00111 g004
Table 1. Contents of the database.
Table 1. Contents of the database.
Scientific Name/Folder NameFamilyCommon Colombian Name/Global Trade NameWood Type Number of Images
Campnosperma panamensisAnacardiaceaeSajo/Orey WoodHardwood823
Cedrela odorataMeliaceae Cedro costeño/ Cigarbox cedar Hardwood1128
Cedrelinga cateniformisFabaceaeAchapo/CedroranaHardwood1189
Cordia alliodoraBoraginaceaeNogal cafetero/LaurelHardwood929
Dialyanthera gracilipesMyristicaceaeCuángare/Virola/White CedarHardwood1100
Eucalyptus globulusMyrtaceaeEucalipto blanco/Blue gumHardwood1105
Handroanthus chrysanthusBignoniaceaeGuayacán amarillo/Roble amarillo/Trumpet Tree Hardwood1106
Humiriastrum procerumHumiriaceaeChanul/CorozoHardwood1001
Fraxinus uhdeiOleaceaeUrapan/fresno/Shamel ashHardwood1025
Cupresus lusitanicaCupressaceaeCipres/Pino CipresSoftwood815
Pinus patulaPinaceaePino patula/OcoteSoftwood571
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Cano Saenz, D.A.; Ordoñez Urbano, C.F.; Gaitan Mesa, H.R.; Vargas-Cañas, R. Tropical Wood Species Recognition: A Dataset of Macroscopic Images. Data 2022, 7, 111. https://doi.org/10.3390/data7080111

AMA Style

Cano Saenz DA, Ordoñez Urbano CF, Gaitan Mesa HR, Vargas-Cañas R. Tropical Wood Species Recognition: A Dataset of Macroscopic Images. Data. 2022; 7(8):111. https://doi.org/10.3390/data7080111

Chicago/Turabian Style

Cano Saenz, Daniel Alejandro, Carlos Felipe Ordoñez Urbano, Holman Raul Gaitan Mesa, and Rubiel Vargas-Cañas. 2022. "Tropical Wood Species Recognition: A Dataset of Macroscopic Images" Data 7, no. 8: 111. https://doi.org/10.3390/data7080111

Article Metrics

Back to TopTop