Next Article in Journal
Exploring Heavy Metal and Metalloid Exposure in Children: A Pilot Biomonitoring Study near a Sugarcane Mill
Previous Article in Journal
The Development of a Rapid, Cost-Effective, and Green Analytical Method for Mercury Speciation
Previous Article in Special Issue
Advancing Endocrine Disruptors via In Vitro Evaluation: Recognizing the Significance of the Organization for Economic Co-Operation and Development and United States Environmental Protection Agency Guidelines, Embracing New Assessment Methods, and the Urgent Need for a Comprehensive Battery of Tests
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Models for the No-Observed-Effect Concentration (NOEC) and Maximal Half-Effective Concentration (EC50)

by
Nadia Iovine
,
Alla P. Toropova
,
Andrey A. Toropov
,
Alessandra Roncaglioni
and
Emilio Benfenati
*
Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via Mario Negri 2, 20156 Milan, Italy
*
Author to whom correspondence should be addressed.
Toxics 2024, 12(6), 425; https://doi.org/10.3390/toxics12060425
Submission received: 17 April 2024 / Revised: 4 June 2024 / Accepted: 11 June 2024 / Published: 12 June 2024
(This article belongs to the Special Issue New Models and Applications in Predictive Toxicology)

Abstract

:
Typical in silico models for ecotoxicology focus on a few endpoints, but there is a need to increase the diversity of these models. This study proposes models using the NOEC for the harlequin fly (Chironomus riparius) and EC50 for swollen duckweed (Lemna gibba) for the first time. The data were derived from the EFSA OpenFoodTox database. The models were based on the correlation weights of molecular features used to calculate the 2D descriptor in CORAL software. The Monte Carlo method was used to calculate the correlation weights of the algorithms. The determination coefficients of the best models for the external validation set were 0.74 (NOAEC) and 0.85 (EC50).

1. Introduction

The aquatic ecosystem has an important role for humans, animals, plants, and aquatic organisms. Water pollution resulting from humans as consumers and other anthropogenic activities such as agriculture and industry have negative impacts on aquatic organisms. Hazard and risk assessments are needed to investigate the adverse effects of chemicals on water life such as aquatic plants, fish, and algae [1,2,3].
Considering the amount of chemicals released into the environment, fast, accurate, and cost-effective assessments are required. Several organizations and regulations such as the United States Environmental Protection Agency (US EPA), REACH regulation (Registration, Evaluation, Authorization and Restriction of Chemicals) in Europe, the European Center for Validation of Alternative Methods (ECVAM) of the European Union, and the European Union Commission Scientific Committee on Toxicity, Ecotoxicity, and Environment (CSTEE) suggest and encourage an in silico approach for risk assessment [1]. Among the in silico approaches, the quantitative structure–activity relationship (QSAR) is a rapid and low-cost method to create models to test a large number of chemicals, reducing the number of time-consuming in vivo tests [4].
There are several assays to investigate aquatic ecotoxicity, and some are validated and standardized [5,6,7,8,9,10]. Swollen duckweed (Lemna gibba) has many attributes that make it useful for ecosystem health assessment [11,12,13]. The OECD Test Guidelines (TG) 221 are designed to assess the toxicity of substances to freshwater aquatic plants of the genus Lemna (duckweed). Plant cultures are exposed to different concentrations of the test substance for seven days to test the substance-related effects on vegetative growth. The effects are quantified by comparing growth in test solutions to the control and the concentration that causes x% inhibition of growth is determined (e.g., EC50).
The harlequin fly (Chironomus riparius) is used for ecotoxicological studies to assess acute and sub-lethal effects of chemicals in soil and water [14,15,16]. The OECD TG 218, 219, and 233 [5,6,8] are designed to assess the effects of prolonged exposure to chemicals on the lifecycle of the sediment-dwelling freshwater dipteran Chironomus sp. The instar chironomid larvae are placed in a sediment–water system together with the test substance. Chironomid emergence and development rates are measured. This test shows the concentration that caused x% reduction in emergence, larvae survival and growth (e.g., EC50), and/or the no-observed-effect concentration (NOEC). The OECD TG 233 [8], compared to the former, covers all the lifecycles of the first generation and the early part of the second generation, and effects are measured as the TG described previously.
The NOEC and EC50 are endpoints largely used in ecotoxicology to define the critical effects caused by long-term exposure (NOEC) [17] and acute exposure (EC50) [18] to substances for non-target organisms. In this paper, we collected NOEC and EC50 data from the EFSA OpenFoodTox database [19] and developed two QSAR models for the prediction of ecotoxicological endpoints: NOEC for the harlequin fly and EC50 for swollen duckweed. These models are based on the correlation weights of molecular features used to calculate the 2D descriptor in the CORAL software (http://www.insilico.eu/coral/ accessed on 11 June 2024). The Monte Carlo method has been used for many other models [20,21,22,23,24,25,26,27].

2. Materials and Methods

2.1. Data

Experimental NOEC for harlequin fly (n = 122) and EC50 swollen duckweed (n = 94) were collected from the EFSA OpenFoodTox database version 6 (https://zenodo.org/records/8120114, accessed on 16 April 2024). These values were converted into the logarithmic scale. The chemical substances with the experimental data for the two endpoints we studied are pesticides; for other families of substances, it is not common to perform this kind of experimental study. The datasets with the information on the compound structures and experimental values are reported in Table S1 (EC50 for swollen duckweed) and Table S2 (NOEC for harlequin fly). The folder “2D chemical structures”, available in Supplementary Materials, reports the 2D chemical structures of the compounds for both datasets.

2.2. Model

In this work, models of endpoints are regression relations of the following form [28,29]:
E n d p o i n t = C 0 + C 1 × D C W T ; N
The DCW(T, N) is the SMILES-based optimal descriptor, which is the sum of the so-called correlation weights. The numerical data on correlation weights were calculated with the Monte Carlo method. T and N are parameters of the Monte Carlo optimization procedure. T is the threshold to define active molecular features extracted from a simplified molecular input-line entry system (SMILES). T = 5 means that all molecular features present in the training set at least five times are active; otherwise, they are inactive. N is the number of epochs (iterations) of the Monte Carlo procedure. Starting from the same dataset, the substances were assigned to active training, passive training, calibration, or validation subsets. This splitting was repeated three times; thus, we developed three models for each endpoint.

2.3. Optimal SMILES-Based Descriptors

The descriptor used here was calculated as follows:
D C W T , N = C W ( S k ) + C W ( S S k )
where Sk is the SMILES atom, i.e., the undivided part of the SMILES. In other words, Sk is a single symbol (‘C’, ‘O’, etc.) or a group of symbols that cannot be examined separately because they define the atom jointly (‘Cl’, ‘Br’, etc.). The CW(x) are the correlation weights of the corresponding molecular features.

2.4. Monte Carlo Optimization

The optimization of the correlation weights is based on the search for the maximum of the target function (TF) calculated as follows:
T F   = r A T + r P T r A T r P T × 0.1 + ( I I C + C I I ) × 0.3
where IIC is the index of the ideality of correlation. CII is the correlation intensity index. The application of IIC and CII improves the predictive potential of models (i.e., the statistical quality of the validation set) but to the detriment of values on the training set [30,31].
The use of IIC and CII is actually “statistical forcing”; the correlation weights are optimized to be more favorable for the calibration set than for the training sets. It is hoped (and often justified) that a favorable effect on the calibration set will be accompanied by a favorable effect on the external validation set. This way, the system prefers a general model rather than a model that is too close to the training set—in this last case, there is the risk of sticking to a model that is overfitting.

2.5. Validation of the Models

There are a number of conceptually different criteria for the predictive potential of QSAR models. Some of them are based on calculations with the training set only. Other criteria are calculated using an external validation set. In our case, the models were validated using the following criteria for the predictive potential:
  • The determination coefficient (R2);
  • The correlation ideality index (IIC) [30,31];
  • The correlation intensity index (CII) [30,31];
  • RMSE, which is the root mean squared error;
  • MAE, which is the mean absolute error;
  • F, which is the Fischer F ratio.
In this way, we calculated statistics suitable both for the internal and the eternal validation approaches.

3. Results

3.1. The New Models for the Environmental Protection

Our environment needs better protection. Unfortunately, this calls for a huge effort, to cover an incredibly large number of species. In silico models may be a valid help in this effort, but so far, most of the models relate to a few endpoints; typically, in silico models address fish, daphnia, and algae, partly because these are commonly requested by current regulation. Here, we introduce new models for other species, exploiting data from the OpenFoodTox database. In particular, we addressed the harlequin fly (Chironomus riparius) and swollen duckweed (Lemna gibba). Collections of experimental values from sound databases offer a powerful driver for the development of transparent and reproducible in silico models. Thus, this paper promotes this vision. Using the datasets from authoritative databases boosts the development of in silico models, and it is convenient to identify algorithms that can quickly process datasets and develop models. CORAL, the software applied in our study, is an example.

3.2. Models for NOEC for Harlequin Fly

Table 1 contains the main statistical characteristics of models for logNOEC for the harlequin fly. The statistical values are for each subset (active training, passive training, calibration, and validation), also considering the leave-one-out statistics, indicated as Q2. More details on the statistical parameters are given in Table S3 in Supplementary Materials. These models, for each of the three splits starting from the same overall dataset, are the following:
LogNOEC = −1.3840 (±0.1057) + 0.1873 (±0.0066) × DCW(5; 15)
LogNOEC = −1.5626 (±0.0983) + 0.2218 (±0.0093) × DCW(5; 15)
LogNOEC = −2.3440 (±0.0752) + 0.2848 (±0.0075) × DCW(5; 15)
Table S4 in Supplementary Materials shows the experimental and calculated values and the applicability domain (AD) of logNOEC for the model obtained in the case of split 1.

3.3. Models for EC50 for Swollen Duckweed

Table 2 gives the statistical characteristics of models for logEC50. More details on the statistical parameters are given in Table S5 in Supplementary Materials. These models are the following, for the three splits:
LogEC50 = −1.1669 (±0.1576) + 0.1228 (±0.0100) × DCW(5; 15)
LogEC50 = −1.8297 (±0.1470) + 0.4278 (±0.0189) × DCW(5; 15)
LogEC50 = −1.2497 (±0.1670) + 0.1006 (±0.0079) × DCW(5; 15)
Considering the results on the two endpoints, there is a similar situation. The results for the validation sets are better than those for the active and passive training sets. For instance, in the model for NOEC (Table 1), the R2 values for the validation sets are in the range of 0.72–0.75, while the R2 values for the training sets are in the range of 0.26–0.58. The situation is similar for the second endpoint, EC50 (Table 2). In this case, the R2 values for the training sets are in the range of 0.26–0.49, always lower than the R2 values of the validation sets, which are in the range of 0.77–0.85. This is due to the mechanism of the CORAL model which forces the optimization of the values of the calibration sets.
The situation is similar for RMSE. The ranges for training sets are 1.07–2.03 (NOEC) and 1.55–1.67 (EC50), while for calibration sets, the ranges are 0.449–0.968 (NOEC) and 0.726–0.631 (EC50). This is due to the influence of the factors IIC and CII on the Monte Carlo optimization method [20,27,30,31].
Table S6 in Supplementary Materials shows the experimental and calculated values and the applicability domain (AD) of EC50 for swollen duckweed for the model obtained in the case of split 1.
Compared with the common practice of splitting the set of substances into training and validation sets, our approach is more elaborate. We further split the set of substances to be used for the model development into three subsets, namely active training, passive training, and calibration, as described in the Materials and Methods section. In practice, the initial steps of model building, when active and passive training sets are used, are preliminary phases to identify the components of the models. The structure of the proposed model is decided in the calibration phase. Thus, the results on the calibration set are conceptually closer to those in the usual process where only one training set is used. For this reason, the values of the validation set should be compared only with the calibration set, which in fact are much more similar than those in the preliminary phases of the model using the active and passive training sets, which are related to a not fully developed model.

4. Discussion

Our models were developed using SMILES as the format to represent the chemical structures. This is one of the most commonly used formats for in silico models. There are several advantages associated with the use of this format. It is quite compact; for instance, the structure of 1-tert-butyl-3,5-dimethyl-2,4,6-trinitrobenzene requires 62 bytes when represented with the SMILES, 998 bytes using a connected table, and 2066 bytes using and MDL MOL file. This speeds up the calculation and makes the use of the developed models more convenient and faster. The SMILES format is somehow more readable, at least for expert users. The SMILES structures can be easily found on the internet, for instance, from PubChem (https://pubchem.ncbi.nlm.nih.gov/ accessed on 11 June 2024). There are disadvantages too. The SMILES format does not address the 3D structure, which is, however, not used in most in silico models for ecotoxicity. Indeed, 3D models require more complexity in handling the structure, which has to be optimized, and the benefits resulting from the improved description of the chemical structure are usually not appreciable due to the uncertainty of the experimental data. Furthermore, the SMILES format is not unique, even if we used the canonical SMILES. Different formats may be used, for instance, for the description of the nitro groups, and this has to be standardized before modeling.
The SMILES format can cover chirality. However, in our case, we did not use the chiral representation, since in our datasets, there was no information about the toxicity values specific to individual enantiomers. This is a common situation, and pure enantiomers are only used within specific sectors when the price of the product can afford the physical separation of less active enantiomers.
Beyond these general considerations regarding the use of SMILES for the representation of the substances, there is a particular advantage related to the use of the CORAL software. In CORAL, the SMILES is directly used by the software, without the need to calculate molecular descriptors starting from the 2D or 3D structure, which is typical in most in silico models. Indeed, the software processes directly the SMILES code through the identification of the sequence of characters. This represents an important simplification of the modeling scheme. This is convenient from the point of view of using and comparing models. The obtained results also allow us to conclude that the correlation ideality index can be a useful measure of the predictive potential of models.
The Monte Carlo method involves the use of some random process. The figures (Figure 1 and Figure 2) representing the constructed models as well as their statistical quality (Table 1 and Table 2) show that, nevertheless, in both cases, both the determination coefficients and the mean square errors are reproduced during the mentioned random processes. Thus, the method under consideration can be considered useful from both practical and heuristic points of view.
This study was carried out to assess the feasibility of the approach used for modeling experimental data for two endpoints, which have not been studied so far. The only example we found in the literature refers to a model for swollen duckweed, using 13 substances in total; thus, it can be considered a preliminary study, without a proper validation of the results [32]. The paucity of existing models demonstrates the need to generate new models for these poorly represented endpoints. However, it should be noted that it is possible to expand the search space of this modeling system by assessing mechanistic interpretations and deeper analysis of the reproducibility of the results of the described random processes on large datasets. This is what is planned to be carried out in the near future.

5. Conclusions

We introduced new QSAR models for two endpoints of ecotoxicological interest for the first time. The data came from the EFSA database OpenFoodTox. This illustrates the easy way to exploit data that are more and more frequently available to protect our environment by better considering a larger set of species than those usually addressed. We used CORAL software to obtain models, which simply need the SMILES, without calculating molecular descriptors. We investigated different ways to optimize the results using new features of the CORAL software. The IIC and CII components of the target function are significant, since without these, the models are not satisfactory (their predictive potentials are low). These models will be implemented in the VEGAHUB website (www.vegahub.eu; accessed on 11 June 2024) for free use.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/toxics12060425/s1, Table S1: information on the compounds (ID, CAS number, name, and SMILES) and the experimental values used for the EC50 model of swollen duckweed. Table S2: information on the compounds (ID, CAS number, name, and SMILES) and the experimental values used for the NOEC model of the harlequin fly. Table S3: the statistical details on the model for NOEC for harlequin fly (Chirinomus riparius). Table S4: the experimental and calculated values and the applicability domain of NOEC for the harlequin fly (Chirinomus riparius) for the model obtained in the case of split 1. Table S5: the statistical details on the model for EC50 for swollen duckweed (Lemna gibba). Table S6: the experimental and calculated values and the applicability domain of EC50 for swollen duckweed (Lemna gibba) for the model obtained in the case of split 1. The folder “2D chemical structures” contains the 2D chemical structures of the compounds used for the development of the models. Each pdf file has the name of the compound ID.

Author Contributions

Conceptualization, N.I., A.P.T., A.A.T., A.R. and E.B.; data curation, N.I., A.P.T., A.A.T., A.R. and E.B.; writing—original draft preparation, N.I., A.P.T., A.A.T., A.R. and E.B.; writing—review and editing, N.I., A.P.T., A.A.T., A.R. and E.B.; supervision, A.R. and E.B.; project administration, E.B. All authors have read and agreed to the published version of the manuscript.

Funding

We acknowledge EFSA for the financial contribution within the project sOFT-ERA, OC/EFSA/IDATA/2022/02.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data regarding the models are available in Supplementary Materials.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Hamadache, M.; Benkortbi, O.; Amrane, A.; Hanini, S. QSAR approaches and ecotoxicological risk assessment. In Ecotoxicological QSARs. Methods in Pharmacology and Toxicology; Roy, K., Ed.; Humana: New York, NY, USA, 2020. [Google Scholar] [CrossRef]
  2. Häder, D.P.; Banaszak, A.T.; Villafañe, V.E.; Narvarte, M.A.; González, R.A.; Helbling, E.W. Anthropogenic pollution of aquatic ecosystems: Emerging problems with global implications. Sci. Total Environ. 2020, 713, 136586. [Google Scholar] [CrossRef]
  3. Toma, C.; Cappelli, C.I.; Manganaro, A.; Lombardo, A.; Arning, J.; Benfenati, E. New models to predict the acute and chronic toxicities of representative species of the main trophic levels of aquatic environments. Molecules 2021, 26, 6983. [Google Scholar] [CrossRef]
  4. de Roode, D.; Hoekzema, C.; de Vries-Buitenweg, S.; van de Waart, B.; van der Hoeven, J. QSARs in ecotoxicological risk assessment. Regul. Toxicol. Pharmacol. 2006, 45, 24–35. [Google Scholar] [CrossRef]
  5. OECD. Test No. 218. Sediment-Water Chironomid Toxicity Using Spiked Sediment, OECD Guidelines for the Testing of Chemicals, Section 2; OECD Publishing: Paris, France, 2004. [Google Scholar] [CrossRef]
  6. OECD. Test No. 219. Sediment-Water Chironomid Toxicity Using Spiked Water, OECD Guidelines for the Testing of Chemicals, Section 2; OECD Publishing: Paris, France, 2004. [Google Scholar] [CrossRef]
  7. OECD. Test No. 221. Lemna sp. Growth Inhibition Test, OECD Guidelines for the Testing of Chemicals, Section 2; OECD Publishing: Paris, France, 2006. [Google Scholar] [CrossRef]
  8. OECD. Test No. 233. Sediment-Water Chironomid Life-Cycle Toxicity Test Using Spiked Water or Spiked Sediment, OECD Guidelines for the Testing of Chemicals, Section 2; OECD Publishing: Paris, France, 2010. [Google Scholar] [CrossRef]
  9. OECD. Test No. 201. Freshwater Alga and Cyanobacteria, Growth Inhibition Test, OECD Guidelines for the Testing of Chemicals, Section 2; OECD Publishing: Paris, France, 2011. [Google Scholar] [CrossRef]
  10. OECD. Test No. 239. Water-Sediment Myriophyllum Spicatum Toxicity Test, OECD Guidelines for the Testing of Chemicals, Section 2; OECD Publishing: Paris, France, 2014. [Google Scholar] [CrossRef]
  11. Greenberg, B.M.; Huang, X.D.; Dixon, D.G. Applications of the aquatic higher plant Lemna gibba for ecotoxicological assessment. Aquat. Ecosyst. Health 1992, 1, 147–155. [Google Scholar] [CrossRef]
  12. Sobrero, M.C.; Rimoldi, F.; Ronco, A.E. Effects of the Glyphosate Active Ingredient and a Formulation on Lemna gibba L. at Different Exposure Levels and Assessment End-Points. Bull. Environ. Contam. Toxicol. 2007, 79, 537–543. [Google Scholar] [CrossRef]
  13. Marques, M.B.; Brunetti, I.A.; Faleiros, C.A.; da Cruz, C.; Iqbal, H.; Bilal, M.; Américo-Pinheiro, J. Ecotoxicological assessment and environmental risk of the insecticide chlorpyrifos for aquatic neotropical indicators. Water Air Soil. Pollut. 2021, 232, 428. [Google Scholar] [CrossRef]
  14. Crane, M.; Sildanchandra, W.; Kheir, R.; Callaghan, A. Relationship between biomarker activity and developmental endpoints in Chironomus riparius Meigen exposed to an organophosphate insecticide. Ecotoxicol. Environ. Saf. 2002, 53, 361–369. [Google Scholar] [CrossRef]
  15. Rodrigues, A.C.; Gravato, C.; Quintaneiro, C.; Golovko, O.; Žlábek, V.; Barata, C.; Soares, A.M.; Pestana, J.L. Life history and biochemical effects of chlorantraniliprole on Chironomus riparius. Sci. Total Environ. 2015, 508, 506–513. [Google Scholar] [CrossRef]
  16. Campos, D.; Gravato, C.; Quintaneiro, C.; Soares, A.M.; Pestana, J.L. Responses of the aquatic midge Chironomus riparius to DEET exposure. Aquat. Toxicol. 2016, 172, 80–85. [Google Scholar] [CrossRef]
  17. Cappelli, C.I.; Toropov, A.A.; Toropova, A.P.; Benfenati, E. Ecosystem ecology: Models for acute toxicity of pesticides towards Daphnia magna. Environ. Toxicol. Pharmacol. 2020, 80, 103459. [Google Scholar] [CrossRef]
  18. Lavado, G.J.; Baderna, D.; Carnesecchi, E.; Toropova, A.P.; Toropov, A.A.; Dorne, J.L.C.M.; Benfenati, E. QSAR models for soil ecotoxicity: Development and validation of models to predict reproductive toxicity of organic chemicals in the collembola Folsomia candida. J. Hazard. Mater. 2022, 423, 127236. [Google Scholar] [CrossRef]
  19. Kovarich, S.; Ciacci, A.; Baldin, R.; Roncaglioni, A.; Mostrag, A.; Tarkhov, A.; Carnesecchi, E.; Gibin, D.; Di Piazza, G.; Pasinato, L.; et al. OpenFoodTox: EFSA’s Chemical Hazards Database (Version 5) [Data Set]. Zenodo; EFSA: Parma, Italy. [CrossRef]
  20. Kumar, P.; Kumar, A.; Singh, D. CORAL: Development of a hybrid descriptor based QSTR model to predict the toxicity of dioxins and dioxin-like compounds with correlation intensity index and consensus modelling. Environ. Toxicol. Pharmacol. 2022, 93, 103893. [Google Scholar] [CrossRef]
  21. Yuan, B.; Wang, Y.; Zong, C.; Sang, L.; Chen, S.; Liu, C.; Pan, Y.; Zhang, H. Modeling study for predicting altered cellular activity induced by nanomaterials based on Dlk1-Dio3 gene expression and structural relationships. Chemosphere 2023, 335, 139090. [Google Scholar] [CrossRef]
  22. Živadinović, B.; Stamenović, J.; Živadinović, J.; Živadinović, L.; Sokolović, M.; Filipović, S.S.; Sokolović, D.; Veselinović, A.M. QSAR modelling, molecular docking studies and ADMET predictions of polysubstituted pyridinylimidazoles as dual inhibitors of JNK3 and p38α MAPK. J. Mol. Struct. 2022, 1265, 133504. [Google Scholar] [CrossRef]
  23. Zivkovic, M.; Zlatanovic, M.; Zlatanovic, N.; Golubović, M.; Veselinović, A.M. A QSAR model for predicting the corneal permeability of drugs—The application of the Monte Carlo optimization method. New J. Chem. 2022, 47, 224–230. [Google Scholar] [CrossRef]
  24. Azimi, A.; Ahmadi, S.; Kumar, A.; Qomi, M.; Almasirad, A. SMILES-based QSAR and molecular docking study of oseltamivir derivatives as influenza inhibitors. Polycycl. Aromat. Comp. 2023, 43, 3257–3277. [Google Scholar] [CrossRef]
  25. Goyal, S.; Rani, P.; Chahar, M.; Hussain, K.; Kumar, P.; Sindhu, J. Quantitative structure activity relationship studies of androgen receptor binding affinity of endocrine disruptor chemicals with index of ideality of correlation, their molecular docking, molecular dynamics and ADME studies. J. Biomol. Struct. Dyn. 2023, 1, 13616–13631. [Google Scholar] [CrossRef]
  26. Hamzehali, H.; Lotfi, S.; Ahmadi, S.; Kumar, P. Quantitative structure–activity relationship modeling for predication of inhibition potencies of imatinib derivatives using SMILES attributes. Sci. Rep. 2022, 12, 21708. [Google Scholar] [CrossRef]
  27. Soleymani, N.; Ahmadi, S.; Shiri, F.; Almasirad, A. QSAR and molecular docking studies of isatin and indole derivatives as SARS 3CLpro inhibitors. BMC Chem. 2023, 17, 32. [Google Scholar] [CrossRef]
  28. Toropov, A.A.; Toropova, A.P.; Benfenati, E.; Leszczynska, D.; Leszczynski, J. SMILES-based optimal descriptors: QSAR analysis of fullerene-based HIV-1 PR inhibitors by means of balance of correlations. J. Comput. Chem. 2010, 31, 381–392. [Google Scholar] [CrossRef]
  29. Toropov, A.A.; Toropova, A.P.; Marzo, M.; Dorne, J.L.; Georgiadis, N.; Benfenati, E. QSAR models for predicting acute toxicity of pesticides in rainbow trout using the CORAL software and EFSA’s OpenFoodTox database. Environ. Toxicol. Pharmacol. 2017, 53, 158–163. [Google Scholar] [CrossRef]
  30. Toropova, A.P.; Toropov, A.A.; Fjodorova, N. Quasi-SMILES for predicting toxicity of nano-mixtures to Daphnia magna. NanoImpact 2022, 28, 100427. [Google Scholar] [CrossRef]
  31. Toropova, A.P.; Toropov, A.A.; Roncaglioni, A.; Benfenati, E. Monte Carlo technique to study the adsorption affinity of azo dyes by applying new statistical criteria of the predictive potential. SAR QSAR Environ. Res. 2022, 33, 621–630. [Google Scholar] [CrossRef]
  32. Galimberti, F.; Moretto, A.; Papa, E. Application of chemometric methods and QSAR models to support pesticide risk assessment starting from ecotoxicological datasets. Water Res. 2020, 174, 115583. [Google Scholar] [CrossRef]
Figure 1. Splitting correlations for the active and passive training sets for logNOEC models for the harlequin fly. Black dots represent the substances in the subset.
Figure 1. Splitting correlations for the active and passive training sets for logNOEC models for the harlequin fly. Black dots represent the substances in the subset.
Toxics 12 00425 g001
Figure 2. Splitting correlations for the active and passive training, calibration, and validation sets for logEC50 for swollen duckweed. Black dots represent the substances in the subset.
Figure 2. Splitting correlations for the active and passive training, calibration, and validation sets for logEC50 for swollen duckweed. Black dots represent the substances in the subset.
Toxics 12 00425 g002
Table 1. The statistical characteristics of model logNOEC for the harlequin fly. A, P, C, and V are active training, passive training, calibration, and validation sets.
Table 1. The statistical characteristics of model logNOEC for the harlequin fly. A, P, C, and V are active training, passive training, calibration, and validation sets.
SplitSetnR2 *IICCIIQ2RMSEMAEF
1A310.43960.54600.73860.36941.511.3423
P300.41860.56300.70600.31221.641.3820
C310.61660.78470.83290.53280.8330.65047
V300.7449---0.686--
2A320.44270.66540.69270.35901.431.2624
P310.46110.65570.72810.37531.411.2825
C300.80370.89630.86840.78250.4490.360115
V290.7234---0.563--
3A310.58740.71850.73390.53501.070.90541
P300.26010.47730.72010.15102.031.7410
C300.66270.81400.78290.61160.9680.72755
V310.7295---0.833--
* Please see Section 2.3 for the description of the statistical parameters.
Table 2. The statistical characteristics on model logEC50. A, P, C, and V are active training, passive training, calibration, and validation sets.
Table 2. The statistical characteristics on model logEC50. A, P, C, and V are active training, passive training, calibration, and validation sets.
SplitSetnR2 *IICCIIQ2RMSEMAEF
1A240.26170.43290.70680.12101.671.458
P240.35960.59310.71660.26601.571.3912
C230.83460.91310.90730.79800.5140.424106
V230.8500---0.631--
2A240.31250.47300.73860.22291.491.2710
P230.49220.49690.73200.41041.631.4320
C230.81420.90200.87670.77930.8700.63092
V240.7723---1.113--
3A230.23190.44140.77510.09771.691.536
P240.21280.24350.73300.06331.551.346
C240.82420.90780.90210.78880.7190.587103
V230.7973---0.726--
* Please see Section 2.3 for the description of the statistical parameters.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Iovine, N.; Toropova, A.P.; Toropov, A.A.; Roncaglioni, A.; Benfenati, E. Models for the No-Observed-Effect Concentration (NOEC) and Maximal Half-Effective Concentration (EC50). Toxics 2024, 12, 425. https://doi.org/10.3390/toxics12060425

AMA Style

Iovine N, Toropova AP, Toropov AA, Roncaglioni A, Benfenati E. Models for the No-Observed-Effect Concentration (NOEC) and Maximal Half-Effective Concentration (EC50). Toxics. 2024; 12(6):425. https://doi.org/10.3390/toxics12060425

Chicago/Turabian Style

Iovine, Nadia, Alla P. Toropova, Andrey A. Toropov, Alessandra Roncaglioni, and Emilio Benfenati. 2024. "Models for the No-Observed-Effect Concentration (NOEC) and Maximal Half-Effective Concentration (EC50)" Toxics 12, no. 6: 425. https://doi.org/10.3390/toxics12060425

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop