Next Article in Journal
Drug-Induced Conformational Population Shifts in Topoisomerase-DNA Ternary Complexes
Previous Article in Journal
The Fruits of Wampee Inhibit H2O2-Induced Apoptosis in PC12 Cells via the NF-κB Pathway and Regulation of Cellular Redox Status
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Molecular Classification of Pesticides Including Persistent Organic Pollutants, Phenylurea and Sulphonylurea Herbicides

by
Francisco Torrens
1,* and
Gloria Castellano
2
1
Institut Universitari de Ciència Molecular, Universitat de València, Edifici d'Instituts de Paterna, P.O. Box 22085, E-46071 València, Spain
2
Facultad de Veterinaria y Ciencias Experimentales, Universidad Católica de Valencia San Vicente Mártir, Guillem de Castro-94, E-46001 València, Spain
*
Author to whom correspondence should be addressed.
Molecules 2014, 19(6), 7388-7414; https://doi.org/10.3390/molecules19067388
Submission received: 8 April 2014 / Revised: 29 May 2014 / Accepted: 30 May 2014 / Published: 5 June 2014

Abstract

:
Pesticide residues in wine were analyzed by liquid chromatography–tandem mass spectrometry. Retentions are modelled by structure–property relationships. Bioplastic evolution is an evolutionary perspective conjugating effect of acquired characters and evolutionary indeterminacy–morphological determination–natural selection principles; its application to design co-ordination index barely improves correlations. Fractal dimensions and partition coefficient differentiate pesticides. Classification algorithms are based on information entropy and its production. Pesticides allow a structural classification by nonplanarity, and number of O, S, N and Cl atoms and cycles; different behaviours depend on number of cycles. The novelty of the approach is that the structural parameters are related to retentions. Classification algorithms are based on information entropy. When applying procedures to moderate-sized sets, excessive results appear compatible with data suffering a combinatorial explosion. However, equipartition conjecture selects criterion resulting from classification between hierarchical trees. Information entropy permits classifying compounds agreeing with principal component analyses. Periodic classification shows that pesticides in the same group present similar properties; those also in equal period, maximum resemblance. The advantage of the classification is to predict the retentions for molecules not included in the categorization. Classification extends to phenyl/sulphonylureas and the application will be to predict their retentions.

1. Introduction

Twenty-six billion litres of wine were produced worldwide and 24 billion litres, consumed in 2010 according to the International Organization of Vine and Wine. Wine, especially red wine, is rich in polyphenols (e.g., resveratrol, catechin, epicatechin), which are antioxidants that protect cells from oxidative damage caused by free radicals. Red-wine antioxidants inhibit cancer development, e.g., that of prostate cancer. Red-wine consumption presents heart-health benefits. Application of pesticides (e.g., fungicides, insecticides) to improve grape yields is common. However, pesticides permeate via the plant tissues and remain in harvested grapes/processed products (e.g., grape juice, wine). Because pesticides are a source of toxicants that are harmful to human beings it is important to test for their levels in grapes, juice and wine. Although EU has set maximum residue levels (MRLs) for pesticides in wine grapes of 0.01–10 mg·kg−1 it has not done so for wine. An EU-wine study revealed that 34 out of 40 bottles contained at least one pesticide. Average number was >4 pesticides per bottle while the highest number was 10. Pesticide analysis in red wine is challenging because of the complexity of the matrix that contains alcohol, organic acids, sugars, phenols and pigments, e.g., anthocyanins. Traditional red-wine sample preparation methods include liquid–liquid extraction (LLE) with organic solvents [1,2] and solid-phase extraction (SPE) with reversed-phase C18/polymeric sorbents [3,4,5]. However, LLE is labour-intensive, consumes large amounts of organic solvents and forms emulsions making difficult to separate organic/aqueous phases. In contrast, SPE demands more development. Solid-phase microextraction (SPME) [6,7], hollow-fibre liquid-phase microextraction [8] and stir-bar sorptive extraction (SBSE) [9] are lesser reproducible. Typical detections incorporate gas chromatography (GC), GC coupled to mass spectrometry (MS) (GC–MS) and liquid chromatography coupled to tandem MS (LC–MS–MS).
Quick, easy, cheap, effective, rugged and safe (QuEChERS) is a sample preparation method that was reported for pesticide-residue determination in vegetables/fruits [10]; it was used for pesticide/compound analysis in various food, oil and beverage matrices [11,12,13]; QuEChERS involves pesticide extraction from a sample with high water content into acetonitrile, with addition of salts to separate phases and partition the pesticides into the organic layer, which is followed by dispersive SPE (dSPE) to clean up various matrix co-extractives and achieve mixing of an aliquot of sample extract with sorbents prepacked in a centrifuge tube. Pesticide determination in red wine was reported [14]. Eight pesticides belonging to the insecticide (methamidophos, diazinone, pyrazophos, chlorpyrifos), fungicide (carbendazim, thiabendazole, pyrimethanil, cyprodinil, pyrazophos) and parasiticide (thiabendazole) classes were selected. Their polarities are different. Some are planar (carbendazim, thiabendazole, pyrimethanil, cyprodinil). Cyprodinil was most usually detected on grapes with chlorpyrifos, diazinone and methamidophos, frequent. Carbendazim was detected in three out of six red-wine samples. Occurrence and removal efficiency of pesticides in sewage treatment plants from Spanish, Mediterranean, Brazilian and other rivers were reviewed [15,16] and reported [17,18]. Transport of organic persistent microcontaminants associated with suspended particulate material in the Ebro River Basin was described [19,20]. Several researchers have reported the quantitative structure–activity/property relationships (QSAR/QSPR) of pesticides. The Benfenati group modelled the QSPR of the octanol/water partition coefficient of organometallic substances by optimal SMILES-based descriptors [21], QSAR of the toxicity of organic substances to Daphnia magna via freeware CORAL [22], and optimal descriptor as a translator of eclectic data into endpoint prediction and mutagenicity of fullerene as a mathematical function of conditions [23]. The Roy group modelled predictive chemometrics and three-dimensional toxicophore mapping of diverse organic chemicals causing bioluminescent repression of the bacterium genus Pseudomonas [24] and QSAR for toxicity of ionic liquids to D. magna analyzing aromaticity vs. lipophilicity [25].
The chromatographic retention time was correlated to the stationary and mobile phases of the system. In earlier publications the free energy of solvation and partition coefficients in methanol–water binary mixtures were analized [26]. Stationary phase was modelled in size-exclusion chromatography with binary eluents as a strategy in size-exclusion chromatography [27]. Stationary–mobile phase distribution coefficient for polystyrene standards was represented [28]. A new chemical index inspired by plastic evolution was presented [29] and applied to valence-isoelectronic series of aromatics [30]. QSPR of retention times of phenylureas [31,32] and pesticides [33] was described by plastic evolution. A simple computerized algorithm was proposed to be useful for establishing relationships between chemical structures and biosignificance [34,35]. Starting point is to use information entropy for pattern recognition. Entropy is formulated on basis of similarity matrix between two biochemical species. As entropy is weakly discriminating for classification, the more powerful concepts of entropy production and equipartition conjecture were introduced [36]. The aim of the present report is to find properties that distinguish pesticide structures according to retention times. The study applies a chemical index to pesticides. The goal is index usefulness validation via the capability to distinguish between pesticides, and interest as a predictive index for retention as compared with fractal dimensions and partition coefficients. Section 2 illustrates and discusses the results. Section 3 presents the computational method, including classification algorithm, information entropy, equipartition conjecture of entropy production and learning procedure. Finally, the last section summarizes our conclusions.

2. Results and Discussion

For pesticides, LC–MS–MS retention times Rt were taken from Wang and Telepchak. Methamidophos was taken as the reference Rt (Rt°) because of its least Rt (cf. Table 1). Internal standard (IS) triphenyl phosphate (TPP) was included in the classification. The (RtRt°)/Rt° ratios were calculated. Molecular fractal dimensions were computed with our program TOPO [37].
Variations of (RtRt°)/Rt° vs. 1-octanol–water partition coefficient and fractal dimension averaged for nonburied atoms minus molecular fractal dimension D’D show fit. The regression turns out to be:
(Rt−Rt°)/Rt°= − 0.188 + 0.367logP + 19.6(D’−D), n = 9, r = 0.973, s = 0.337, F = 53.3 MAPE = 8.77% AEV = 0.0533
where mean absolute percentage error (MAPE) is 8.77% and approximation error variance (AEV), 0.0533. If IS TPP is excluded the results are improved:
(Rt−Rt°)/Rt°= − 0.299 + 0.420logP + 19.1(D’−D), n = 8, r = 0.982, s = 0.295, F = 67.5 MAPE = 7.17% AEV = 0.0357
and AEV decays by 33%. When D’ is included in the fit the correlation is bettered:
(Rt−Rt°)/Rt°= − 11.6 + 0.272logP + 9.44D’, n = 8, r = 0.987, s = 0.253, F = 93.2 MAPE = 5.81% AEV = 0.0261
and AEV drops by 51%. The best quadratic model vs. D’ improves the fit:
(Rt−Rt°)/Rt°= − 112 + 151 D’−49.3 D’2), n = 9, r = 0.988, s = 0.224, F = 125.2
MAPE = 5.39% AEV = 0.0234
and AEV decreases by 56%. If IS TPP is excluded the results are bettered:
(Rt−Rt°)/Rt°= − 107 + 144 D’−46.7 D’2), n = 8, r = 0.989, s = 0.228, F = 114.4
MAPE = 5.84% AEV = 0.0214
and AEV decays by 60%. Model (3) is linear and expected to perform better than Equations (4) and (5) for extrapolation. However, the latter are nonlinear and could function better than Equation (3) for intrapolation. Additional fitting parameters were tested: absolute/differential formation enthalpies, molecular dipole moment, organic solvent/water partition coefficients, free energies of solvation and water → organic solvent transfer, molecular volume, surface area, globularity, rugosity, hydrophobic, hydrophilic and total solvent accessible surfaces, and numbers of P and total atoms. However, the results do not improve Equations (3)–(5).
Table 1. Vector property (cyc123, O0345, NP, S=, N13, Cl3), retention, logP, pKa and dimensions for pesticides.
Table 1. Vector property (cyc123, O0345, NP, S=, N13, Cl3), retention, logP, pKa and dimensions for pesticides.
CompoundRt
(min)
RtRt°
(min)
(RtRt°)/Rt°logPpKaDD’
1. Methamidophos C2H8NO2PS <001010>2.780.000.00000−0.779−0.581.2351.266
2. Carbendazim C9H9N3O2 <100010>6.483.701.330941.525.661.2841.332
3. Thiabendazole C10H7N3S <110010>6.914.131.485612.473.401.2881.331
4. Pyrimethanil C12H13N3 <110010>10.437.652.751802.5584.411.3141.407
5. Cyprodinil C14H15N3 <110010>11.448.663.115113.0124.221.3441.470
6. TPP (IS) C18H15O4P <111000>11.789.003.237414.63−51.3941.504
7. Diazinone C12H21N2O3PS <111100>11.929.143.287773.7661.211.3981.509
8. Pyrazophos C14H20N3O5PS <111110>12.249.463.402882.810−1.371.4031.505
9. Chlorpyrifos C9H11NO3PSCl3 <111111>13.4210.643.827345.004−5.281.3941.494
Pearson correlation coefficient matrix R was calculated between pairs of vector properties <i1, i2, i3,i4, i5, i6> for nine pesticides. Intercorrelations are illustrated in the partial correlation diagram, which contains high (r ≥ 0.75), medium (0.50 ≤ r < 0.75), low (0.25 ≤ r < 0.50) and zero (r < 0.25) partial autocorrelations. Pairs of molecules with higher partial correlations show similar vector property. However, results should be taken with care, because Entry 9 with constant vector <111111> shows null standard deviation, causing greatest partial correlations r = 1 with any compound, which is an artefact. With the equipartition conjecture the upper triangle of R resulted:
Molecules 19 07388 i001
Some correlations are high, e.g., R3,4 = R3,5 = R4,5 = 0.984. They are illustrated in the partial correlation diagram, which contains 21 high (cf. Figure 1, red lines), seven medium (orange), one low (yellow) and seven zero (black) partial correlations. Two out of eight high partial correlations of Entry 9 are corrected: its correlation with Entry 2 is medium and its correlation with Entry 1 is zero partial correlation. For instance, pesticide 2 (carbendazim) shows medium partial correlations with molecules 3–9 (0.50 ≤ r < 0.75, orange) and low partial correlation with compound 1 (0.25 ≤ r < 0.50, yellow).
The grouping rule in the case with equal weights ak = 0.5 for b1 = 0.93 allows the classes:
C b1 = (1)(2)(3,4,5)(6)(7,8,9)
Five clusters are obtained with the associated entropy h R b1 = 10.70 matching to <i1,i2,i3,i4,i5,i6> and C b1 [38,39,40]; the binary taxonomy (Table 1) separates the classes 1, 2, 3, 4 and 5 with 1, 1, 3, 1 and 3 pesticides, respectively [41]. The planar molecules 3–5 with low retention are grouped into the same class; nonplanar thiophosphates 7–9 with the greatest retention are aggregated into the same cluster. Substances belonging to the same grouping appear highly correlated in the partial correlation diagram (Figure 1). However, C b1 results should be taken with care because classes (1), (2) and (6) with only one substance could be outliers. At level b2 with 0.74 ≤ b2 ≤ 0.76, the set of groupings turns out to be:
C − b2 = (1)(2)(3,4,5,6,7,8,9)
Three clusters result and entropy decays to h – R – b2 = 3.71 going with <i1,i2,i3,i4,i5,i6> and C − b2 dividing classes: 1–3 with 1, 1 and 7 pesticides. Again, nonplanar thiophosphates 7–9 with the greatest retention are aggregated into the same class. Compounds in the same cluster appear highly correlated in partial correlation diagram (Figure 1). Notwithstanding, C – b2 results should be taken with caution because groupings (1) and (2) with a unique compound could be outliers. Table 2 shows comparative analysis of the set containing 1–9 classes in agreement with partial correlation diagram (Figure 1).
From the previous partial correlation diagram (Figure 1) and set of nine classifications (Table 2), we suggest splitting the data into three groupings:
(1,2)(3,4,5)(6,7,8,9)
The pesticides dendrogram (cf. Figure 2) shows different behaviour depending on the number of cyles. One more time, the planar molecules 3–5 with low retention are grouped into the same class and nonplanar thiophosphates 6–9 with the greatest retention are aggregated into the same cluster.
Figure 1. Partial correlation diagram: High (red), medium (orange) and low (yellow) correlations of pesticides.
Figure 1. Partial correlation diagram: High (red), medium (orange) and low (yellow) correlations of pesticides.
Molecules 19 07388 g001
Table 2. Classification level, number of classes and entropy for vector property of pesticides.
Table 2. Classification level, number of classes and entropy for vector property of pesticides.
Classification Level bNumber of ClassesEntropy h
1.00932.49
0.98720.01
0.96615.13
0.93510.70
0.8746.77
0.7633.71
0.5121.47
0.1010.08
Figure 2. Dendrogram of pesticides according, topbottom, to: (1,2)(3,4,5)(6,7,8,9).
Figure 2. Dendrogram of pesticides according, topbottom, to: (1,2)(3,4,5)(6,7,8,9).
Molecules 19 07388 g002
The illustration of the classification above in a radial tree (cf. Figure 3) shows the different behaviour of the pesticides depending on the number of cyles. The same classes above are recognized, in qualitative agreement with partial correlation diagram and dendrogram (Figure 1 and Figure 2). One more time, planar molecules 3–5 with low retention are grouped into the same class, and nonplanar thiophosphates 6–9 with the greatest retention are aggregated into identical cluster.
Figure 3. Radial tree of pesticides according, from top to bottom, to: (1,2)(3,4,5)(6,7,8,9).
Figure 3. Radial tree of pesticides according, from top to bottom, to: (1,2)(3,4,5)(6,7,8,9).
Molecules 19 07388 g003
Program SplitsTree allows examining cluster analysis (CA) data [42]. Based on split decomposition it takes as input a distance matrix and produces as output a graph, which represents relations between taxa. For ideal data the graph is a tree whereas less ideal data will give rise to a tree-like net, which is interpreted as possible evidence for conflicting data. As split decomposition does not attempt to force data on to a tree it can provide a good indication of how tree-like are given data. In the splits graph for nine pesticides (cf. Figure 4), points 4 and 5 are superimposed on 3, and 9 on 8. It reveals conflicting relationship between class 1, and groupings 2 and 3 because of interdependences. It indicates spurious relation resulting from base-composition effects. It shows different pesticides behaviour depending on number of cycles in agreement with partial correlation diagram, binary and radial trees (Figure 1,Figure 2 and Figure 3).
Figure 4. Splits graph of pesticides according, topbottom, to: (1,2)(3,4,5)(6,7,8,9).
Figure 4. Splits graph of pesticides according, topbottom, to: (1,2)(3,4,5)(6,7,8,9).
Molecules 19 07388 g004
In QSPR, the data file contains less than 100 objects and thousands of X‑variables. So many X‑variables exist that no one can discover by inspection patterns, trends, clusters, etc. in objects. Principal components analysis (PCA) is a technique useful to summarize all information contained in X‑matrix and put it understandable [43,44,45,46,47,48]. The PCA works decomposing X‑matrix as the product of two smaller matrices P and T. Loading matrix (P) with information about the variables contains few vectors, the principal components (PCs), which are obtained as linear combinations of the original X-variables. In the score matrix (T) with information about objects, every object is described in terms of the projections on to PCs instead of the original variables: X = TP’ + E, where denotes the transpose matrix. The information not contained in the matrices remains as unexplained X-variance in the residual matrix (E). Every PCi is a new co-ordinate expressed as linear combination of old features xj: PCi = ∑jbijxj. The new co‑ordinates PCi are named scores/factors while the coefficients bij are called loadings. The scores are ordered according to their information content with regard to the total variance among all objects. Score–score plots show positions of compounds in the new co-ordinate system while loading–loading plots indicate the locations of the features that represent the compounds in the new co-ordinates. The PCs present two interesting properties. (1) They are extracted in decaying order of importance. First PC F1 always contains more information than the second F2 does, F2 more than the third F3, etc. (2) Every PC is orthogonal to one another. There is no correlation between the information contained in different PCs. A PCA was performed for the vector properties. The importance of PCA factors F1–F6 for {i1,i2,i3,i4,i5,i6} is collected in Table 3. The use of the first factor F1 explains 39% of variance (61% error), combined application of two factors F1/2 accounts for 66% of variance (34% error), utilization of factors F1–3 justifies 87% of variance (13% error), etc.
Table 3. Importance of the principal component analysis factors for vector property of pesticides.
Table 3. Importance of the principal component analysis factors for vector property of pesticides.
FactorEigenvaluePercentage of VarianceCumulative Percentage of Variance
F12.3310982938.8538.85
F21.6299831827.1766.02
F31.2548274620.9186.93
F40.385177516.4293.35
F50.335187185.5998.94
F60.063726371.06100.00
The PCA factor loadings are shown in Table 4.
Table 4. Principal component analysis loadings for the vector property of pesticides.
Table 4. Principal component analysis loadings for the vector property of pesticides.
Property PCA Factor Loadings a
F1 F2 F3 F4 F1 F6
i1 0.308227660.644747010.02673332−0.045947970.50138891−0.48485077
i2 0.448047950.45046774−0.057316130.08927231−0.760020340.08629170
i3 0.40956062−0.56947041−0.185745290.04102918−0.15327503−0.66954134
i4 0.55772042−0.179165160.178925390.595257990.325113660.40595876
i5 −0.315885770.042538380.728523100.44514023−0.20425092−0.35748077
i6 0.35450381−0.152229720.63145758−0.660116610.008227150.12881324
a Loadings greater than 0.7 are boldfaced.
The PCA F1–F3 profile for the vector property is listed in Table 5. For F1, variable i4 shows the greatest weight in the profile; however, F1 cannot be reduced to two variables {i2,i4} without 49% error. For F2, variable i1 presents the greatest weight; notwithstanding, F2 cannot be reduced to two variables {i1,i3} without 26% error. For F3, variable i5 displays the greatest weight; nevertheless, F3 cannot be reduced to two variables {i5,i6} without 7% error. For F4, variable i6 exhibits the greatest weight; however, F4 cannot be reduced to two variables {i4,i6} without 21% error. For F5, variable i2 reveals the greatest weight; notwithstanding, F5 cannot be reduced to two variables {i1,i2} without 17% error. For F6, variable i3 bares the greatest weight; nevertheless, F6 cannot be reduced to two variables {i1,i3} without 32% error. Factors F1–6 can be considered as the linear combinations of {i2,i4}, {i1,i3}, {i5,i6}, {i4,i6}, {i1,i2} and {i1,i3} with 49%, 26%, 7%, 21%, 17% and 32% errors.
Table 5. Profile of the principal component analysis factors for the vector property of pesticides.
Table 5. Profile of the principal component analysis factors for the vector property of pesticides.
Percentage of i1 a% of i2% of i3% of i4% of i5% of i6
F19.5020.0716.7731.119.9812.57
F241.5720.2932.433.210.182.32
F30.070.333.453.2053.0739.87
F40.210.800.1735.4319.8143.58
F525.1457.762.3510.574.170.01
F623.510.7444.8316.4812.781.66
a Percentages greater than 50% are boldfaced.
In PCA F2–F1 scores plot (cf. Figure 5), points 4 and 5 appear superimposed on 3. It shows different behaviour depending on number of cyles. It distinguishes three clusters: class 1 (two molecules, F1 < F2, left), grouping 2 (three compounds, F1 << F2, top) and cluster 3 (four units, F1 >> F2, right).
Figure 5. F2 versus F1 scores plot of the principal component analysis for the pesticides.
Figure 5. F2 versus F1 scores plot of the principal component analysis for the pesticides.
Molecules 19 07388 g005
From PCA factor loadings of pesticides, F2–F1 loadings plot (cf. Figure 6) depicts the six properties. In addition as a complement to the scores plot (Figure 5) for the loadings (Figure 6), it is confirmed that pesticide 2 located on the left side presents a contribution of cyc123 situated near the same side of Figure 5. Class 2 on the top shows more pronounced contribution of O0345 placed in the same position (Figure 6). Two classes of properties are clearly distinguished in the loadings plot: class 1 {cyc123,O0345,N13} (F1 < F2, top) and grouping 2 {NP,S=,Cl3} (F1 >> F2, bottom).
Figure 6. F2 versus F1 loadings plot of the principal component analysis for the pesticides.
Figure 6. F2 versus F1 loadings plot of the principal component analysis for the pesticides.
Molecules 19 07388 g006
Instead of nine pesticides in the ℜ6 space of six vector properties, consider six properties in the ℜ9 space of nine molecules. The upper triangle of matrix R between pairs of properties resulted in:
Molecules 19 07388 i002
Correlations are high, e.g., R4,6 = 0.986. Properties dendrogram (cf. Figure 7) separates cyc123 and O0345 from N13 (class 1), and Cl3 from NP/S= (cluster 2) in agreement with PCA loadings plot (Figure 6).
The radial tree for the vector properties (cf. Figure 8) separates the same two classes as PCA loadings plot and dendrogram (Figure 6 and Figure 7). Splits graph for properties (cf. Figure 9) reveals conflicting relation between classes because of interdependences. It is in agreement with PCA loadings plot and binary/radial trees (Figure 6, Figure 7 and Figure 8).
A PCA was performed for the vector properties. Factor F1 explains 50% of variance (50% error), factors F1/2 account for 69% of variance (31% error), factors F1–3 rationalize 82% of variance (18% error), etc. In PCA F2–F1 scores plot, the same two groupings of properties are distinguished: class 1 {cyc123,O0345,N13} (F1 >> F2, cf. Figure 10, right) and grouping 2 {NP,S=,Cl3} (F1 << F2, left) in qualitative agreement with PCA loadings plot, binary/radial trees and splits graph (Figure 6, Figure 7, Figure 8 and Figure 9).
Figure 7. Dendrogram for the vector properties corresponding to the pesticides.
Figure 7. Dendrogram for the vector properties corresponding to the pesticides.
Molecules 19 07388 g007
Figure 8. Radial tree for the vector properties corresponding to the pesticides.
Figure 8. Radial tree for the vector properties corresponding to the pesticides.
Molecules 19 07388 g008
Figure 9. Splits graph for the vector properties corresponding to the pesticides.
Figure 9. Splits graph for the vector properties corresponding to the pesticides.
Molecules 19 07388 g009
Figure 10. PCA F2 vs. F1 scores plot for the vector properties corresponding to pesticides.
Figure 10. PCA F2 vs. F1 scores plot for the vector properties corresponding to pesticides.
Molecules 19 07388 g010
The recommended format of the pesticides periodic table (PT, cf. Table 6) shows that they are classified first by i1, i2, i3, i4, i5 and, finally, by i6. Vertical groups are defined by <i1,i2,i3,i4,i5> and horizontal periods, by <i6>. Periods of eight units are assumed; e.g., group g00101 stands for <i1,i2,i3,i4,i5> = <00101>: <001010> (cyc0,O2,NP,S=0,N1,Cl0), etc. Pesticides in the same column appear close in partial correlation diagram, binary/radial trees, splits graph and PCA scores (Figure 1, Figure 2, Figure 3, Figure 4 and Figure 5). Phenylurea herbicides were determined in tap water/soft drink samples by HPLC–UV [49]. Table 6 includes five phenylureas: metazachlor is similar to carbendazim. Can et al. determined sulphonyl/phenylurea herbicides toxicities [50]. Table 6 includes 27 sulphonyl/phenylurea herbicides: (1) phenylureas are similar to metoxuron, monuron, diuron and linuron; (2) sulphonylureas flazasulphuron, triasulphuron, azimsulphuron and chlorsulphuron go with TPP. High-resolution and ultratrace analyses of pesticides were reported via silica (SiO2) monoliths [51]. Table 6 takes in six new pesticides: (1) metamitron and phenylurea isoproturon match metoxuron; (2) metolachlor goes with carbendazim; (3) carbofuran agrees with thiabendazole. Qualitative LC–MS analysis of pesticides was informed via monolithic SiO2 capillaries [52]. Table 6 contains two novel pesticides: phenylurea pencycuron tallies metamitron. Analytical standards were provided for persistent organic pollutants (POPs) [53]. Table 6 embraces five POPs: lindane and pentachlorobenzene equal carbetamide
Property P variation of vector <i1,i2,i3,i4,i5,i6> (cf. Figure 11) is expressed in decimal system, P = 105i1 + 104i2 + 103i3 + 102i4 + 10i5 + i6, vs. structural parameters {i1,i2,i3,i4,i5,i6} for the pesticides. Most points and lines i3/i5 collapse. For instance, for molecule 1 (methamidophos) <001010>, vector property P = 105·0 + 104·0 + 103·1 + 102·0 + 10·1 + 0 = 1010 where the structural parameters are 0 and 1, and the corresponding points are (i1 = i2 = i4 = i6 = 0, P = 1010) and (i3 = i5 = 1, P = 1010). The results show parameters hierarchy: i1 > i2 > i3 > i4 > i5 > i6 in agreement with PT of properties (Table 6) with vertical groups defined by {i1,i2,i3,i4,i5} and horizontal periods described by {i6}. The property was not used in PT development and validates it.
Figure 11. Variation of vector property P(p) of the pesticides versus counts of {i1,i2,i3}.
Figure 11. Variation of vector property P(p) of the pesticides versus counts of {i1,i2,i3}.
Molecules 19 07388 g011
Table 6. Table of periodic properties for pesticides, persistent organic pollutants, phenylureas and sulphonylureas.
Table 6. Table of periodic properties for pesticides, persistent organic pollutants, phenylureas and sulphonylureas.
P.g00100g00101g01100g10000g10001g10100g11000g11001g11100g11110g11111
p0Chlordecone**MethamidophosPFOS**Metamitron*
BDE‑99**
Metoxuron
Monuron
Diuron
Linuron
Buturon
Chlorotoluron
Daimuron
Fenuron
Methyldimuron
Fluometuron
Siduron
Neburon
Isoproturon
Pencycuron
Carbendazim
Metolachlor*
Metazachlor
AMS
BSM
CME
CNS
EMS
MSM
NCS
OXS
PSE
TFS
TBM
TFO
3FS
RMS
IDS
Carbetamide*
Prometryne*
Lindane**
PCB**
Thiabendazole
Pyrimethanil
Cyprodinil
Carbofuran*
TPP
Flazasulphuron
Triasulphuron
Azimsulphuron
Chlorsulphuron
DiazinonePyrazophos
p1 Chlorfenvinphos** Chlorpyrifos
PFOS: perfluoroctane sulphonate. BDE‑99: 2,2',4,4',5‑pentabromodiphenylether. AMS: amidosulphuron. BSM: bensulphuron‑methyl. CME: Chlorimuron‑ethyl. CNS: Cinosulphuron. EMS: ethametsulphuron‑methyl. MSM: metsulphuron‑methyl. NCS: nicosulphuron. OXS: oxasulphuron. PSE: pyrazosulphuron‑ethyl. TFS: Thifensulphuron‑methyl. TBM: Tribenuron‑methyl. TFO: Trifloxysulphuron‑Na. 3FS: triflusulphuron‑methyl. RMS: rimsulphuron. IDS: iodosulphuron. PCB: pentachlorobenzene. Regular typeface: pesticides (this work). Regular typeface *: pesticides taken from Ref. [51]. Regular typeface **: persistent organic pollutants. Italics: phenylureas. Bold: sulphonylureas.
Property P change of vector <i1,i2,i3,i4,i5,i6> in base 10 (cf. Figure 12) is represented vs. number of group in PT, for pesticides (Table 1 subset of Table 6). It reveals minima corresponding to compounds with <i1,i2,i3,i4,i5> ca. <00101> (group g00101) and maxima with <i1,i2,i3,i4,i5> ca. <11111> (group g11111). For group 6, period 2 is superimposed on 1. For instance, for group g001010 and period p0, molecule 1 (methamidophos) <001010> lies in the first group in the subset with P = 1010 and the point is (group = 1, P = 1010). Periods p0 and p1 represent rows 1 and 2, respectively, in Table 6. Function P(i1,i2,i3,i4,i5,i6) denotes two periodic waves clearly limited by two maxima, which suggest a periodic behaviour that recalls form of a trigonometric function. For <i1,i2,i3,i4,i5,i6>, a maximum is shown. Distance in <i1,i2,i3,i4,i5,i6> units between each pair of consecutive maxima is six, which coincides with pesticide sets in successive periods. The maxima occupy analogous positions and are in phase. The representative points in phase should correspond to elements in the same group in PT. For both maxima, <i1,i2,i3,i4,i5,i6> some coherence exists between two representations; however, the consistency is not general. Waves comparison shows two differences: period 1 is somewhat step-like and period 2 is incomplete. The most characteristic points are maxima, which lie about group g11111. The values of <i1,i2,i3,i4,i5,i6> are repeated as the periodic law (PL) states.
Figure 12. Variation of the vector property P(p) of the pesticides versus group number.
Figure 12. Variation of the vector property P(p) of the pesticides versus group number.
Molecules 19 07388 g012
An empirical function P(p) reproduces the different <i1,i2,i3,i4,i5,i6> values. The minimum of P(p) has meaning only if it is compared with former P(p – 1) and later P(p + 1) points needing to fulfill:
Pmin (P) < P(P−1)
Pmin (P) < P(P+1)
The order relations (6) should repeat at determined intervals equal to period size and are equivalent to:
Pmin (P) − P(P − 1) <0
P(P+1) − Pmin (P) >0
As relations (7) are valid only for minima, more general ones are desired for all values of p. Differences D(p) = P(p + 1) − P(p) are calculated assigning every value to pesticide p:
D(P) = P(P+1) − P(P)
Instead of D(p), R(p) = P(p + 1)/P(p) is taken, assigning them to pesticide p. If PL were general, elements in the same group in analogous positions in different periodic waves would satisfy:
either D(P)>0 or D(P)<0
and either R(P) >1 or R(P) <1
However, the results show that this is not the case so that PL is not general, existing some anomalies; e.g., D(p) variation vs. group number (cf. Figure 13) presents lack of coherence between <i1,i2,i3,i4,i5,i6> Cartesian and PT representations. For instance, for group g001010 and period p0, pesticide 1 (methamidophos) <001010> (group = 1, P = 1010) presents, in the next PT position, molecule 2 (carbendazim) <100010> (g100010, group = 2, P = 100010), D = 100010 − 1010 = 99000 and the point is (group = 1, D = 99000). If consistency were rigorous, all points in each period would have the same sign. In general, a trend exists in points to give D(p) > 0, especially for lower groups.
Figure 13. Variation of property D(p) = P(p + 1) − P(p) versus group. P: vector property.
Figure 13. Variation of property D(p) = P(p + 1) − P(p) versus group. P: vector property.
Molecules 19 07388 g013
The change of R(p) vs. group number (cf. Figure 14) confirms the lack of constancy between Cartesian and PT charts. For instance, for group g001010 and period p0, pesticide 1 (methamidophos) <001010> (group = 1, P = 1010) shows, in the next PT cell, molecule 2 (carbendazim) <100010> (g100010, group = 2, P = 100010), R = 100010/1010 = 99.0198 and the point is (group = 1, R = 99.0198). If the steadiness were exact, all points in each period would show R(p) either lesser or greater than one. A trend exists to give R(p) > 1, especially for the lower groups.
Figure 14. Variation of property R(p) = P(p + 1)/P(p) vs. group number. P: vector property.
Figure 14. Variation of property R(p) = P(p + 1)/P(p) vs. group number. P: vector property.
Molecules 19 07388 g014

3. Experimental

The key problem in classification studies is to define similarity indices when several criteria of comparison are involved. The first step in quantifying similarity concept for pesticides is to list the most important chemical characteristics of molecules. The vector of properties Molecules 19 07388 i013 = <i1,i2,…ik,…> should be associated with every pesticide i, whose components correspond to different molecular features in a hierarchical order according to their expected importance in retention. If characteristic m-th is chromatographically more significant for retention than k-th then m < k. Components ik are either “1” or “0”, according to whether a similar characteristic of rank k is either present or absent in pesticide i compared to a reference. Analysis includes six structural and constitutional characteristics: presence of cycle (cyc123), occurrence of either none or 3–5 O atoms (O0345), nonplanarity (NP), double-bonded S atom (S=), incidence of either one or three N atoms (N13) and existence of three Cl atoms (Cl3, cf. Figure 15). It is assumed that the chemical characteristics can be ranked according to their contribution to retention in the following order of decaying importance: cyc123 > O0345 > NP > S= > N13 > Cl3. Index i1 = 1 denotes cyc123 (i1 = 0 for cyc0), i2 = 1 means O0345 (i2 = 0 for O2), i3 = 1 signifies NP, i4 = 1 indicates S=, i5 = 1 stands for N13 (i5 = 0 for N0 or N2) and i6 = 1 represents Cl3 (i6 = 0 for Cl0). In chlorpyrifos number of cycles is one, O is three, it is NP and S=, number of N is one and number of Cl atoms is three; obviously its associated vector is <111111>. In this study chlorpyrifos was selected as reference because of its greatest retention. Table 1 contains vectors associated with nine pesticides. Vector <001010> is associated with methamidophos since it shows cyc0, O2, NP, not S=, N1 and Cl0.
Let us denote by rij(0 ≤ rij≤ 1) similarity index of two pesticides associated with vectors and , respectively. Similitude relation is characterized by similarity matrixR = [rij]. Similarity index between two pesticides Molecules 19 07388 i013 = <i1,i2,…ik…> and Molecules 19 07388 i014 = <j1,j2,…jk…> is defined as:
Molecules 19 07388 i003
where 0 ≤ ak ≤ 1 and tk = 1 if ik = jk but tk = 0 if ikjk. Definition assigns a weight (ak)k to any property involved in description of molecule i or j.
Figure 15. Pesticides: (a) methamidophos, (b) carbendazim, (c) thiabendazole, (d) pyrimethanil, (e) cyprodinil, (f) diazinone, (g) pyrazophos and (h) chlorpyrifos.
Figure 15. Pesticides: (a) methamidophos, (b) carbendazim, (c) thiabendazole, (d) pyrimethanil, (e) cyprodinil, (f) diazinone, (g) pyrazophos and (h) chlorpyrifos.
Molecules 19 07388 g015

3.1. Classification Algorithm

Grouping algorithm uses the stabilized similarity matrix by applying max–min composition rule o:
(RoS)ij = maxk [mink(rik,skj)]
where R = [rij] and S = [sij] are matrices of equal type and (RoS)ij is the (i,j)‑th element of matrix RoS[54,55,56,57]. When applying composition rule max–min iteratively so that R(n + 1) = R(n) o R, an integer n exists such that: R(n) = R(n + 1) = … Matrix R(n) is called stabilized similarity matrix. Its importance lies in fact that in classification it generates partition into disjoint classes. Stabilized matrix is designated by R(n) = [rij(n)]. Grouping rule follows: i and j are assigned to the same class if rij(n) ≥ b. Class of i noted Molecules 19 07388 i005 is set of species j that satisfies rule: rij(n) ≥ b. Matrix of classes is:
Molecules 19 07388 i004
where s stands for any index of species belonging to class Molecules 19 07388 i005(similarly for t and Molecules 19 07388 i006). Rule (13) means finding largest similarity index between species of two different classes.

3.2. Information Entropy

In information theory, information entropy h measures the surprise that source emitting sequences, e.g., cannon-shots, can give [58,59]. Consider use of qualitative spot test to determine the presence of Fe in a water sample. Without any history of testing the analyst must begin by assuming that the two outcomes 0/1 (Fe absent/present) are equiprobable with probabilities 1/2. When up to two metals may be present in sample, e.g., Fe or Ni, four possible outcomes exist, ranging from neither (0,0) to both present (1,1) with equal probabilities 1/4. Which of four possibilities turns up can be determined via two tests each having two observable states. Similarly with three elements eight possibilities exist each with probability of 1/8 = 1/23; three tests are needed. Pattern relates uncertainty and information needed to resolve it. Number of possibilities is expressed to power of 2. Power to which 2 must be raised to give number of possibilities N is defined as logarithm to base 2 of that number. Information/uncertainty can be defined in terms of logarithm to base 2 of number of possible analytical outcomes: I = H = log2 N = log2 1/p = –log2 p, where I is information contained in answer given that N possibilities existed, H, initial uncertainty resulting from need to consider N possibilities and p, probability of each outcome if all N possibilities are equally likely to occur. The expression is generalized to a situation in which the probability of every outcome is unequal. If one knows from past experience that some elements are more likely to be present than others, the equation is adjusted so that logarithms of individual probabilities suitable weighted are summed: H = – Σ pi log2 pi, where Σ pi = 1. Consider original example except that now past experience showed that 90% of samples contained no Fe. Degree of uncertainty is calculated using: H = –(0.9 log2 0.9 + 0.1 log2 0.1) = 0.469 bits. For a single event occurring with probability p degree of surprise is proportional to −ln p. Generalizing result to random variable X (which can take N possible values x1, …, xN with probabilities p1, …, pN) average surprise received on learning X value is: – Σ pi ln pi. Information entropy associated with similarity matrix R is:
Molecules 19 07388 i007
Denote by Cb set of classes and by Molecules 19 07388 i008 similarity matrix at grouping level b. Information entropy satisfies following properties. (1) h(R) = 0 if either rij = 0 or rij = 1; (2) h(R) is maximum if rij = 0.5,i.e., when imprecision is maximum; (3) h( Molecules 19 07388 i008)≤h(R) for any b, i.e., classification leads to entropy loss; (4) h Molecules 19 07388 i009h Molecules 19 07388 i010 if b1 < b2, i.e., entropy is monotone function of grouping level b.

3.3. Equipartition Conjecture of Entropy Production

In classification algorithm every hierarchical tree corresponds to entropy dependence on grouping level and diagram h − b is obtained. Tondeur and Kvaalen equipartition conjecture of entropy production is proposed as selection criterion among hierarchical trees. According to conjecture for given charge, dendrogram (binary tree) with best configuration is that in which entropy production is most uniformly distributed. One proceeds by analogy using information instead of thermodynamic one. Equipartition implies linear dependence so that equipartition line results:
heqp = hmax b
Since classification is discrete, way of expressing equipartition would be regular staircase function. Best variant is chosen to be that minimizing sum of squares of deviations:
Molecules 19 07388 i011

3.4. Learning Procedure

Learning procedures were implemented similar to those encountered in stochastic methods[60]. Consider a given partition into classes as good from practical observations, which corresponds to reference similarity matrix S = [sij] obtained for equal weights a1 = a2 = … = a and for an arbitrary number of fictious properties. Next consider the same set of species as in the good classification and actual properties. Degree of similarity rij is computed with Equation (11) giving matrix R. Number of properties for R and S differs. Learning procedure consists in finding classification results for R as close as possible to good classification. First weight a1 is taken constant and only following weights a2, a3,… are subjected to random variations. New similarity matrix is obtained using Equation (11) and new weights. Distance between partitions into classes characterized by R and S is:
Molecules 19 07388 i012
Definition was suggested by that introduced in information theory to measure distance between two probability distributions [61]. In the present case it is distance between matrices R and S. Since for every matrix a corresponding classification exists two classifications will be compared by distance, which is nonnegative quantity that approaches zero as resemblance between R and S increases. The algorithm result is a set of weights allowing classification. The procedure was applied to the synthesis of complex dendrograms using information entropy [62,63,64,65]. Our program MolClas is simple, reliable, efficient and fast procedure for molecular classification, based on equipartition conjecture of entropy production according to Equations (11)–(17); it reads number of properties and molecular properties; it allows optimization of coefficients; it optionally reads starting coefficients and number of iteration cycles. Correlation matrix can be either calculated by program or read from input file. The MolClas calculates property similarity matrix in symmetric storage mode; it applies graphical correlation model for partial correlation diagram; it computes classifications, calculates distances between clusters, computes groupings similarity matrices, works out classifications information entropy, optimizes coefficients, performs single/complete-linkage hierarchical cluster analyses and plots cluster diagrams; it was written not only to analyze the equipartition conjecture of entropy production but also to explore the world of molecular classification.

4. Conclusions

From the present results and discussion the following conclusions can be drawn:
(1)
The objective was to develop a structure–property relation for qualitative and quantitative prediction of chromatographic retention times of pesticides. Results of the present work contribute to relation prediction of pesticide residues, in food and environmental samples. Code TOPO allows fractal dimensions, and SCAP, solvation free energies and partition coefficient, which show that for a given atom energies and partitions are sensitive to the presence in the molecule of other atoms and functional groups. Fractal dimensions, partition coefficient, etc. differentiated pesticides. Parameters needed for co-ordination index are molar formation enthalpy, molecular weight and surface area. The morphological and co-ordination indices barely improved equations. Correlation between molecular area and weight points not only to a homogeneous molecular structure of pesticides, but also to the ability to predict and tailor their properties; the latter is nontrivial in environmental toxicology.
(2)
Several criteria selected to reduce the analysis to a manageable quantity of pesticides, referred to structural and constituent characteristics related to nonplanarity, and the number of rings, and O, double-bonded S, N and Cl atoms. Classification was in agreement with the principal component analyses. Program MolClas is a simple, reliable, efficient and fast procedure for molecular classification based on equipartition conjecture of entropy production. It was written to analyze equipartition conjecture of entropy production and explore molecular-classification world.
(3)
Periodic law does not satisfy physics-law status: (a) pesticides retentions are not repeated; perhaps chemical character; (b) order relations are repeated with exceptions. Analysis forces statement: Relations that any compound p has with its neighbour, p + 1, are approximately repeated for each period. Periodicity is not general; however, if substance natural order is accepted law must be phenomenological. Retention is not used in periodic-table generation and serves to validate it. The analysis of other properties would give an insight into the possible generality of the periodic table. The periodic classification was extended to phenylureas and sulphonylureas.

Acknowledgments

This work was funded by the own resources of our universities.

Author Contributions

Both authors designed and performed research, and analyzed the data; FT wrote the report. Both authors read and approved the final manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. De Melo Abreu, S.; Caboni, P.; Cabras, P.; Garau, V.L.; Alves, A. Validation and global uncertainty of a liquid chromatographic with diode array detection method for the screening of azoxystrobin, kresoxim-methyl, trifloxystrobin, famoxadone, pyraclostrobin and fenamidone in grapes and wine. Anal. Chim. Acta 2006, 291, 573–574. [Google Scholar]
  2. Oliva, J.; Navarro, S.; Barba, A.; Navarro, G. Determination of chlorpyrifos, penconazole, fenarimol, vinclozolin and metalaxyl in grapes, must and wine by on-line microextraction and gas chromatography. J. Chromatogr. A 1999, 833, 43–51. [Google Scholar] [CrossRef]
  3. Jiménez, J.J.; Bernal, J.L.; del Nozal, M.J.; Toribio, L.; Arias, E. Analysis of pesticide residues in wine by solid-phase extraction and gas chromatography with electron capture and nitrogen–phosphorus detection. J. Chromatogr. A 2001, 919, 147–156. [Google Scholar] [CrossRef]
  4. Wang, J.F.; Luan, L.; Wang, Z.Q.; Jiang, S.R.; Pan, C.P. Determination of 19 multi-residue pesticides in grape wine by gas chromatography-mass spectrometry with micro liquid-liquid extraction and solid phase extraction. Chin. J. Anal. Chem. 2007, 35, 1430–1434. [Google Scholar]
  5. Economou, A.; Botitsi, H.; Antoniou, S.; Tsipi, D. Determination of multi-class pesticides in wines by solid-phase extraction and liquid chromatography-tandem mass spectrometry. J. Chromatogr. A 2009, 1216, 5856–5867. [Google Scholar] [CrossRef]
  6. Hu, Y.; Liu, W.M.; Zhou, Y.M.; Guan, Y.F. Determination of organophosphorous pesticide residues in red wine by solid phase microextraction-gas chromatography. Chin. J. Chromatogr. 2006, 24, 290–293. [Google Scholar]
  7. Wu, J.; Tragas, C.; Lord, H.; Pawliszyn, J. Analysis of polar pesticides in water and wine samples by automated in-tube solid-phase microextraction coupled with high-performance liquid chromatography–mass spectrometry. J. Chromatogr. A 2002, 976, 357–367. [Google Scholar] [CrossRef]
  8. Bolaños, P.P.; Romero-González, R.; Frenich, A.G.; Vidal, J.L.M. Application of hollow fibre liquid phase microextraction for the multiresidue determination of pesticides in alcoholic beverages by ultra-high pressure liquid chromatography coupled to tandem mass spectrometry. J. Chromatogr. A 2008, 1208, 16–24. [Google Scholar] [CrossRef]
  9. Vinas, P.; Aguinaga, N.; Campillo, N.; Hernández-Córdoba, M. Comparison of stir bar sorptive extraction and membrane-assisted solvent extraction for the ultra-performance liquid chromatographic determination of oxazole fungicide residues in wines and juices. J. Chromatogr. A 2008, 1194, 178–183. [Google Scholar] [CrossRef]
  10. Anastassiades, M.; Lehotay, S.J.; Stajnbaher, D.; Schenck, F.J. Fast and easy multiresidue method employing acetonitrile extraction/partitioning and “dispersive solid-phase extraction” for the determination of pesticide residues in produce. J. AOAC Int. 2003, 86, 412–431. [Google Scholar]
  11. Lehotay, S.J. Mass Spectrometry in Food Safety; Zweigenbaum, J., Ed.; Methods in Molecular Biology No. 747; Humana: Totowa, NJ, USA, 2011; pp. 65–91. [Google Scholar]
  12. Cunha, S.C.; Lehotay, S.J.; Mastovska, K.; Fernandes, J.O.; Beatriz, M.; Oliveira, P.P. Evaluation of the QuEChERS sample preparation approach for the analysis of pesticide residues in olives. J. Sep. Sci. 2007, 30, 620–632. [Google Scholar] [CrossRef]
  13. Whelan, M.; Kinsella, B.; Furey, A.; Moloney, M.; Cantwell, H.; Lehotay, S.J.; Danaher, M. Determination of anthelmintic drug residues in milk using ultra high performance liquid chromatography–tandem mass spectrometry with rapid polarity switching. J. Chromatogr. A 2010, 1217, 4612–4622. [Google Scholar] [CrossRef]
  14. Wang, X.; Telepchak, M.J. Determination of pesticides in red wine by QuEChERS extraction, rapid mini-cartridge cleanup and LC–MS–MS detection. LC·GC Eur. 2013, 26, 66–76. [Google Scholar]
  15. Blasco, C.; Picó, Y. Prospects for combining chemical and biological methods for integrated environmental assessment. Trends Anal. Chem. 2009, 28, 745–757. [Google Scholar] [CrossRef]
  16. de Umbuzeiro, A.G. (Ed.) Guia de Potabilidade para Substàncias Químicas. Limiar: São Paulo, Brazil, 2012. Available online: http://www.abes-sp.org.br/arquivos/ctsp/guia_potabilidade.pdf (accessed on 5 June 2014).
  17. Moganti, S.; Richardson, B.J.; McClellan, K.; Martin, M.; Lam, P.K.S.; Zheng, G.J. Use of the clam Asaphis deflorata as a potential indicator of organochlorine bioaccumulation in Hong Kong coastal sediments. Mar. Pollut. Bull. 2008, 57, 672–680. [Google Scholar] [CrossRef]
  18. Barco-Bonilla, N.; Romero-González, R.; Plaza-Bolaños, P.; Garrido Frenich, A.; Martínez Vidal, J.L. Analysis and study of the distribution of polar and non–polar pesticides in wastewater effluents from modern and conventional treatments. J. Chromatogr. A 2010, 1217, 7817–7825. [Google Scholar] [CrossRef]
  19. Navarro, A.; Tauler, R.; Lacorte, S.; Barceló, D. Chemometrical investigation of the presence and distribution of organochlorine and polyaromatic compounds in sediments of the Ebro River Basin. Anal. Bioanal. Chem. 2006, 385, 1020–1030. [Google Scholar] [CrossRef]
  20. Navarro-Ortega, A.; Tauler, R.; Lacorte, S.; Barceló, D. Occurrence and transport of PAHs, pesticides and alkylphenols in sediment samples along the Ebro River Basin. J. Hydrol. 2010, 383, 5–17. [Google Scholar] [CrossRef]
  21. Toropov, A.A.; Toropova, A.P.; Benfenati, E. QSPR modelling of the octanol/water partition coefficient of organometallic substances by optimal SMILES-based descriptors. Cent. Eur. J. Chem. 2009, 7, 846–856. [Google Scholar] [CrossRef]
  22. Toropova, A.P.; Toropov, A.A.; Benfenati, E.; Gini, G. QSAR models for toxicity of organic substances to Daphnia magna built up by using the CORAL freeware. Chem. Biol. Drug Des. 2011, 79, 332–338. [Google Scholar]
  23. Toropov, A.A.; Toropova, A.P. Optimal descriptor as a translator of eclectic data into endpoint prediction: Mutagenicity of fullerene as a mathematical function of conditions. Chemosphere 2014, 104, 262–264. [Google Scholar] [CrossRef]
  24. Kar, S.; Roy, K. Predictive chemometric modeling and three-dimensional toxicophore mapping of diverse organic chemicals causing bioluminescent repression of the bacterium genus Pseudomonas. Ind. Eng. Chem. Res. 2013, 52, 17648–17657. [Google Scholar] [CrossRef]
  25. Roy, K.; Das, R.N.; Popelier, P.L.A. Quantitative structure–activity relationship for toxicity of ionic liquids to Daphnia magna: Aromaticity vs. lipophilicity. Chemosphere 2014, 112, 120–127. [Google Scholar] [CrossRef]
  26. Torrens, F. Free energy of solvation and partition coefficients in methanol–water binary mixtures. Chromatographia 2001, 53, S199–S203. [Google Scholar] [CrossRef]
  27. Soria, V.; Campos, A.; Figueruelo, J.E.; Gómez, C.; Porcar, I.; García, R. Modelling of stationary phase in size-exclusion chromatography with binary eluents. In Strategies in Size Exclusion Chromatography; Potschka, M., Dubin, P.L., Eds.; ACS Symposium Series. No. 635; American Chemical Society: Washington, DC, USA, 1996; Volume Cnapter 7, pp. 103–126. [Google Scholar]
  28. Torrens, F.; Soria, V. Stationary-mobile phase distribution coefficient for polystyrene standards. Sep. Sci. Technol. 2002, 37, 1653–1665. [Google Scholar] [CrossRef]
  29. Torrens, F. A new chemical index inspired by biological plastic evolution. Indian J. Chem. Sect. A 2003, 42, 1258–1263. [Google Scholar]
  30. Torrens, F. A chemical index inspired by biological plastic evolution: Valence-isoelectronic series of aromatics. J. Chem. Inf. Comput. Sci. 2004, 44, 575–581. [Google Scholar] [CrossRef]
  31. Torrens, F.; Castellano, G. QSPR prediction of retention times of phenylurea herbicides by biological plastic evolution. Curr. Drug Saf. 2012, 7, 262–268. [Google Scholar] [CrossRef]
  32. Torrens, F.; Castellano, G. Molecular categorization of phenylurea and sulphonylurea herbicides, pesticides and persistent organic pollutants. In QSAR in Drug and Environmental Research; Roy, K., Ed.; IGI Global: Hershey, PA, USA, 2015; in press.
  33. Torrens, F.; Castellano, G. QSPR prediction of chromatographic retention times of pesticides: Partition and fractal indices. J. Environ. Sci. Health Part B 2014, 49, 400–407. [Google Scholar] [CrossRef]
  34. Varmuza, K. Pattern Recognition in Chemistry; Springer: New York, NY, USA, 1980. [Google Scholar]
  35. Benzecri, J.P. L’Analyse des Données; Dunod: Paris, France, 1984; volume 1. [Google Scholar]
  36. Tondeur, D.; Kvaalen, E. Equipartition of entropy production. An optimality criterion for transfer and separation processes. Ind. Eng. Chem. Fundam. 1987, 26, 50–56. [Google Scholar] [CrossRef]
  37. Torrens, F. Characterizing cavity-like spaces in active-site models of zeolites. Comput. Mater. Sci. 2003, 27, 96–101. [Google Scholar] [CrossRef]
  38. IMSL. In Integrated Mathematical Statistical Library (IMSL); IMSL: Houston, TX, USA, 1989.
  39. Tryon, R.C. A multivariate analysis of the risk of coronary heart disease in Framingham. J. Chronic Dis. 1939, 20, 511–524. [Google Scholar]
  40. Jarvis, R.A.; Patrick, E.A. Clustering using a similarity measure based on shared nearest neighbors. IEEE Trans. Comput. 1973, C22, 1025–1034. [Google Scholar] [CrossRef]
  41. Page, R.D.M. Program TreeView; Universiy of Glasgow: Glasgow, UK, 2000. [Google Scholar]
  42. Huson, D.H. SplitsTree: Analizing and visualizing evolutionary data. Bioinformatics 1998, 14, 68–73. [Google Scholar] [CrossRef]
  43. Hotelling, H. Analysis of a complex of statistical variables into principal components. J. Educ. Psychol. 1933, 24, 417–441. [Google Scholar] [CrossRef]
  44. Kramer, R. Chemometric Techniques for Quantitative Analysis; Marcel Dekker: New York, NY, USA, 1998. [Google Scholar]
  45. Patra, S.K.; Mandal, A.K.; Pal, M.K. State of aggregation of bilirubin in aqueous solution: Principal component analysis approach. J. Photochem. Photobiol. A 1999, 122, 23–31. [Google Scholar] [CrossRef]
  46. Jolliffe, I.T. Principal Component Analysis; Springer: Berlin, Germany, 2002. [Google Scholar]
  47. Xu, J.; Hagler, A. Chemoinformatics and drug discovery. Molecules 2002, 7, 566–600. [Google Scholar] [CrossRef]
  48. Shaw, P.J.A. Multivariate Statistics for the Environmental Sciences; Hodder-Arnold: New York, NY, USA, 2003. [Google Scholar]
  49. Kaur, M.; Malik, A.K.; Singh, B. Determination of phenylurea herbicides in tap water and soft drink samples by HPLC–UV and solid-phase extraction. LC·GC Eur. 2013, 25, 120–129. [Google Scholar]
  50. Can, A.; Yildiz, I.; Guvendik, G. The determination of toxicities of sulphonylurea and phenylurea herbicides with quantitative structure–toxicity relationship (QSTR) studies. Environ. Toxicol. Pharmacol. 2013, 35, 369–379. [Google Scholar] [CrossRef]
  51. Cabrera, K.; Altmaier, S. High-resolution and ultra trace analysis of pesticides using silica monoliths. Int. Labmate 2013, 38, 4–5. [Google Scholar]
  52. Forster, S.; Altmaier, S. Qualitative LC–MS analysis of pesticides using monolithic silica capillaries and potential for assay of pesticides in kidney. LC·GC Eur. 2013, 26, 488–496. [Google Scholar]
  53. Nold, M. Analytical standards for persistent organic pollutants. Analytix 2009, 2009, 11–12. [Google Scholar]
  54. Kaufmann, A. Introduction à la Théorie des Sous-ensembles Flous; Masson: Paris, France, 1975; volume 3. [Google Scholar]
  55. Cox, E. The Fuzzy Systems Handbook; Academic: New York, NY, USA, 1994. [Google Scholar]
  56. Kundu, S. The min–max composition rule and its superiority over the usual max–min composition rule. Fuzzy Sets Sys. 1998, 93, 319–329. [Google Scholar] [CrossRef]
  57. Lambert-Torres, G.; Pereira Pinto, J.O.; Borges da Silva, L.E. Minmax techniques. In Wiley Encyclopedia of Electrical and Electronics Engineering; Wiley: New York, NY, USA, 1999. [Google Scholar]
  58. Shannon, C.E. A mathematical theory of communication: Part I, discrete noiseless systems. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef]
  59. Shannon, C.E. A mathematical theory of communication: Part II, the discrete channel with noise. Bell Syst. Tech. J. 1948, 27, 623–656. [Google Scholar] [CrossRef]
  60. White, H. Neural network learning and statistics. AI Expert 1989, 4, 48–52. [Google Scholar]
  61. Kullback, S. Information Theory and Statistics; Wiley: New York, NY, USA, 1959. [Google Scholar]
  62. Iordache, O.; Corriou, J.P.; Garrido-Sánchez, L.; Fonteix, C.; Tondeur, D. Neural network frames. application to biochemical kinetic diagnosis. Comput. Chem. Eng. 1993, 17, 1101–1113. [Google Scholar] [CrossRef]
  63. Iordache, O. Modeling Multi-Level Systems; Springer: Berlin, Germany, 2011. [Google Scholar]
  64. Iordache, O. Self-Evolvable Systems: Machine Learning in Social Media; Springer: Berlin, Germany, 2012. [Google Scholar]
  65. Iordache, O. Polytope Projects; CRC: Boca Raton, FL, USA, 2014. [Google Scholar]
  • Sample Availability: Not available.

Share and Cite

MDPI and ACS Style

Torrens, F.; Castellano, G. Molecular Classification of Pesticides Including Persistent Organic Pollutants, Phenylurea and Sulphonylurea Herbicides. Molecules 2014, 19, 7388-7414. https://doi.org/10.3390/molecules19067388

AMA Style

Torrens F, Castellano G. Molecular Classification of Pesticides Including Persistent Organic Pollutants, Phenylurea and Sulphonylurea Herbicides. Molecules. 2014; 19(6):7388-7414. https://doi.org/10.3390/molecules19067388

Chicago/Turabian Style

Torrens, Francisco, and Gloria Castellano. 2014. "Molecular Classification of Pesticides Including Persistent Organic Pollutants, Phenylurea and Sulphonylurea Herbicides" Molecules 19, no. 6: 7388-7414. https://doi.org/10.3390/molecules19067388

Article Metrics

Back to TopTop