3.1. Molecular Dynamics Simulations Results
MD simulations provided detailed insights into the thermodynamic and dynamic properties of the bulk liquid systems of G-series and A-series agents. For the G-series agents, the simulated densities were validated through comparison with available experimental data from previous studies [
22,
23]. The experimentally reported densities for Sarin, Soman, and Tabun were 1.10 g/cm
3, 1.02 g/cm
3, and 1.07 g/cm
3, respectively [
2,
13]. As shown in
Table 2 and
Figure 2, the MD-derived values exhibit good agreement with these references across both charge models (eQeQ and ML), supporting the accuracy of the employed force fields.
Sarin and Soman show minimal variation between the two charge models, with calculated densities closely matching experimental values within 0.01–0.02 g/cm3. This suggests that both the eQeQ and ML charge assignment schemes are suitable for simulating the bulk-phase behavior of these agents. In contrast, Tabun shows a more significant discrepancy between the two MD charge models, with the ML-based simulation yielding a higher density (1.16 g/cm3), deviating from both the experimental value (1.07 g/cm3) and the ChemSpider in silico estimation (1.08 ± 0.1 g/mL). This may indicate that the ML-derived partial charges overestimate intermolecular interactions for Tabun, possibly due to the specific electronic environment of the agent’s substituents. In general, the eQeQ model results in calculated densities within the expected experimental uncertainty of 0.01–0.005 g/mL.
Overall, the consistency between MD-derived, experimental, and in silico densities confirms the reliability of the modeling approach for capturing key thermophysical properties of the G-series compounds. The comparison further underscores the importance of evaluating multiple charge models, especially for compounds with distinct functional groups or electronic characteristics.
For the A-series agents, the calculated density values were derived from extensive NPT molecular dynamics (MD) simulations using two distinct structural models: the phosphoramidofluoridate structures proposed by Mirzayanov and the phosphorofluoridate structures proposed by Ellison and Hoenig. The simulations employed two charge assignment schemes (eQeQ and ML) and were benchmarked against both experimental data and ChemSpider-derived in silico estimates to evaluate consistency and predictive accuracy.
As shown in
Table 3 and
Figure 3, the experimental densities exhibit a decreasing trend from A230 (1.612 g/cm
3) to A234 (1.414 g/cm
3), likely reflecting structural modifications and reduced molecular packing efficiency with increasing molecular mass and steric complexity.
The MD simulations using Mirzayanov’s structures consistently yielded lower density values, regardless of the charge assignment method, with results falling between 1.05 and 1.17 g/cm3. These values are significantly below experimental references and suggest that Mirzayanov’s structures may underestimate the degree of molecular packing in the liquid phase. This is likely due to the presence of the direct phosphorus–nitrogen (P–N) bond, which modifies the electron distribution around the phosphorus center and alters intermolecular interaction potentials.
In contrast, simulations based on the Ellison–Hoenig structures produced densities much closer to experimental values. Particularly, under the eQeQ model, the calculated values show excellent agreement: e.g., A230 (1.608 g/cm3) compared to 1.612 g/cm3 experimentally. These results support the idea that the phosphorofluoridate representation better captures the structural and electronic environment of these agents, especially with respect to packing efficiency and cohesive energy density. The in silico values retrieved from ChemSpider were relatively consistent across all three agents, around 1.1 ± 0.1 g/mL. Summarizing results for A-agents, it is noticeable that Ellison–Hoenig structures exhibit calculated densities close to the experimental ones, whilst Mirzayanov structures are close to those calculated via Chemspider. Since experimental values are not reportedly attached to either Mirzayanov or Ellison–Hoenig structures, we conclude that they represent measurements of the Ellison–Hoenig structures.
The heat capacity (Cp) calculated values ranged from 258 to 263 J/mol·K for Sarin models to 343 to 350 J/mol·K for Soman models, with Tabun being in the middle range (
Table 4). In general, eQeQ and ML models predict quite close values to each other for the same substance [
26], values with differences inside the statistical noise window.
The specific heat capacity (Cp) for the A-series agents varied between 366.19 J/mol·K for A230 and 462.58 J/mol·K for A234 (
Table 5). The values calculated for the A-series agents were relatively higher across the groups in comparison with those of the G-series, independently of the proposed structure. Furthermore, the differences between the Cp calculated values for the two alternative proposed structures for each of the A-series agents were also relatively significantly different for each compound (
Table 5).
The self-diffusion coefficient was calculated with the Einstein relationship (Equation (1)) using data from mean square displacement of molecules over a wide range of time.
Molecular diffusivity values obtained from this mean square displacement analysis demonstrated that Sarin has the highest diffusivity (0.82 × 10
−9 m
2/s), followed by Tabun and Soman (
Table 6). The A-series agents’ molecular diffusivity values were comparable to those calculated for Tabun when the Mirzayanov molecular structures were considered. The results for the A-series agents with the Ellison–Hoenig-proposed structure were significantly lower, with the minimum values presented for the A232 agent (
Table 7).
In the following, pair radial distribution functions (pRDFs) are presented and discussed. A pRDF, often denoted as g(r), is a statistical measurement of the average intemolecular distances in a distance r, between a reference particle and another particle, belonging to adjacent molecules. Therefore, in small distances, g(r) values are almost always zero, due to stereochemical reasons and repulsive forces, whilst in large distances, values equal unity, since the local density of specific particles equals the corresponding bulk density in the simulation cell. In intermediate distances, peaks over 1 signify a stronger connection of specific particles (i.e., selected molecular sites are found in the particular distance with higher probability), and the sharper a selected peak is, the more distinct the intermolecular structure can be considered.
The RDFs for G-series agents (
Figure 4,
Figure 5 and
Figure 6) displayed distinct and well-defined peaks. Sarin (
Figure 4) exhibited the most pronounced first peak at approximately 3.4 Å, indicative of strong local ordering and short-range interactions between methyl carbon and the phosphoryl oxygen. Soman (
Figure 5) showed a sharper, slightly shifted peak around 4.1 Å, particularly under the ML-derived charge model, suggesting tighter local packing due to the presence of its bulkier alkyl substituent. Tabun (
Figure 6) presented a dual-peak structure of moderate intensity, reflecting a more complex spatial organization influenced by the nitrile group, which modifies the electronic distribution around the phosphorus atom.
For A-series agents in the Ellison–Hoenig configuration (
Figure 7,
Figure 8 and
Figure 9), RDFs demonstrated highly ordered first peaks. In particular, A232eh (
Figure 7) displayed a sharp and narrow peak with g(r) values exceeding 4.0, suggesting strong directional interactions and tight molecular packing similar to classical G-series agents. The other Ellison–Hoenig structures, A230eh (
Figure 7) and A234eh (
Figure 9), also showed well-defined short-range order, consistent with their phosphorofluoridate bonding and tetrahedral geometry.
In contrast, the Mirzayanov structures (
Figure 10,
Figure 11 and
Figure 12) exhibited broader, less intense RDF peaks. A230m (
Figure 10) and A232m (
Figure 10) revealed weaker, more diffuse first coordination shells, beginning around 4.0 Å and peaking at lower g(r) values than their Ellison and Hoenig counterparts. A234m (
Figure 12) displayed a slightly sharper first peak, though still less structured compared to the EH configuration. These differences suggest a more relaxed and disordered local environment, likely a result of the direct phosphorus–nitrogen bonding in the Mirzayanov structures, which modifies both electron distribution and steric interactions.
Across all systems, the choice of partial charge model (eQeQ vs. ML) significantly influenced RDF behavior. ML-derived charges generally led to broader and lower-intensity g(r) peaks, indicating a prediction of reduced local density clustering and weaker intermolecular correlations. This effect was especially pronounced in A230m and A232m, where the ML curves produced smoother transitions between coordination shells and lower peak intensities overall.
These RDF results reinforce the conclusion that both structural representation and charge model substantially affect the predicted liquid-phase behavior of nerve agents. These distinctions are not merely computational artifacts—they may have real implications for molecular diffusion, solubility, biological target interaction, and environmental persistence of these highly toxic compounds.
Molecular dynamics simulations of G- and A-type nerve agents demonstrated consistent behavior of density, diffusivity, and intermolecular structure consistent with past computational results. The density of the two agent classes was in close accordance with past MD studies, with G-agents having values close to previously determined [
14,
22], but A-agents such as A230, A232, and A234 revealed comparable behavior consistent with Mirzayanov’s model of structure [
13]. Diffusivity analysis revealed greater molecular motion of G-agents due to more supple conformational moieties, compared to sterically hindered A-series molecules. RDF analysis further revealed characteristic short-range ordering of active molecular sites, consistent with RDF patterns of other MD simulations of polar molecular liquids [
13,
14]
Moreover, density values for the A-series nerve agents A-230, A-232, and A-234, calculated with the molecular dynamics (MD) simulations, revealed significant variations depending on the structural models employed. Based on Ellison and Hoenig’s phosphorofluoridate structures, the model produced density values closely aligned with the experimental results. This may suggest that these proposed molecular structures can accurately represent the packing efficiency and intermolecular interactions of these agents [
1,
2]. The simulations implementing Mirzayanov’s phosphoramidofluoridate structures produced consistently lower densities. This discrepancy can be attributed to the presence of a direct phosphorus–nitrogen (P-N) bond in Mirzayanov’s structures, which alters the electronic environment around phosphorus, reducing molecular packing efficiency compared to the phosphorus–oxygen (P=O) bond proposed by Ellison and Hoenig.
3.2. QSAR Modeling Results
Quantitative Structure–Activity Relationship (QSAR) modeling was employed to evaluate key physicochemical and toxicokinetic properties of selected G-series and A-series nerve agents, with an emphasis on parameters relevant to environmental fate and biological behavior. One of the primary physicochemical descriptors analyzed was vapor pressure at 20 °C, which is directly related to the volatility and environmental dispersal potential of these compounds. The G-series agents Sarin and Soman demonstrated relatively high vapor pressures (2.10 mmHg and 0.40 mmHg, respectively), indicative of their known volatility. In contrast, the A-series compounds—particularly A232 and A234—displayed substantially lower vapor pressures (0.01–0.03 mmHg), suggesting a markedly reduced tendency to vaporize under ambient conditions (
Table 8).
These findings suggest that A-series agents may display greater environmental persistence due to their reduced volatility. Boiling point estimates further support this interpretation, with values ranging from 158 °C for Sarin to approximately 260 °C for A234, independently of the proposed molecular structure. Differences between the Mirzayanov and Ellison–Hoenig structural variants of the A-series agents reflect a shift in both vapor pressure and boiling point, with the Ellison–Hoenig structures generally exhibiting slightly higher boiling points and lower volatilities. These results underscore the influence of molecular structure on the thermodynamic behavior of nerve agents and support the utility of MD simulations in differentiating between proposed isomeric forms.
In parallel, QSAR analysis was extended to include lipophilicity (LogP), a pivotal descriptor influencing a compound’s bioavailability, tissue distribution, and potential for bioaccumulation. The modeling results revealed significant divergence in LogP values between the G-series and A-series agents, consistent with their differing absorption profiles and mechanisms of toxicological action [
9]. When considered alongside vapor pressure and boiling point data, the LogP findings contribute to a multidimensional understanding of the agents’ environmental mobility and biological interaction potential, reinforcing the value of integrated QSAR approaches in the early-stage hazard characterization of chemical warfare agents.
The calculated LogP values were 0.3 for Sarin, 2.10 for Soman, and 1.70 for Tabun (
Table 9). Soman’s higher lipophilicity suggests a greater potential for membrane penetration and systemic absorption, consistent with its higher toxicity. LD
50 values predicted by the QSAR models showed Soman as the most toxic agent (LD
50 = 0.012 mg/kg), followed by Sarin (LD
50 = 0.014 mg/kg) and Tabun (LD
50 = 0.017 mg/kg) [
4] (
Table 9).
Additional descriptors such as molecular weight, hydrogen bond donors, and polar surface area (PSA) were analyzed to correlate with the agents’ biological activity. Sarin’s lower PSA suggests its rapid systemic absorption, while Tabun’s slightly higher PSA may reduce its overall bioavailability.
According to the QSAR model’s results, LogP values ranged from 1.70 for Tabun to 2.10 for Soman. These values closely resemble those reported in previous QSAR studies on structurally related compounds, including VX (LogP ~ 2.20) and paraoxon (LogP ~ 1.85) [
33]. The correlation between LogP and toxicity observed in this study is also consistent with findings from studies on pesticide-related organophosphates, where increased lipophilicity was linked to higher bioaccumulation potential and AChE inhibition potency [
15,
17,
19].
Furthermore, the predicted LD
50 values of Sarin (0.014 mg/kg), Soman (0.012 mg/kg), and Tabun (0.017 mg/kg) were in strong agreement with experimental data from the TOXNET and DSSTox databases, reinforcing the reliability of the QSAR model applied. These results align with previous QSAR analyses of nerve agents, which demonstrated that LD50 values could be accurately predicted using molecular descriptors such as hydrogen bond acceptors, molecular weight, and electronic properties [
30].
The calculated LogP values for the A-series agents ranged from 2.14 (A-230) to 2.97 (A-234), indicating higher lipophilicity compared to the G-series compounds. The elevated LogP values are consistent with the more complex and hydrophobic substituents found in the A-series molecular structures, which could enhance membrane permeability and tissue distribution potential. A-234 exhibited the highest lipophilicity, which aligns with its larger molecular weight and surface area, potentially contributing to prolonged biological retention. Recent studies employing cheminformatics and predictive toxicology tools, such as those by Noga and Opravil [
8,
11], have reported LogP values within a similar range, reinforcing the conclusion that A-series agents exhibit high lipophilicity. This property is associated with increased membrane permeability and the potential for bioaccumulation, particularly for A-234.
The predicted LD50 values for the A-series agents were 0.51 mg/kg for A-234 and around 1.22–1.23 mg/kg for A-230 and A-232, suggesting lower acute toxicity than their G-series counterparts. While these values may seem relatively high for these agents, it is important to note that these predictions are based on extrapolation from a model primarily trained on G-series and related organophosphorus structures.
The results highlight structural features influencing QSAR-predicted toxicity, such as molecular size, lipophilicity, and polar surface area. A-234, which has the highest PSA (51.7 Å2) among the A-agents, also shows the lowest predicted LD50 within its group, potentially reflecting an increased likelihood of biological interaction. These observations offer useful insight into how computational models might differentiate between subtle structural variants, even within a class of closely related compounds.