A new approach in topological descriptors usage. Iterated line graphs in the theoretical prediction of physico-chemical properties of saturated hydrocarbons

Keywords: topological descriptors, line (edge) graph, regression analysis, determination coefficient, leave-one-out cross-validation, Y-scrambling, “forgotten” index

Abstract

A new look on the problem of the molecular systems index description is presented. The capabilities of iterated line (edge) graphs in characterization of saturated hydrocarbons properties were investigated. It was demonstrated that single selected molecular (graph-theoretical (topological) or informational) descriptor calculated for the sequence of nested line graphs provides quite reliable progressive set of regression equations. Hence, the problem of descriptor set reduction is solved in the presented approach at list partially. Corresponding program complex (QUASAR) has been implemented with Python 3 program language. As the test example physico-chemical properties of octane isomers have been chosen. Among the properties under investigation there are boiling point, critical temperature, critical pressure, enthalpy of vaporization, enthalpy of formation, surface tension and viscosity. The corresponding rather simple linear regression equations which include one, two or three parameters correspondingly have been obtained. The predictive ability of the equations has been investigated using internal validation tests. The test by leave-one-out (LOO) validation and Y‑scrambling evaluate the obtained equations as adequate. For instance, for the regression model for boiling point the best equation characterizes by determination coefficients R2 = 0.943, with LOO procedure – Q2 = 0.918, while for the Y-scrambling test Q2y-scr<0.3 basically.

It is shown that all the abovementioned molecular properties in iterated line graph approach can be effectively described by commonly used topological indices. Namely almost every randomly selected topological index can give adequate equation. Effectiveness is demonstrated on the example of Zagreb group indices. Also essential effectiveness and rather universal applicability of the so-called “forgotten” index (ZM3) was demonstrated.

Downloads

Download data is not yet available.

References

Kubinyi H. QSAR: Hansch analysis and related approaches. VCH: 1993.

Roy K., Kar S., Das N.R. A Primer on QSAR/QSPR Modeling. Fundamental Concepts. Springer: 2015.

Hastie T., Tibshirani R., Wainwright M. Statistical Learning with Scarcity. The Lasso and Generalizations. CRC Press: 2015.

Talete SRL Homepage [online] http://www.talete.mi.it/products/dragon_description.htm (ac-cessed March 10, 2019)

Wiener H. Structural determination of paraffin boiling points. J. Am. Chem. Soc. 1947, 69(1), 17-20.

Randić M. On Characterization of Molecular Branching. J. Am. Chem. Soc. 1975, 97(23), 6609-6615.

Randić M. On Characterization of Chemical Structure. J. Chem. Inf. Comput. Sci. 1997, 37(4), 672-672.

Todeschini R., Consonni V. Handbook of Molecular Descriptors. New York, Wiley-VCH Verlag: 2000, 667.

Stankevic M.I., Stankevic I.V., Zefirov N.S. Topologicheskie indeksy v organicheskoy khimii. Uspechi khimii. 1988, 57(3), 337-366.

Ali A., Trinajstic A. Novel/Old Modification of the first Zagreb Index. Mol. Inf. 2018, 37, 1800008.

Furtula B, Gutman I. A forgotten topological index. J. Math. Chem. 2015, 53, 1184-1190.

Pogliani L. From Molecular Connectivity Indices to Semiempirical Connectivity Terms: Re-cent Trends in Graph Theoretical Descriptors. Chem. Rev. 2000, 100, 3827-3858.

Zakharov A.B., Dyachenko A.V., Ivanov V.V. Topological characteristics of iterated line graphs in QSAR problem: Octane numbers of saturated hydrocarbons // Journal of Chemomet-rics. 2019, e3169, 1-10.

Yaws C.L.. Yaws' Handbook of Thermodynamic and Physical Properties of Chemical Com-pounds: Physical, Thermodynamic and Transport Properties for 5,000 Organic Chemical Compounds. McGraw-Hill: 2003.

Estrada E. Spectral Moments of the Edge Adjacency Matrix in Molecular Graphs. 1. Defini-tion and Applications to the Prediction of Physical Properties of Alkanes. J. Chem. Inf. Com-put. Sci. 1996, 36, 844-849.

Diudea M.V., Horvath D., Graovac A. Molecular Topology. 15. 3D Distance Matrixes and Related Topological Indices. J. Chem. Inf Comput. Sci. 1995, 35, 129-135.

Diudea M.V., Horvath D., Bonchev D. Molecular Topology. 14. Molord Algorithm and Real Number Subgraph Invariants. Croat. Chem. Acta. 1995, 68, 131-148.

Tropsha A., Gramatica P., Gombar V.K. The Importance of Being Earnest: Validation is the Absolute Essential for Successful Application and Interpretation of QSPR. QSAR Comb. Sci. 2003, 22, 69-77.

Hosamani S., Perigidad D., Jamagoud S. QSPR Analysis of Certain Degree Based Topological Indices // J. Stat. Appl. Pro. 2017, 6(2), 361-371.

Published
2019-06-14
Cited
How to Cite
Zakharov, A. B., & Ivanov, V. V. (2019). A new approach in topological descriptors usage. Iterated line graphs in the theoretical prediction of physico-chemical properties of saturated hydrocarbons. Kharkiv University Bulletin. Chemical Series, (32), 38-45. https://doi.org/10.26565/2220-637X-2019-32-02