Abstract
Principal component-genetic algorithm-multiparameter linear regression (PC-GA-MLR) and principal component-genetic algorithm-artificial neural network (PC-GA-ANN) models were applied for prediction of the basicity constant (pK b ) for various pyridines (91 compounds) dissolved in water at 25°C. A large number of theoretical descriptors were calculated for each compound. The first 54 principal components (PCs) were found to explain more than 99.9% of variances in the original data matrix. From the pool of these PCs, the genetic algorithm was employed for selection of the best set of extracted PCs for PC-MLR and PC-ANN models. The models were generated using eight principal components as variables. For evaluation of the predictive power of the models, pK b values of 18 compounds in the prediction set were calculated. Mean percentage deviation (MPD) for PC-GA-MLR and PC-GA-ANN models are 21.096 and 3.541. Comparison of the results obtained by the models reveals superiority of the PC-GA-ANN model relative to the PC-GA-MLR model. The improvements are due to the fact that the pK b of the pyridines demonstrate non-linear correlations with the principal components.
Similar content being viewed by others
References
XJ Yao YW Wang XY Zhang RS Zhang MC Liu ZD Hu BT Fan (2002) Chemom Intell Lab Syst 62 217 Occurrence Handle10.1016/S0169-7439(02)00017-5 Occurrence Handle1:CAS:528:DC%2BD38XjsFyht7w%3D
R Guha JR Serra PC Jurs (2004) J Mol Graph Model 23 1 Occurrence Handle10.1016/j.jmgm.2004.03.003 Occurrence Handle1:CAS:528:DC%2BD2cXntV2msrY%3D
P Krogsgaard-Larsen T Liljefors U Madsen (2002) Textbook of Drug Design and Discovery Taylor & Francis London
V Consonni R Todeschini M Pavan P Gramatica (2002) J Chem Inf Comput Sci 42 693 Occurrence Handle1:CAS:528:DC%2BD38XivFCgtrc%3D
M Karthikeyan RC Glen A Bender (2005) J Chem Inf Model 45 581 Occurrence Handle10.1021/ci0500132 Occurrence Handle1:CAS:528:DC%2BD2MXjslKls7g%3D
AA Melnikov VA Palyulin NS Zefirov (2007) J Chem Inf Model 47 2077 Occurrence Handle10.1021/ci700156f Occurrence Handle1:CAS:528:DC%2BD2sXht1Wmt7vN
S Ajmani SC Rogers MH Barley DJ Livingstone (2006) J Chem Inf Model 46 2043 Occurrence Handle10.1021/ci050559o Occurrence Handle1:CAS:528:DC%2BD28XmvFWiu7k%3D
AR Katritzky IB Stoyanova-Slavova DA Dobchev (2007) J Mol Graph Model 26 529 Occurrence Handle10.1016/j.jmgm.2007.03.006 Occurrence Handle1:CAS:528:DC%2BD2sXpvFKhu70%3D
M Shamsipur A Siroueinejad B Hemmateenejad A Abbaspour H Sharghi K Alizadeh S Arshadi (2007) J Electranal Chem 600 345 Occurrence Handle10.1016/j.jelechem.2006.09.006 Occurrence Handle1:CAS:528:DC%2BD2sXpslGntA%3D%3D
R Todeschini V Consonni (2000) Handbook of Molecular Descriptors Wiley-VCH Weinheim, Germany
JM Sutter JH Kalivas PM Lang (1992) J Chemometr 6 217 Occurrence Handle10.1002/cem.1180060406 Occurrence Handle1:CAS:528:DyaK38XmsFSks7o%3D
R Vendrame RS Braga Y Takahata DS Galvao (1999) J Chem Inf Comput Sci 39 1094 Occurrence Handle1:CAS:528:DyaK1MXms1eju74%3D
ER Malinowski (2002) Factor Analysis in Chemistry Wiley-Interscience New York
AR Katritzky I Tulp DC Fara A Lauria U Maran WE Acree (2005) J Chem Inf Model 45 913 Occurrence Handle10.1021/ci0496189 Occurrence Handle1:CAS:528:DC%2BD2MXkvVSjtLs%3D
B Hemmateenejad M Akhond R Miri M Shamsipur (2003) J Chem Inf Comput Sci 43 1328 Occurrence Handle1:CAS:528:DC%2BD3sXkvVaiur4%3D
B Hemmateenejad M Shamsipur (2004) Internet Electron, J Mol Des 3 316 Occurrence Handle1:CAS:528:DC%2BD2cXms1Okt74%3D
M Jalali-Heravi A Kyani (2004) J Chem Inf Comput Sci 44 1328 Occurrence Handle1:CAS:528:DC%2BD2cXksVKhsb0%3D
B Hemmateenejad MA Safarpour R Miri N Nesari (2005) J Chem Inf Model 45 190 Occurrence Handle10.1021/ci049766z Occurrence Handle1:CAS:528:DC%2BD2cXhtVGhs7fN
B Hemmateenejad M Safarpour R Miri F Taghavi (2004) J Comput Chem 25 1495 Occurrence Handle10.1002/jcc.20066 Occurrence Handle1:CAS:528:DC%2BD2cXmtFKgt7o%3D
U Depczynski VJ Frost K Molt (2000) Anal Chim Acta 420 217 Occurrence Handle10.1016/S0003-2670(00)00893-X Occurrence Handle1:CAS:528:DC%2BD3cXmtVamsrw%3D
B Hemmateenejad (2005) Chemom Intell Lab Syst 75 231 Occurrence Handle10.1016/j.chemolab.2004.09.005 Occurrence Handle1:CAS:528:DC%2BD2MXmtFOrsA%3D%3D
DE Goldberg (2000) Genetic Algorithm in Search, Optimization and Machine Learning Addison-Wesley-Longman Reading, MA, USA
SJ Cho MA Hermsmeier (2002) J Chem Inf Comput Sci 42 927 Occurrence Handle1:CAS:528:DC%2BD38XjvVaju7o%3D
F Despagne DL Massart (1998) Analyst 123 157 Occurrence Handle10.1039/a805562i
J Zupan J Gasteiger (1999) Neural Networks in Chemistry and Drug Design Wiley-VCH Germany
J Meiler R Meusinger M Will (2000) J Chem Inf Comput Sci 40 1169 Occurrence Handle1:CAS:528:DC%2BD3cXlvFSru7s%3D
A Habibi-Yangjeh M Nooshyar (2005) Phys Chem Liq 43 239 Occurrence Handle10.1080/00319100500061233 Occurrence Handle1:CAS:528:DC%2BD2MXkslOlsbY%3D
A Habibi-Yangjeh M Nooshyar (2005) Bull Korean Chem Soc 26 139 Occurrence Handle1:CAS:528:DC%2BD2MXhsVCksr8%3D
A Habibi-Yangjeh M Danandeh-Jenagharad M Nooshyar (2005) Bull Korean Chem Soc 26 2007 Occurrence Handle1:CAS:528:DC%2BD28XosFSmsQ%3D%3D
A Habibi-Yangjeh (2007) Phys Chem Liq 45 471 Occurrence Handle10.1080/00319100601089679 Occurrence Handle1:CAS:528:DC%2BD2sXotV2ksLk%3D
R Tabaraki T Khayamian AA Ensafi (2006) J Mol Graph Model 25 46 Occurrence Handle10.1016/j.jmgm.2005.10.012 Occurrence Handle1:CAS:528:DC%2BD28XoslOgs7k%3D
A Habibi-Yangjeh M Danandeh-Jenagharad (2007) Indian J Chem 46B 478 Occurrence Handle1:CAS:528:DC%2BD2sXjvVSmtrs%3D
A Habibi-Yangjeh M Esmailian (2007) Bull Korean Chem Soc 28 1477 Occurrence Handle1:CAS:528:DC%2BD2sXhsValu7rF
A Habibi-Yangjeh E Pourbasheer M Danandeh-Jenagharad (2008) Bull Korean Chem Soc 29 833 Occurrence Handle1:CAS:528:DC%2BD1cXmtlaqtrs%3D Occurrence Handle10.5012/bkcs.2008.29.4.833
J Jover R Bosque J Sales (2007) QSAR Comb Sci 26 385 Occurrence Handle10.1002/qsar.200610088 Occurrence Handle1:CAS:528:DC%2BD2sXjs1Kjur0%3D
AA Ivanova II Baskin VA Palyulin ANS Zefirov (2007) Doklady Chem 413 90 Occurrence Handle10.1134/S0012500807040040 Occurrence Handle1:CAS:528:DC%2BD2sXksFelsrs%3D
F Luan W Ma H Zhang X Zhang M Liu Z Hu B Fan (2005) Pharmaceut Res 22 1454 Occurrence Handle10.1007/s11095-005-6246-8 Occurrence Handle1:CAS:528:DC%2BD2MXptVWlurc%3D
A Habibi-Yangjeh M Danandeh-Jenagharad M Nooshyar (2006) J Mol Model 12 338 Occurrence Handle10.1007/s00894-005-0050-6 Occurrence Handle1:CAS:528:DC%2BD28XktVGjs78%3D
Dean JA (1999) Lange’s Handbook of Chemistry, 15th edn. McGraw-Hill Inc
HyperChem Release 7, HyperCube Inc., http://www.hyper.com.
Todeschini R, Milano Chemometrics and QSPR Group, http://www.disat.unimib.it/vhm.
Matlab 6.5. Mathworks, 1984–2002
SPSS for Windows, Statistical Package for IBM PC, SPSS Inc., http://www.spss.com
SJ Cho MA Hermsmeier (2002) J Chem Inf Comput Sci 42 927 Occurrence Handle1:CAS:528:DC%2BD38XjvVaju7o%3D
K Baumann H Albert M Von Korff (2002) J Chemometr 16 339 Occurrence Handle10.1002/cem.730 Occurrence Handle1:CAS:528:DC%2BD38XlslOlsb8%3D
Q Lu G Shen R Yu (2002) J Comput Chem 23 1357 Occurrence Handle10.1002/jcc.10149 Occurrence Handle1:CAS:528:DC%2BD38XnslSnsr4%3D
S Ahmad MM Gromiha (2003) J Comput Chem 24 1313 Occurrence Handle10.1002/jcc.10298 Occurrence Handle1:CAS:528:DC%2BD3sXlsFGgs74%3D
O Deeb B Hemmateenejad A Jaber R Garduno-Juarez R Miri (2007) Chemosphere 67 2122 Occurrence Handle10.1016/j.chemosphere.2006.12.098 Occurrence Handle1:CAS:528:DC%2BD2sXjsVGjtLw%3D
The Mathworks Inc (2002) Genetic Algorithm and Direct Search Toolbox User’s Guide, Massachusetts
The Mathworks Inc (2002) Neural Network Toolbox User’s Guide, Massachusetts
Author information
Authors and Affiliations
Corresponding author
Additional information
Correspondence: Eslam Pourbasheer, Department of Chemistry, Faculty of Science, University of Mohaghegh Ardabili, Ardabil, Iran.
Rights and permissions
About this article
Cite this article
Habibi-Yangjeh, A., Pourbasheer, E. & Danandeh-Jenagharad, M. Prediction of basicity constants of various pyridines in aqueous solution using a principal component-genetic algorithm-artificial neural network. Monatsh Chem 139, 1423–1431 (2008). https://doi.org/10.1007/s00706-008-0951-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00706-008-0951-z