Feature construction with inductive logic programming: A study of quantitative predictions of biological activity by structural attributes

Srinivasan, Ashwin; King, Ross.D.

doi:10.1007/3-540-63494-0_50

Ashwin Srinivasan¹ &
Ross.D. King²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1314))

Included in the following conference series:

International Conference on Inductive Logic Programming

127 Accesses
13 Citations

Abstract

Recently, computer programs developed within the field of Inductive Logic Programming (ILP) have received some attention for their ability to construct restricted first-order logic solutions using problemspecific background knowledge. Prominent applications of such programs have been concerned with determining “structure-activity” relationships in the areas of molecular biology and chemistry. Typically the task here is to predict the “activity” of a compound, like toxicity, from its chemical structure. Research in the area shows that: (a) ILP programs have been restricted to qualitative predictions of activity (“high”, “low” etc.); (b) When appropriate attributes are available, ILP programs have not been able to better the performance of standard quantitative analysis techniques like linear regression. However ILP programs perform creditably when such attributes are unavailable; and (c) When both are applicable, ILP programs are usually slower than their propositional counterparts. This paper examines the use of ILP programs, not for obtaining theories complete for the sample, but as a method of “discovering” new attributes. These could then be used by methods like linear regression, thus allowing for quantitative predictions and the ability to use structural information as background knowledge. Using structure-activity tasks as a test-bed the utility of ILP programs in constructing new features was evaluated by examining the prediction of chemical activity using linear regression, with and without the aid of ILP learnt logical attributes. In three out of the five datasets examined the addition of ILP attributes produced statistically better results (P 10.01). In addition six important structural features that have escaped the attention of the expert chemists were discovered.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Inductive Logic Programming

Identification

The ANI-1ccx and ANI-1x data sets, coupled-cluster and density functional theory properties for molecules

Article Open access 01 May 2020

References

T.A. Andrea and H. Kalayeh. Applications of Neural Networks in Quantitative Structure-Activity Relationship of Dihydrofolate Reductase. Journal of Medicinal Chemistry, 34:2824–2836, 1991.
Google Scholar
I. Bratko and M.Grobelnik. Inductive learning applied to program construction and verification. In Third International Workshop on Inductive Logic Programming, pages 279–292, 1993. Available as Technical Report IJS-DP-6707, J. Stefan Inst., Ljubljana, Slovenia.
Google Scholar
W. Cohen and C.D. Page. Polynomial learnability and inductive logic programming: Methods and results. New Generation Computing, 13(3,4):369–409, 1995.
Google Scholar
J.S. Collins. A regression analysis program incorporating heuristic term selection. In E. Dale and D. Michie, editors, Machine Intelligence 2. Oliver and Boyd, 1968.
Google Scholar
A.M. Davis, N.P. Gensmantel, E. Johansson, and D.P. Marriott. The Use of the GRID Program in the 3-D QSAR Analysis of a series of Calcium-Channel Agonists. Journal of Medicinal Chemistry, 37:963–972, 1994.
Google Scholar
A.K. Debnath, R.L Lopez de Compadre, G. Debnath, A.J. Schusterman, and C. Hansch. Structure-Activity Relationship of Mutagenic Aromatic and Heteroaromatic Nitro compounds. Correlation with molecular orbital energies and hydrophobicity. Journal of Medicinal Chemistry, 34(2):786–797, 1991.
Google Scholar
B. Dolsak and S. Muggleton. The application of Inductive Logic Programming to finite element mesh design. In S. Muggleton, editor, Inductive Logic Programming, pages 453–472. Academic Press, London, 1992.
Google Scholar
S. Dzeroski. Numerical Constraints and Learnability in Inductive Logic Programming. University of Ljubljana, (PhD. Thesis), Ljubljana, 1995.
Google Scholar
S. Dzeroski, L. Dehaspe, B. Ruck, and W. Walley. Classification of river water quality data using machine learning. In Proceedings of the Fifth International Conference on the Development and Application of Computer Techniques Environmental Studies, 1994.
Google Scholar
C. Feng. Inducing temporal fault dignostic rules from a qualitative model. In S. Muggleton, editor, Inductive Logic Programming, pages 473–486. Academic Press, London, 1992.
Google Scholar
C. Hansch, R.Li, J.M. Blaney, and R. Langridge. Comparison of the inhibition of Escherichia coli and Lactobacillus casei Dihydrofolate Reductase by 2,4-Diamino-5-(Substituted-benzyl) pyrimidines: Quantitative Structure-Activity Relationships,X-ray Crystallography, and Computer Graphics in StructureActivity Analysis. Journal of Medicinal Chemistry, 25:777–784, 1982.
Google Scholar
A. Karalic. Relational regression: first steps. Technical report ijs-dp-7001, J. Stefan Institute, Ljubljana, Yugoslavia, 1994.
Google Scholar
R.D. King, A. Srinivasan, and M.J.E. Sternberg. Relating chemical activity to structure: an examination of ILP successes. New Gen. Comput., 13(3,4), 1995.
Google Scholar
R.D. King, S.H. Muggleton, A. Srinivasan, and M.J.E. Sternberg. Structureactivity relationships derived by machine learning: The use of atoms and their bond connectivities to predict mutagenicity by inductive logic programming. Proc. of the National Academy of Sciences, 93:438–442, 1996.
Google Scholar
R.D. King, S.H. Muggleton, and M.J.E. Sternberg. Drug design by machine learning: The use of inductive logic programming to model the structure-activity relationships of trimethoprim analogues binding to dihydrofolate reductase. Proc. of the National Academy of Sciences, 89(23):11322–11326, 1992.
Google Scholar
N. Lavrac and S. Dzeroski. ILP: Techniques and Applications. Ellis Horwood, London, 1994.
Google Scholar
R. Michalski, I. Mozetic, J. Hong, and N. Lavrac. The AQ15 inductive learning system: an overview and experiments. In Proceedings of IMAL 1986, Orsay, 1986. Université de Paris-Sud.
Google Scholar
R.S. Michalski. Understanding the nature of learning: issues and research directions. In R. Michalski, J. Carbonnel, and T. Mitchell, editors, Machine Learning: An Artificial Intelligence Approach, volume 2, pages 3–25. Kaufmann, Los Altos, CA, 1986.
Google Scholar
D. Michie, D.J. Spiegelhalter, and C.C. Taylor, editors. Machine Learning, Neural and Statistical classification. Ellis-Horwood, New York, 1994.
Google Scholar
S. Muggleton. Inverse Entailment and Progol. New Gen. Comput., 13:245–286, 1995.
Google Scholar
S. Muggleton, R. King, and M. Sternberg. Predicting protein secondary structure using inductive logic programming. Protein Engineering, 5:647–657, 1992.
Google Scholar
S.H. Muggleton and C. Feng. Efficient induction of logic programs. In Proceedings of the First Conference on Algorithmic Learning Theory, Tokyo, 1990. Ohmsha.
Google Scholar
M.J. Norusis. SPSS: Base System User Guide. Release 6.0. SPSS Inc., 444 N Michigan Ave, Chicago, Illinois 60611, 1994.
Google Scholar
C. Silipo and C. Hansch. Correlation analysis. its Application to the StructureActivity Relationship of Triazines Inhibiting Dihydrofolate Reductase. Journal of Medicinal Chemistry, 19:6849–6861, 1976.
Google Scholar
A. Srinivasan and R.C. Camacho. Experiments in numerical reasoning with inductive logic programming. In D. Michie S. Muggleton and K. Furukawa, editors, Machine Intelligence 15. Oxford University Press, Oxford, 1996. to appear.
Google Scholar
A. Srinivasan, S.H. Muggleton, R.D. King, and M.J.E. Sternberg. Mutagenesis: ILP experiments in a non-determinate biological domain. In S. Wrobel, editor, Proceedings of the Fourth International Inductive Logic Programming Workshop. Gesellschaft fur Mathematik and Datenverarbeitung MBH, 1994. GMD-Studien Nr 237.
Google Scholar
A. Srinivasan, S.H. Muggleton, R.D. King, and M.J.E. Sternberg. Theories for mutagenicity: a study of first-order and feature based induction. Artificial Intelligence, 1995. to appear.
Google Scholar
S. Wold. Cross-validatory estimation of the number of components in factor and principal components models. Technometrics, 20:397–404, 1978.
Google Scholar
J. Zelle and R. Mooney. Learning semantic grammars with constructive inductive logic programming. In Proceedings of the Eleventh National Conference on Artificial Intelligence, pages 817–822. Morgan Kaufmann, 1993.
Google Scholar

Download references

Author information

Authors and Affiliations

Oxford University Computing Laboratory, Wolfson Building, Parks Road, Oxford
Ashwin Srinivasan
Biomolecular Modelling Laboratory, Imperial Cancer Research Fund, Lincoln's Inn Fields, London
Ross.D. King

Authors

Ashwin Srinivasan
View author publications
You can also search for this author in PubMed Google Scholar
Ross.D. King
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Stephen Muggleton

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Srinivasan, A., King, R. (1997). Feature construction with inductive logic programming: A study of quantitative predictions of biological activity by structural attributes. In: Muggleton, S. (eds) Inductive Logic Programming. ILP 1996. Lecture Notes in Computer Science, vol 1314. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-63494-0_50

Download citation

DOI: https://doi.org/10.1007/3-540-63494-0_50
Published: 18 August 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-63494-2
Online ISBN: 978-3-540-69583-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Feature construction with inductive logic programming: A study of quantitative predictions of biological activity by structural attributes

Abstract

Access this chapter

Preview

Similar content being viewed by others

Inductive Logic Programming

Identification

The ANI-1ccx and ANI-1x data sets, coupled-cluster and density functional theory properties for molecules

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Feature construction with inductive logic programming: A study of quantitative predictions of biological activity by structural attributes

Abstract

Access this chapter

Preview

Similar content being viewed by others

Inductive Logic Programming

Identification

The ANI-1ccx and ANI-1x data sets, coupled-cluster and density functional theory properties for molecules

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation