research-article

Maximum Common Subgraph based locally weighted regression

Authors:
Madeleine Seeland

Technische Universität München, Garching, Germany

Technische Universität München, Garching, Germany
View Profile

,
Fabian Buchwald

Technische Universität München, Garching, Germany

Technische Universität München, Garching, Germany
View Profile

,
Stefan Kramer

Johannes Gutenberg-Universität Mainz, Mainz, Germany

Johannes Gutenberg-Universität Mainz, Mainz, Germany
View Profile

,
Bernhard Pfahringer

The University of Waikato, Hamilton, New Zealand

The University of Waikato, Hamilton, New Zealand
View Profile

SAC '12: Proceedings of the 27th Annual ACM Symposium on Applied ComputingMarch 2012Pages 165–172https://doi.org/10.1145/2245276.2245309

Published:26 March 2012Publication History

SAC '12: Proceedings of the 27th Annual ACM Symposium on Applied Computing

Pages 165–172

ABSTRACT

This paper investigates a simple, yet effective method for regression on graphs, in particular for applications in chem-informatics and for quantitative structure-activity relationships (QSARs). The method combines Locally Weighted Learning (LWL) with Maximum Common Subgraph (MCS) based graph distances. More specifically, we investigate a variant of locally weighted regression on graphs (structures) that uses the maximum common subgraph for determining and weighting the neighborhood of a graph and feature vectors for the actual regression model. We show that this combination, LWL-MCS, outperforms other methods that use the local neighborhood of graphs for regression. The performance of this method on graphs suggests it might be useful for other types of structured data as well.

References

E. Alphonse, T. Girschick, F. Buchwald, and S. Kramer. A numerical refinement operator based on multi-instance learning. In Proceedings of the 20th International Conference on Inductive Logic Programming, pages 14--21. Springer, 2011. Google ScholarDigital Library
C. Atkeson, A. Moore, and S. Schaal. Locally weighted learning. AI Review, 11: 11--73, 1997. Google ScholarDigital Library
S. Bickel and T. Scheffer. Multi-view clustering. In Proceedings of the Fourth IEEE International Conference on Data Mining, ICDM '04, pages 19--26, Washington, DC, USA, 2004. IEEE Computer Society. Google ScholarDigital Library
F. Buchwald, T. Girschick, M. Seeland, and S. Kramer. Using local models to improve (Q)SAR predictivity. Molecular Informatics, 30(2--3): 205--218, 2011.Google Scholar
Y. Cao, T. Jiang, and T. Girke. A maximum common substructure-based algorithm for searching and predicting drug-like compounds. Bioinformatics, 24(13): i366--i374, 2008. Google ScholarDigital Library
W. Cleveland. Robust locally weighted regression and smoothing scatterplots. Journal of the American Statistical Association, 74: 829--836, 1979.Google ScholarCross Ref
D. Conte, P. Foggia, C. Sansone, and M. Vento. Thirty years of graph matching in pattern recognition. International Journal of Pattern Recognition and Artificial Intelligence, pages 265--298, 2004.Google ScholarCross Ref
L. De Raedt. Logical and Relational Learning. Springer, 2008. Google ScholarDigital Library
T. Gärtner. Kernels for Structured Data. PhD thesis, Universität Bonn, 2005.Google Scholar
M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. H. Witten. The WEKA data mining software: an update. SIGKDD Explorations, 11(1): 10--18, 2009. Google ScholarDigital Library
M. Kloft, U. Rückert, and P. L. Bartlett. A unifying view of multiple kernel learning. In Proceedings of the 2010 European Conference on Machine Learning and Knowledge Discovery in Databases, pages 66--81, 2010. Google ScholarDigital Library
Y. C. Martin, J. L. Kofron, and L. M. Traphagen. Do structurally similar molecules have similar biological activity? Journal of Medicinal Chemistry, 45(19): 4350--4358, 2002.Google ScholarCross Ref
U. Rückert, T. Girschick, F. Buchwald, and S. Kramer. Adapted transfer of distance measures for quantitative structure-activity relationships. In Proceedings of the 13th International Conference on Discovery Science, DS'10, pages 341--355, 2010. Google ScholarDigital Library
S. Rüping. Globalization of local models with SVMs. In LeGo-08 - From Local Patterns to Global Models, Workshop at ECML/PKDD, 2008.Google Scholar
L. Schietgat, F. Costa, J. Ramon, and L. De Raedt. Effective feature construction by maximum common subgraph sampling. Machine Learning, 83: 137--161, 2011. Google ScholarDigital Library
M. Seeland, S. A. Berger, A. Stamatakis, and S. Kramer. Parallel structural graph clustering. In Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases, ECML PKDD '11, pages 256--272, 2011. Google ScholarDigital Library
K. Tsuda. Support vector classifier with asymmetric kernel functions. In Proceedings of the Seventh European Symposium on Artificial Neural Networks, ESANN'99, pages 183--188, 1999.Google Scholar
W. D. Wallis, P. Shoubridge, M. Kraetz, and D. Ray. Graph distances using graph union. Pattern Recognition Letters, 22: 701--704, 2001. Google ScholarDigital Library

Recommendations

Approximating the maximum common subgraph isomorphism problem with a weighted graph

The maximum common subgraph isomorphism problem is a difficult graph problem, and the problem of finding the maximum common subgraph isomorphism problem is NP-hard. This means there is likely no algorithm that will be able to find the maximal isomorphic ...
Read More
Mean and maximum common subgraph of two graphs

A mean of a pair of graphs, g1 and g2, is formally defined as a graph that minimizes the sum of edit distances to g1 and g2. The edit distance of two graphs g and g' is the minimum cost taken over all sequences of edit operations that transform g into g'...
Read More
Using locally weighted learning to improve SMOreg for regression
PRICAI'06: Proceedings of the 9th Pacific Rim international conference on Artificial intelligence

Shevade et al.[1] are successful in extending some improved ideas to Smola and Scholkopf's SMO algorithm[2] for solving regression problems, simply named SMOreg. In this paper, we use SMOreg in exactly the same way as linear regression(LR) is used in ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
SAC '12: Proceedings of the 27th Annual ACM Symposium on Applied Computing
March 2012
2179 pages
ISBN:9781450308571
DOI:10.1145/2245276
Conference Chairs:
Sascha Ossowski
University Rey Juan Carlos, Spain
,
Paola Lecca
The Microsoft Research - University of Trento COSBI, Italy
Copyright © 2012 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 26 March 2012
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
applications
bioinformatics
clustering
graph-based learning methods
lazy learning
regression
Qualifiers
- research-article
Conference

Acceptance Rates
SAC '12 Paper Acceptance Rate270of1,056submissions,26%Overall Acceptance Rate1,650of6,669submissions,25%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 143
  Total Downloads
- Downloads (Last 12 months)2
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Maximum Common Subgraph based locally weighted regression

SAC '12: Proceedings of the 27th Annual ACM Symposium on Applied Computing

ABSTRACT

References

Cited By

Recommendations

Approximating the maximum common subgraph isomorphism problem with a weighted graph

Mean and maximum common subgraph of two graphs

Using locally weighted learning to improve SMOreg for regression