Skip to main content

Clustering of sequences using a minimum grammar complexity criterion

  • Session: Natural Language and Pattern Recognition
  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1147))

Abstract

The problem of cluster analysis of syntactic patterns is addressed. A new clustering method is proposed based on a minimum grammar complexity criterion. Grammars, describing the structure of patterns, are used as kernel functions in a hierarchical agglomerative clustering algorithm, taking a ratio of decrease in grammar complexity as criterion for cluster association. Cluster analysis of a set of contour images is performed to illustrate the proposed approach.

This is a preview of subscription content, log in via an institution.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. K. S. Fu. Syntactic pattern recognition. In Handbook of Pattern Recognition and Image Processing, chapter 4, pages 85–117. Academic Press, 1986.

    Google Scholar 

  2. Michael G. Thomason. Syntactic pattern recognition: Stochastic languages. In Handbook of Pattern Recognition and Image Processing, chapter 5, pages 119–142. Academic Press, 1986.

    Google Scholar 

  3. R. N. Moll, M. A. Arbib and A. J. Kfoury, An Introduction to Formal Language Theory. Springer Verlag, 1988.

    Google Scholar 

  4. A. V. Aho and J. D. Ullman. The Theory of Parsing, Translation and Compiling, volume I. Englewood Cliffs, New Jersey, 1972.

    Google Scholar 

  5. A. L. N. Fred and J. M. N. Leitão. Improving sentence recognition in stochastic context-free grammars. In Proc, of the Int'l Conference on Acoustics, Speech and Signal Processing, ICASSP-93, pages 9–12, August 1994.

    Google Scholar 

  6. A. L. N. Fred and J. M. N. Leitão. Parsing in attributed context-free grammars: application to automatic sleep analysis. In Proc, of the Int'l Conference on Acoustics, Speech and Signal Processing, ICASSP-96, May 1996.

    Google Scholar 

  7. H. Bunke, String matching for structural pattern recognition, in Syntactic and Structural Pattern Recognition, Theory and Applications, H. Bunke and A. Sanfeliu (Eds.), World Scientific, pp 119–144, 1990.

    Google Scholar 

  8. H. Bunke, Recent advance in string matching, in Advances in Structural and Syntactic Pattern Recognition, H. Bunke (Ed.), World Scientific, pp 107–116, 1992.

    Google Scholar 

  9. R. L. Kashiap and B. J. Oommen. String correction using probabilistic models. Pattern Recog. Letters, pp 147–154, 1984

    Google Scholar 

  10. A. L. N. Fred and J. M. N. Leitão. A minimum code length technique for clustering of syntactic patterns. In Proc, of the 13th IAPR Int'l Conference on Pattern Recognition, Vienna, August 1996.

    Google Scholar 

  11. R. Duda and P. E. Hart, Pattern Classification and Scene Analysis, John Wiley and Sons, 1973.

    Google Scholar 

  12. A. K. Jain and R. C. Dubes. Algorithms for Clustering Data. Prentice Hall, Englewood Cliffs, 1988.

    Google Scholar 

  13. S. Y. Lu and K. S. Fu. A Sentence-to-Sentence Clustering Procedure for Pattern Analysis. IEEE Trans. Systems Man Cybernetics, SMC-8, 5, pp 381–389, 1978.

    Google Scholar 

  14. K. S. and S. Y. Lu, A clustering procedure for syntactic patterns, IEEE Trans. Systems, Man Cybernetics, SMC-7, No. 7, 537–541, 1977.

    Google Scholar 

  15. K. S. Fu and T. L. Booth. Grammatical inference: introduction and survey — part 1 and II. IEEE Trans. Pattern Anal. and Machine Intell., PAMI-8, pp 343–359, 1986.

    Google Scholar 

  16. L. Miclet, Grammatical inference. In Syntactic and Structural Pattern Recognition — Theory and Applications, Scientific Publishing, pp 237–290, 1990.

    Google Scholar 

  17. R. Gonzalez and R. E. Woods, Digital Image Processing, Addison-Wesley, 1992.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Laurent Miclet Colin de la Higuera

Rights and permissions

Reprints and permissions

Copyright information

© 1996 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Fred, A.L.N. (1996). Clustering of sequences using a minimum grammar complexity criterion. In: Miclet, L., de la Higuera, C. (eds) Grammatical Interference: Learning Syntax from Sentences. ICGI 1996. Lecture Notes in Computer Science, vol 1147. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0033346

Download citation

  • DOI: https://doi.org/10.1007/BFb0033346

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-61778-5

  • Online ISBN: 978-3-540-70678-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics