Skip to main content

Spelling Correction for Kazakh

  • Conference paper
Computational Linguistics and Intelligent Text Processing (CICLing 2014)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8404))

Abstract

Being an agglutinative language Kazakh imposes certain difficulties on both recognition of correct words and generation of candidate corrections for misspelled words. In this paper we describe a spelling correction method for Kazakh that takes advantage of both morphological analysis and noisy channel-based model. Our method outperforms both open source and commercial analogues in terms of the overall accuracy. We performed a comparative analysis of the spelling correction tools and pointed out some problems of spelling correction for agglutinative languages in general and for Kazakh in particular.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Koskenniemi, K.: A general computational model for word-form recognition and production. In: Proceedings of the 10th International Conference on Computational Linguistics, pp. 178–181. Association for Computational Linguistics (1984)

    Google Scholar 

  2. Hakkani-Tur, D.Z., Oflazer, K., Tur, G.: Statistical morphological disambiguation for agglutinative languages. Computers and the Humanities 36(4), 381–410 (2002)

    Article  Google Scholar 

  3. Oflazer, K., Güzey, C.: Spelling correction in agglutinative languages. In: ANLP, pp. 194–195 (1994)

    Google Scholar 

  4. Makhambetov, O., Makazhanov, A., Yessenbayev, Z., Matkarimov, B., Sabyrgaliyev, I., Sharafudinov, A.: Assembling the kazakh language corpus. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, pp. 1022–1031. Association for Computational Linguistics (October 2013)

    Google Scholar 

  5. Németh, L.: Hunspell open source spell checker (2011)

    Google Scholar 

  6. Church, K., Gale, W.: Probability scoring for spelling correction. Statistics and Computing 1(2), 93–103 (1991)

    Article  Google Scholar 

  7. Mussayeva, A.: Kazakh language spelling with hunspell in openoffice.org. Technical report, The University of Nottingham (2008)

    Google Scholar 

  8. Microsoft: Microsoft Office 2010, kazakh language pack (2010)

    Google Scholar 

  9. Damerau, F.J.: A technique for computer detection and correction of spelling errors. Commun. ACM 7(3), 171–176 (1964)

    Article  Google Scholar 

  10. Levenshtein, V.I.: Binary codes capable of correcting deletions, insertions and reversals. Soviet Physics Doklady 10(8), 707–710 (1966)

    MathSciNet  Google Scholar 

  11. Mays, E., Damerau, F., Mercer, R.: Context based spelling correction. Information Processing & Management 27(5), 517–522 (1991)

    Article  Google Scholar 

  12. Shannon, C.E.: A mathematical theory of communication. The Bell System Technical Journal 27, 379–423 (1948)

    Article  MATH  MathSciNet  Google Scholar 

  13. Brill, E., Moore, R.: An improved error model for noisy channel spelling correction. In: Proceedings of the 38th Annual Meeting of the Association for Computational Linguistics, Hong Kong (2000)

    Google Scholar 

  14. Hodge, V.J., Austin, J.: A comparison of a novel neural spell checker and standard spell checking algorithms. Pattern Recognition 35(11), 2571–2580 (2002)

    Article  MATH  Google Scholar 

  15. Austin, J., Kennedy, J., Lees, K.: The advanced uncertain reasoning architecture, aura. Technical report, University of Canterbury (1995)

    Google Scholar 

  16. Alegria, I., Ceberio, K., Ezeiza, N., Soroa, A., Hernández, G.: Spelling correction: from two-level morphology to open source. In: LREC. European Language Resources Association (2008)

    Google Scholar 

  17. Pirinen, T.A., Silfverberg, M., Lindén, K.: Improving finite-state spell- checker suggestions with part of speech n-grams (2012)

    Google Scholar 

  18. Mussayeva, A.: Mozilla add-ons, kazakh spelling dictionary 1.1 (2009)

    Google Scholar 

  19. Mussayeva, A.: OpenOffice, kazakh spelling dictionary (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Makazhanov, A., Makhambetov, O., Sabyrgaliyev, I., Yessenbayev, Z. (2014). Spelling Correction for Kazakh. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2014. Lecture Notes in Computer Science, vol 8404. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-54903-8_44

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-54903-8_44

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-54902-1

  • Online ISBN: 978-3-642-54903-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics