NTPC: N-fold Templated Piped Correction

Wu, Dekai; Ngai, Grace; Carpuat, Marine

doi:10.1007/978-3-540-30211-7_50

Dekai Wu²²,
Grace Ngai²³ &
Marine Carpuat²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3248))

Included in the following conference series:

International Conference on Natural Language Processing

1579 Accesses
1 Citations

Abstract

We describe a broadly-applicable conservative error correcting model, N-fold Templated Piped Correction or NTPC (“nitpick”), that consistently improves the accuracy of existing high-accuracy base models. Under circumstances where most obvious approaches actually reduce accuracy more than they improve it, NTPC nevertheless comes with little risk of accidentally degrading performance. NTPC is particularly well suited for natural language applications involving high-dimensional feature spaces, such as bracketing and disambiguation tasks, since its easily customizable template-driven learner allows efficient search over the kind of complex feature combinations that have typically eluded the base models. We show empirically that NTPC yields small but consistent accuracy gains on top of even high-performing models like boosting. We also give evidence that the various extreme design parameters in NTPC are indeed necessary for the intended operating range, even though they diverge from usual practice.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Brill, E.: Transformation-based error-driven learning and natural language processing: A case study in part of speech tagging. Computational Linguistics 21(4), 543–565 (1995)
Google Scholar
Carreras, X., Màrques, L., Padró, L.: Named entity extraction using AdaBoost. In: Roth, D., van den Bosch, A. (eds.) Proceedings of CoNLL 2002, Taipei, Taiwan, pp. 167–170 (2002)
Google Scholar
Escudero, G., Marquez, L., Rigau, G.: Boosting applied to word sense disambiguation. In: European Conference on Machine Learning, pp. 129–141 (2000)
Google Scholar
Freund, Y., Schapire, R.E.: A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences 55(1), 119–139 (1997)
Article MATH MathSciNet Google Scholar
Rivest, R.L.: Learning decision lists. Machine Learning 2(3), 229–246 (1987)
Google Scholar
Schapire, R.E., Singer, Y.: Boostexter: A boosting-based system for text categorization. Machine Learning 2(3), 135–168 (2000)
Article Google Scholar
Sang, E.T.K., Meulder, F.: Introduction to the CoNLL 2003 shared task: Language-independent named entity recognition. In: Daelemans, W., Osborne, M. (eds.) Proceedings of CoNLL 2003, Edmonton, Canada (2003)
Google Scholar
Sang, E.T.K.: Introduction to the CoNLL 2002 shared task: Languageindependent named entity recognition. In: Roth, D., van den Bosch, A. (eds.) Proceedings of CoNLL 2002, Taipei, Taiwan, pp. 155–158 (2002)
Google Scholar
Tsukamoto, K., Mitsuishi, Y., Sassano, M.: Learning with multiple stacking for named entity recognition. In: Roth, D., van den Bosch, A. (eds.) Proceedings of CoNLL 2002, Taipei, Taiwan, pp. 191–194 (2002)
Google Scholar
Wu, D., Ngai, G., Carpuat, M., Larsen, J., Yang, Y.: Boosting for named entity recognition. In: Roth, D., van den Bosch, A. (eds.) Proceedings of CoNLL 2002, Taipei, Taiwan, pp. 195–198 (2002)
Google Scholar
Wu, D., Ngai, G., Carpuat, M.: A stacked, voted, stacked model for named entity recognition. In: Daelemans, W., Osborne, M. (eds.) Proceedings of CoNLL 2003, Edmonton, Canada, pp. 200–203 (2003)
Google Scholar
Wu, D., Ngai, G., Carpuat, M.: Raising the bar: Stacked conservative error correction beyond boosting. In: Fourth International Conference on Language Resources and Evaluation (LREC 2004), Lisbon (May 2004)
Google Scholar
Wu, D., Ngai, G., Carpuat, M.: Why nitpicking works: Evidence for Occam’s Razor in error correctors. In: 20th International Conference on Computational Linguistics (COLING 2004), Geneva (August 2004)
Google Scholar

Download references

Author information

Authors and Affiliations

HKUST, Human Language Technology Center, Dept. of Computer Science, University of Science and Technology, Clear Water Bay, Hong Kong
Dekai Wu & Marine Carpuat
Dept. of Computing, Hong Kong Polytechnic University, Kowloon, Hong Kong
Grace Ngai

Authors

Dekai Wu
View author publications
You can also search for this author in PubMed Google Scholar
Grace Ngai
View author publications
You can also search for this author in PubMed Google Scholar
Marine Carpuat
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Behavior Design Corporation, IV Science-Based Industrial Park Hsinchu, 2F, No.5, Industry E. Rd, Taiwan
Keh-Yih Su
University of Tokyo, Hongo 7-3-1, Bunkyo-ku, Tokyo 113-0033, JST CREST, Honcho 4-1-8, Kawaguchi-shi,, 332-0012, Saitama,
Jun’ichi Tsujii
Pohang University of Science and Technology (POSTECH), AITrc, Republic of Korea
Jong-Hyeok Lee
Language Information Sciences Research Centre, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong
Oi Yee Kwong

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wu, D., Ngai, G., Carpuat, M. (2005). NTPC: N-fold Templated Piped Correction. In: Su, KY., Tsujii, J., Lee, JH., Kwong, O.Y. (eds) Natural Language Processing – IJCNLP 2004. IJCNLP 2004. Lecture Notes in Computer Science(), vol 3248. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30211-7_50

Download citation

DOI: https://doi.org/10.1007/978-3-540-30211-7_50
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24475-2
Online ISBN: 978-3-540-30211-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics