Skip to main content
Log in

TDC: Typed Dependencies-Based Chunking Model

  • Research Article - Computer Engineering and Computer Science
  • Published:
Arabian Journal for Science and Engineering Aims and scope Submit manuscript

Abstract

Chunking is considered as one of the very important problems in Natural Language Processing. The chunking models developed so far usually are unable to extract the relationships among the chunks. In this paper, we present a typed dependency-based chunking model (TDC), which is based on stanford typed dependencies, which attains the highest f-score. Besides attaining highest f-score, unique feature of the proposed chunking model is the extraction of semantic relationships among the chunks. Hence, TDC can easily be utilized for the tasks which require the semantics of the text. TDC is evaluated on the training and test sets provided for the CoNLL-2000 chunking shared task (Tjong et al., in: Proceedings of the 2nd workshop on Learning language in logic and the 4th conference on Computational Natural Language Learning, Association for Computational Linguistics, 2000). The results show that TDC achieves highest f-score than the top scoring model of CoNLL-2000 (shared task), and the models developed after the shared task.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Tjong Kim Sang, E.F.; Buchholz, S.: Introduction to the CoNLL-2000 shared task: chunking. In: Proceedings of the 2nd Workshop on Learning Language in Logic and the 4th Conference on Computational Natural Language Learning

  2. Punyakanok, V.; Roth, D.: The Use of Classifiers in Sequential Inference. arXiv preprint arXiv:cs/0111003 (2001)

  3. Sebastiani, F.: Machine learning in automated text categorization. ACM Comput. Surv. 34(1), 1–47 (2002)

    Article  Google Scholar 

  4. Surdeanu, M.; Harabagiu, S.; Williams, J.; Aarseth, P.: Using predicate-argument structures for information extraction. In: Proceedings of the 41st Annual Meeting on Association for Computational Linguistics, vol. 1, pp. 8–15. Association for Computational Linguistics (2003)

  5. Echizen-ya, H.; Araki, K.: Automatic evaluation method for machine translation using noun-phrase chunking. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp. 108–117. Association for Computational Linguistics (2010)

  6. Shen, D.; Lapata, M.; Using semantic roles to improve question answering. In: EMNLP-CoNLL, pp. 12–21 (2007)

  7. De Marneffe, M.-C.; Manning, C.D.: Stanford Typed Dependencies Manual. http://nlp.stanford.edu/software/dependenciesmanual.pdf (2008)

  8. Abney, S.P.: Parsing by Chunks. Springer, Berlin (1992)

    Google Scholar 

  9. Vilain, M.; Day, D.: Phrase parsing with rule sequence processors: an application to the shared conll task. In: Proceedings of the 2nd Workshop on Learning Language in Logic and the 4th Conference on Computational Natural Language Learning, vol. 7, pp. 160–162. Association for Computational Linguistics (2000)

  10. Johansson, C.: A context sensitive maximum likelihood approach to chunking. In: Proceedings of the 2nd Workshop on Learning Language in Logic and the 4th Conference on Computational Natural Language Learning, vol. 7, pp. 136–138. Association for Computational Linguistics (2000)

  11. Déjean, H.: Learning syntactic structures with xml. In: Proceedings of the 2nd Workshop on Learning Language in Logic and the 4th Conference on Computational Natural Language Learning, vol. 7, pp. 133–135. Association for Computational Linguistics (2000)

  12. Van Halteren, H.: Chunking with WPDV models. In: Proceedings of the 2nd Workshop on Learning Language in Logic and the 4th Conference on Computational Natural Language Learning, vol. 7, pp. 154–156. Association for Computational Linguistics (2000)

  13. Tjong Kim Sang, E.F.: Text chunking by system combination. In: Proceedings of the 2nd Workshop on Learning Language in Logic and the 4th Conference on Computational Natural Language Learning, vol. 7, pp. 151–153. Association for Computational Linguistics (2000)

  14. Kudoh, T.; Matsumoto, Y.: Use of support vector learning for chunk identification. In: Proceedings of the 2nd Workshop on Learning Language in Logic and the 4th Conference on Computational Natural Language Learning, vol. 7, pp. 142–144. Association for Computational Linguistics (2000)

  15. Zhang, T.; Damerau, F.; Johnson, D.: Text chunking based on a generalization of winnow. J. Mach. Learn. Res. 2, 615–637 (2002)

    MATH  Google Scholar 

  16. Kivinen, J.; Warmuth, M.K.; Auer, P.: The perceptron algorithm versus winnow: linear versus logarithmic mistake bounds when few input variables are relevant. Artif. Intell. 97(1), 325–343 (1997)

    Article  MathSciNet  MATH  Google Scholar 

  17. Ando, R.K.; Zhang, T.: A high-performance semi-supervised learning method for text chunking. In: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, pp. 1–9. Association for Computational Linguistics (2005)

  18. Wu, Y.-C.; Lee, Y.-S.; Yang, J.-C.: Robust and efficient multiclass svm models for phrase pattern recognition. Pattern Recognit. 41(9), 2874–2889 (2008)

    Article  MATH  Google Scholar 

  19. Zhou, J.; Qu, W.; Zhang, F.: Exploiting chunk-level features to improve phrase chunking. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 557–567. Association for Computational Linguistics (2012)

  20. Ramshaw, L.A.; Marcus, M.P.: Text chunking using transformation-based learning. In: Natural Language Processing Using Very Large Corpora, pp. 157–176. Springer, Netherlands (1999)

  21. Cer, D.M.; De Marneffe, M.-C.; Jurafsky, D.; Manning, C.D.: Parsing to stanford dependencies: trade-offs between speed and accuracy. In: LREC (2010)

  22. Marcus, M.; Kim, G.; Marcinkiewicz, M.A.; MacIntyre, R.; Bies, A.; Ferguson, M.; Katz, K.; Schasberger, B.: The penn treebank: annotating predicate argument structure. In: Proceedings of the Workshop on Human Language Technology, pp. 114–119. Association for Computational Linguistics (1994)

  23. Punyakanok, V.; Roth, D.; Yih, W.: The importance of syntactic parsing and inference in semantic role labeling. Comput. Linguist. 34(2), 257–287 (2008)

    Article  Google Scholar 

  24. Chunking: http://www.cnts.ua.ac.be/conll2000/chunking/ (2011)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sarwat Nizamani.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Nizamani, S., Memon, N., Nizamani, S. et al. TDC: Typed Dependencies-Based Chunking Model. Arab J Sci Eng 42, 3585–3595 (2017). https://doi.org/10.1007/s13369-017-2587-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13369-017-2587-y

Keywords

Navigation