TDC: Typed Dependencies-Based Chunking Model

Nizamani, Sarwat; Memon, Nasrullah; Nizamani, Saad; Nizamani, Sehrish

doi:10.1007/s13369-017-2587-y

TDC: Typed Dependencies-Based Chunking Model

Research Article - Computer Engineering and Computer Science
Published: 01 June 2017

Volume 42, pages 3585–3595, (2017)
Cite this article

Arabian Journal for Science and Engineering Aims and scope Submit manuscript

Sarwat Nizamani^1,2,
Nasrullah Memon^1,3,
Saad Nizamani² &
…
Sehrish Nizamani²

66 Accesses
3 Altmetric
Explore all metrics

Abstract

Chunking is considered as one of the very important problems in Natural Language Processing. The chunking models developed so far usually are unable to extract the relationships among the chunks. In this paper, we present a typed dependency-based chunking model (TDC), which is based on stanford typed dependencies, which attains the highest f-score. Besides attaining highest f-score, unique feature of the proposed chunking model is the extraction of semantic relationships among the chunks. Hence, TDC can easily be utilized for the tasks which require the semantics of the text. TDC is evaluated on the training and test sets provided for the CoNLL-2000 chunking shared task (Tjong et al., in: Proceedings of the 2nd workshop on Learning language in logic and the 4th conference on Computational Natural Language Learning, Association for Computational Linguistics, 2000). The results show that TDC achieves highest f-score than the top scoring model of CoNLL-2000 (shared task), and the models developed after the shared task.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Tjong Kim Sang, E.F.; Buchholz, S.: Introduction to the CoNLL-2000 shared task: chunking. In: Proceedings of the 2nd Workshop on Learning Language in Logic and the 4th Conference on Computational Natural Language Learning
Punyakanok, V.; Roth, D.: The Use of Classifiers in Sequential Inference. arXiv preprint arXiv:cs/0111003 (2001)
Sebastiani, F.: Machine learning in automated text categorization. ACM Comput. Surv. 34(1), 1–47 (2002)
Article Google Scholar
Surdeanu, M.; Harabagiu, S.; Williams, J.; Aarseth, P.: Using predicate-argument structures for information extraction. In: Proceedings of the 41st Annual Meeting on Association for Computational Linguistics, vol. 1, pp. 8–15. Association for Computational Linguistics (2003)
Echizen-ya, H.; Araki, K.: Automatic evaluation method for machine translation using noun-phrase chunking. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp. 108–117. Association for Computational Linguistics (2010)
Shen, D.; Lapata, M.; Using semantic roles to improve question answering. In: EMNLP-CoNLL, pp. 12–21 (2007)
De Marneffe, M.-C.; Manning, C.D.: Stanford Typed Dependencies Manual. http://nlp.stanford.edu/software/dependenciesmanual.pdf (2008)
Abney, S.P.: Parsing by Chunks. Springer, Berlin (1992)
Google Scholar
Vilain, M.; Day, D.: Phrase parsing with rule sequence processors: an application to the shared conll task. In: Proceedings of the 2nd Workshop on Learning Language in Logic and the 4th Conference on Computational Natural Language Learning, vol. 7, pp. 160–162. Association for Computational Linguistics (2000)
Johansson, C.: A context sensitive maximum likelihood approach to chunking. In: Proceedings of the 2nd Workshop on Learning Language in Logic and the 4th Conference on Computational Natural Language Learning, vol. 7, pp. 136–138. Association for Computational Linguistics (2000)
Déjean, H.: Learning syntactic structures with xml. In: Proceedings of the 2nd Workshop on Learning Language in Logic and the 4th Conference on Computational Natural Language Learning, vol. 7, pp. 133–135. Association for Computational Linguistics (2000)
Van Halteren, H.: Chunking with WPDV models. In: Proceedings of the 2nd Workshop on Learning Language in Logic and the 4th Conference on Computational Natural Language Learning, vol. 7, pp. 154–156. Association for Computational Linguistics (2000)
Tjong Kim Sang, E.F.: Text chunking by system combination. In: Proceedings of the 2nd Workshop on Learning Language in Logic and the 4th Conference on Computational Natural Language Learning, vol. 7, pp. 151–153. Association for Computational Linguistics (2000)
Kudoh, T.; Matsumoto, Y.: Use of support vector learning for chunk identification. In: Proceedings of the 2nd Workshop on Learning Language in Logic and the 4th Conference on Computational Natural Language Learning, vol. 7, pp. 142–144. Association for Computational Linguistics (2000)
Zhang, T.; Damerau, F.; Johnson, D.: Text chunking based on a generalization of winnow. J. Mach. Learn. Res. 2, 615–637 (2002)
MATH Google Scholar
Kivinen, J.; Warmuth, M.K.; Auer, P.: The perceptron algorithm versus winnow: linear versus logarithmic mistake bounds when few input variables are relevant. Artif. Intell. 97(1), 325–343 (1997)
Article MathSciNet MATH Google Scholar
Ando, R.K.; Zhang, T.: A high-performance semi-supervised learning method for text chunking. In: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, pp. 1–9. Association for Computational Linguistics (2005)
Wu, Y.-C.; Lee, Y.-S.; Yang, J.-C.: Robust and efficient multiclass svm models for phrase pattern recognition. Pattern Recognit. 41(9), 2874–2889 (2008)
Article MATH Google Scholar
Zhou, J.; Qu, W.; Zhang, F.: Exploiting chunk-level features to improve phrase chunking. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 557–567. Association for Computational Linguistics (2012)
Ramshaw, L.A.; Marcus, M.P.: Text chunking using transformation-based learning. In: Natural Language Processing Using Very Large Corpora, pp. 157–176. Springer, Netherlands (1999)
Cer, D.M.; De Marneffe, M.-C.; Jurafsky, D.; Manning, C.D.: Parsing to stanford dependencies: trade-offs between speed and accuracy. In: LREC (2010)
Marcus, M.; Kim, G.; Marcinkiewicz, M.A.; MacIntyre, R.; Bies, A.; Ferguson, M.; Katz, K.; Schasberger, B.: The penn treebank: annotating predicate argument structure. In: Proceedings of the Workshop on Human Language Technology, pp. 114–119. Association for Computational Linguistics (1994)
Punyakanok, V.; Roth, D.; Yih, W.: The importance of syntactic parsing and inference in semantic role labeling. Comput. Linguist. 34(2), 257–287 (2008)
Article Google Scholar
Chunking: http://www.cnts.ua.ac.be/conll2000/chunking/ (2011)

Download references

Author information

Authors and Affiliations

The Maersk McKinney Moller Institute, University of Southern Denmark, Odense, Denmark
Sarwat Nizamani & Nasrullah Memon
University of Sindh, Jamshoro, Pakistan
Sarwat Nizamani, Saad Nizamani & Sehrish Nizamani
Mehran University of Engineering and Technology, Jamshoro, Pakistan
Nasrullah Memon

Authors

Sarwat Nizamani
View author publications
You can also search for this author in PubMed Google Scholar
Nasrullah Memon
View author publications
You can also search for this author in PubMed Google Scholar
Saad Nizamani
View author publications
You can also search for this author in PubMed Google Scholar
Sehrish Nizamani
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sarwat Nizamani.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Nizamani, S., Memon, N., Nizamani, S. et al. TDC: Typed Dependencies-Based Chunking Model. Arab J Sci Eng 42, 3585–3595 (2017). https://doi.org/10.1007/s13369-017-2587-y

Download citation

Received: 05 May 2014
Accepted: 08 May 2017
Published: 01 June 2017
Issue Date: August 2017
DOI: https://doi.org/10.1007/s13369-017-2587-y

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

TDC: Typed Dependencies-Based Chunking Model

Abstract

Access this article

Similar content being viewed by others

Gut, Besser, Chunker – Selecting the Best Models for Text Chunking with Voting

Chunk-Based Dependency-to-String Model with Japanese Case Frame

Lexicalized Token Subcategory and Complex Context Based Shallow Parsing

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

TDC: Typed Dependencies-Based Chunking Model

Abstract

Access this article

Similar content being viewed by others

Gut, Besser, Chunker – Selecting the Best Models for Text Chunking with Voting

Chunk-Based Dependency-to-String Model with Japanese Case Frame

Lexicalized Token Subcategory and Complex Context Based Shallow Parsing

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation