Abstract
Extracting Lives_In relations between bacteria and their locations involves two steps, namely bacteria/location entity recognition and Lives_In relation classification. Previous work solved this task by pipeline models, which may suffer error propagation and cannot utilize the interactions between these steps. We follow the line of work using joint models, which perform two subtasks simultaneously to obtain better performances. A state-of-the-art neural joint model for relation extraction in the Automatic Content Extraction (ACE) task is adapted to our task. Furthermore, we propose two strategies to improve this model. First, a novel relation is suggested in the second step to detect the errors in the first step, thus this relation can correct some errors in the first step. Second, we replace the original greedy-search decoding with beam-search, and train the model with early-update techniques. Experimental results on a standard dataset for this task show that our adapted model achieves better precisions than other systems. After adding the novel relation, we gain a nearly 2% improvement of F1 for Lives_In relation extraction. When beam-search is used, the F1 is further improved by 6%. These demonstrate that our proposed strategies are effective for this task. However, additional experiments show that the performance improvement in another dataset of bacteria and location extraction is not significant. Therefore, whether our methods are effective for other relation extraction tasks needs to be further investigated.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Bossy, R., Golik, W., Ratkovic, Z., Valsamou, D., Bessières, P., Nédellec, C.: Overview of the gene regulation network and the bacteria biotope tasks in BioNLP 2013 shared task. BMC Bioinform. 16(10), S1 (2015)
Claveau, V.: IRISA participation to BioNLP-ST13: lazy-learning and information retrieval for information extraction tasks. In: Proceedings of the BioNLP Shared Task 2013 Workshop (2013)
Deléger, L., Bossy, R., Chaix, E., Ba, M., Ferré, A., Bessières, P., Nédellec, C.: Overview of the bacteria biotope task at BioNLP shared task 2016. In: Proceedings of the 4th BioNLP Shared Task Workshop (2016)
Grouin, C.: Identification of mentions and relations between bacteria and biotope from PubMed abstracts. In: Proceedings of the 4th BioNLP Shared Task Workshop (2016)
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Kordjamshidi, P., Roth, D., Moens, M.F.: Structured learning for spatial information extraction from biomedical text: bacteria biotopes. BMC Bioinform. 16, 129 (2015)
Lample, G., Ballesteros, M., Subramanian, S., Kawakami, K., Dyer, C.: Neural architectures for named entity recognition. In: Proceedings of the 2016 Conference of the NAACL, pp. 260–270 (2016)
Li, H., Zhang, J., Wang, J., Lin, H., Yang, Z.: DUTIR in BioNLP-ST 2016: utilizing convolutional network and distributed representation to extract complicate relations. In: Proceedings of the 4th BioNLP Shared Task Workshop (2016)
Li, Q., Ji, H.: Incremental joint extraction of entity mentions and relations. In: Proceedings of the 52nd ACL, pp. 402–412 (2014)
Lin, Y., Shen, S., Liu, Z., Luan, H., Sun, M.: Neural relation extraction with selective attention over instances. In: Proceedings of the 54th Annual Meeting of the ACL, pp. 2124–2133 (2016)
Manning, C.D., Surdeanu, M., Bauer, J., Finkel, J., Bethard, S.J., McClosky, D.: The Stanford coreNLP natural language processing toolkit. In: Proceedings of 52nd ACL, pp. 55–60, September 2014
Mehryary, F., Björne, J., Pyysalo, S., Salakoski, T., Ginter, F.: Deep learning with minimal training data: TurkuNLP entry in the BioNLP shared task 2016. In: Proceedings of the 4th BioNLP Shared Task Workshop (2016)
Miwa, M., Bansal, M.: End-to-end relation extraction using LSTMs on sequences and tree structures. In: Proceedings of the 54th Annual Meeting of the ACL, pp. 1105–1116 (2016)
Nguyen, N., Tsuruoka, Y.: Extracting bacteria biotopes with semi-supervised named entity recognition and coreference resolution. In: Proceedings of the BioNLP Shared Task 2011 Workshop, pp. 94–101 (2011)
Pyysalo, S., Ginter, F., Moen, H., Salakoski, T., Ananiadou, S.: Distributional semantics resources for biomedical text processing. In: LBM (2013)
Roth, D., Yih, W.: Global inference for entity and relation identification via a linear programming formulation. In: Introduction to Statistical Relational Learning (2007)
Wang, L., Cao, Z., de Melo, G., Liu, Z.: Relation classification via multi-level attention CNNs. In: Proceedings of the 54th Annual Meeting of the ACL, pp. 1298–1307 (2016)
Xu, Y., Mou, L., Li, G., Chen, Y., Peng, H., Jin, Z.: Classifying relations via long short term memory networks along shortest dependency paths. In: Proceedings of the EMNLP, pp. 1785–1794 (2015)
Zeng, D., Liu, K., Lai, S., Zhou, G., Zhao, J.: Relation classification via convolutional deep neural network. In: Proceedings of the 25th COLING, pp. 2335–2344 (2014)
Zhang, M., Yang, J., Teng, Z., Zhang, Y.: LibN3L: a lightweight package for neural NLP. In: Proceedings of the Tenth International Conference on Language Resources and Evaluation, pp. 23–28 (2016)
Zhang, M., Zhang, Y., Fu, G.: Transition-based neural word segmentation. In: Proceedings of the 54th Annual Meeting of the ACL, pp. 421–431 (2016)
Acknowledgments
This work is supported by the National Natural Science Foundation of China (No. 61373108), the National Philosophy Social Science Major Bidding Project of China (No. 11&ZD189). This work is also supported by Humanities and Social Science Foundation of Ministry of Education of China (16YJCZH004), and the China Postdoctoral Science Foundation (2014T70722).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Li, F., Zhang, M., Fu, G., Ji, D. (2017). A Neural Joint Model for Extracting Bacteria and Their Locations. In: Kim, J., Shim, K., Cao, L., Lee, JG., Lin, X., Moon, YS. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2017. Lecture Notes in Computer Science(), vol 10235. Springer, Cham. https://doi.org/10.1007/978-3-319-57529-2_2
Download citation
DOI: https://doi.org/10.1007/978-3-319-57529-2_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-57528-5
Online ISBN: 978-3-319-57529-2
eBook Packages: Computer ScienceComputer Science (R0)