Skip to main content

A Neural Joint Model for Extracting Bacteria and Their Locations

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10235))

Abstract

Extracting Lives_In relations between bacteria and their locations involves two steps, namely bacteria/location entity recognition and Lives_In relation classification. Previous work solved this task by pipeline models, which may suffer error propagation and cannot utilize the interactions between these steps. We follow the line of work using joint models, which perform two subtasks simultaneously to obtain better performances. A state-of-the-art neural joint model for relation extraction in the Automatic Content Extraction (ACE) task is adapted to our task. Furthermore, we propose two strategies to improve this model. First, a novel relation is suggested in the second step to detect the errors in the first step, thus this relation can correct some errors in the first step. Second, we replace the original greedy-search decoding with beam-search, and train the model with early-update techniques. Experimental results on a standard dataset for this task show that our adapted model achieves better precisions than other systems. After adding the novel relation, we gain a nearly 2% improvement of F1 for Lives_In relation extraction. When beam-search is used, the F1 is further improved by 6%. These demonstrate that our proposed strategies are effective for this task. However, additional experiments show that the performance improvement in another dataset of bacteria and location extraction is not significant. Therefore, whether our methods are effective for other relation extraction tasks needs to be further investigated.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    https://www.ldc.upenn.edu/collaborations/past-projects/ace.

  2. 2.

    http://bibliome.jouy.inra.fr/demo/BioNLP-ST-2016-Evaluation/index.html.

References

  1. Bossy, R., Golik, W., Ratkovic, Z., Valsamou, D., Bessières, P., Nédellec, C.: Overview of the gene regulation network and the bacteria biotope tasks in BioNLP 2013 shared task. BMC Bioinform. 16(10), S1 (2015)

    Article  Google Scholar 

  2. Claveau, V.: IRISA participation to BioNLP-ST13: lazy-learning and information retrieval for information extraction tasks. In: Proceedings of the BioNLP Shared Task 2013 Workshop (2013)

    Google Scholar 

  3. Deléger, L., Bossy, R., Chaix, E., Ba, M., Ferré, A., Bessières, P., Nédellec, C.: Overview of the bacteria biotope task at BioNLP shared task 2016. In: Proceedings of the 4th BioNLP Shared Task Workshop (2016)

    Google Scholar 

  4. Grouin, C.: Identification of mentions and relations between bacteria and biotope from PubMed abstracts. In: Proceedings of the 4th BioNLP Shared Task Workshop (2016)

    Google Scholar 

  5. Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)

    Article  Google Scholar 

  6. Kordjamshidi, P., Roth, D., Moens, M.F.: Structured learning for spatial information extraction from biomedical text: bacteria biotopes. BMC Bioinform. 16, 129 (2015)

    Article  Google Scholar 

  7. Lample, G., Ballesteros, M., Subramanian, S., Kawakami, K., Dyer, C.: Neural architectures for named entity recognition. In: Proceedings of the 2016 Conference of the NAACL, pp. 260–270 (2016)

    Google Scholar 

  8. Li, H., Zhang, J., Wang, J., Lin, H., Yang, Z.: DUTIR in BioNLP-ST 2016: utilizing convolutional network and distributed representation to extract complicate relations. In: Proceedings of the 4th BioNLP Shared Task Workshop (2016)

    Google Scholar 

  9. Li, Q., Ji, H.: Incremental joint extraction of entity mentions and relations. In: Proceedings of the 52nd ACL, pp. 402–412 (2014)

    Google Scholar 

  10. Lin, Y., Shen, S., Liu, Z., Luan, H., Sun, M.: Neural relation extraction with selective attention over instances. In: Proceedings of the 54th Annual Meeting of the ACL, pp. 2124–2133 (2016)

    Google Scholar 

  11. Manning, C.D., Surdeanu, M., Bauer, J., Finkel, J., Bethard, S.J., McClosky, D.: The Stanford coreNLP natural language processing toolkit. In: Proceedings of 52nd ACL, pp. 55–60, September 2014

    Google Scholar 

  12. Mehryary, F., Björne, J., Pyysalo, S., Salakoski, T., Ginter, F.: Deep learning with minimal training data: TurkuNLP entry in the BioNLP shared task 2016. In: Proceedings of the 4th BioNLP Shared Task Workshop (2016)

    Google Scholar 

  13. Miwa, M., Bansal, M.: End-to-end relation extraction using LSTMs on sequences and tree structures. In: Proceedings of the 54th Annual Meeting of the ACL, pp. 1105–1116 (2016)

    Google Scholar 

  14. Nguyen, N., Tsuruoka, Y.: Extracting bacteria biotopes with semi-supervised named entity recognition and coreference resolution. In: Proceedings of the BioNLP Shared Task 2011 Workshop, pp. 94–101 (2011)

    Google Scholar 

  15. Pyysalo, S., Ginter, F., Moen, H., Salakoski, T., Ananiadou, S.: Distributional semantics resources for biomedical text processing. In: LBM (2013)

    Google Scholar 

  16. Roth, D., Yih, W.: Global inference for entity and relation identification via a linear programming formulation. In: Introduction to Statistical Relational Learning (2007)

    Google Scholar 

  17. Wang, L., Cao, Z., de Melo, G., Liu, Z.: Relation classification via multi-level attention CNNs. In: Proceedings of the 54th Annual Meeting of the ACL, pp. 1298–1307 (2016)

    Google Scholar 

  18. Xu, Y., Mou, L., Li, G., Chen, Y., Peng, H., Jin, Z.: Classifying relations via long short term memory networks along shortest dependency paths. In: Proceedings of the EMNLP, pp. 1785–1794 (2015)

    Google Scholar 

  19. Zeng, D., Liu, K., Lai, S., Zhou, G., Zhao, J.: Relation classification via convolutional deep neural network. In: Proceedings of the 25th COLING, pp. 2335–2344 (2014)

    Google Scholar 

  20. Zhang, M., Yang, J., Teng, Z., Zhang, Y.: LibN3L: a lightweight package for neural NLP. In: Proceedings of the Tenth International Conference on Language Resources and Evaluation, pp. 23–28 (2016)

    Google Scholar 

  21. Zhang, M., Zhang, Y., Fu, G.: Transition-based neural word segmentation. In: Proceedings of the 54th Annual Meeting of the ACL, pp. 421–431 (2016)

    Google Scholar 

Download references

Acknowledgments

This work is supported by the National Natural Science Foundation of China (No. 61373108), the National Philosophy Social Science Major Bidding Project of China (No. 11&ZD189). This work is also supported by Humanities and Social Science Foundation of Ministry of Education of China (16YJCZH004), and the China Postdoctoral Science Foundation (2014T70722).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Donghong Ji .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Li, F., Zhang, M., Fu, G., Ji, D. (2017). A Neural Joint Model for Extracting Bacteria and Their Locations. In: Kim, J., Shim, K., Cao, L., Lee, JG., Lin, X., Moon, YS. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2017. Lecture Notes in Computer Science(), vol 10235. Springer, Cham. https://doi.org/10.1007/978-3-319-57529-2_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-57529-2_2

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-57528-5

  • Online ISBN: 978-3-319-57529-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics