Skip to main content

Text Mining in Scientific Literature Evaluation: Extraction of Keywords for Describing Content

  • Chapter
  • First Online:
Apply Data Science

Abstract

Keywords should represent the content of documents in a compact form. They serve to assign suitable publications to a search query in the context of a scientific literature search. If a literature search is carried out as part of a scientific work, the found publications must be analyzed and their content evaluated. This can be a large number of publications, so that the analysis of the content can be extremely time-consuming. This article describes how the analysis of publications can be supported by text mining in the context of “Explainable AI” literature evaluation. Keywords are extracted from the abstracts of the found publications by text mining.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Vom Brocke J, Simons A, Niehaves B, Riemer K, Plattfaut R, Cleven A (2009) Reconstructing the giant: on the importance of rigour in documenting the literature search process. In: Newell S, Whitley E, Pouloudi N, Wareham J, Mathiassen L (Eds) Proceedings of the 17. European Conference on Information Systems (ECIS). AIS eLibrary, Verona

    Google Scholar 

  2. Tauchert C, Bender M, Mesbah N, Buxmann P (2020) Towards an integrative approach for automated literature reviews using machine learning. In: Bui TX (Eds) Proceedings of the 53rd Hawaii international conference on system sciences. AIS eLibrary, Maui

    Google Scholar 

  3. Ribeiro MT, Singh S, Guestrin, C (2016) Why should I trust you? Explaining the predictions of any classifier. In: Ghani R, Senator TE, Bradley P, Parekh R, He J (Eds) Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, San Francisco, 1135–1144

    Google Scholar 

  4. Hind M (2019) Explaining explainable AI. XRDS 25(3):16–19

    Article  Google Scholar 

  5. Holzinger A (2018) Explainable AI. Informatik Spektrum 41:138–143

    Article  Google Scholar 

  6. Rose S, Engel D, Cramer N, Cowley W (2010) Automatic keyword extraction from individual documents. In: Berry MW, Kogan J (Eds) Text mining: applications and theory, pp 1–20. Wiley, Chichester

    Google Scholar 

  7. ACM Digital Library (2001). https://dl.acm.org/. Accessed: 2. Jan 2021

  8. Ihaka R (1998) R: Past and future history. In: Weisberg S (Eds) Proceedings of the 30th symposium on the interface, the interface foundation of North America, pp 392–396. Fairfax Station, VA

    Google Scholar 

  9. Straka M, Strakov J (2017) Tokenizing, POS Tagging, Lemmatizing and Parsing UD 2.0 with UDPipe. In: Levy L, Specia L (Eds) Proceedings of the 21st conference on Computational Natural Language Learning (CoNLL 2017). The Association for Computational Linguistics, Stroudsburg

    Google Scholar 

  10. Hausmann G, Lämmel U (2021) Künstliche Intelligenz in der automatisierten Dokumentenverarbeitung am Beispiel von Krankenversicherungen. In: Barton T, Muller C (Hrsg) Künstliche Intelligenz in der Anwendung: Rechtliche Aspekte, Anwendungspotentiale und Einsatzszenarien. Springer Vieweg Wiesbaden, pp 177–193

    Google Scholar 

  11. Goodfellow I, Bengio Y, Courville A (2016) Deep learning. The MIT Press, Cambridge

    MATH  Google Scholar 

  12. Kannan A, Kurach K, Ravi S, Kaufmann T, Tomkins A, Miklos B, Corrado G, Lukacs L, Ganea M, Young P, Ramavajjala V (2016) Smart reply: automated response suggestion for email. In: Aggarwal C, Krishnapuram B, Rastogi R, Shah M, Shen D, Smola A (Eds) Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, San Francisco, pp 955–964. ACM, New York

    Google Scholar 

  13. An L, Jeng J-J, Lee YM, Ren C (2007) Effective workforce lifecycle management via system dynamics modeling and simulation. In: Henderson SG, Biller B, Hsieh M-H, Shortle J, Tew JD, Barton RR (Eds) Proceedings of the 39th winter simulation conference, Piscataway, 2187–2195

    Google Scholar 

  14. Shu K, Sliva A, Wang S, Tang J, Liu H (2017) Fake news detection on social media: a data mining perspective. ACM SIGKDD Explor Newsl 19(1):22–36

    Article  Google Scholar 

  15. Lämmel U, Cleve C (2020) Künstliche Intelligenz, 5th eds. Hanser, München.

    Google Scholar 

  16. Gao S, Newsam S, Zhao L, Lunga D, Hu Y, Martins B, Zhou X, Chen F (2019) Proceedings of the 3rd ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery, Chicago. ACM, New York

    Google Scholar 

  17. Zhao W, Xia G, Dong C, Li W, Ren S, Xue Y, Chen S, Chen X (2019) Immune and genetic hybrid optimization algorithm for data relay satellite with microwave and laser links. In: Auger A, Stützle T (Eds) Proceedings of the genetic and evolutionary computation conference companion, Prague, pp 2008–2015. ACM, New York

    Google Scholar 

  18. Wang Y, Chen B, Zhu Z, AI C (2019) Strategy of hybrid optimization algorithms for source parameters estimation of hazardous gas in field cases. In: Zhang H, Huang Y, Thill J-C (Eds) Proceedings of the 5th ACM SIGSPATIAL international workshop on the use of GIS in emergency management, Chicago, pp 1–6. ACM, New York

    Google Scholar 

  19. Biwer S, Filipek M, Arikan E, Jammernegg W (2018) Capacity planning challenges in a global production network with an example from the semiconductor industry. In: Johansson BJI, Jain S (Eds) Proceedings of the 2018 winter simulation conference, Gothenburg, pp 3639–3650. ACM, New York

    Google Scholar 

  20. Barton T, Peuker A (2022) Extraktion und Analyse von Schlüsselwörtern für eine automatisierte Literaturauswertung zum Thema Empfehlungssysteme. HMD Praxis der Wirtschaftsinformatik. to be published

    Google Scholar 

  21. Copurkuyu M, Barton T (2022) Extraktion und Analyse von Schlüsselwörtern in einer Literaturrecherche zu Quantum Computing. AKWI-Tagungsband zur 35. AKWI-Jahrestagung, pp 245–260. https://doi.org/10.30844/AKWI_2022_16

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Thomas Barton .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Fachmedien Wiesbaden GmbH, part of Springer Nature

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Barton, T., Kokoev, A. (2023). Text Mining in Scientific Literature Evaluation: Extraction of Keywords for Describing Content. In: Barton, T., Müller, C. (eds) Apply Data Science. Springer Vieweg, Wiesbaden. https://doi.org/10.1007/978-3-658-38798-3_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-658-38798-3_11

  • Published:

  • Publisher Name: Springer Vieweg, Wiesbaden

  • Print ISBN: 978-3-658-38797-6

  • Online ISBN: 978-3-658-38798-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics