Skip to main content

Feature Selection and Extraction for Dogri Text Summarization

  • Conference paper
  • First Online:
Rising Threats in Expert Applications and Solutions

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1187))

Abstract

Text summarization is defined as the process of condensing information from the source text into a shorter form without affecting the context of the information. Based on the summary generated by the summarization system, it is classified into abstractive and extractive summarization. Extractive summarization is the technique of extracting important sentences from the document that delivers the logical summary of the document. The candidate sentences for summary generation are decided by using statistical and linguistic features of the given source text. The proposed approach for Extractive Dogri Text summarization is presented in this paper. Various statistical and linguistic features that can contribute to the selection of appropriate sentences for Dogri text summarization are also illustrated in the paper. Statistical features like term frequency, length of a sentence, position of a sentence and term frequency-inverse sentence frequency (TF-ISF) are taken into consideration. And the linguistic features like presence of proper noun, numerical information, English-Dogri words are also considered for determining the candidature of the sentences for inclusion in the final summary generation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. H.P. Luhn, The automatic creation of literature abstracts. IBM J. Res. Dev. 2(2), 159–165 (1958)

    Article  MathSciNet  Google Scholar 

  2. M. Gambhir, V. Gupta, Recent automatic text summarization techniques. Artif. Intell. Rev. 47(1), 1–66 (2017)

    Article  Google Scholar 

  3. N. Munot, S.S. Govilkar, Comparative study of text summarization methods. Int. J. Comput. Appl. 102(12), 975–8887 (2014)

    Google Scholar 

  4. J.-M. Torres-Moreno, Automatic Text Summarization (Wiley-ISTE, 2014)

    Google Scholar 

  5. N. Moratanch, S. Chitrakala, A survey on extractive text summarization, in International Conference on Computer, Communication, and Signal Processing: Special Focus on IoT, ICCCSP 2017 (2017)

    Google Scholar 

  6. L. Suanmali, N. Salim, M.S. Binwahlan, Fuzzy logic based method for improving text summarization. IJCSIS Int. J. Comput. Sci. Inf. Secur. 2(1) (2009)

    Google Scholar 

  7. Languages Included in the Eighth Schedule of the Indian Constitution|Department of Official Language |Ministry of Home Affairs|GoI. http://rajbhasha.nic.in/en/languages-includedeighth-schedule-indian-constution

  8. Dogri language—Wikipedia. https://en.wikipedia.org/wiki/Dogri_language

  9. Dogri language|Britannica.com. https://www.britannica.com/topic/Dogri-language

  10. S.S. Naik, M.N. Gaonkar, Extractive text summarization by feature-based sentence extraction using rule-based concept, in RTEICT 2017—2nd IEEE International Conference on Recent Trends in Electronics, Information and Communication Technology, Proceedings, vol. 2018 (2017), pp. 1364–1368

    Google Scholar 

  11. M. Afsharizadeh, H. Ebrahimpour-Komleh, A. Bagheri, Query-oriented text summarization using sentence extraction technique, in 2018 4th International Conference on Web Research, ICWR 2018 (2018), pp. 128–132

    Google Scholar 

  12. R. Ahuja, W. Anand, Multi-document text summarization using sentence extraction. Adv. Intell. Syst. Comput. 517, 235–242 (2017)

    Google Scholar 

  13. V. Gupta, G.S. Lehal, Features selection and weight learning for Punjabi text summarization. Int. J. Eng. Trends Technol. 2, 45–48 (2011)

    Google Scholar 

  14. A.N. Gulati, S.D. Sawarkar, A novel technique for multidocument Hindi text summarization, in 2017 International Conference on Nascent Technologies in Engineering ICNTE 2017—Proceedings, (2017)

    Google Scholar 

  15. R. Ferreira et al., Assessing sentence scoring techniques for extractive text summarization. Expert Syst. Appl. 40, 5755–5764 (2013)

    Article  Google Scholar 

  16. Jammu Prabhat: First and Only Dogri News Paper, Dogri Newspaper. http://www.jammuprabhat.com/

  17. J.L. Neto et al., Document clustering and text summarization (2000)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sonam Gandotra .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Gandotra, S., Arora, B. (2021). Feature Selection and Extraction for Dogri Text Summarization. In: Rathore, V.S., Dey, N., Piuri, V., Babo, R., Polkowski, Z., Tavares, J.M.R.S. (eds) Rising Threats in Expert Applications and Solutions. Advances in Intelligent Systems and Computing, vol 1187. Springer, Singapore. https://doi.org/10.1007/978-981-15-6014-9_65

Download citation

Publish with us

Policies and ethics