Skip to main content

N-Gram Classifier System to Filter Spam Messages from OSN User Wall

  • Conference paper
  • First Online:
Innovations in Computer Science and Engineering

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 413))

Abstract

Online Social Network (OSN) allows users to create, comment, post, and read articles of their own interest within virtual communities. They may allow forming mini-networks within the bigger, more diverse social network service. But still, improper access management of the shared contents on the network may give rise to security and privacy problems like spam messages being generated on someone’s public or private wall through people like friends, unknown persons, and friends of friends. This may also reduce the interest of Internet data surfing and may cause damage to less secure data. To avoid this, there was a need of a system that could remove such unwanted contents, particularly the messages from OSN. Here in this paper, for secure message delivery I have presented a classifier system based on N-gram generated profile. This system consists of ML technique using soft classifier, that is, N-gram which will automatically label the received messages from users in support of content-based filtering. Effectiveness of N-grams is studied in this paper for the purpose of measuring the similarity between test documents and trained classified documents. As an enhancement, N-gram method can also be used to detect and prevent leakage of very sensitive data by using N-grams frequency for document classification.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 219.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 279.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Rula Sayaf and Dave Clarke, “Access Control Models For Online Social Networks”, in book, internationally recognised scientific publisher, IGI Global, [2012].

    Google Scholar 

  2. Mrs. Sayantani Ghosh, Mr. Sudipta Roy, and Prof. Samir K. Bandyopadhyay, “A tutorial review on Text Mining Algorithms”, in International Journal of Advanced Research in Computer and Communication Engineering, Vol. 1, Issue 4, June [2012].

    Google Scholar 

  3. Nicholas J. Belkin and W. Bruce Croft, “Information filtering and information retrieval: Two sides of the same coin?”, in Communications of the ACM v35 n12p29(10), Dec [1992].

    Google Scholar 

  4. Zakaria Elberrichi & Badr Aljohar, “N-grams in Texts Categorization”, in Scientific Journal of King Faisal University Vol. 8 [2007].

    Google Scholar 

  5. Xin Jin, Cindy Xide Lin, Jiebo Luo and Jiawei Han, “A Data Mining based Spam Detection System for Social Media Networks”, in Proceedings of the VLDB Endowment, Vol. 4, No. 12, August 29th - September 3rd [2011].

    Google Scholar 

  6. Sultan Alneyadi, Elankayer Sithirasenan and Vallipuram Muthukkumarasamy, “Word N-gram Based Classification for Data Leakage Prevention”, in 12th IEEE interanational conference July [2013].

    Google Scholar 

  7. Marco Vanetti, Elisabetta Binaghi, Elena Ferrari, Barbara Carminati, and Moreno Carullo, “A System to Filter Unwanted Messages from OSN UserWalls”, in IEEE Transactions On Knowledge And Data Engineering, Vol. 25, No. 2, February [2013].

    Google Scholar 

  8. Salwa Adriana Saab, Nicholas Mitri and Mariette Awad, “Ham or Spam? A comparative study for some Content-based Classification Algorithms for Email Filtering”, in 17th IEEE Mediterraneaan Electronical Conference,Beirut,April [2014].

    Google Scholar 

  9. Christina V, Karpagavalli S and Suganya G, “A Study on Email Spam Filtering Techniques”, International Journal of Computer Applications Vol. 12– No.1, December [2010].

    Google Scholar 

  10. RoissAlhutaish and Nazlia Omar, “Arabic Text Classification Using K-Nearest Neighbour Algorithm”, in The International Arab Journal of Information,vol.12,No.2,March [2015].

    Google Scholar 

  11. Presentation,Porter Stemmer Daniel Waegel CISC889/ Fall [2011].

    Google Scholar 

  12. Taher Zaki, Youssef Es-saady, Driss Mammass, Abdellatif Ennaji and Stéphane Nicolas, “A Hybrid Method N-Grams-TFIDF with radial basis for indexing and classification of Arabic documents”, in International Journal of Software Engineering and Its ApplicationsVol.8, No.2, pp.127-144,[2014].

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sneha R. Harsule .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer Science+Business Media Singapore

About this paper

Cite this paper

Harsule, S.R., Nighot, M.K. (2016). N-Gram Classifier System to Filter Spam Messages from OSN User Wall. In: Saini, H., Sayal, R., Rawat, S. (eds) Innovations in Computer Science and Engineering. Advances in Intelligent Systems and Computing, vol 413. Springer, Singapore. https://doi.org/10.1007/978-981-10-0419-3_3

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-0419-3_3

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-0417-9

  • Online ISBN: 978-981-10-0419-3

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics