Abstract
Online Social Network (OSN) allows users to create, comment, post, and read articles of their own interest within virtual communities. They may allow forming mini-networks within the bigger, more diverse social network service. But still, improper access management of the shared contents on the network may give rise to security and privacy problems like spam messages being generated on someone’s public or private wall through people like friends, unknown persons, and friends of friends. This may also reduce the interest of Internet data surfing and may cause damage to less secure data. To avoid this, there was a need of a system that could remove such unwanted contents, particularly the messages from OSN. Here in this paper, for secure message delivery I have presented a classifier system based on N-gram generated profile. This system consists of ML technique using soft classifier, that is, N-gram which will automatically label the received messages from users in support of content-based filtering. Effectiveness of N-grams is studied in this paper for the purpose of measuring the similarity between test documents and trained classified documents. As an enhancement, N-gram method can also be used to detect and prevent leakage of very sensitive data by using N-grams frequency for document classification.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Rula Sayaf and Dave Clarke, “Access Control Models For Online Social Networks”, in book, internationally recognised scientific publisher, IGI Global, [2012].
Mrs. Sayantani Ghosh, Mr. Sudipta Roy, and Prof. Samir K. Bandyopadhyay, “A tutorial review on Text Mining Algorithms”, in International Journal of Advanced Research in Computer and Communication Engineering, Vol. 1, Issue 4, June [2012].
Nicholas J. Belkin and W. Bruce Croft, “Information filtering and information retrieval: Two sides of the same coin?”, in Communications of the ACM v35 n12p29(10), Dec [1992].
Zakaria Elberrichi & Badr Aljohar, “N-grams in Texts Categorization”, in Scientific Journal of King Faisal University Vol. 8 [2007].
Xin Jin, Cindy Xide Lin, Jiebo Luo and Jiawei Han, “A Data Mining based Spam Detection System for Social Media Networks”, in Proceedings of the VLDB Endowment, Vol. 4, No. 12, August 29th - September 3rd [2011].
Sultan Alneyadi, Elankayer Sithirasenan and Vallipuram Muthukkumarasamy, “Word N-gram Based Classification for Data Leakage Prevention”, in 12th IEEE interanational conference July [2013].
Marco Vanetti, Elisabetta Binaghi, Elena Ferrari, Barbara Carminati, and Moreno Carullo, “A System to Filter Unwanted Messages from OSN UserWalls”, in IEEE Transactions On Knowledge And Data Engineering, Vol. 25, No. 2, February [2013].
Salwa Adriana Saab, Nicholas Mitri and Mariette Awad, “Ham or Spam? A comparative study for some Content-based Classification Algorithms for Email Filtering”, in 17th IEEE Mediterraneaan Electronical Conference,Beirut,April [2014].
Christina V, Karpagavalli S and Suganya G, “A Study on Email Spam Filtering Techniques”, International Journal of Computer Applications Vol. 12– No.1, December [2010].
RoissAlhutaish and Nazlia Omar, “Arabic Text Classification Using K-Nearest Neighbour Algorithm”, in The International Arab Journal of Information,vol.12,No.2,March [2015].
Presentation,Porter Stemmer Daniel Waegel CISC889/ Fall [2011].
Taher Zaki, Youssef Es-saady, Driss Mammass, Abdellatif Ennaji and Stéphane Nicolas, “A Hybrid Method N-Grams-TFIDF with radial basis for indexing and classification of Arabic documents”, in International Journal of Software Engineering and Its ApplicationsVol.8, No.2, pp.127-144,[2014].
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer Science+Business Media Singapore
About this paper
Cite this paper
Harsule, S.R., Nighot, M.K. (2016). N-Gram Classifier System to Filter Spam Messages from OSN User Wall. In: Saini, H., Sayal, R., Rawat, S. (eds) Innovations in Computer Science and Engineering. Advances in Intelligent Systems and Computing, vol 413. Springer, Singapore. https://doi.org/10.1007/978-981-10-0419-3_3
Download citation
DOI: https://doi.org/10.1007/978-981-10-0419-3_3
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-0417-9
Online ISBN: 978-981-10-0419-3
eBook Packages: EngineeringEngineering (R0)