Abstract
Event detection from social media is a highly important field of study. The main goal of the event detection is to detect events (e.g., traffic, disaster) automatically from social media like Twitter. In the area of supervised event detection from Twitter, word2vec model is not yet used by the researchers as a feature extraction technique. Since word2vec introduces a new approach on feature extraction from words and documents, in this work, we propose a near real-time traffic congestion detection system from the Twitter data stream using word2vec model as our feature extraction method. The system fetches the continuous stream of tweets from Twitter, preprocess and extract features from tweets using word2vec, classifies traffic congestion related tweets and notifies the occurrence of traffic congestion in a particular region. For the classification task, we compared the performance of Support Vector Machine (SVM), Logistic Regression (LR) and Naive Bayes (NB) algorithms. Our experimental results have shown that SVM outperforms the other two algorithms achieving an accuracy of 91.73%. Later on, the SVM has been selected to build the actual system.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Farzindar, A., & Wael, K. (2015). A survey of techniques for event detection in Twitter. Computational Intelligence, 31, 132–164.
Gutierrez, C., Figuerias, P., Oliveira, P., Costa, R., & Jardim-Goncalves, R. (2015). Twitter mining for traffic events detection. In Proceedings of the 2015 Science and Information Conference. SAI (pp. 371–378). London, UK.
Lilleberg, J., Zhu, Y., & Zhang, Y. (2015). Support vector machines and word2vec for text classification with semantic features. In Proceedings of 2015 IEEE 14th International Conference on Cognitive Informatics & Cognitive Computing, ICCI*CC 2015 (pp. 136–140). Beijing, China.
Gaglio, S., Lo Re, G., & Morana, M. (2016). A framework for real-time Twitter data analysis. Computer Communications, 73, 236–242.
Tan, Z., Zhang, P., Tan, J., & Guo, L. (2014). A multi-layer event detection algorithm for detecting global and local hot events in social networks. Procedia Computer Science, 29, 2080–2089.
Rogstadius, J., Vukovic, M., Teixeira, C. A., Kostakos, V., Karapanos, E., & Laredo, J. A. (2013). CrisisTracker: Crowdsourced social media curation for disaster awareness. IBM Journal of Research and Development, 57, 1–13.
Sankaranarayanan, J., Samet, H., Teitler, B. E., Lieberman, M. D., & Sperling, J. (2009). TwitterStand. In Proceedings of the 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems—GIS 09 (pp. 42–51). Seattle, WA, USA.
Ramachandran, D., & Parvathi, R. (2019). Enhanced event detection in Twitter through feature analysis. International Journal of Information Technology and Web Engineering, 14, 1–15.
Laylavi, F., Rajabifard, A., & Kalantari, M. (2017). Event relatedness assessment of Twitter messages for emergency response. Information Processing and Management, 53, 266–280.
Taufiq, M. (2019). Detection traffic congestion based on Twitter data using machine learning. Procedia Computer Science, 157, 118–124.
D’Andrea, E., Ducange, P., Lazzerini, B., & Marcelloni, F. (2015). Real-time detection of traffic from Twitter stream analysis. IEEE Transactions on Intelligent Transportation Systems, 16, 2269–2283.
Alkhatib, M., El Barachi, M., & Shaalan, K. (2019). An Arabic social media based framework for incidents and events monitoring in smart cities. Journal of Cleaner Production, 220, 771–785.
Odea, B., Wan, S., Batterham, P. J., Calear, A. L., Paris, C., & Christensen, H. (2015). Detecting suicidality on Twitter. Internet Interventions, 2, 183–188.
Cui, M., Bai, R., Lu, Z., Li, X., Aickelin, U., & Ge, P. (2019). Regular expression based medical text classification using constructive heuristic approach. IEEE Access, 7, 147892–147904.
Grover, A., Kumar, P., & Garg, K. (2020). Improvisation of cleaning process on tweets for opinion mining. International Journal of Big Data and Analytics in Healthcare, 5, 49–59.
Yao, Z., & Ze-Wen, C. (2011). Research on the construction and filter method of stop-word list in text preprocessing. In Proceedings—4th International Conference on Intelligent Computation Technology and Automation, ICICTA 2011 (Vol. 1, pp. 217–221). Shenzhen, Guangdong, China.
Word2Vec. https://spark.apache.org/docs/latest/mllib-feature-extraction.html.
Apache Spark. https://spark.apache.org.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Noori, M.A.R., Mehra, R. (2021). Traffic Congestion Detection from Twitter Using word2vec. In: Fong, S., Dey, N., Joshi, A. (eds) ICT Analysis and Applications. Lecture Notes in Networks and Systems, vol 154. Springer, Singapore. https://doi.org/10.1007/978-981-15-8354-4_52
Download citation
DOI: https://doi.org/10.1007/978-981-15-8354-4_52
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-8353-7
Online ISBN: 978-981-15-8354-4
eBook Packages: EngineeringEngineering (R0)