DOI QR코드

DOI QR Code

Classification of Seoul Metro Stations Based on Boarding/ Alighting Patterns Using Machine Learning Clustering

기계학습 클러스터링을 이용한 승하차 패턴에 따른 서울시 지하철역 분류

  • Min, Meekyung (Dept. of Computer Science, Seokyeong University)
  • 민미경 (서경대학교 컴퓨터과학과)
  • Received : 2018.06.14
  • Accepted : 2018.08.10
  • Published : 2018.08.31

Abstract

In this study, we classify Seoul metro stations according to boarding and alighting patterns using machine earning technique. The target data is the number of boarding and alighting passengers per hour every day at 233 subway stations from 2008 to 2017 provided by the public data portal. Gaussian mixture model (GMM) and K-means clustering are used as machine learning techniques in order to classify subway stations. The distribution of the boarding time and the alighting time of the passengers can be modeled by the Gaussian mixture model. K-means clustering algorithm is used for unsupervised learning based on the data obtained by GMM modeling. As a result of the research, Seoul metro stations are classified into four groups according to boarding and alighting patterns. The results of this study can be utilized as a basic knowledge for analyzing the characteristics of Seoul subway stations and analyzing it economically, socially and culturally. The method of this research can be applied to public data and big data in areas requiring clustering.

본 연구에서는 기계학습을 이용하여 서울시 지하철역의 승하차 패턴에 따라 지하철역을 분류한다. 대상 데이터는 공공데이터 포탈에서 제공하는 2008년부터 2017년까지 서울 지하철 233개 역에서의 매일 매시간별 승차객 숫자와 하차객 숫자이다. 기계학습 기법으로는 가우시안 혼합 모델(GMM)과 K-평균 클러스터링을 사용한다. 이용객의 승차시간과 하차시간의 분포는 가우시안 혼합 모델로 모델링할 수 있으며, 이를 K-평균 클러스터링을 이용하여 비지도 학습시킨다. 학습결과 서울시 지하철역은 승하차 패턴에 따라 4개의 그룹으로 분류되었다. 본 연구의 결과는 서울시 지하철역의 특성을 파악하여 경제, 사회, 문화적으로 분석하기 위한 주요 기반 지식으로 활용될 수 있다. 본 연구의 방법은 클러스터링이 필요한 모든 공공데이터나 빅데이터에 적용할 수 있다.

Keywords

References

  1. BongHyun Back, Il-Kyu Ha, "A Method for Selective Storing and Visualization of Public Big Data Using XML Structure", Journal of the Korea Institute of Information and Communication Engineering, Vol. 21, No. 12, pp. 2305-2311, Dec 2017. DOI: https://doi.org/10.6109/jkiice.2017.21.12.2305.
  2. Jae-Young Chang, "An Experimental Evaluation of Box office Revenue Prediction through Social Bigdata Analysis and Machine Learning", The Journal of the Institute of Internet, Broadcasting and Communication(JIIBC), Vol. 17, No. 3, pp. 167-173, Jun 2017. DOI: https://doi.org/10.7236/JIIBC.2017.17.3.167.
  3. Min-Soo Kang, Yong-Gyu Jung, Du-Hwan Jang, "A Study on the Search of Optimal Aquaculture farm condition based on Machine Learning", The Journal of the Institute of Internet, Broadcasting and Communication(JIIBC), Vol. 17, No. 4, pp. 135-140, Apr 2017. DOI: https://doi.org/10.7236/JIIBC.2017.17.2.135.
  4. Jin-su Kim, "Subway Congestion Prediction and Recommendation System using Big Data Analysis", Journal of Digital Convergence, Vol. 14, No. 11, pp. 289-295, Nov 2016. DOI: https://doi.org/10.14400/JDC.2016.14.11.289.
  5. Minwoo Kim, "Predicting Subway Passengers Flows By Spatio-Temporal Modeling", Master Thesis, Seoul National University, Aug 2017.
  6. R. S. Michalski, J. G. Carbonell, T. M. Mitchell, Machine Learning: An Artificial Intelligence Approach, Springer Science & Business Media, 2013.
  7. https://www.data.go.kr