Abstract
A huge volume of digitized clinical data is generated and accumulated rapidly since the widespread adoption of Electronic Medical Records (EMRs). These big data in healthcare hold the promise of propelling healthcare evolving from a proficiency-based art to a data-driven science, from a reactive mode to a proactive mode, from one-size-fits-all medicine to personalized medicine. This paper first discusses the research background - big data analytics in healthcare, the research framework of big data analytics in healthcare, analysis of medical process, and the literature summary of treatment pattern mining. Then the challenges for data-driven typical treatment pattern mining are highlighted, including similarity measure between treatment records, typical treatment pattern extraction, evaluation and recommendation, when considering the rich temporal and heterogeneous medical information in EMRs. Furthermore, three categories of typical treatment patterns are mined from doctor order content, duration, and sequence view respectively, which can provide a data-driven guideline to achieve the “5R” goal for rational drug use and clinical pathways.
Similar content being viewed by others
References
Ainsworth J, Buchan I (2012). COCPIT: A tool for integrated care pathway variance analysis. Studies in Health Technology and Informatics 180: 995–999.
Auffray C, Chen Z, Hood L (2009). Systems medicine: The future of medical genomics and healthcare. Genome Medicine 1(1): 2.
Bakker M, Tsui K L (2017). Dynamic resource allocation for efficient patient scheduling: A data-driven approach. Journal of Systems Science and Systems Engineering 26(4): 448–462.
Bouarfa L, Dankelman J (2012). Workflow mining and outlier detection from clinical activity logs. Journal of Biomedical Informatics 45(6): 1185–1190.
Bricage P (2017). Use of chronolithotherapy for better individual healthcare and welfare. Journal of Systems Science and Systems Engineering 26(3): 336–358.
Chen H, Chiang R H L, Storey V C (2012). Business intelligence and analytics: From big data to big impact. MIS Quarterly 36(4): 1165–1188.
Chen G, Wu G, Gu Y, Lu B, Wei Q (2018). The challenges for big data driven research and applications in the context of managerial decision-making-paradigm shift and research directions. Journal of Management Science in China 169(7): 6–15. (In Chinese)
Chen J, Li K, Rong H, Bilal K, Yang N, Li K (2018). A disease diagnosis and treatment recommendation system based on big data mining and cloud computing. Information Sciences 435: 124–149.
Chen J, Guo C, Sun L, Lu M (2018). Mining typical drug use patterns based on patient similarity from electronic medical records. In International Symposium on Knowledge and Systems Sciences, Tokyo, Japan, Nov 25–27, 2018.
Chen J, Guo C, Sun L, Lu M (2019). Mining typical treatment duration patterns for rational drug use from electronic medical records. Journal of Systems Science and Systems Engineering 28(5): 602–620.
Chen J, Sun L, Guo C, Wei W, Xie Y (2018). A data-driven framework of typical treatment process extraction and evaluation. Journal of Biomedical Informatics 83: 178–195.
Chen J, Wei W, Guo C, Tang L, Sun L (2017). Textual analysis and visualization of research trends in data mining for electronic health records. Health Policy and Technology 6(4): 389–400.
Chen J, Yuan P, Zhou X, Tang X (2016). Performance comparison of TF*IDF, LDA and paragraph vector for document classification. In International Symposium on Knowledge and Systems Sciences, Kobe, Japan, Nov 4–6, 2016.
Cho SG, Kim SB (2017). Feature network-driven quadrant mapping for summarizing customer reviews. Journal of Systems Science and Systems Engineering 26(5): 646–664.
Dang TT, Ho TB (2017). Sequence-based measure for assessing drug-side effect causal relation from electronic medical records. In International Symposium on Knowledge and Systems Sciences, Bangkok, Thailand, Nov 17–19, 2017.
Esfandiari N, Babavalian M R, Moghadam A M E, Tabar V K (2014). Knowledge discovery in medicine: Current issue and future trend. Expert Systems with Applications 41(9): 4434–4463.
Frankel F, Reid R (2008). Big data: Distilling meaning from data. Nature 455(7209): 30.
Groves P, Kayyali B, Knott D, Kuiken SV (2013). The “big data” revolution in healthcare: Accelerating value and innovation. McKinsey Quarterly 2(3): 1–19.
Guo C, Du Z, Kou X (2018). Products ranking through aspect-based sentiment analysis of online heterogeneous reviews. Journal of Systems Science and Systems Engineering 27(5): 542–558.
Han J, Kamber M, Pei J (2011). Data Mining: Concepts and Techniques(3ed). Morgan Kaufmann Publishers Inc., San Mateo, USA.
Hey T, Tansley S, Tolle K (2009). The Fourth Paradigm: Data-Intensive Scientific Discovery. Redmond, WA: Microsoft Research, Washington, USA.
Hirano S, Tsumoto S (2014). Mining Typical Order Sequences from EHR for Building Clinical Pathways. In Pacific-Asia Conference on Knowledge Discovery and Data Mining, Taiwan, China, May 13–16, 2014.
Hoang K H, Ho T B (2019). Learning and recommending treatments using electronic medical records. Knowledge-Based Systems 181: 104788.
Hopp W J, Li J, Wang G (2018). Big data and the precision medicine revolution. Production and Operations Management 27(9): 1647–1664.
Htun H H, Sornlertlamvanich V (2017). Concept name similarity measure on SNOMED CT. In International Symposium on Knowledge and Systems Sciences, Bangkok, Thailand, Nov 17–19, 2017.
Huang Z, Dong W, Ji L, Gan C, Lu X, Duan H (2014). Discovery of clinical pathway patterns from event logs using probabilistic topic models. Journal of Biomedical Informatics 47: 39–57.
Huang Z, Dong W, Bath P, Ji L, Duan H (2015). On mining latent treatment patterns from electronic medical records. Data Mining and Knowledge Discovery 29(4): 914–949.
Huang Z, Lu X, Duan H, Fan W (2013). Summarizing clinical pathways from event logs. Journal of Biomedical Informatics 46(1): 111–127.
Jensen PB, Jensen LJ, Brunak S (2012). Mining electronic health records: Towards better research applications and clinical care. Nature Reviews Genetics 13(6): 395–405.
Ji G, Hu L, Tan K H (2017). A study on decision-making of food supply chain based on big data. Journal of Systems Science and Systems Engineering 26(2): 183–198.
Jin B, Yang H, Sun L, Liu C, Qu Y, Tong J (2018). A treatment engine by predicting next-period prescriptions. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, London, United Kingdom. August 19–23, 2018.
Johnson AEW, Pollard TJ, Shen L, Lehman LWH, Mark RG (2016). MIMIC-III a freely accessible critical care database. Scientific Data 3: 160035.
Lakshmanan G T, Rozsnyai S, Wang F (2013). Investigating clinical care pathways correlated with outcomes. In Business Process Management 8094:323–338. Springer, Berlin, Heidelberg, Germany.
Li X, Mei J, Liu H, Yu Y, Xie G, Hu J, Wang F (2015). Analysis of care pathway variation patterns in patient records. Studies in Health Technology and Informatics 210: 692–696.
Lynch C (2008). Big data: How do your data grow? Nature 455(7209): 28–29.
Mans R, Schonenberg H, Leonardi G, Panzarasa S, Cavallini A, Quaglini S, Van Der Aalst W (2008). Process mining techniques: An application to stroke care. Studies in Health Technology and Informatics 136: 573–578.
Mayer-Schönberger V, Cukier K (2013). “Big Data: A revolution that will transform how we live, work, and think”. Houghton Mifflin Harcourt, Boston, USA.
MIT Critical Data (2016). Secondary Analysis of Electronic Health Records. Springer, Berlin, Germany.
Miller K (2012). Big data analytics in biomedical research. Biomedical Computation Review 2: 14–21.
Niaksu O (2015). CRISP data mining methodology extension for medical domain. Baltic Journal of Modern Computing 3(2): 92.
Perer A, Wang F, Hu J (2015). Mining and exploring care pathways from electronic medical records with visual analytics. Journal of Biomedical Informatics 56: 369–378.
Rebuge Á, Ferreira D R (2012). Business process analysis in healthcare environments: A methodology based on process mining. Information Systems 37(2):99–116.
Shi Y (2014). Big data: History, current status, and challenges going forward. The Bridge 44(4): 6–11.
Shortliffe E H, Cimino J J (2006). Biomedical informatics: Computer applications in health care and biomedicine (3ed). Springer Science+Business Media, LLC, New York, USA.
Staff, S. (2011). Challenges and opportunities. Science 331(6018): 692–693.
Sun L, Chen G, Xiong H, Guo C (2017). Cluster analysis in data-driven management and decisions. Journal of Management Science and Engineering 2(4): 227–251.
Sun L, Guo C, Liu C, Xiong H (2017). Fast affinity propagation clustering based on incomplete similarity matrix. Knowledge and Information Systems 51(3): 941–963.
Sun L, Jin B, Yang H, Tong J, Liu C, Xiong H (2019). Unsupervised EEG feature extraction based on echo state network. Information Sciences 475: 1–17.
Sun L, Liu C, Guo C, Xiong H, Xie Y (2016). Data-driven automatic treatment regimen development and recommendation. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, USA, August 13–17, 2016.
Tien J M, Goldschmidt-Clermont P J (2009). Healthcare: A complex service system. Journal of Systems Science and Systems Engineering 18(3): 257–282.
Topol E (2015). The Patient Will See You Now: The future of medicine is in your hands. Basic Books, New York, USA.
Wang Y, Qian L, Li F, Zhang L (2018). A comparative study on shilling detection methods for trustworthy recommendations. Journal of Systems Science and Systems Engineering 27(4): 458–478.
Wei W, Guo C (2019). A text semantic topic discovery method based on the conditional co-occurrence degree. Neurocomputing 368: 11–24.
World Health Organization (2012). The Pursuit of Responsible Use of Medicines: Sharing and Learning from Country Experiences. WHO/EMP/MAR/2012.3. Geneva Switzerland: WHO.
Wright A P, Wright A T, McCoy A B, Sittig D F (2015). The use of sequential pattern mining to predict next prescribed medications. Journal of Biomedical Informatics 53: 73–80.
Wu X, Chen H, Wu G, Liu J, Zheng Q, He X, Zhou A, Zhao Z, Wei B, Gao M, Li Y, Zhang Q, Zhang S, Lu R, Li Y (2015). Knowledge engineering with big data. IEEE Intelligent Systems 30(5): 46–55.
Xu N, Tang X (2018). Generating risk maps for evolution analysis of societal risk events. In International Symposium on Knowledge and Systems Sciences, Tokyo, Japan, Nov 25–27, 2018.
Yadav P, Steinbach M, Kumar V, Simon G (2018). Mining electronic health records (EHRs): A survey. ACM Computing Surveys 50(6): 85.
Yang S, Dong X, Sun L, Zhou Y, Farneth RA, Xiong H, Burd RS, Marsic I (2017). A data-driven process recommender framework. Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, Canada, August 13–17, 2017.
Acknowledgments
The authors would like to thank the anonymous referees for their help to improve the quality of the paper. This research was supported in part by the National Natural Science Foundation of China under Grant Nos. 71771034 and 71421001, and Science and Technology Program of Jieyang under Grant No. 2017xm041.
Author information
Authors and Affiliations
Corresponding author
Additional information
The main contents of this paperwere originally presented as the invite keynote speech at the 19th International Symposium on Knolwdge and Systems Sciences in Tokyo during Nov 25–27, 2018.
Chonghui Guo is a professor of the Institute of Systems Engineering, Dalian University of Technology, Dalian, China. He received the B.S. degree in mathematics from Liaoning University in 1995, M.S. degree in operational research and control theory in 1999 and Ph.D. degree in management science and engineering from Dalian University of Technology in 2002. He was a postdoctoral research fellow in the Department of Computer Science in Tsinghua University, Beijing, China. His studies concentrate on data mining and knowledge discovery. He has published over 100 peer-reviewed papers in academic journals and conferences, besides 5 text-books and 2 monographs. He has been the Principal Investigator on over 10 research projects from the Government and the Industry.
Jingfeng Chen received the B.S. degree in mathematics from Henan University of Economics and Law in 2012, M.S. degree from Dongbei University of Finance and Economics in 2015, and Ph.D. degree from Institute of Systems Engineering, Dalian University of Technology, in 2019. His research interests include medical data mining, business intelligence. His papers have been published and presented on journals and conferences such as Journal of Biomedical Informatics, Health Policy and Technology, Soft Computing, Journal of Systems Science and Systems Engineering, and the 19th International Symposium on Knowledge and Systems Sciences (KSS2018).
Rights and permissions
About this article
Cite this article
Guo, C., Chen, J. Big Data Analytics in Healthcare: Data-Driven Methods for Typical Treatment Pattern Mining. J. Syst. Sci. Syst. Eng. 28, 694–714 (2019). https://doi.org/10.1007/s11518-019-5437-5
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11518-019-5437-5