Skip to main content

Advertisement

Log in

Big Data Analytics in Healthcare: Data-Driven Methods for Typical Treatment Pattern Mining

  • Published:
Journal of Systems Science and Systems Engineering Aims and scope Submit manuscript

Abstract

A huge volume of digitized clinical data is generated and accumulated rapidly since the widespread adoption of Electronic Medical Records (EMRs). These big data in healthcare hold the promise of propelling healthcare evolving from a proficiency-based art to a data-driven science, from a reactive mode to a proactive mode, from one-size-fits-all medicine to personalized medicine. This paper first discusses the research background - big data analytics in healthcare, the research framework of big data analytics in healthcare, analysis of medical process, and the literature summary of treatment pattern mining. Then the challenges for data-driven typical treatment pattern mining are highlighted, including similarity measure between treatment records, typical treatment pattern extraction, evaluation and recommendation, when considering the rich temporal and heterogeneous medical information in EMRs. Furthermore, three categories of typical treatment patterns are mined from doctor order content, duration, and sequence view respectively, which can provide a data-driven guideline to achieve the “5R” goal for rational drug use and clinical pathways.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Ainsworth J, Buchan I (2012). COCPIT: A tool for integrated care pathway variance analysis. Studies in Health Technology and Informatics 180: 995–999.

    Google Scholar 

  • Auffray C, Chen Z, Hood L (2009). Systems medicine: The future of medical genomics and healthcare. Genome Medicine 1(1): 2.

    Article  Google Scholar 

  • Bakker M, Tsui K L (2017). Dynamic resource allocation for efficient patient scheduling: A data-driven approach. Journal of Systems Science and Systems Engineering 26(4): 448–462.

    Article  Google Scholar 

  • Bouarfa L, Dankelman J (2012). Workflow mining and outlier detection from clinical activity logs. Journal of Biomedical Informatics 45(6): 1185–1190.

    Article  Google Scholar 

  • Bricage P (2017). Use of chronolithotherapy for better individual healthcare and welfare. Journal of Systems Science and Systems Engineering 26(3): 336–358.

    Article  Google Scholar 

  • Chen H, Chiang R H L, Storey V C (2012). Business intelligence and analytics: From big data to big impact. MIS Quarterly 36(4): 1165–1188.

    Article  Google Scholar 

  • Chen G, Wu G, Gu Y, Lu B, Wei Q (2018). The challenges for big data driven research and applications in the context of managerial decision-making-paradigm shift and research directions. Journal of Management Science in China 169(7): 6–15. (In Chinese)

    Google Scholar 

  • Chen J, Li K, Rong H, Bilal K, Yang N, Li K (2018). A disease diagnosis and treatment recommendation system based on big data mining and cloud computing. Information Sciences 435: 124–149.

    Article  Google Scholar 

  • Chen J, Guo C, Sun L, Lu M (2018). Mining typical drug use patterns based on patient similarity from electronic medical records. In International Symposium on Knowledge and Systems Sciences, Tokyo, Japan, Nov 25–27, 2018.

    Chapter  Google Scholar 

  • Chen J, Guo C, Sun L, Lu M (2019). Mining typical treatment duration patterns for rational drug use from electronic medical records. Journal of Systems Science and Systems Engineering 28(5): 602–620.

    Article  Google Scholar 

  • Chen J, Sun L, Guo C, Wei W, Xie Y (2018). A data-driven framework of typical treatment process extraction and evaluation. Journal of Biomedical Informatics 83: 178–195.

    Article  Google Scholar 

  • Chen J, Wei W, Guo C, Tang L, Sun L (2017). Textual analysis and visualization of research trends in data mining for electronic health records. Health Policy and Technology 6(4): 389–400.

    Article  Google Scholar 

  • Chen J, Yuan P, Zhou X, Tang X (2016). Performance comparison of TF*IDF, LDA and paragraph vector for document classification. In International Symposium on Knowledge and Systems Sciences, Kobe, Japan, Nov 4–6, 2016.

    Chapter  Google Scholar 

  • Cho SG, Kim SB (2017). Feature network-driven quadrant mapping for summarizing customer reviews. Journal of Systems Science and Systems Engineering 26(5): 646–664.

    Article  Google Scholar 

  • Dang TT, Ho TB (2017). Sequence-based measure for assessing drug-side effect causal relation from electronic medical records. In International Symposium on Knowledge and Systems Sciences, Bangkok, Thailand, Nov 17–19, 2017.

    Google Scholar 

  • Esfandiari N, Babavalian M R, Moghadam A M E, Tabar V K (2014). Knowledge discovery in medicine: Current issue and future trend. Expert Systems with Applications 41(9): 4434–4463.

    Article  Google Scholar 

  • Frankel F, Reid R (2008). Big data: Distilling meaning from data. Nature 455(7209): 30.

    Article  Google Scholar 

  • Groves P, Kayyali B, Knott D, Kuiken SV (2013). The “big data” revolution in healthcare: Accelerating value and innovation. McKinsey Quarterly 2(3): 1–19.

    Google Scholar 

  • Guo C, Du Z, Kou X (2018). Products ranking through aspect-based sentiment analysis of online heterogeneous reviews. Journal of Systems Science and Systems Engineering 27(5): 542–558.

    Article  Google Scholar 

  • Han J, Kamber M, Pei J (2011). Data Mining: Concepts and Techniques(3ed). Morgan Kaufmann Publishers Inc., San Mateo, USA.

    MATH  Google Scholar 

  • Hey T, Tansley S, Tolle K (2009). The Fourth Paradigm: Data-Intensive Scientific Discovery. Redmond, WA: Microsoft Research, Washington, USA.

    Google Scholar 

  • Hirano S, Tsumoto S (2014). Mining Typical Order Sequences from EHR for Building Clinical Pathways. In Pacific-Asia Conference on Knowledge Discovery and Data Mining, Taiwan, China, May 13–16, 2014.

    Google Scholar 

  • Hoang K H, Ho T B (2019). Learning and recommending treatments using electronic medical records. Knowledge-Based Systems 181: 104788.

    Article  Google Scholar 

  • Hopp W J, Li J, Wang G (2018). Big data and the precision medicine revolution. Production and Operations Management 27(9): 1647–1664.

    Article  Google Scholar 

  • Htun H H, Sornlertlamvanich V (2017). Concept name similarity measure on SNOMED CT. In International Symposium on Knowledge and Systems Sciences, Bangkok, Thailand, Nov 17–19, 2017.

    Google Scholar 

  • Huang Z, Dong W, Ji L, Gan C, Lu X, Duan H (2014). Discovery of clinical pathway patterns from event logs using probabilistic topic models. Journal of Biomedical Informatics 47: 39–57.

    Article  Google Scholar 

  • Huang Z, Dong W, Bath P, Ji L, Duan H (2015). On mining latent treatment patterns from electronic medical records. Data Mining and Knowledge Discovery 29(4): 914–949.

    Article  MathSciNet  Google Scholar 

  • Huang Z, Lu X, Duan H, Fan W (2013). Summarizing clinical pathways from event logs. Journal of Biomedical Informatics 46(1): 111–127.

    Article  Google Scholar 

  • Jensen PB, Jensen LJ, Brunak S (2012). Mining electronic health records: Towards better research applications and clinical care. Nature Reviews Genetics 13(6): 395–405.

    Article  Google Scholar 

  • Ji G, Hu L, Tan K H (2017). A study on decision-making of food supply chain based on big data. Journal of Systems Science and Systems Engineering 26(2): 183–198.

    Article  Google Scholar 

  • Jin B, Yang H, Sun L, Liu C, Qu Y, Tong J (2018). A treatment engine by predicting next-period prescriptions. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, London, United Kingdom. August 19–23, 2018.

    Google Scholar 

  • Johnson AEW, Pollard TJ, Shen L, Lehman LWH, Mark RG (2016). MIMIC-III a freely accessible critical care database. Scientific Data 3: 160035.

    Article  Google Scholar 

  • Lakshmanan G T, Rozsnyai S, Wang F (2013). Investigating clinical care pathways correlated with outcomes. In Business Process Management 8094:323–338. Springer, Berlin, Heidelberg, Germany.

    Chapter  Google Scholar 

  • Li X, Mei J, Liu H, Yu Y, Xie G, Hu J, Wang F (2015). Analysis of care pathway variation patterns in patient records. Studies in Health Technology and Informatics 210: 692–696.

    Google Scholar 

  • Lynch C (2008). Big data: How do your data grow? Nature 455(7209): 28–29.

    Article  Google Scholar 

  • Mans R, Schonenberg H, Leonardi G, Panzarasa S, Cavallini A, Quaglini S, Van Der Aalst W (2008). Process mining techniques: An application to stroke care. Studies in Health Technology and Informatics 136: 573–578.

    Google Scholar 

  • Mayer-Schönberger V, Cukier K (2013). “Big Data: A revolution that will transform how we live, work, and think”. Houghton Mifflin Harcourt, Boston, USA.

    Google Scholar 

  • MIT Critical Data (2016). Secondary Analysis of Electronic Health Records. Springer, Berlin, Germany.

    Book  Google Scholar 

  • Miller K (2012). Big data analytics in biomedical research. Biomedical Computation Review 2: 14–21.

    Google Scholar 

  • Niaksu O (2015). CRISP data mining methodology extension for medical domain. Baltic Journal of Modern Computing 3(2): 92.

    Google Scholar 

  • Perer A, Wang F, Hu J (2015). Mining and exploring care pathways from electronic medical records with visual analytics. Journal of Biomedical Informatics 56: 369–378.

    Article  Google Scholar 

  • Rebuge Á, Ferreira D R (2012). Business process analysis in healthcare environments: A methodology based on process mining. Information Systems 37(2):99–116.

    Article  Google Scholar 

  • Shi Y (2014). Big data: History, current status, and challenges going forward. The Bridge 44(4): 6–11.

    Google Scholar 

  • Shortliffe E H, Cimino J J (2006). Biomedical informatics: Computer applications in health care and biomedicine (3ed). Springer Science+Business Media, LLC, New York, USA.

    Book  Google Scholar 

  • Staff, S. (2011). Challenges and opportunities. Science 331(6018): 692–693.

    Article  Google Scholar 

  • Sun L, Chen G, Xiong H, Guo C (2017). Cluster analysis in data-driven management and decisions. Journal of Management Science and Engineering 2(4): 227–251.

    Article  Google Scholar 

  • Sun L, Guo C, Liu C, Xiong H (2017). Fast affinity propagation clustering based on incomplete similarity matrix. Knowledge and Information Systems 51(3): 941–963.

    Article  Google Scholar 

  • Sun L, Jin B, Yang H, Tong J, Liu C, Xiong H (2019). Unsupervised EEG feature extraction based on echo state network. Information Sciences 475: 1–17.

    Article  Google Scholar 

  • Sun L, Liu C, Guo C, Xiong H, Xie Y (2016). Data-driven automatic treatment regimen development and recommendation. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, USA, August 13–17, 2016.

    Google Scholar 

  • Tien J M, Goldschmidt-Clermont P J (2009). Healthcare: A complex service system. Journal of Systems Science and Systems Engineering 18(3): 257–282.

    Article  Google Scholar 

  • Topol E (2015). The Patient Will See You Now: The future of medicine is in your hands. Basic Books, New York, USA.

    Google Scholar 

  • Wang Y, Qian L, Li F, Zhang L (2018). A comparative study on shilling detection methods for trustworthy recommendations. Journal of Systems Science and Systems Engineering 27(4): 458–478.

    Article  Google Scholar 

  • Wei W, Guo C (2019). A text semantic topic discovery method based on the conditional co-occurrence degree. Neurocomputing 368: 11–24.

    Article  Google Scholar 

  • World Health Organization (2012). The Pursuit of Responsible Use of Medicines: Sharing and Learning from Country Experiences. WHO/EMP/MAR/2012.3. Geneva Switzerland: WHO.

    Google Scholar 

  • Wright A P, Wright A T, McCoy A B, Sittig D F (2015). The use of sequential pattern mining to predict next prescribed medications. Journal of Biomedical Informatics 53: 73–80.

    Article  Google Scholar 

  • Wu X, Chen H, Wu G, Liu J, Zheng Q, He X, Zhou A, Zhao Z, Wei B, Gao M, Li Y, Zhang Q, Zhang S, Lu R, Li Y (2015). Knowledge engineering with big data. IEEE Intelligent Systems 30(5): 46–55.

    Article  Google Scholar 

  • Xu N, Tang X (2018). Generating risk maps for evolution analysis of societal risk events. In International Symposium on Knowledge and Systems Sciences, Tokyo, Japan, Nov 25–27, 2018.

    Google Scholar 

  • Yadav P, Steinbach M, Kumar V, Simon G (2018). Mining electronic health records (EHRs): A survey. ACM Computing Surveys 50(6): 85.

    Article  Google Scholar 

  • Yang S, Dong X, Sun L, Zhou Y, Farneth RA, Xiong H, Burd RS, Marsic I (2017). A data-driven process recommender framework. Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, Canada, August 13–17, 2017.

    Google Scholar 

Download references

Acknowledgments

The authors would like to thank the anonymous referees for their help to improve the quality of the paper. This research was supported in part by the National Natural Science Foundation of China under Grant Nos. 71771034 and 71421001, and Science and Technology Program of Jieyang under Grant No. 2017xm041.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chonghui Guo.

Additional information

The main contents of this paperwere originally presented as the invite keynote speech at the 19th International Symposium on Knolwdge and Systems Sciences in Tokyo during Nov 25–27, 2018.

Chonghui Guo is a professor of the Institute of Systems Engineering, Dalian University of Technology, Dalian, China. He received the B.S. degree in mathematics from Liaoning University in 1995, M.S. degree in operational research and control theory in 1999 and Ph.D. degree in management science and engineering from Dalian University of Technology in 2002. He was a postdoctoral research fellow in the Department of Computer Science in Tsinghua University, Beijing, China. His studies concentrate on data mining and knowledge discovery. He has published over 100 peer-reviewed papers in academic journals and conferences, besides 5 text-books and 2 monographs. He has been the Principal Investigator on over 10 research projects from the Government and the Industry.

Jingfeng Chen received the B.S. degree in mathematics from Henan University of Economics and Law in 2012, M.S. degree from Dongbei University of Finance and Economics in 2015, and Ph.D. degree from Institute of Systems Engineering, Dalian University of Technology, in 2019. His research interests include medical data mining, business intelligence. His papers have been published and presented on journals and conferences such as Journal of Biomedical Informatics, Health Policy and Technology, Soft Computing, Journal of Systems Science and Systems Engineering, and the 19th International Symposium on Knowledge and Systems Sciences (KSS2018).

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Guo, C., Chen, J. Big Data Analytics in Healthcare: Data-Driven Methods for Typical Treatment Pattern Mining. J. Syst. Sci. Syst. Eng. 28, 694–714 (2019). https://doi.org/10.1007/s11518-019-5437-5

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11518-019-5437-5

Keywords

Navigation