Abstract
Comparing care trajectories helps improve health services. Medico-administrative databases are useful for automatically reconstructing the patients’ history of care. Care trajectories can be compared by determining their overlapping parts. This comparison relies on both semantically-rich representation formalism for care trajectories and an adequate similarity measure. The longest common subsequence (LCS) approach could have been appropriate if representing complex care trajectories as simple sequences was expressive enough. Furthermore, by failing to take into account similarities between different but semantically close medical events, the LCS overestimates differences. We propose a generalization of the LCS to a more expressive representation of care trajectories as sequences of sets. A set represents a medical episode composed by one or several medical events, such as diagnosis, drug prescription or medical procedures. Moreover, we propose to take events’ semantic similarity into account for comparing medical episodes. To assess our approach, we applied the method on a care trajectories’ sample from patients who underwent a surgical act among three kinds of acts. The formalism reduced calculation time, and introducing semantic similarity made the three groups more homogeneous.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Tuppin, P., de Roquefeuil, L., Weill, A., Ricordeau, P., Merlière, Y.: French national health insurance information system and the permanent beneficiaries sample. Rev. Epidemiol. Sante Publique 58, 286–290 (2010)
Moulis, G., Lapeyre-Mestre, M., Palmaro, A., Pugnet, G., Montastruc, J.-L., Sailler, L.: French health insurance databases: what interest for medical research? Rev. Med. Interne 36, 411–417 (2015)
Jay, N., Nuemi, G., Gadreau, M., Quantin, C.: A data mining approach for grouping and analyzing trajectories of care using claim data: the example of breast cancer. BMC Med. Inform. Decis. Mak. 13, 130 (2013)
Le Meur, N., Gao, F., Bayat, S.: Mining care trajectories using health administrative information systems: the use of state sequence analysis to assess disparities in prenatal care consumption. BMC Health Serv. Res. 15, 200 (2015)
Pesquita, C., Faria, D., Falcão, A.O., Lord, P., Couto, F.M.: Semantic similarity in biomedical ontologies. PLoS Comput. Biol. 5, e1000443 (2009)
Girardi, D., Wartner, S., Halmerbauer, G., Ehrenmüller, M., Kosorus, H., Dreiseitl, S.: Using concept hierarchies to improve calculation of patient similarity. J. Biomed. Inform. 63, 66–73 (2016)
Studer, M., Ritschard, G.: A comparative review of sequence dissimilarity measures. LIVES Work. Pap. 2014, 1–47 (2014)
Hirschberg, D.S.: A linear space algorithm for computing maximal common subsequences. Commun. ACM 18, 341–343 (1975)
Bellman, R.: The theory of dynamic programming. Bull. Amer. Math. Soc. 60, 503–515 (1954)
Wu, Z., Palmer, M.: Verbs semantics and lexical selection. In: Presented at the Proceedings of the 32nd Annual Meeting on Association for Computational Linguistics, 27 June 1994
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Rivault, Y., Le Meur, N., Dameron, O. (2017). A Similarity Measure Based on Care Trajectories as Sequences of Sets. In: ten Teije, A., Popow, C., Holmes, J., Sacchi, L. (eds) Artificial Intelligence in Medicine. AIME 2017. Lecture Notes in Computer Science(), vol 10259. Springer, Cham. https://doi.org/10.1007/978-3-319-59758-4_32
Download citation
DOI: https://doi.org/10.1007/978-3-319-59758-4_32
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-59757-7
Online ISBN: 978-3-319-59758-4
eBook Packages: Computer ScienceComputer Science (R0)