Abstract
The analysis of clinical time series is currently a key topic in biostatistics and machine learning applications to medical research. The extraction of relevant features from longitudinal patients data brings several problems for which novel algorithms are warranted. It is usually impossible to measure many data points due to practical and also ethical restrictions, which leads to short time series (STS) data. The sampling might also be at unequally spaced time-points and many of the predicted measurements are often missing. These problems constitute the rationale of the present work, where we present two methods to deal with missing data in STS using fuzzy clustering analysis. The methods are tested and compared using data with equal and varying time sampling interval lengths, with and without missing data. The results illustrate the potential of these methods in clinical studies for patient classification and feature selection using biomarker time series data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Cismondi, F., et al.: Missing data in medical databases: Impute, delete or classify? Artificial Intelligence in Medicine 58(1), 63–72 (2013)
Westhoff, P.G., et al.: An Easy Tool to Predict Survival in Patients Receiving Radiation Therapy for Painful Bone Metastases. International Journal of Radiation Oncology*Biology*Physics 90(4), 739–747 (2014)
Harries, M., et al.: Incidence of bone metastases and survival after a diagnosis of bone metastases in breast cancer patients. Cancer Epidemiology 38(4), 427–434 (2014)
Hathaway, R.J., Bezdek, J.C.: Fuzzy c-means clustering of incomplete data. IEEE Transactions on Systems, Man, and Cybernetics. Part B, Cybernetics : a Publication of the IEEE Systems, Man, and Cybernetics Society 31(5), 735–744 (2001)
Dixon, J.K.: Pattern recognition with partly missing data. IEEE Transactions on Systems, Man and Cybernetics 9(10), 617–621 (1979)
Warren Liao, T.: Clustering of time series data - A survey. Pattern Recognition 38(11), 1857–1874 (2005)
Möller-Levet, C.S., Klawonn, F., Cho, K.-H., Wolkenhauer, O.: Fuzzy clustering of short time-series and unevenly distributed sampling points. In: Berthold, M., Lenz, H.-J., Bradley, E., Kruse, R., Borgelt, C. (eds.) IDA 2003. LNCS, vol. 2810, pp. 330–340. Springer, Heidelberg (2003)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Cruz, L.P., Vieira, S.M., Vinga, S. (2015). Fuzzy Clustering for Incomplete Short Time Series Data. In: Pereira, F., Machado, P., Costa, E., Cardoso, A. (eds) Progress in Artificial Intelligence. EPIA 2015. Lecture Notes in Computer Science(), vol 9273. Springer, Cham. https://doi.org/10.1007/978-3-319-23485-4_36
Download citation
DOI: https://doi.org/10.1007/978-3-319-23485-4_36
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23484-7
Online ISBN: 978-3-319-23485-4
eBook Packages: Computer ScienceComputer Science (R0)