Abstract
The unbalanced of data is a common problem in many domains. Using unbalanced data for standard machine learning classifiers significantly affect the obtained performance. In this paper is presented a description of the problem and a review of the main alternatives to solve it. It is also proposed an alternative model, illustrating its application through a case of the medical field. The proposed model manages to get a balanced distribution of instances per class. It is based on the automatic selection of a subset of cases from the majority classes, using the natural groupings of these classes through self-organizing maps. The model is applied to the recognition of heartbeat types and the results are compared with others methods. The results show the feasibility of using this model to address this problem.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Hadad, A.J., Evin, D., Drozdowicz, B. (2015). Preprocessing Unbalanced Data Set Based on Self-organizing Neural Networks. In: Braidot, A., Hadad, A. (eds) VI Latin American Congress on Biomedical Engineering CLAIB 2014, Paraná, Argentina 29, 30 & 31 October 2014. IFMBE Proceedings, vol 49. Springer, Cham. https://doi.org/10.1007/978-3-319-13117-7_198
Download citation
DOI: https://doi.org/10.1007/978-3-319-13117-7_198
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13116-0
Online ISBN: 978-3-319-13117-7
eBook Packages: EngineeringEngineering (R0)