Abstract
Success of machine learning algorithms is usually dependent on a quality of a dataset they operate on. For datasets containing noisy, inadequate or irrelevant information these algorithms may produce less accurate results. Therefore a common pre-processing step in data mining domain is a selection of highly predictive attributes. In this case study we select subsets of attributes from medical data using filter feature selection algorithms. To validate the algorithms we induce decision rules from the selected subsets of attributes and compare classification accuracy on both training and test datasets. Additionally medical relevance of the selected attributes is checked with help of domain experts.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Rich, E., Knight, K.: Artificial Intelligence. McGraw-Hill Science, New York (1990)
Ghiselli, E.: Theory of Psychological Measurement. McGraw-Hill Book, New York (1964)
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, Los Altos (1993)
Kononenko, I.: On Biases in Estimating Multi-Valued Attributes. In: International Joint Conference on Artificial Intelligence, Montreal, pp. 1034–1040 (1995)
Li, J., Cercone, N.: Introducing A Rule Importance Measure. In: Peters, J.F., Skowron, A. (eds.) Transactions on Rough Sets V. LNCS, vol. 4100, pp. 171–194. Springer, Heidelberg (2006)
Quinlan, J.R.: Induction of Decision Trees. Mach. Learn., 81–106 (2003)
Hall, M.: Correlation-based Feature Selection for Machine Learning. Ph.D diss. Hamilton, NZ: Waikato University, Department of Computer Science (1998)
Kohavi, R., John, G.: Wrappers for Feature Subset Selection. Artif. Intell. 97, 273–324
Kohavi, R.: The Power of Decision Tables. In: Hsiang, J. (ed.) RTA 1995. LNCS, vol. 914, pp. 174–189. Springer, Heidelberg (1995)
John, G., Kohavi, R., Pfleger, K.: Irrelevant Features and the Subset Selection Problem. In: International Conference on Machine Learning, New Jersey, pp. 121–129 (1994)
Pawlak, Z.: Rough sets. International Journal of Computer and Information Science 11, 341–356 (1982)
Everitt, B.S.: The analysis of contingency tables. Chapman and Hall, London (1977)
Pawlak, Z.: Knowledge and Uncertainty: A Rough Set Approach. In: SOFTEKS Workshop on Incompleteness and Uncertainty in Information Systems, pp. 34–42 (1993)
Pawlak, Z., et al.: Rough Sets. Commun. ACM 38, 88–95 (1995)
Ilczuk, G., Wakulicz-Deja, A.: Rough Sets Approach to Medical Diagnosis System. In: Szczepaniak, P.S., Kacprzyk, J., Niewiadomski, A. (eds.) AWIC 2005. LNCS (LNAI), vol. 3528, pp. 204–210. Springer, Heidelberg (2005)
Ilczuk, G., Wakulicz-Deja, A.: Attribute Selection and Rule Generation Techniques for Medical Diagnosis Systems. In: Ślęzak, D., et al. (eds.) RSFDGrC 2005. LNCS (LNAI), vol. 3641, pp. 352–361. Springer, Heidelberg (2005)
Wakulicz-Deja, A., Paszek, P.: Applying Rough Set Theory to Multi Stage Medical Diagnosing. Fundam. Inform. 54, 387–408 (2003)
Grzymala-Busse, J.W.: MLEM2 - Discretization During Rule Induction. In: IIS 2003, Zakopane, pp. 499–508 (2003)
Ilczuk, G., et al.: Rough Sets Techniques for Medical Diagnosis Systems. In: Computers in Cardiology 2005, Lyon, pp. 837–840 (2005)
Mlynarski, R., et al.: Automated Decision Support and Guideline Verification in Clinical Practice. In: Computers in Cardiology 2005, Lyon, pp. 375–378 (2005)
Chan, C.C., Grzymala-Busse, J.W.: On the two local inductive algorithms: PRISM and LEM2. Foundations of Computing and Decision Sciences 19, 185–203 (1994)
Chan, C.C., Grzymala-Busse, J.W.: On the attribute redundancy and the learning programs ID3, PRISM, and LEM2. Department of Computer Science, University of Kansas, TR-91-14 (1991)
Grzymala-Busse, J.W.: A new version of the rule induction system LERS. Fundam. Inform. 31, 27–39 (1997)
Komorowski, H.J., et al.: Rough Sets: A Tutorial. Springer, Singapore (1999)
Paszek, P., Wakulicz-Deja, A.: The Application of Support Diagnose in Mitochondrial Encephalomyopathies. In: Alpigini, J.J., et al. (eds.) RSCTC 2002. LNCS (LNAI), vol. 2475, pp. 586–593. Springer, Heidelberg (2002)
Ziarko, W.: The discovery, analysis and representation of data dependencies in databases. In: Piatesky-Shapiro, G., Frawley, W.J. (eds.) Knowledge Discovery in Databases, MIT Press, Cambridge (1991)
Modrzejewski, M.: Feature selection using rough sets theory. In: Brazdil, P.B. (ed.) ECML 1993. LNCS, vol. 667, pp. 213–226. Springer, Heidelberg (1993)
Bazan, J., et al.: Rough set algorithms in classification problems. In: Polkowski, L., Lin, T.Y., Tsumoto, S. (eds.) Rough Set Methods and Applications: New Developments in Knowledge Discovery in Information Systems. Studies in Fuzziness and Soft Computing, vol. 56, pp. 49–88. Springer, Heidelberg (2000)
Polkowski, L., Lin, T.Y., Tsumoto, S. (eds.): Rough Set Methods and Applications: New Developments in Knowledge Discovery in Information Systems. Studies in Fuzziness and Soft Computing, vol. 56. Springer, Heidelberg (2000)
Pal, S.K., Polkowski, L., Skowron, A. (eds.): Rough-Neural Computing: Techniques for Computing with Words. Cognitive Technologies. Springer, Heidelberg (2004)
Wróblewski, J.: Adaptive aspects of combining approximation spaces. In: Pal, S.K., Polkowski, L., Skowron, A. (eds.) Rough-Neural Computing: Techniques for Computing with Words. Cognitive Technologies, pp. 139–156. Springer, Heidelberg (2004)
Bazan, J.G.: A comparison of dynamic and nondynamic rough set methods for extracting laws from decision tables. In: Polkowski, L., Skowron, A. (eds.) Rough Sets in Knowledge Discovery 1: Methodology and Applications. Studies in Fuzziness and Soft Computing, vol. 18, pp. 321–365. Physica, Heidelberg (1998)
Polkowski, L., Skowron, A. (eds.): Rough Sets in Knowledge Discovery 1: Methodology and Applications. Studies in Fuzziness and Soft Computing, vol. 18. Physica, Heidelberg (1998)
Pawlak, Z., Skowron, A.: Rudiments of rough sets. Information Sciences 177(1), 3–27 (2007)
Pawlak, Z., Skowron, A.: Rough sets: Some extensions. Information Sciences 177(1), 28–40 (2007)
Pawlak, Z., Skowron, A.: Rough sets and Boolean reasoning. Information Sciences 177(1), 41–73 (2007)
Swiniarski, R., Skowron, A.: Rough set methods in feature selection and extraction. Pattern Recognition Letters 24(6), 833–849 (2003)
Swiniarski, R.W., Skowron, A.: Independent component analysis, principal component analysis and rough sets in face recognition. In: Peters, J.F., et al. (eds.) Transactions on Rough Sets I. LNCS, vol. 3100, pp. 392–404. Springer, Heidelberg (2004)
Peters, J.F., et al. (eds.): Transactions on Rough Sets I. LNCS, vol. 3100. Springer, Heidelberg (2004)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this chapter
Cite this chapter
Ilczuk, G., Wakulicz-Deja, A. (2007). Selection of Important Attributes for Medical Diagnosis Systems. In: Peters, J.F., Skowron, A., Marek, V.W., Orłowska, E., Słowiński, R., Ziarko, W. (eds) Transactions on Rough Sets VII. Lecture Notes in Computer Science, vol 4400. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71663-1_5
Download citation
DOI: https://doi.org/10.1007/978-3-540-71663-1_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71662-4
Online ISBN: 978-3-540-71663-1
eBook Packages: Computer ScienceComputer Science (R0)