Abstract
In the past decade, the amount of data in biological field has become larger and larger; Bio-techniques for analysis of biological data have been developed and new tools have been introduced. Several computational methods are based on unsupervised neural network algorithms that are widely used for multiple purposes including clustering and visualization, i.e. the Self Organizing Maps (SOM). Unfortunately, even though this method is unsupervised, the performances in terms of quality of result and learning speed are strongly dependent from the neuron weights initialization. In this paper we present a new initialization technique based on a totally connected undirected graph, that report relations among some intersting features of data input. Result of experimental tests, where the proposed algorithm is compared to the original initialization techniques, shows that our technique assures faster learning and better performance in terms of quantization error.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Kohonen, T.: Self-Organizing Maps, 3rd edn. Springer, Berlin (2001)
Varsta, M.: Self organizing maps in sequence processing. Dissertation, Department of Electrical and Communications Engineering, Helsinki University of Technology (2002)
Vesanto, J., Alhoniemi, E.: Clustering of the Self-Organizing Map. J. IEEE-NN 11(3), 586–600 (2000)
Zupan, J., Gasteiger, J.: Neural Networks for Chemists: An Introduction, pp. 121–122. VCH, Weinheim (1993)
Manallack, D.T., Livingstone, D.J.: Neural networks in drug discovery: have they lived up to their promise? European Journal of Medicinal Chemistry 34, 95–208 (1999)
Nikkilä, J., Törönen, P., Kaski, S., Venna, J., Castrén, E., Wong, G.: Analysis and visualization of gene expression data using Self-Organizing Maps Neural Networks. Special issue on New Developments on Self-Organizing Maps 15(8-9), 953–966 (2002)
Fort, J., Letrémy, P., Cottrell, M.: Advantages and drawbacks of the Batch Kohonen algorithm. In: ESANN 2002, pp. 223–230 (2002)
Van Hulle, M.: Faithful Representations and Topographic Maps: From Distortion- to Information-Based Self-Organization. John Wiley, New York (2000)
Fiannaca, A., Di Fatta, G., Gaglio, S., Rizzo, R., Urso, A.: Improved SOM Learning using Simulated Annealing. In: de Sá, J.M., Alexandre, L.A., Duch, W., Mandic, D.P. (eds.) ICANN 2007. LNCS, vol. 4668, pp. 279–288. Springer, Heidelberg (2007)
Golub, G.H., Van Loan, C.F.: Matrix Computations, 3rd edn. Johns Hopkins University Press, Baltimore (1996)
Wall, M.E., Rechtsteiner, A., Rocha, L.M.: Singular value decomposition and principal component analysis. In: Berrar, D.P., Dubitzky, W., Granzow, M. (eds.) A Practical Approach to Microarray Data Analysis, pp. 91–109. Kluwer, Norwell (2003)
Di Fatta, G., Fiannaca, A., Rizzo, R., Urso, A., Berthold, M., Gaglio, S.: Context-Aware Visual Exploration of Molecular Datab. In: ICDM Workshops 2006. Sixth IEEE International Conference on Data Mining, Hong Kong, pp. 136–141 (2006)
Papadimitriou, C.H., Raghavan, P., Tamaki, H., Vempala, S.: Latent semantic indexing: A probabilistic analysis. In: Proc. 17th ACM Symp. on the Principles of Database Systems, pp. 159–168 (1998)
Cullum, J.K., Willoughby, R.A.: Real rectangular matrices. In: Lancozos algorithms for large symmetric eigenvalue computations, Theory, vol. 1. Birkhäuser, Boston (1985)
Holt, F.B.: Subspace Representations of Unstructured Text. In: IEEE ICDM Workshop on Text Mining (TextDM 2001), San Jose, California, USA (2001)
Mangasarian, O.L., Street, W.N., Wolberg, W.H.: Breast cancer diagnosis and prognosis via linear programming. Operations Research 43(4), 570–577 (1995)
National Cancer Institute, Self-organized map (som) of compounds tested in the nci anti-hiv screen, http://cactus.nci.nih.gov/services/somqsar/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Fiannaca, A., Di Fatta, G., Urso, A., Rizzo, R., Gaglio, S. (2009). A New Linear Initialization in SOM for Biomolecular Data. In: Masulli, F., Tagliaferri, R., Verkhivker, G.M. (eds) Computational Intelligence Methods for Bioinformatics and Biostatistics. CIBB 2008. Lecture Notes in Computer Science(), vol 5488. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02504-4_16
Download citation
DOI: https://doi.org/10.1007/978-3-642-02504-4_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02503-7
Online ISBN: 978-3-642-02504-4
eBook Packages: Computer ScienceComputer Science (R0)