Abstract
With more than 200 different types of “-omic” data [1] spanning from submolecular, through molecular, cell, cell-systems, tissues, organs, phenotypes, gene-environment interactions, and ending at ecology and organism communities, the problem and complexity of bioscientific data processing has never been greater. Often data are generated in high-throughput studies with the aim to have a sufficient volume to find patterns and detect rare events. For these highthroughput approaches new methods have to be developed in order to assure integrity of the volume of data that is produced. At the same time efforts to integrate these widely-varying data types are underway in research fields such as systems biology. Systems-level research requires yet additional methodologies to pipeline, process, query, and interpret data, and such pipelines are, themselves, objects of scientific value if they can be re-used or re-purposed by other researchers.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
McDonald, D., Clemente, J., Kuczynski, J., Rideout, J., Stombaugh, J., Wendel, D., Wilke, A., Huse, S., Hufnagle, J., Meyer, F., Knight, R., Caporaso, J.: The Biological Observation Matrix (BIOM) format or: how I learned to stop worrying and love the ome-ome. GigaScience 1(1), 7 (2012)
Stoop, E., Schipper, T., Rosendahl Huber, S., Nezhinsky, A., Verbeek, F., Gurcha, S., Besra, G., Vandenbroucke-Grauls, C., Bitter, W., van der Sar, A.: Zebrafish embryo screen for mycobacterial genes involved in the initiation of granuloma formation reveals a newly identified ESX-1 component. Disease Model Mechanisms, 526–536 (2011)
Nezhinsky, A.E., Verbeek, F.J.: Pattern Recognition for High Throughput Zebrafish Imaging Using Genetic Algorithm Optimization. In: Dijkstra, T.M.H., Tsivtsivadze, E., Marchiori, E., Heskes, T. (eds.) PRIB 2010. LNCS, vol. 6282, pp. 301–312. Springer, Heidelberg (2010)
Larios, E., Zhang, Y., Yan, K., Di, Z., LeDévédec, S., Groffen, F., Verbeek, F.J.: Automation in Cytomics: A Modern RDBMS Based Platform for Image Analysis and Management in High-Throughput Screening Experiments. In: He, J., Liu, X., Krupinski, E.A., Xu, G. (eds.) HIS 2012. LNCS, vol. 7231, pp. 76–87. Springer, Heidelberg (2012)
LeDévédec, S., Yan, K., de Bont, H., Ghotra, V., Truong, H., Danen, E., Verbeek, F., van de Water, B.: A Systems Microscopy Approach to Understand Cancer Cell Migration and Metastasis. Journal Cellular and Molecular Life Sciences 67(19), 3219–3240 (2010)
Yan, K., Larios, E., LeDevedec, S., van de Water, B., Verbeek, F.J.: Automation in Cytomics: Systematic Solution for Image Analysis and Management in High Throughput Sequences. In: Proceedings IEEE Conf. Engineering and Technology (CET 2011), vol. 7 (2011)
Hawrylycz, M., Baldock, R.A., Burger, A., Hashikawa, T., Johnson, G.A., Martone, M., Ng, L., Lau, C., Larsen, S.D., Nissanov, J., Puelles, L., Ruffins, S., Verbeek, F., Zaslavsky, I., Boline, J.: Digital Atlasing and Standardization in the Mouse Brain. PLoS Comput. Biol. 7(2), e1001065+ (2011)
Belmamoune, M., Potikanond, D., Verbeek, F.: Mining and analysing spatio-temporal patterns of gene expression in an integrative database framework. Journal of Integrative Bioinformatics 7(3)(128), 1–10 (2010)
Verbeek, F., Boon, P., Sloetjes, H., van der Velde, R., de Vos, N.: Visualization of complex data sets over Internet: 2D and 3D visualization of the 3D digital atlas of zebrafish development. In: Proc. SPIE 4672, Internet Imaging III, pp. 20–29 (2002)
Slob, J., Kallergi, A., Verbeek, F.J.: Observations on Semantic Annotation of Microscope Images for Life Sciences. In: Marshall, M.S., Burger, A., Romano, P., Paschke, A., Splendiani, A. (eds.) SWAT4LS. CEUR Workshop Proceedings, vol. 559, CEUR-WS.org (2009)
Sanchez-Faddeev, H., Emmerich, M.T., Verbeek, F.J., Henry, A.H., Grimshaw, S., Spaink, H.P., van Vlijmen, H.W., Bender, A.: Using Multiobjective Optimization and Energy Minimization to Design an Isoform-Selective Ligand of the 14-3-3 Protein. In: Margaria, T., Steffen, B. (eds.) ISoLA 2012, Part II. LNCS, vol. 7610, pp. 12–24. Springer, Heidelberg (2012)
Yan, K., Verbeek, F.J.: Segmentation for High-throughput Image Analysis: Watershed Masked Clustering. In: Margaria, T., Steffen, B. (eds.) ISoLA 2012, Part II. LNCS, vol. 7610, pp. 25–41. Springer, Heidelberg (2012)
Nezhinsky, A.E., Verbeek, F.J.: Efficient and Robust Shape Retrieval from Deformable Templates. In: Margaria, T., Steffen, B. (eds.) ISoLA 2012, Part II. LNCS, vol. 7610, pp. 42–55. Springer, Heidelberg (2012)
Taylor, I.: Workflows for E-Science: Scientific Workflows for Grids. Springer (2007)
Wikipedia: Bioinformatics workflow management systems — Wikipedia, The Free Encyclopedia (2012) (Online; last accessed June 25, 2012)
Kok, J.N., Lamprecht, A.-L., Wilkinson, M.D.: Tools in Scientific Workflow Composition. In: Margaria, T., Steffen, B. (eds.) ISoLA 2010, Part I. LNCS, vol. 6415, pp. 258–260. Springer, Heidelberg (2010)
Chen, L., Shadbolt, N.R., Goble, C.A., Tao, F., Cox, S.J., Puleston, C., Smart, P.R.: Towards a Knowledge-Based Approach to Semantic Service Composition. In: Fensel, D., Sycara, K., Mylopoulos, J. (eds.) ISWC 2003. LNCS, vol. 2870, pp. 319–334. Springer, Heidelberg (2003)
Lord, P., Bechhofer, S., Wilkinson, M.D., Schiltz, G., Gessler, D., Hull, D., Goble, C.A., Stein, L.: Applying Semantic Web Services to Bioinformatics: Experiences Gained, Lessons Learnt. In: McIlraith, S.A., Plexousakis, D., van Harmelen, F. (eds.) ISWC 2004. LNCS, vol. 3298, pp. 350–364. Springer, Heidelberg (2004)
Lamprecht, A.L., Margaria, T., Steffen, B.: Bio-jETI: a framework for semantics-based service composition. BMC Bioinformatics 10(suppl. 10), S8 (2009)
Lamprecht, A.L., Naujokat, S., Margaria, T., Steffen, B.: Semantics-based composition of EMBOSS services. Biomedical Semantics 2(suppl. 1), S5 (2011)
Lamprecht, A.L., Naujokat, S., Steffen, B., Margaria, T.: Constraint-Guided Workflow Composition Based on the EDAM Ontology. In: Burger, A., Marshall, M.S., Romano, P., Paschke, A., Splendiani, A. (eds.) Proceedings of the 3rd Workshop on Semantic Web Applications and Tools for Life Sciences (SWAT4LS 2010), vol. 698, CEUR Workshop Proceedings (December 2010)
Martín-Requena, V., Ríos, J., García, M., Ramírez, S., Trelles, O.: jORCA: easily integrating bioinformatics Web Services. Bioinformatics 26(4), 553–559 (2010)
Karlsson, J., Martín-Requena, V., Ríos, J., Trelles, O.: Workflow Composition and Enactment Using jORCA. In: Margaria, T., Steffen, B. (eds.) ISoLA 2010, Part I. LNCS, vol. 6415, pp. 328–339. Springer, Heidelberg (2010)
Wilkinson, M.D., Vandervalk, B., McCarthy, L.: SADI Semantic Web Services - ’cause you can’t always GET what you want! In: Proceedings of the IEEE Services Computing Conference, APSCC 2009, December 7-11, pp. 13–18. IEEE Asia-Pacific, Singapore (2009)
Vandervalk, B.P., McCarthy, E.L., Wilkinson, M.D.: SHARE: A Semantic Web Query Engine for Bioinformatics. In: Gómez-Pérez, A., Yu, Y., Ding, Y. (eds.) ASWC 2009. LNCS, vol. 5926, pp. 367–369. Springer, Heidelberg (2009)
Gil, Y., Ratnakar, V., Deelman, E., Mehta, G., Kim, J.: Wings for Pegasus: creating large-scale scientific applications using semantic representations of computational workflows. In: Proceedings of the 19th National Conference on Innovative Applications of Artificial Intelligence, vol. 2, pp. 1767–1774. AAAI Press (2007)
Deelman, E., Singh, G., Hui Su, M., Blythe, J., Gil, A., Kesselman, C., Mehta, G., Vahi, K., Berriman, G.B., Good, J., Laity, A., Jacob, J.C., Katz, D.S.: Pegasus: a framework for mapping complex scientific workflows onto distributed systems. Scientific Programming Journal 13, 219–237 (2005)
Withers, D., Kawas, E., McCarthy, L., Vandervalk, B., Wilkinson, M.: Semantically-Guided Workflow Construction in Taverna: The SADI and BioMoby Plug-Ins. In: Margaria, T., Steffen, B. (eds.) ISoLA 2010, Part I. LNCS, vol. 6415, pp. 301–312. Springer, Heidelberg (2010)
Wood, I., Vandervalk, B., McCarthy, L., Wilkinson, M.D.: OWL-DL Domain Models as Abstract Workflows. In: Margaria, T., Steffen, B. (eds.) ISoLA 2012, Part II. LNCS, vol. 7610, pp. 56–66. Springer, Heidelberg (2012)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kok, J., Lamprecht, AL., Verbeek, F.J., Wilkinson, M.D. (2012). Bioscientific Data Processing and Modeling. In: Margaria, T., Steffen, B. (eds) Leveraging Applications of Formal Methods, Verification and Validation. Applications and Case Studies. ISoLA 2012. Lecture Notes in Computer Science, vol 7610. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34032-1_2
Download citation
DOI: https://doi.org/10.1007/978-3-642-34032-1_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34031-4
Online ISBN: 978-3-642-34032-1
eBook Packages: Computer ScienceComputer Science (R0)