Skip to main content

Enabling the Definition and Reuse of Multi-Domain Workflow-Based Data Analysis

  • Conference paper
  • First Online:
Intelligent Systems Design and Applications (ISDA 2016)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 557))

  • 1628 Accesses

Abstract

Data analysis applications have become essential to extract significant insight from heterogeneous data sources. However, their development requires technical expertise in computer science techniques like data mining, making its broad adoption by non-experts difficult. In this context, workflows have emerged as a high-level solution to define and automate the sequence of steps involved in the data analysis process, hiding the low-level computational requirements. Existing workflow systems have some difficulties related to their complexity to adapt the provided elements and their inability to reuse workflow definitions. To address these problems, a novel framework for creating customized, ready-to-use and interoperable workflow systems is proposed and prototyped in this paper. Its multi-layer architecture has been designed on the basis of the separation of concerns and the reuse of knowledge assets. As a result, the presented approach allows reducing the time-to-market and saving development costs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Terminology & glossary. Technical report, WFMC-TC-1011, Workflow Management Coalition (1999)

    Google Scholar 

  2. Berthold, M.R., et al.: KNIME: the Konstanz information miner. In: Preisach, C., Burkhardt, H., Schmidt-Thieme, L., Decker, R. (eds.) Data Analysis, Machine Learning and Applications. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin (2007)

    Google Scholar 

  3. Elmroth, E., Hernández, F., Tordsson, J.: Three fundamental dimensions of scientific workflow interoperability: model of computation, language, and execution environment. Future Gener. Comput. Syst. 26(2), 245–256 (2010)

    Article  Google Scholar 

  4. Fowler, M.: Domain Specific Languages, 1st edn. Addison-Wesley, Boston (2010)

    Google Scholar 

  5. Hofmann, M., Klinkenberg, R.: RapidMiner: Data Mining Use Cases and Business Analytics Applications. Chapman & Hall/CRC, Boca Raton (2013)

    Google Scholar 

  6. Liu, J., Pacitti, E., Valduriez, P., Mattoso, M.: A survey of data-intensive scientific workflow management. J. Grid Comput. 13(4), 457–493 (2015)

    Article  Google Scholar 

  7. Loukides, M.: What is Data Science?. O’Reilly Radar, Sebastopol (2010)

    Google Scholar 

  8. Recker, J., Rosa, M.L.: Understanding user differences in open-source workflow management system usage intentions. Inf. Syst. 37(3), 200–212 (2012)

    Article  Google Scholar 

  9. Roure, D.D., Goble, C., Bhagat, J., Cruickshank, D., Goderis, A., Michaelides, D., Newman, D.: myExperiment: defining the social virtual research environment. In: 4th IEEE International Conference on e-Science, pp. 182–189. IEEE Press (2008)

    Google Scholar 

  10. Schmidt, D.C.: Guest editor’s introduction: model-driven engineering. Computer 39, 25–31 (2006)

    Article  Google Scholar 

  11. Weske, M., van der Aalst, W., Verbeek, H.: Advances in business process management. Data Knowl. Eng. 50(1), 1–8 (2004)

    Article  Google Scholar 

  12. Wolstencroft, K., Haines, R., Fellows, D., Williams, A., Withers, D., Owen, S., Soiland-Reyes, S., Dunlop, I., Nenadic, A., Fisher, P., Bhagat, J., Belhajjame, K., Bacall, F., Hardisty, A., de la Hidalga, A.N., Balcazar Vargas, M.P., Sufi, S., Goble, C.: The taverna workflow suite: designing and executing workflows of web services on the desktop, web or in the cloud. Nucleic Acids Research 41(W1), W557–W561 (2013)

    Article  Google Scholar 

  13. Yu, J., Buyya, R.: A taxonomy of workflow management systems for grid computing. J. Grid Comput. 3(3), 171–200 (2006)

    Google Scholar 

Download references

Acknowledgments

Work supported by the Spanish Government, project TIN2014-55252-P.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rubén Salado-Cid .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Salado-Cid, R., Romero, J.R. (2017). Enabling the Definition and Reuse of Multi-Domain Workflow-Based Data Analysis. In: Madureira, A., Abraham, A., Gamboa, D., Novais, P. (eds) Intelligent Systems Design and Applications. ISDA 2016. Advances in Intelligent Systems and Computing, vol 557. Springer, Cham. https://doi.org/10.1007/978-3-319-53480-0_68

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-53480-0_68

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-53479-4

  • Online ISBN: 978-3-319-53480-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics