A Data Utility Model for Data-Intensive Applications in Fog Computing Environments

Cappiello, Cinzia; Plebani, Pierluigi; Vitali, Monica

doi:10.1007/978-3-319-94890-4_9

Cinzia Cappiello²,
Pierluigi Plebani² &
Monica Vitali²

1370 Accesses
3 Citations

Abstract

Sensors, smart devices , and wearables have been widely adopted in recent years, bringing to the production of a vast amount of data which can be shared among several applications as input for their analysis. Data-intensive applications can benefit from these data but only if data are reliable and timely, and if they fit the requirements of the application. Designing data-intensive applications requires a trade-off between the value obtained by the analysis of the data, which is affected by their quality and volume, and the performance of the analysis that can be affected by delays in accessing the data and availability of the data source . In this chapter, we present a Data Utility model to assess the fitness of a data source with respect to its usage in a data-intensive application running in a Fog Computing environment. In this context, data are provided using a Data-as-a-Service (DaaS) approach, and both data storage and data processing can be placed in a cloud resource as well as in an edge device. The placement of a resource affects the quality of the service and the data quality as well. On this basis, the Data Utility model provides a support for making decisions on the deployment of data-intensive applications according to the impact of the task location, and on the selection of proper data sources as input for the application according to the application requirements, taking into consideration that both tasks and data can be moved from the edge to the cloud, and vice versa, to improve the efficiency and the effectiveness of applications.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 159.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Borthakur D (2008) HDFS architecture guide. Hadoop Apache Proj
Google Scholar
Pokorny J (2013) NoSQL databases: a step to database scalability in web environment. Int J Web Inform Syst 9(1):69–82 (Mar 29)
Article Google Scholar
Chodorow K (2013) MongoDB: the definitive guide: powerful and scalable data storage. O’Reilly Media, Inc
Google Scholar
Cassandra A (2018) http://cassandra.apache.org (last accessed 26 Jan 2018)
Bhandarkar M (2010) MapReduce programming with apache Hadoop. In: IEEE international symposium on Parallel and distributed processing (IPDPS), 2010 19 Apr 2010, pp 1–1
Google Scholar
AWS Lambda (2018) https://aws.amazon.com/lambda/ (last accessed 26 Jan 2018)
Shi W, Dustdar S (2016) The promise of edge computing. Computer 49(5):78–81
Article Google Scholar
Bonomi F, Milito R, Zhu J, Addepalli S (2012) Fog computing and its role in the internet of things. In Proceedings of the first edition of the MCC workshop on  mobile cloud computing. MCC ‘12, pp 13–16
Google Scholar
OpenFog Consortium Architecture Working Group, OpenFog Architecture Overview (February 2016). http://www.openfogconsortium.org/ra 
Plebani P, Garcia-Perez D, Anderson M, Bermbach D, Cappiello C, Kat RI, Pallas F, Pernici B, Tai S, Vitali M (2017) Information logistics and fog computing: The DITAS approach. In Proceedings of the forum and doctoral consortium papers presented at the 29th international conference on advanced information systems engineering, CAISE 2017, vol-1848. Essen, Germany. CEUR, pp 129–136
Google Scholar
Cappiello C, Pernici B, Plebani P, Vitali M (2017) Utility-driven data management for data-intensive applications in fog environments. In: International conference on conceptual modeling. Springer, Cham, pp 216–226
Google Scholar
Kock N (2007) Encyclopedia of E-collaboration. Imprint of IGI Publishing, Hershey, PA, Information Science Reference
Google Scholar
Syed MR, Syed SN (2008) Handbook of research on modern systems analysis and design technologies and applications. Imprint of IGI Publishing, Hershey, PA, Information Science Reference
Google Scholar
Hundepool A, Domingo-Ferrer J, Franconi L, Giessing S, Nordholt ES, Spicer K, de Wolf PP (2012) Statistical disclosure control. Wiley
Book Google Scholar
Weiss GM, Zadrozny B, Saar-Tsechansky M (2008) Guest editorial: special issue on utility-based data mining. Data Min Knowl Discov 17(2):129–135
Article MathSciNet Google Scholar
Lin YC, Wu CW, TsengVS (2015) Mining high utility itemsets in big data. Springer International Publishing, Cham, pp  649–661
Google Scholar
Ives B, Olson MH, Baroudi JJ (1983) The measurement of user information satisfaction. Commun ACM 26(10):785–793
Article Google Scholar
Wang RY, Strong DM (1996) Beyond accuracy: what data quality means to data consumers. J of Manage Inf Syst 12(4):5–33
Article Google Scholar
Ho TTN, Pernici B (2015) A data-value-driven adaptation framework for energy  efficiency for data intensive applications in clouds. In: 2015 IEEE conference on technologies for sustainability (SusTech), pp 47–52
Google Scholar
Moody D, Walsh P (1999) Measuring the value of information: an asset valuation  approach. In: European conference on information systems
Google Scholar
Even A, Shankaranarayanan G, Berger PD (2010) Inequality in the utility of customer data: implications for data management and usage. J Database Mark Customer Strategy Manage 17(1):19–35
Article Google Scholar
Gharib M, Giorgini P, Mylopoulos J (2016) Analysis of information quality requirements in business processes, revisited. Requirements Eng 1–23
Google Scholar
D’Andria F, Field D, Kopaneli A, Kousiouris G, Garcia-Perez D, Pernici B, Plebani P (2015) Data movement in the internet of things domain. In Proceedings of European conference on service oriented and cloud computing, ESOCC 2015. pp 243–252
Chapter Google Scholar
Gomez A, Merseguer J, Di Nitto E, Tamburri DA (2016) Towards a UML profile for  data intensive applications. In: Proceedings of the 1st international workshop on quality-aware DevOps.  Saarbrücken, Germany, pp 18–23
Google Scholar
Nalchigar S, Yu E, Ramani R (2016) A conceptual modeling framework for business analytics. Springer International Publishing, Cham, pp. 35–49
Chapter Google Scholar
Distributed Management Task Force Inc. Common Information Model (DMTF-CIM). https://www.dmtf.org/standards/cim
Cleve A, Brogneaux AF, Hainaut JL (2010) A conceptual approach to database applications evolution. Springer Berlin Heidelberg
Chapter Google Scholar
Batini C, Scannapieco M (2016) Data and information quality-dimensions. Principles and Techniques, Data-Centric Systems and Applications, Springer
Book Google Scholar
Garijo D, Alper P, Belhajjame K, Corcho O, Gil Y, Goble CA (2014) Common motifs in scientific workflows: an empirical analysis. Future Generation Comput Syst 36:338–351
Article Google Scholar

Download references

Acknowledgements

This research has been developed in the framework of the DITAS project. DITAS project receives funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement RIA 731945.

Author information

Authors and Affiliations

Dipartimento di Elettronica, Informazione e Bioingegneria (DEIB), Politecnico di Milano, Piazza Leonardo da Vinci 32, 20133, Milan, Italy
Cinzia Cappiello, Pierluigi Plebani & Monica Vitali

Authors

Cinzia Cappiello
View author publications
You can also search for this author in PubMed Google Scholar
Pierluigi Plebani
View author publications
You can also search for this author in PubMed Google Scholar
Monica Vitali
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Monica Vitali .

Editor information

Editors and Affiliations

Debesis Education, Derby, United Kingdom
Zaigham Mahmood

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Cappiello, C., Plebani, P., Vitali, M. (2018). A Data Utility Model for Data-Intensive Applications in Fog Computing Environments. In: Mahmood, Z. (eds) Fog Computing. Springer, Cham. https://doi.org/10.1007/978-3-319-94890-4_9

Download citation

DOI: https://doi.org/10.1007/978-3-319-94890-4_9
Published: 13 July 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-94889-8
Online ISBN: 978-3-319-94890-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics