Abstract
Sensors, smart devices , and wearables have been widely adopted in recent years, bringing to the production of a vast amount of data which can be shared among several applications as input for their analysis. Data-intensive applications can benefit from these data but only if data are reliable and timely, and if they fit the requirements of the application. Designing data-intensive applications requires a trade-off between the value obtained by the analysis of the data, which is affected by their quality and volume, and the performance of the analysis that can be affected by delays in accessing the data and availability of the data source . In this chapter, we present a Data Utility model to assess the fitness of a data source with respect to its usage in a data-intensive application running in a Fog Computing environment. In this context, data are provided using a Data-as-a-Service (DaaS) approach, and both data storage and data processing can be placed in a cloud resource as well as in an edge device. The placement of a resource affects the quality of the service and the data quality as well. On this basis, the Data Utility model provides a support for making decisions on the deployment of data-intensive applications according to the impact of the task location, and on the selection of proper data sources as input for the application according to the application requirements, taking into consideration that both tasks and data can be moved from the edge to the cloud, and vice versa, to improve the efficiency and the effectiveness of applications.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Borthakur D (2008) HDFS architecture guide. Hadoop Apache Proj
Pokorny J (2013) NoSQL databases: a step to database scalability in web environment. Int J Web Inform Syst 9(1):69–82 (Mar 29)
Chodorow K (2013) MongoDB: the definitive guide: powerful and scalable data storage. O’Reilly Media, Inc
Cassandra A (2018) http://cassandra.apache.org (last accessed 26 Jan 2018)
Bhandarkar M (2010) MapReduce programming with apache Hadoop. In: IEEE international symposium on Parallel and distributed processing (IPDPS), 2010 19 Apr 2010, pp 1–1
AWS Lambda (2018) https://aws.amazon.com/lambda/ (last accessed 26 Jan 2018)
Shi W, Dustdar S (2016) The promise of edge computing. Computer 49(5):78–81
Bonomi F, Milito R, Zhu J, Addepalli S (2012) Fog computing and its role in the internet of things. In Proceedings of the first edition of the MCC workshop on 
mobile cloud computing. MCC ‘12, pp 13–16
OpenFog Consortium Architecture Working Group, OpenFog Architecture Overview (February 2016). http://www.openfogconsortium.org/ra

Plebani P, Garcia-Perez D, Anderson M, Bermbach D, Cappiello C, Kat RI, Pallas F, Pernici B, Tai S, Vitali M (2017) Information logistics and fog computing: The DITAS approach. In Proceedings of the forum and doctoral consortium papers presented at the 29th international conference on advanced information systems engineering, CAISE 2017, vol-1848. Essen, Germany. CEUR, pp 129–136
Cappiello C, Pernici B, Plebani P, Vitali M (2017) Utility-driven data management for data-intensive applications in fog environments. In: International conference on conceptual modeling. Springer, Cham, pp 216–226
Kock N (2007) Encyclopedia of E-collaboration. Imprint of IGI Publishing, Hershey, PA, Information Science Reference
Syed MR, Syed SN (2008) Handbook of research on modern systems analysis and design technologies and applications. Imprint of IGI Publishing, Hershey, PA, Information Science Reference
Hundepool A, Domingo-Ferrer J, Franconi L, Giessing S, Nordholt ES, Spicer K, de Wolf PP (2012) Statistical disclosure control. Wiley
Weiss GM, Zadrozny B, Saar-Tsechansky M (2008) Guest editorial: special issue on utility-based data mining. Data Min Knowl Discov 17(2):129–135
Lin YC, Wu CW, TsengVS (2015) Mining high utility itemsets in big data. Springer International Publishing, Cham, pp 
649–661
Ives B, Olson MH, Baroudi JJ (1983) The measurement of user information satisfaction. Commun ACM 26(10):785–793
Wang RY, Strong DM (1996) Beyond accuracy: what data quality means to data consumers. J of Manage Inf Syst 12(4):5–33
Ho TTN, Pernici B (2015) A data-value-driven adaptation framework for energy 
efficiency for data intensive applications in clouds. In: 2015 IEEE conference on technologies for sustainability (SusTech), pp 47–52
Moody D, Walsh P (1999) Measuring the value of information: an asset valuation 
approach. In: European conference on information systems
Even A, Shankaranarayanan G, Berger PD (2010) Inequality in the utility of customer data: implications for data management and usage. J Database Mark Customer Strategy Manage 17(1):19–35
Gharib M, Giorgini P, Mylopoulos J (2016) Analysis of information quality requirements in business processes, revisited. Requirements Eng 1–23
D’Andria F, Field D, Kopaneli A, Kousiouris G, Garcia-Perez D, Pernici B, Plebani P (2015) Data movement in the internet of things domain. In Proceedings of European conference on service oriented and cloud computing, ESOCC 2015. pp 243–252
Gomez A, Merseguer J, Di Nitto E, Tamburri DA (2016) Towards a UML profile for 
data intensive applications. In: Proceedings of the 1st international workshop on quality-aware DevOps. 
Saarbrücken, Germany, pp 18–23
Nalchigar S, Yu E, Ramani R (2016) A conceptual modeling framework for business analytics. Springer International Publishing, Cham, pp. 35–49
Distributed Management Task Force Inc. Common Information Model (DMTF-CIM). https://www.dmtf.org/standards/cim
Cleve A, Brogneaux AF, Hainaut JL (2010) A conceptual approach to database applications evolution. Springer Berlin Heidelberg
Batini C, Scannapieco M (2016) Data and information quality-dimensions. Principles and Techniques, Data-Centric Systems and Applications, Springer
Garijo D, Alper P, Belhajjame K, Corcho O, Gil Y, Goble CA (2014) Common motifs in scientific workflows: an empirical analysis. Future Generation Comput Syst 36:338–351
Acknowledgements
This research has been developed in the framework of the DITAS project. DITAS project receives funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement RIA 731945.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this chapter
Cite this chapter
Cappiello, C., Plebani, P., Vitali, M. (2018). A Data Utility Model for Data-Intensive Applications in Fog Computing Environments. In: Mahmood, Z. (eds) Fog Computing. Springer, Cham. https://doi.org/10.1007/978-3-319-94890-4_9
Download citation
DOI: https://doi.org/10.1007/978-3-319-94890-4_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-94889-8
Online ISBN: 978-3-319-94890-4
eBook Packages: Computer ScienceComputer Science (R0)