ABSTRACT
Developments in big data have led to an increase in data analytics projects conducted by organizations. Such projects aim to create value by improving decision making or enhancing business processes. However, many data analytics projects still fail to deliver the expected value. The use of process models or methodologies is recommended to increase the success rate of these projects. Nevertheless, organizations are hardly using them because they are considered too rigid and hard to implement. The existing methodologies often do not fit the specific project characteristics. Therefore, this research suggests grouping different project characteristics to identify the most appropriate project methodology for a specific type of project. More specifically, this research provides a structured description that helps to determine what type of project methodology works for different types of data analytics projects. The results of six different case studies show that continuous projects would benefit from an iterative methodology.
- V. Grover, R. H. L. Chiang, T. Liang, and D. Zhang, "Creating Strategic Business Value from Big Data Analytics: A Research Framework," J. Manag. Inf. Syst., vol. 35, no. 2, pp. 388--423, 2018.Google ScholarCross Ref
- B. H. Wixom, B. Yen, and M. Relich, "Maximizing Value from Business Analytics," MISQ Exec., vol. 12, no. 2, pp. 111--123, 2013.Google Scholar
- G. Mariscal, Ó. Marbán, and C. Fernández, "A survey of data mining and knowledge discovery process models and methodologies," Knowl. Eng. Rev., vol. 25, no. 2, pp. 137--166, 2010.Google ScholarDigital Library
- J. S. Saltz and I. Shamshurin, "Big data team process methodologies: A literature review and the identification of key factors for a project's success," in Proceedings - 2016 IEEE International Conference on Big Data, 2016, pp. 2872--2879.Google Scholar
- J. Baijens and R. W. Helms, "Developments in Knowledge Discovery Processes and Methodologies: Anything New ?," in Twenty-fifth Americas Conference on Information Systems, 2019, pp. 1 -10.Google Scholar
- D. Larson and V. Chang, "A review and future direction of agile, business intelligence, analytics and data science," Int. J. Inf. Manage., vol. 36, no. 5, pp. 700--710, 2016.Google ScholarDigital Library
- C. Dremel, M. M. Herterich, J. Wulf, J.-C. Waizmann, and W. Brenner, "How Audi AG established big data analytics in its digital transformation," MIS Q. Exec., vol. 16, no. 2, pp. 81--100, 2017.Google Scholar
- F. K. Y. Chan and J. Y. L. Thong, "Acceptance of agile methodologies: A critical review and conceptual framework," Decis. Support Syst., vol. 46, no. 4, pp. 803--814, 2009.Google ScholarDigital Library
- J. Gao, A. Koronios, and S. Selle, "Towards a process view on critical success factors in Big Data analytics projects," 2015 Am. Conf. Inf. Syst. AMCIS 2015, pp. 1--14, 2015.Google Scholar
- H.-M. Chen, R. Schütz, R. Kazman, and F. Matthes, "How Lufthansa Capitalized on Big Data for Business Model Renovation," MIS Q. Exec., vol. 16, no. 1, pp. 299--320, 2017.Google Scholar
- J. S. Saltz, D. Wild, N. Hotz, and K. Stirling, "Exploring Project Management Methodologies Used Within Data Science Teams," in Twenty-fourth Americas Conference on Information Systems, New Orleans, 2018, 2018, pp. 1--5.Google Scholar
- J. Saltz, I. Shamshurin, and C. Connors, "Predicting data science sociotechnical execution challenges by categorizing data science projects," J. Assoc. Inf. Sci. Technol., vol. 68, no. 12, pp. 2720--2728, 2017.Google ScholarDigital Library
- Y. Li, M. A. Thomas, and K. M. Osei-Bryson, "A snail shell process model for knowledge discovery via data analytics," Decis. Support Syst., vol. 91, pp. 1--12, Nov. 2016.Google ScholarDigital Library
- J. Gao, A. Koronios, and S. Selle, "Towards a process view on critical success factors in Big Data analytics projects," in 2015 Americas Conference on Information Systems, AMCIS 2015, 2015.Google Scholar
- U. Fayyad, G. Piatetsky-Shapiro, and P. Smyth, "From Data Mining to Knowledge Discovery in Databases," AI Mag., vol. 17, no. 3, p. 37, 1996.Google ScholarDigital Library
- P. Chapman et al., "Crisp-Dm 1.0," 2000.Google Scholar
- C. Schmidt and W. N. Sun, "Synthesizing Agile and Knowledge Discovery: Case Study Results," J. Comput. Inf. Syst., vol. 58, no. 2, pp. 142--150, 2018.Google ScholarCross Ref
- S. Ahangama and D. C. C. Poo, "Designing a Process Model for Health Analytic Projects," in PACIS 2015 Proceedings. 3., 2015.Google Scholar
- S. Sharma, K.-M. Osei-Bryson, and G. M. Kasper, "Evaluation of an integrated Knowledge Discovery and Data Mining process model," Expert Syst. Appl., vol. 39, no. 13, pp. 11335--11348, 2012.Google ScholarDigital Library
- O. Marbán, J. Segovia, E. Menasalvas, and C. Fernández-Baizán, "Toward data mining engineering: A software engineering approach," Inf. Syst., vol. 34, no. 1, pp. 87--107, 2009.Google ScholarDigital Library
- S. Angee, "Towards an Improved ASUM-DM Process Methodology for Cross-Disciplinary Multi-organization Big Data & Analytics Projects," in International Conference on Knowledge Management in Organizations, 2018, vol. 877, pp. 613--624.Google ScholarCross Ref
- S. Ahangama and D. C. C. Poo, "Unified Structured Process for Health Analytics," Int. J. Medical, Heal. Biomed. Bioeng. Pharm. Eng., vol. 8, no. 11, pp. 768--776, 2014.Google Scholar
- G. S. do Nascimento and A. A. de Oliveira, "An Agile Knowledge Discovery in Databases Software Process," in The Second International Conference on Advances in Information Mining and Management compliance, 2012, pp. 343--351.Google Scholar
- J. Baijens, R. Helms, and D. Iren, "Applying Scrum in Data Science Projects," in IEEE 22nd Conference on Business Informatics (CBI), 2020, pp. 30--38.Google Scholar
- J. S. Saltz, I. Shamshurin, and K. Crowston, "Comparing Data Science Project Management Methodologies via a Controlled Experiment," in Proceedings of the 50th Hawaii International Conference on System Sciences, 2017, pp. 1013 -1022.Google Scholar
- L. Williams, "Agile Software Development Methodologies and Practices," Adv. Comput., vol. 80, pp. 1--44, 2010.Google ScholarCross Ref
- N. W. Grady, J. A. Payne, and H. Parker, "Agile big data analytics: AnalyticsOps for data science," in Proceedings 2017 IEEE International Conference on Big Data, 2017, pp. 2331--2339.Google Scholar
- J. S. Saltz and A. Sutherland, "SKI: An Agile Framework for Data Science," in 2019 IEEE International Conference on Big Data (Big Data), 2019, pp. 3468--3476.Google Scholar
- J. S. Saltz, R. Heckman, and I. Shamshurin, "Exploring How Different Project Management Methodologies Impact Data Science Students," in Twenty-Fifth European Conference on Information Systems (ECIS), Guimarães, Portugal, 2017, pp. 2939--2948.Google Scholar
- F. Martínez-Plumed et al., "CRISP-DM Twenty Years Later: From Data Mining Processes to Data Science Trajectories," in IEEE Transactions on Knowledge and Data Engineering, 2019, pp. 1--14.Google Scholar
- M. Das, R. Cui, D. R. Campbell, G. Agrawal, and R. Ramnath, "Towards methods for systematic research on big data," Proc. - 2015 IEEE Int. Conf. Big Data, IEEE Big Data 2015, pp. 2072--2081, 2015.Google Scholar
- S. Viaene and A. Van den Bunder, "The secrets to managing business analytics projects," MIT Sloan Manag. Rev., vol. 53, no. 1, pp. 65--69, 2011.Google Scholar
- F. Provost and T. Fawcett, "Data Science for Business," Book, 2013.Google ScholarDigital Library
- M. H. Jensen, P. A. Nielsen, and J. S. Persson, "Managing Big Data Analytics Projects: The Challenges of Realizing Value," Proc. 27th Eur. Conf. Inf. Syst., no. May, pp. 0--15, 2019.Google Scholar
- W. Ayele, "Adapting CRISP-DM for Idea Mining," Int. J. Adv. Comput. Sci. Appl., vol. 11, no. 6, pp. 20--32, 2020.Google Scholar
- D. J. Power, C. Heavin, J. McDermott, and M. Daly, "Defining business analytics: an empirical approach," J. Bus. Anal., vol. 1, no. 1, pp. 40--53, 2018.Google ScholarCross Ref
- P. Darke, G. Shanks, and M. Broadbent, "Successfully completing case study research: combining rigour, relevance and pragmatism," Inf. Syst. J., vol. 8, no. 4, pp. 273--289, 1998.Google ScholarCross Ref
- R. K. Yin, "Robert K. Yin. (2014). Case Study Research Design and Methods (5th ed.). Thousand Oaks, CA: Sage. 282 pages." Can. J. Progr. Eval., no. March 2016, pp. 1--5, 2018.Google Scholar
- R. k Yin, Case study research and applications: Design and methods. SAGE Publications, 2017.Google Scholar
- J. Dul and T. Hak, Case Study Methodology in Business Research. 2008.Google Scholar
- M. Saunders, P. Lewis, and A. Thornhill, Research Methods for Business Students. Pearson Education LTD, 2009.Google Scholar
- A. L. Strauss, Qualitative Analysis for Social Scientists. Cambridge university press, 1987.Google ScholarCross Ref
- T. J. Burant, C. Gray, E. Ndaw, V. McKinney-Keys, and G. Allen, "The Rhythms of a Teacher Research Group," Multicult. Perspect., vol. 9, no. 1, pp. 10--18, 2007.Google ScholarCross Ref
- J. Saldaña, The coding manual for qualitative researchers. Sage, 2015.Google Scholar
Index Terms
- Data Analytics Project Methodologies: Which One to Choose?
Recommendations
Preserving Privacy in Data Analytics
IWSPA '19: Proceedings of the ACM International Workshop on Security and Privacy AnalyticsData Analytics is becoming an essential business tool for many data intensive companies and organizations. However, the increased use of such methods comes with the threat of data disclosure. Privacy-preserving methods have been developed with varying ...
WSDM'15 Workshop Summary / Scalable Data Analytics: Theory and Applications
WSDM '15: Proceedings of the Eighth ACM International Conference on Web Search and Data MiningThe SDA workshop at WSDM 2015 is the fifth International Workshop on Scalable Data Analytics, following the previous four workshops of SDA respectively held at IEEE Big Data 2013, PAKDD 2014, IEEE Big Data 2014, and IEEE ICDM 2014. This series of ...
Perspectives, Motivations and Implications Of Big Data Analytics
ICARCSET '15: Proceedings of the 2015 International Conference on Advanced Research in Computer Science Engineering & Technology (ICARCSET 2015)As today there is an enormous volume of data, examining these large sets contains structure and unstructured data of different types and sizes; big data analytics is used. Data Analytics allows the user to analyze the unusable data to make a faster and ...
Comments