skip to main content
research-article

Accuracy of aggregate data in distributed project settings: Model, analysis and implications

Published:29 May 2013Publication History
Skip Abstract Section

Abstract

We examine the management of data accuracy in inter-organizational data exchanges using the context of distributed software projects. Organizations typically manage projects by outsourcing portions of the project to partners. Managing a portfolio of such projects requires sharing data regarding the status of work-in-progress residing with the partners and estimates of these projects' completion times. Portfolio managers use these data to assign projects to be outsourced to partners. These data are rarely accurate. Unless these data are filtered, inaccuracies can lead to myopic and expensive sourcing decisions. We develop a model that uses project-status data to identify an optimal assignment of projects to be outsourced. This model permits corruption of project-status data. We use this model to compute the costs of using perfect versus inaccurate project-status data and show that the costs of deviation from optimal are sizable when the inaccuracy in the data is significant. We further propose a filter to correct inaccurate project-status data and generate an estimate of true progress. With this filter, depending on the relative magnitudes of errors, we show that accuracy of project-status data can be improved and the associated economic benefit is significant. We illustrate the improvement in accuracy and associated economic benefit by instantiating the model and the filter. We further elaborate on how the model parameters may be estimated and used in practice.

Skip Supplemental Material Section

Supplemental Material

References

  1. Anderson, E. G. and Joglekar, N. R. 2005. A hierarchical product development planning framework. Prod. Oper. Manag. 14, 2.Google ScholarGoogle Scholar
  2. Ballou, D. P., Wang, R. Y., Pazer, H., and Tayi, G. K. 1998. Modeling information manufacturing systems to determine information product quality. Manag. Sci. 44, 4, 462--484. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Ballou, D. P. and Pazer, H. L. 1995. Designing information systems to optimize the accuracy-timeliness tradeoff. Inf. Syst. Res. 6, 1, 51--72.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Banker, R. D. and Kaufmann, R. J. 2004. The evolution of research on information systems: A fiftieth-year survey of the literature in management science. Manag. Sci. 50, 3, 281--289. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Bertsekas, D. 2001. Dynamic Programming and Optimal Control 2nd Ed. Athena Scientific, Belmont, MA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Braha, D. 2001. Data Mining for Design and Manufacturing: Methods and Applications. Kluwer Academic. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Browning, T. R., Devst, J. J., Eppinger, S. D., and Whitney, D. E. 2002. Adding value in product development by creating information and reducing risk. IEEE Trans. Engin. Manag. 49, 4, 443--458.Google ScholarGoogle ScholarCross RefCross Ref
  8. Caballero, I., Vizcaíno, A., and Piattini, M. 2009. Optimal data quality in project management for global software developments. In Proceedings of the 4th International Conference on Cooperation and Promotion of Information Resources in Science and Technology. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Cai, Y. and Shankaranarayanan, G. 2007. Managing data quality in inter-organizational data networks. Int. J. Inf. Qual. 1, 3, 254--271.Google ScholarGoogle ScholarCross RefCross Ref
  10. Carrascosa, M., Eppinger, S. D., and Whitney, D. E. 1998. Using the design structure matrix to estimate time to market in a product development process. In Proceedings of the ASME Design Theory and Methodology Conference. 98--6013.Google ScholarGoogle Scholar
  11. Chen, Y. and Bharadwaj, A. 2009. An empirical analysis of contract structures in IT outsourcing. Inf. Syst. Res. 20, 4, 484--506. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Chiang, R. I. and Mookerjee, V. 2004. A fault threshold policy to manage software development projects. Inf. Syst. Res. 15, 1, 3--21. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Cusumano, M. A. and Selby, R. W. 1995. Microsoft Secrets: How the World's Most Powerful Software Company Creates Technology, Shapes Markets, and Manages People. Free Press, New York. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Dehoratius, N. 2004. In pursuit of information quality. Cutter IT J. 17, 9, 2--4.Google ScholarGoogle Scholar
  15. Dominguez, A. 2006. Project management in noisy environments. In Proceedings of the POM Annual Meeting.Google ScholarGoogle Scholar
  16. Even, A. and Shankaranarayanan, G. 2007. Utility driven assessment of data quality. Datab. Adv. Inf. Syst. 38, 2, 75--93. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Fisher, C. W., Chengalur-Smith, I., and Ballou, D. P. 2003. The impact of experience and time on the use of data quality information in decision-making. Inf. Syst. Res. 14, 2, 170--188. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Ford, D. and Sterman, J. 2003. The liar's club: Impacts of concealment in concurrent development projects. Concurr. Engin. Res. Appl. 11.Google ScholarGoogle Scholar
  19. Gaynor, M. and Shankaranarayanan, G. 2008. Implications of sensors and sensor-networks for data quality management. Int. J. Inf. Qual. 2, 1, 75--93.Google ScholarGoogle ScholarCross RefCross Ref
  20. Gomes, P. J. and Joglekar, N. R. 2008 Linking modularity with problem solving and coordination efforts. Manag. Decis. Econ. 29, 5, 443--457.Google ScholarGoogle ScholarCross RefCross Ref
  21. Graves, S. 1986. A tactical planning model for a job shop. Oper. Res. 34, 522--533. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Hernadez, M. A. and Stolfo, S. J. 1998. Real-world data is dirty: Data cleansing and the merge/purge problem. J. Data Mining Knowl. Discov. 1, 2. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Hirschheim, R., Heinzl, A., and Diebern, J. 2002. Information Systems Outsourcing. Springer.Google ScholarGoogle Scholar
  24. Holt, C. C., Modigliani, F., Muth, J. F., and Simon, H. A. 1960. Planning Production, Inventories, and Work Force. Prentice-Hall, Englewood Cliffs, NJ.Google ScholarGoogle Scholar
  25. Jones, M. C. and Price, R. L. 2004. Organizational knowledge sharing in erp implementation: Lessons from industry. J. Organ. End User Comput. 16, 1, 21--41.Google ScholarGoogle ScholarCross RefCross Ref
  26. Jorgensen, M. 2007. Forecasting of software development work effort: Evidence on expert judgment and formal models. Int. J. Forecast. 23, 3, 449--462.Google ScholarGoogle ScholarCross RefCross Ref
  27. Lee, Y. W., Pipino, L. L., Funk, J. D., and Wang, R. Y. 2006. Journey to Data Quality. MIT Press, Cambridge, MA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Madnick, S., Wang, R. Y., and Lee, Y. W. 2009. Overview and framework for data and information quality research. ACM J. Data Inf. Data Qual. 1, 1--22. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Makridakis, S. and Winkler, R. 1983. Averages of forecasts: Some empirical results. Manag. Sci. 29, 987--996.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Mandel, M. and Engardio, P. 2007. The real cost of off-shoring. BusinessWeek, June 18.Google ScholarGoogle Scholar
  31. Mcgrath, M. 2001. How to boost r&d productivity by 50%%? Insights Mag. www.prtm.comGoogle ScholarGoogle Scholar
  32. Meredith, J. R. and Mantel, S. J. 2005. Project Management: A Managerial Approach. Wiley, New York.Google ScholarGoogle Scholar
  33. Mihm, J., Loch, C. H., and Huchzermeier, A. 2003. Problem-solving oscillations in complex projects. Manag. Sci. 49, 6, 733--750. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Mookerjee, V., Manino, M., and Gilson, R. 1995. Improving the performance stability of inductive expert systems under input noise. Inf. Syst. Res. 6, 328--356.Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Morrice, D. J., Anderson, E. G., and Bharadwaj, S. 2004. A simulation study to assess the efficacy of linear control theory models for the coordination of a two-stage customized service supply chain. In Proceedings of the Winter Simulation Conference. R. G. Ingalls, M. D. Rossetti, J. S. Smith, and B. A. Peters, Eds. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Oppenheim, A.,Willsky, A. S., and Young, I. T. 1983. Signals and Systems. Prentice-Hall, Englewood Cliffs, NJ.Google ScholarGoogle Scholar
  37. Parker, G. G. and Anderson, E. G. 2003. From buyer to integrator: The transformation of the supply chain manager in the vertically disintegrating firm. Prod. Oper. Manag. 11, 1, 75--91.Google ScholarGoogle ScholarCross RefCross Ref
  38. Pmi-Pmbok. 2008. A Guide to the Project Management Body of Knowledge 4th Ed. Project Management Institute Communications.Google ScholarGoogle Scholar
  39. Pipino, L. L., Lee, Y. W., and Wang, R. Y. 2002. Data quality assessment. Comm. ACM, 45, 4, 211--218. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Redman, T. C. 1996. Data Quality for the Information Age. Artech House, Boston, MA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Sahoo, N., Krishnan, R., Duncan, G., and Callan, J. 2011. The halo effect in multi-component ratings and its implications for recommender systems: The case of yahoo! movies. Inf. Syst. Res. 23, 1, 231--246. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Sethi, S. P. and Thompson, G. L. 2000. Optimal Control Theory: Applications to Management Science and Economics. Kluwer Academic.Google ScholarGoogle Scholar
  43. Shankaranarayanan, G., Ziad, M., and Wang, R. Y. 2003. Managing data quality in dynamic decision environment: An information product approach. J. Datab. Manag. 14, 14--32.Google ScholarGoogle ScholarCross RefCross Ref
  44. Smith, R. and Eppinger, S. D. 1997. Identifying controlling features of engineering design iteration. Manag. Sci. 43, 3, 276--293. Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Stengel, R. F. 1994. Optimal Control and Estimation. Dover, New York.Google ScholarGoogle Scholar
  46. Sterman, J. D. 1989. Modeling managerial behavior: Misperceptions of feedback in a dynamic decision making experiment. Manag. Sci. 35, 321--339. Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. Tirole, J. 1993. The Theory of Industrial Organization. MIT Press, Cambridge, MA.Google ScholarGoogle Scholar
  48. Trigg, D. W. and Leach, A. G. 1967. Exponential smoothing with an adaptive response rate. Oper. Res. Quart. 18, 1, 53--59.Google ScholarGoogle ScholarCross RefCross Ref
  49. Upton, D. M. and Staats, B. R. 2006. Lean at wipro technologies. Case study 9-607-032, Harvard Business School.Google ScholarGoogle Scholar
  50. Ulrich, K. T. and Eppinger, S. D. 2000. Product Design and Development. McGraw-Hill, New York.Google ScholarGoogle Scholar
  51. Wang, R. Y. 1998. A product perspective on total quality management. Comm. ACM 41, 2, 58--65. Google ScholarGoogle ScholarDigital LibraryDigital Library
  52. Wang, R. Y. and Strong, D. M. 1996. Beyond accuracy: What data quality means to data consumers. J. Manag. Inf. Syst. 12, 4, 5--34. Google ScholarGoogle ScholarDigital LibraryDigital Library
  53. Yassine, A., Joglekar, N. R., Braha, D., Eppinger, S. D., and Whitney, D. 2003. Information hiding in product development: The design churn effect. Res. Engin. Des. 14, 145--161.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Accuracy of aggregate data in distributed project settings: Model, analysis and implications

      Recommendations

      Reviews

      Bhavanandan L. Rathinasamy

      Distributed software project development is ubiquitous nowadays with the advent of high-speed Internet. Almost all big information technology firms outsource at least a portion of their projects to partner firms, to make use of their human resources and to reduce costs. It is vital for the planning team of the principal firm that outsources projects to have access to up-to-date information about the so-called project portfolio, the status of the projects carried out by the partner firms, and the current project workload. Project portfolios are managed through data sharing between the principal firm and the partner firms. Hence, the accuracy of such data is vital in making predictions about the project assignments and completion times. Often, this data is inaccurate for various reasons. In this paper, Joglekar et al. examine the reasons for inaccuracy in interorganizational data exchange, and propose a model from control theory to identify project assignments for the partner firms that are optimal despite inaccurate data. They use the Kalman-Bucy filter to remove inaccuracy in the data, and demonstrate the economic benefit in doing so. The mathematical analysis is described eloquently, with all the details necessary to enable readers to follow the main results. Extensive numerical analyses are presented to illustrate the efficiency of the proposed control theory-based model and Kalman-Bucy filter. Online Computing Reviews Service

      Access critical reviews of Computing literature here

      Become a reviewer for Computing Reviews.

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image Journal of Data and Information Quality
        Journal of Data and Information Quality  Volume 4, Issue 3
        May 2013
        68 pages
        ISSN:1936-1955
        EISSN:1936-1963
        DOI:10.1145/2458517
        Issue’s Table of Contents

        Copyright © 2013 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 29 May 2013
        • Accepted: 1 January 2013
        • Revised: 1 June 2012
        • Received: 1 March 2012
        Published in jdiq Volume 4, Issue 3

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader