Abstract
The chapter argues that accessibility and flexibility are the two principles and practices that can bring big data projects the closest to a data factory ideal. The chapter elaborates on the necessity of these two principles, offering a reasoned explanation for their value in context. Using two big data social scientific research projects as a springboard for conversation the chapter highlights both the advantages and the practical limits within which accessibility and flexibility operate. The authors avoid both utopian and dystopian tropes about big data approaches. In addition, they offer a critical feminist discussion of big data collaboration. Of particular interest are also the manner in which specific characteristics of big data projects, especially volume and velocity, may affect multidisciplinary collaborations.
Notes
- 1.
The designs of social media and their affordances for data scientists also impact scholars’ ability to work with big social data, for instance, APIs and terms of service change, affecting what data is available and under what conditions. Tools modify the core functions and impact the behaviors users are able to engage in – e.g., Twitter is removing usernames and media attachments from the 140-character limit; Facebook does not treat all crisis and find your friend functionality the same way. The legal milieu about rights to be forgotten differs between the USA and European Union. This obviously is not an exhaustive list but rather an illustrative one that makes clear that the changing technological landscape impacts the research we can do.
- 2.
This statement is not meant to conflate the epistemological differences or conflicts within the social sciences but rather to point to the broad stroke differences between the logics in computational or machine-based sciences and those found in the sciences that are people centered.
References
accessible, adj. (n.d.). OED online. Oxford University Press. Retrieved from http://www.oed.com/view/Entry/1034.
Aggarwal, C. C., & Yu, P. S. (2008a). A general survey of privacy-preserving data mining models and algorithms. In C. C. Aggarwal & P. S. Yu (Eds.), Privacy-preserving data mining (pp. 11–52). New York: Springer. Retrieved from http://link.springer.com/chapter/10.1007/978-0-387-70992-5_2.
Aggarwal, C. C., & Yu, P. S. (Eds.). (2008b). Privacy-preserving data mining: Models and algorithms. New York: Springer.
Anderson, D. J. (2003). Agile management for software engineering: Applying the theory of constraints for business results. Upper Saddle River: Prentice Hall Professional.
Anderson, C. (2008). The end of theory: The data deluge makes the scientific method obsolete. Retrieved June 13, 2016, from http://www.wired.com/2008/06/pb-theory/.
Andrejevic, M. (2014). Big data, big questions| the big data divide. International Journal of Communication, 8(0), 17.
Bamman, D., Eisenstein, J., & Schnoebelen, T. (2014). Gender identity and lexical variation in social media. Journal of SocioLinguistics, 18(2), 135–160.
Bietz, M. J., & Birnholtz, J. P. (2003). Data at work: Supporting sharing in science and engineering (pp. 339–348). New York: ACM. Retrieved from http://doi.acm.org/10.1145/958160.958215.
Bird, S. (2012). Feminist methods of research. Iowa State University. Retrieved from http://www.slideshare.net/miryammastrella/presentation-on-feminist-methods-of-research.
Bleiker, R. (2015). Pluralist methods for visual global politics. Millennium – Journal of International Studies, 43(3), 872–890. https://doi.org/10.1177/0305829815583084.
Boyd, D. (2014). It’s complicated: The social lives of networked teens. New Haven: Yale University Press.
Boyd, D., & Crawford, K. (2012). Critical questions for big data. Information, Communication & Society, 15(5), 662–679. https://doi.org/10.1080/1369118X.2012.678878.
Bozeman, B., & Corley, E. (2004). Scientists’ collaboration strategies: Implications for scientific and technical human capital. Research Policy, 33(4), 599–616. https://doi.org/10.1016/j.respol.2004.01.008.
Caprioli, M. (2004). Feminist IR theory and quantitative methodology: A critical analysis1. International Studies Review, 6(2), 253–269.
Caprioli, M. (2009). Making choices. Politics & Gender, 5(03), 426–431.
Charmaz, K. (2006). Constructing grounded theory: A practical guide through qualitative analysis (2nd ed.). Los Angeles: Sage Publications.
Coleman, G., & Rippin, A. (2000). Putting feminist theory to work: Collaboration as a means towards organizational change. Organization, 7(4), 573–587. https://doi.org/10.1177/ 135050840074004.
Corley, E. A., Boardman, P. C., & Bozeman, B. (2006). Design and the management of multi-institutional research collaborations: Theoretical implications from two case studies. Research Policy, 35(7), 975–993. https://doi.org/10.1016/j.respol.2006.05.003.
Crawford, K., Gray, M., & Miltner, K. (Eds.). (2014). Big data|critiquing big data: Politics, ethics, epistemology. International Journal of Communication, 8, 1663–1672.
Cukier, K. N., & Mayer-Schoenberger, V. (2013). The rise of big data: How it’s changing the way we think about the world. Foreign Affairs. Retrieved from https://www.foreignaffairs.com/articles/2013-04-03/rise-big-data.
Cummings, J. N., & Kiesler, S. (2003). KDI initiative: Multidisciplinary scientific collaborations. Washington, DC: National Science Foundation. Retrieved from http://netvis.fuqua.duke.edu/papers/NSF_KDI_report.pdf.
Das, T. K., & Teng, B.-S. (1998). Between trust and control: Developing confidence in partner cooperation in alliances. Academy of Management Review, 23(3), 491–512. https://doi.org/10.5465/AMR.1998.926623.
Dreyfus, H. L. (1991). Being-in-the-world: A commentary on Heidegger’s being and time, division I. Cambridge, MA: MIT Press.
Glaser, B., & Strauss, A. (1967). Discovery of grounded theory: Strategies for qualitative research. New Brunswick: Aldine.
González-Bailón, S. (2013). Social science in the era of big data. Policy & Internet, 5(2), 147–160. https://doi.org/10.1002/1944-2866.POI328.
Hajian, S., Domingo-Ferrer, J., Monreale, A., Pedreschi, D., & Giannotti, F. (2014). Discrimination- and privacy-aware patterns. Data Mining and Knowledge Discovery, 29(6), 1733–1782. https://doi.org/10.1007/s10618-014-0393-7.
Heidegger, M., & Stambaugh, J. (1996). Being and time: A translation of Sein und Zeit. Albany: State University of New York Press.
Hemphill, L., Roback, A. J. (2014). Tweet acts: How constituents lobby congress via twitter. In Proceedings of the 17th ACM Conference on Computer Supported Cooperative Work & Social Computing (pp. 1200–1210). New York: ACM. https://doi.org/10.1145/2531602.2531735.
Hemphill, L., Otterbacher, J., Shapiro, M. (2013). What’s congress doing on twitter? In Proceedings of the 2013 Conference on Computer Supported Cooperative Work (pp. 877–886). New York: ACM. https://doi.org/10.1145/2441776.2441876.
Hemphill, L., Erickson, I., Ribes, D., Mergel, I. (2014). Feminism and social media research. In Proceedings of the Companion Publication of the 17th ACM Conference on Computer Supported Cooperative Work & Social Computing (pp. 319–322). New York: ACM. https://doi.org/10.1145/2556420.2558864.
Hogan, B. (2010). The presentation of self in the age of social media: Distinguishing performances and exhibitions online. Bulletin of Science, Technology & Society, 0270467610385893. 30 377–386. https://doi.org/10.1177/0270467610385893.
Iwarsson, S., & Stahl, A. (2003). Accessibility, usability and universal design – positioning and definition of concepts describing person-environment relationships. Disability and Rehabilitation, 25(2), 57.
Jackson, P. T. (2010). The conduct of inquiry in international relations: Philosophy of science and its implications for the study of world politics. London: Routledge.
Jackson, S. T. (2016). Marketing militarism in the digital age: Arms production, YouTube, and selling “national security.” In Hamilton & Shepherd, Understanding popular culture and world politics in the digital age. Oxon: Routledge.
King, G. (2011). Ensuring the data-rich future of the social sciences. Science (New York, N.Y.), 331(6018), 719–721. https://doi.org/10.1126/science.1197872.
Kitchin, R. (2014). Big data, new epistemologies and paradigm shifts. Big Data & Society, 1(1), 2053951714528481. https://doi.org/10.1177/2053951714528481.
Koschmann, T., Kuutti, K., & Hickman, L. (1998). The concept of breakdown in Heidegger, Leont’ev, and Dewey and its implications for education. Mind, Culture, and Activity, 5(1), 25–41. https://doi.org/10.1207/s15327884mca0501_3.
Making data – Big data and beyond. (2013). First Monday, 18(10).
Markham, A. (2016). OKCupid data release fiasco: It’s time to rethink ethics education. Retrieved May 18, 2016, from https://medium.com/@amarkham/okcupid-data-release-fiasco-ba0388348cd#.3irk2qca8.
Marwick, A. E., & Boyd, D. (2014). Networked privacy: How teenagers negotiate context in social media. New Media & Society, 16(7), 1051–1067. https://doi.org/10.1177/1461444814543995.
McCarthy, D. R. (2013). Technology and “the International” or: How I learned to stop worrying and love determinism. Millennium – Journal of International Studies, 0305829813484636. https://doi.org/10.1177/0305829813484636.
Metcalf, J., & Crawford, K. (2016). Where are human subjects in big data research? The emerging ethics divide. Big Data & Society. Retrieved from http://papers.ssrn.com/abstract=2779647.
Morien, R. (2005). Agile management and the Toyota way for software project management. In Industrial informatics, 2005. INDIN’05. 2005 3rd IEEE International Conference on (pp. 516–522). IEEE. Retrieved from http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=1560430.
Mountz, A., Bonds, A., Mansfield, B., Loyd, J., Hyndman, J., Walton-Roberts, M., … Curran, W. (2015). For slow scholarship: A feminist politics of resistance through collective action in the neoliberal university. ACME: An International E-Journal for Critical Geographies, 14(4), 1235–1259.
Olson, G. M., & Olson, J. S. (2000). Distance matters. Human Computer Interaction, 15(2 & 3), 139–178.
Olson, G. M., Zimmerman, A., & Bos, N. (2008). Scientific collaboration on the internet. Cambridge: MIT Press.
Ribes, D., & Jackson, S. L. (2013). Data bite man: The work of sustaining a long-term study. In L. Gitelman (Ed.), “Raw data” is an oxymoron (pp. 147–166). Cambridge, MA: The MIT Press.
Sánchez, D., & Batet, M. (2016). C-sanitized: A privacy model for document redaction and sanitization. Journal of the Association for Information Science and Technology, 67(1), 148–163. https://doi.org/10.1002/asi.23363.
Sarantakos, S. (2012). Social research (4th ed.). South Melbourne: Palgrave Macmillan.
Shapiro, M. A., & Hemphill, L. (in press). Agenda building & indexing: Does the U.S. congress direct New York Times content through Twitter? Policy & Internet.
Siegel, E. (2013). Predictive analytics: The power to predict who will click, buy, lie, or die. Hoboken: Wiley.
Xu, L., Jiang, C., Chen, Y., Wang, J., & Ren, Y. (2016). A framework for categorizing and applying privacy-preservation techniques in big data mining. Computer, 49(2), 54–62. https://doi.org/10.1109/MC.2016.43.
Zimmer, M. (2016). OkCupid study reveals the perils of big-data science. Retrieved May 18, 2016, from https://www.wired.com/2016/05/okcupid-study-reveals-perils-big-data-science/.
Zwitter, A. (2014). Big data ethics. Big Data & Society, 1(2). https://doi.org/10.1177/ 2053951714559253.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this chapter
Cite this chapter
Hemphill, L., Jackson, S.T. (2017). Accessibility and Flexibility: Two Organizing Principles for Big Data Collaboration. In: Matei, S., Jullien, N., Goggins, S. (eds) Big Data Factories. Computational Social Sciences. Springer, Cham. https://doi.org/10.1007/978-3-319-59186-5_2
Download citation
DOI: https://doi.org/10.1007/978-3-319-59186-5_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-59185-8
Online ISBN: 978-3-319-59186-5
eBook Packages: Computer ScienceComputer Science (R0)