Sociology’s Stake in Data Science
DOI:
https://doi.org/10.6092/issn.1971-8853/13434Keywords:
data science, Dewey, computational social science, digital transformation, reflexivityAbstract
Data scientists gave sociologists pause when they started disturbing social life and research. This article considers three instances where data science made inroads into the sociology jurisdiction. Instead of calling for a defense, they reveal opportunities for sociological research in the digital age. These opportunities build on the data-analytic thinking that undergirds the discipline's more salient structures and conventions. They recall old sociological intuitions and pragmatist theory that conceptualize the research process in a way that leaves room for novel observations. From this perspective, data science can help integrate sociology around new problems and shared principles and enlarge it by introducing its ideas to different audiences.
References
Abbott, A. (1988). The System of Professions: An Essay on the Division of Expert Labor. Chicago, IL: University of Chicago Press.
Abbott, A. (1998). The Causal Devolution. Sociological Methods & Research, 27(2), 148–181. https://doi.org/10.1177%2F0049124198027002002
Abbott, A. (2001). Chaos of Disciplines. Chicago, IL: University of Chicago Press.
Abbott, A. (2004). Methods of Discovery Heuristics for the Social Sciences. Manhattan, NY: WW Norton & Company.
Auspurg, K., & Brüderl, J. (2021). Has the Credibility of the Social Sciences been Credibly Destroyed? Reanalyzing the “Many Analysts, One Data Set” Project. Socius, 7, 1–14. https://doi.org/10.1177%2F23780231211024421
Avnoon, N. (2021). Data Scientists’ Identity Work: Omnivorous Symbolic Boundaries in Skills Acquisition. Work, Employment and Society, 35(2), 332–349. https://doi.org/10.1177%2F0950017020977306
Bail, C. (2021). Breaking the Social Media Prism: How to Make Our Platforms Less Polarizing. Princeton, NJ: Princeton University Press.
Baćak, V., & Kennedy, E.H. (2019). Principled machine learning using the super learner: An application to predicting prison violence. Sociological Methods & Research, 48(3), 698–721. https://doi.org/10.1177%2F0049124117747301
Barabási, A-L., & Albert, R. (1999). Emergence of Scaling in Random Networks. Science, 286(5439), 509–512. https://doi.org/10.1126/science.286.5439.509
Barbosa, N.M., Sun, E., Antin, J., & Parigi, P. (2020, April). Designing for Trust: A Behavioral Framework for Sharing Economy Platforms. In Y. Huang, I. King, T-Y Liu, & M van Steen (Eds.), Proceedings of The Web Conference 2020 (pp. 2133–2143). New York, NY: ACM. https://doi.org/10.1145/3366423.3380279
Battle-Baptiste, W., & Rusert, B. (Eds.). (2018). WEB Du Bois's Data Portraits: Visualizing Black America. San Francisco, CA: Chronicle Books.
Ben-David, J. (1971). The Scientist's Role in Society: A Comparative Study. Hoboken, NJ: Prentice-Hall.
Black, T. R. (1999). Doing Quantitative Research in the Social Sciences: An Integrated Approach to Research Design, Measurement and Statistics. London: Sage.
Bol, T., de Vaan, M., & van de Rijt, A. (2018). The Matthew effect in science funding. Proceedings of the National Academy of Sciences of the United States of America, 115(19), 4887–4890. https://doi.org/10.1073/pnas.1719557115
Bonacich, P. (1972). Factoring and Weighting Approaches to Status Scores and Clique Identification. Journal of Mathematical Sociology, 2(1), 113–120. https://doi.org/10.1080/0022250X.1972.9989806
Bonacich, P. (1987). Power and Centrality: A Family of Measures. American Journal of Sociology, 92(5), 1170–1182. http://dx.doi.org/10.1086/228631
Börner, K., Scrivner, O., Gallant, M., Ma, S., Liu, X., Chewning, K., Wu, L., & Evans, J. A. (2018). Skill discrepancies between research, education, and jobs reveal the critical need to supply soft skills for the data economy. Proceedings of the National Academy of Sciences of the United States of America, 115(50), 12630–12637. https://doi.org/10.1073/pnas.1804247115
Brandt, P. (2016). The Emergence of the Data Science Profession [Doctoral dissertation, Columbia University]. https://doi.org/10.7916/D8BK1CKJ
Brandt, P., & Timmermans, S. (2021). Abductive Logic of Inquiry for Quantitative Research in the Digital Age. Sociological Science, 8, 191–210. http://dx.doi.org/10.15195/v8.a10
Bravo, G., Farjam, M., Moreno, F.G., Birukou, A., & Squazzoni, F. (2018). Hidden Connections: Network Effects on Editorial Decisions in Four Computer Science Journals. Journal of Informetrics, 12(1), 101–112. https://doi.org/10.1016/j.joi.2017.12.002
Breiman, L. (2001). Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author). Statistical Science, 16(3), 199–231. https://doi.org/10.1214/ss/1009213726
Brin, S., & Page, L. (1998). The Anatomy of a Large-scale Hypertextual Web Search Engine. Computer Networks and ISDN Systems, 30(1–7), 107–117. https://doi.org/10.1016/S0169-7552(98)00110-X
Burt, R.S. (1987). Social Contagion and Innovation: Cohesion versus Structural Equivalence. American Journal of Sociology, 92(6), 1287–1335. https://doi.org/10.1086/228667
Castells, M. (1996). The Rise of the Network Society: The Information Age: Economy, Society, and Culture (Vol. 1). Hoboken, NJ: Wiley.
Christin, A. (2020). Metrics at Work: Journalism and the Contested Meaning of Algorithms. Princeton, NJ: Princeton University Press.
Davenport, T.H., & Patil, D.J. (2012). Data Scientist: The Sexiest Job Of the 21st Century. Harvard Business Review, 90(10), 70–76.
Dewey, J. (1916). Democracy and Education: An Introduction to the Philosophy of Education. New York, NY: Macmillan
Dewey, J. (1922). Human Nature and Conduct. New York, NY: The Modern Library.
Dewey, J. (1929). The Quest for Certainty: A Study of the Relation of Knowledge and Action. New York, NY: Minton, Balch.
Dewey, J. (1938). Logic: The Theory of Inquiry. New York, NY: Henry Holt and Co.
Dewey, J. (1939). Theory of Valuation. Chicago, IL: University of Chicago Press.
Donoho, D. (2017). 50 Years of Data Science. Journal of Computational and Graphical Statistics, 26(7), 745–766. https://doi.org/10.1080/10618600.2017.1384734
Dorschel, R., & Brandt, P. (2021). Professionalization via Ambiguity. The Discursive Construction of Data Scientists in Higher Education and the Labor Market. Zeitschrift für Soziologie, 50(3-4), 193–210. https://doi.org/10.1515/zfsoz-2021-0014
Durkheim, É. (1893). De la division du travail social. Étude sur l’organisation des sociétés supérieures. Paris: Félix Alcan.
Durkheim, É. (1897). Le suicide. Étude de sociologie. Paris: Félix Alcan.
Eubanks, V. (2018). Automating Inequality: How High-tech Tools Profile, Police, and Punish the Poor. New York, NY: St. Martin's Press.
Evans, J., & Foster, J.G. (2019). Computation and the Sociological Imagination. Contexts, 18(4), 10–15. https://doi.org/10.1177/1536504219883850
Geertz, C. (1973). Thick Description: Toward an Interpretive Theory of Culture. In The Interpretation of Cultures (pp. 310–323). New York, NY: Basic Books.
Goldberg, A. (2015). In Defense of Forensic Social Science. Big Data & Society, 2(2), pp. 1–3. https://doi.org/10.1177/2053951715601145
González-Bailón, S. (2017). Decoding the Social World: Data Science and the Unintended Consequences of Communication. Cambridge, MA: MIT Press.
Gouldner, A.W. (1970). The Coming Crisis of Western Sociology. Portsmouth, NH: Heinemann.
Gray, P.S., Williamson, J.B., Karp, D.A., & Dalphin, J.R. (2007). The research imagination: An introduction to qualitative and quantitative methods. Cambridge: Cambridge University Press.
Grbovic, M. (2017). Search ranking and personalization at Airbnb. Proceedings of the Eleventh ACM Conference on Recommender Systems, 339–340. https://doi.org/10.1145/3109859.3109920
Hammerbacher, J. (2009). Information platforms and the rise of the data scientist. In T. Segaran & J. Hammerbacher (Eds.), Beautiful Data: The Stories Behind Elegant Data Solutions (pp. 73–84). Sebastopol, CA: O'Reilly Media.
Healy, K. (2018). Data Visualization: A Practical Introduction. Princeton, NJ: Princeton University Press.
Hedström, P., & Swedberg, R. (1998). Social mechanisms: An introductory essay. In P. Hedström & R. Swedberg (Eds.) Social Mechanisms: An Analytical Approach to Social Theory (pp. 1–31). Cambridge: Cambridge University Press.
Heider, F. (1958). The Psychology of Interpersonal Relations. London: Psychology Press.
House, J.S. (2019). The Culminating crisis of American Sociology and its Role in Social Science and Public Policy: An Autobiographical, Multimethod, Reflexive Perspective. Annual Review of Sociology, 45, 1–26. https://doi.org/10.1146/annurev-soc-073117-041052
Ibarra, H. (1999). Provisional selves: Experimenting with Image and Identity in Professional Adaptation. Administrative Science Quarterly, 44(4), 764–791. https://doi.org/10.2307/2667055
Jerolmack, C., & Khan, S. (2014). Talk is Cheap: Ethnography and the Attitudinal Fallacy. Sociological Methods & Research, 43(2), 178–209. https://doi.org/10.1177/0049124114523396
Jhaver, S., Karpfen, Y., & Antin, J. (2018). Algorithmic Anxiety and Coping Strategies of Airbnb Hosts. Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, 1–12. https://doi.org/10.1145/3173574.3173995
Joas, H. (1992). Die Kreativität des Handelns. Frankfurt am Main: Suhrkamp.
Katz, E., & Katz, R. (2016). Revisiting the Origin of the Administrative versus Critical Research Debate. Journal of Information Policy, 6(1), 4–12. https://doi.org/10.5325/jinfopoli.6.2016.0004
Kleinberg, J.M. (1999). Authoritative sources in a hyperlinked environment. Journal of the ACM, 46(5), 604–632. https://doi.org/10.1145/324133.324140
Krause, M. (2021). On Sociological Reflexivity. Sociological Theory, 39(1), 3–18. https://doi.org/10.1177/0735275121995213
Krippner, G., Granovetter, M., Block, F., Biggart, N., Beamish, T., Hsing, Y., Hart, G., Arrighi,G., Mendell, M., Hall, J., Burawoy, M., Vogel, S., & O’Riain, S. (2004). Polanyi symposium: a conversation on embeddedness. Socio-Economic Review, 2(1), 109–135. https://doi.org/10.1093/soceco/2.1.109
Kuhn, T.S. (1970). The Structure of Scientific Revolutions (2nd ed.). Chicago, IL: University of Chicago Press.
Latour, B. (1987). Science in Action: How to Follow Scientists and Engineers Through Society. Cambridge, MA: Harvard University Press.
Lazarsfeld, P.F., & Oberschall, A.R. (1965). Max Weber and Empirical Social Research. American Sociological Review, 30(2), 185–199. https://doi.org/10.2307/2091563
Lazer, D., Hargittai, E., Freelon, D., González-Bailón, S., Munger, K., Ognyanova, K., & Radford, J. (2021). Meaningful Measures of Human Society in the Twenty-first Century. Nature, 595(7866), 189–196. https://doi.org/10.1038/s41586-021-03660-7
Leahey, E. (2005). Alphas and Asterisks: The Development of Statistical Significance Testing Standards in Sociology. Social Forces, 84(1), 1–24. https://doi.org/10.1353/sof.2005.0108
Leskovec, J., & Krevl, A. (2014). SNAP Datasets. Stanford University. https://snap.stanford.edu/data/
Lohr, S. (2015). Data-ism: The Revolution Transforming Decision Making, Consumer Behavior, and Almost Everything Else. New York, NY: Harper Collins.
Marres, N. (2017). Digital sociology: The Reinvention of Social Research. Hoboken, NJ: Wiley.
Martin, J.L. (2017). Thinking Through Methods: A Social Science Primer. Chicago, IL: University of Chicago Press.
McCormick, T.H., Lee, H., Cesare, N., Shojaie, A., & Spiro, E.S. (2017). Using Twitter for Demographic and Social Science Research: Tools for Data Collection and Processing. Sociological Methods & Research, 46(3), 390–421. https://doi.org/10.1177/0049124115605339
McFarland, D.A., Lewis, K., & Goldberg, A. (2016). Sociology in the Era of Big Data: The Ascent of Forensic Social Science. The American Sociologist, 47(1), 12–35. https://doi.org/10.1007/s12108-015-9291-8
McMahan, P., & McFarland, D.A. (2021). Creative Destruction: The Structural Consequences of Scientific Curation. American Sociological Review, 86(2), 341–376. https://doi.org/10.1177/0003122421996323
Mead, R. (2019). The Airbnb Invasion of Barcelona. The New Yorker, 22 April. https://www.newyorker.com/magazine/2019/04/29/the-airbnb-invasion-of-barcelona
Merton, R.K. (1968). The Matthew Effect in Science. Science, 159(3810), 56–63. https://doi.org/10.1126/science.159.3810.56
Mohr, J.W., & Rawlings, C. (2012). Four Ways to Measure Culture: Social Science, Hermeneutics, and the Cultural Turn. In J.C. Alexanders, R.N. Jacobs, & P. Smith (Eds.), The Oxford Handbook of Cultural Sociology (pp. 70–113). Oxford: Oxford University Press.
Mützel, S., & Kressin, L. (2020). From Simmel to Relational Sociology. In S. Abrutyn & O. Lizardo (Eds.), The Handbook of Classical Sociological Theory (pp. 217–238). New York, NJ: Springer.
Mützel, S., Saner, P., & Unternährer, M. (2018). Schöne Daten! Konstruktion und Verarbeitung von digitalen Daten. In D. Houben, & B. Prietl (Eds.), Datengesellschaft (pp. 111–132). Berlin: Verlag.
Nelson, L.K. (2021, August). Early Career Faculty Spotlight. ASA Methodology Section Newsletter, 8.
Nelson, L.K. (2020). Computational Grounded Theory: A Methodological Framework. Sociological Methods & Research, 49(1), 3–42. https://doi.org/10.1177/0049124117729703
Nelson, L.K. (2022). Laura Nelson. GitHub. https://github.com/lknelson
Noble, S. (2018). Algorithms of Oppression: How Search Engines Reinforce Racism. New York, NY: New York University Press.
NORC. (2021) Get the Data. NORC at the University of Chicago. https://gss.norc.org/get-the-data
O'Neil, C. (2016). Weapons of Math Destruction: How Big Data Increases Inequality and Threatens Democracy. New York, NY: Crown Books.
Ollion, E., & Abbott, A. (2016). French Connections: The Reception of French Sociologists in the USA (1970-2012). Archives européennes de sociologie, 57(2), 331–372. https://doi.org/10.1017/S0003975616000126
Pajo, B. (2017). Introduction to Research Methods: A Hands-on Approach. London: Sage.
R Core Team. (2021). R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing. https://www.R-project.org/
Ribes, D. (2019). STS, Meet Data Science, Once Again. Science, Technology, & Human Values, 44(3), 514–539. https://doi.org/10.1177%2F0162243918798899
Romero, M. (2020). Sociology Engaged in Social Justice. American Sociological Review, 85(1), 1–30. https://doi.org/10.1177%2F0003122419893677
Salganik, M.J. (2018). Bit by Bit: Social Research in the Digital Age. Princeton, NJ: Princeton University Press.
Salganik, M. J., Lundberg, I., Kindel, A. T., Ahearn, C.E., Al-Ghoneim, K., Almaatouq, A., Altschul, D.M., Brand, J.E., Carnegie, N.B., Compton, R.J., Datta, D., Davidson, T., Filippova, A., Gilroy, C., Goode, B.J., Jahani, E., Kashyap, R., Kirchner, A., McKay, S., Morgan, A.C., …, & McLanahan, S. (2020). Measuring the Predictability of Life Outcomes with a Scientific Mass Collaboration. Proceedings of the National Academy of Sciences of the United States of America, 117(15), 8398–8403. https://doi.org/10.1073/pnas.1915006117
Schutt, R., & O'Neil, C. (2013). Doing Data Science. Sebastopol, CA: O'Reilly Media, Inc.
Shwed, U., & Bearman, P.S. (2010). The Temporal Structure of Scientific Consensus Formation. American Sociological Review, 75(6), 817–840. https://doi.org/10.1177/0003122410388488
SIENA. (2022). Data sets for use with Siena. Oxford University. https://www.stats.ox.ac.uk/~snijders/siena/siena_datasets.htm
Silberzahn, R., Uhlmann, E.L., Martin, D.P., Anselmi, P., Aust, F., Awtrey, E., Bahník, Š., Bai, F., Bannard, C., Bonnier, E., Carlsson, R., Cheung, F., Christensen, G., Clay, R., Craig, M.A., Dalla Rosa, A., Dam, L., Evans, M.H., Flores Cervantes, I., Fong, N., …, & Nosek, B.A. (2018). Many Analysts, One Data Set: Making Transparent How Variations in Analytic Choices Affect Results. Advances in Methods and Practices in Psychological Science, 1(3), 337–356. https://doi.org/10.1177/2515245917747646
Simmel, G. (1908). Soziologie. Untersuchungen über die Formen der Vergesellschaftung. Berlin: Duncker & Humblot.
Squazzoni, F., Bravo, G., Farjam, M., Marusic, A., Mehmani, B., Willis, M., Birukou, A., Dondio, P., & Grimaldo, F. (2021). Peer Review and Gender Bias: A Study on 145 Scholarly Journals. Science Advances, 7(2). https://doi.org/10.1126/sciadv.abd0299
Stark, D. (2009). The Sense of Dissonance: Accounts of Worth in Economic Life. Princeton, NJ: Princeton University Press.
Stinchcombe, A.L. (1982). Should Sociologists Forget Their Mothers and Fathers? American Sociologist, 17(1), 2–11. https://www.jstor.org/stable/27702490
Turco, C.J., & Zuckerman, E.W. (2017). Verstehen for Sociology: Comment on Watts. American Journal of Sociology, 122(4), 1272–1291. https://doi.org/10.1086/690762
Twitter, Inc. (2022). Twitter API Academic Research Access. Twitter. https://developer.twitter.com/en/products/twitter-api/academic-research
Vedres, B. (2022). Balazs Vedres. CEU. http://www.personal.ceu.hu/staff/Balazs_Vedres/
Vedres, B., & Stark, D. (2010). Structural Folds: Generative Disruption in Overlapping Groups. American Journal of Sociology, 115(4), 1150–1190. https://doi.org/10.1086/649497
Wagner-Pacifici, R., Mohr, J.W., & Breiger, R.L. (2015). Ontologies, Methodologies, and New uses of Big Data in the Social and Cultural Sciences. Big Data & Society, 2(2), 1–11. https://doi.org/10.1177/2053951715613810
Waight, H. (2021). Recovering John Dewey’s Lost Vision for Social Science in Contemporary American Sociology. The American Sociologist, 52, 420–448. https://doi.org/10.1007/s12108-021-09482-4
Watts, D.J. (2014). Common Sense and Sociological Explanations. American Journal of Sociology, 120(2), 313–351. https://doi.org/10.1086/678271
Weber, M. (2004). Science as a Vocation. In D.S. Owen, T.B. Strong, & R. Livingstone (Eds.), The Vocation Lectures (R. Livingsone, Trans.) (pp. 1–31). Indianapolis, IN: Hackett. (Original work published 1919)
Wellman, B. (1997). An Electronic Group is Virtually a Social Network. In S. Jiesler (Ed.), Culture of the Internet (pp. 179–205). Mahwah, NJ: Lawrence Erlbaum.
White, H.C. (2001). Interview with Harrison White: 4-16-01 by Alair MacLean and Andy Olds. Theory@Madison. https://www.ssc.wisc.edu/theoryatmadison/papers/ivwWhite.pdf
White, H.C., Boorman, S.A., & Breiger, R.L. (1976). Social Structure from Multiple Networks. I. Blockmodels of Roles and Positions. American Journal of Sociology, 81(4), 730–780. https://www.jstor.org/stable/2777596
Whitford, J. (2022). Disambiguating Dewey; or Why Pragmatist Action Theory Neither Needs Nor Asks Paradigmatic Privilege. In N. Gross, I.A. Reed, & C. Winship (Eds.), The New Pragmatist Sociology: Inquiry, Agency, and Democracy. New York, NY: Columbia University Press.
Winship, C., & Morgan, S.L. (1999). The Estimation of Causal Effects from Observational Data. Annual Review of Sociology, 25(1), 659–706. https://doi.org/10.1146/annurev.soc.25.1.659
Published
How to Cite
Issue
Section
License
Copyright (c) 2022 Philipp Brandt
This work is licensed under a Creative Commons Attribution 4.0 International License.