Beginner’s guide on Quantitative Methods in Social Work Research

Updated on Jan 28, 2022


Welcome to graduate school! And welcome to the world of social work research. Social work is fundamentally involved in multi-level systems, and social work research is also interdisciplinary for this reason. This is a good thing. However, you may get lost on which statistical methodologies to learn primarily for that reason, and at least I did!  I had questions like these: In what order and how can I learn quantitative methods suitable for my research questions? What types of research questions could be answered by which methodologies? What are the differences between the terminology and methodology used in different disciplines? What software should I learn to use? I hope someone who is lost as I was will find some help in this article.  

Choosing your methods: “What do I need to study out of the whole new world of quantitative methods?”

Learning Basic statistics (descriptive statistics, bivariate analysis, regression analysis)

The first step is to learn basic statistics (descriptive statistics, bivariate analysis, regression analysis), starting from the type of variables. It is essential to tell continuous, categorical, and dichotomous variables apart and convert from each type of variable to another type of variable. For example, age could be a continuous variable. It can be converted into a categorical variable for age groups (1=less than 18, 2= 18~30, 3= 31~50, 4=51 and over) or dichotomous variable (0=less than 18, 1= 18 and over), depending on the research questions.

The type of variables decides the type of analyses that you need to choose. So then, you need to learn how to learn bivariate analyses (t-test, chi-square test, correlation analysis, ANOVA, …). The next step is to learn regression analysis. UCLA IDRE guides on “Choosing the Correct Statistical Test” which explains what fundamental kind of primary analysis is appropriate for the independent variables/dependent variables in the research questions. So, you first need to understand the type of the variables (interval, ordinal, continuous, binary, categorical..).

After learning these basic statistics, you can choose to develop your methodological toolkits by choosing your methods depending on your research interests. For example, understanding logistic regression is the most basic step in applying machine learning for the classification task.

Resources for introduction to statistics

MOOC 

Book

Choosing your methods: History of the development of methods in social sciences
Jones, L. V., & Thissen, D. (2006). 1 A History and Overview of PsychometricsHandbook of statistics26, 1-27.
  • Psychologists and economists have independently developed statistical methods for social science (Goldberger, 1971). However, they sometimes overlap each other (e.g., some psychologists apply causal inference methods to psychometrics, and some economists apply psychological predictors/outcomes to econometrics, especially behavioral economists).
  • Sometimes, statistical terms widely accepted in the field vary by field, but statistical logic is the same. For example, epidemiologists call the interaction effect an effect modification. Sociologists would probably refer to a survival analysis by event history analysis.
  • Researchers in public health and epidemiology are more likely to use econometrics or biostatistics (e.g., survival analysis) since the “causality” factor is crucial to their findings. If you are interested in the effectiveness of interventions, causal inference and experimental research design to prove causation before/after intervention are essential.
  • Social work or education scholars are more likely to use psychometrics since they are interested in subjective concepts considering populations of interest. However, there are exceptions: social welfare policy scholars, educational policy scholars, and interventionists tend to use econometrics more than psychometrics in their research. In recent years, both lines of statistical methods have been developed by integrating computational approaches (e.g., introducing SEM for machine learning).
Psychometrics
  • If you are interested in subjective variables (e.g., mental health, stigma, well-being…)
  • Utilize psychometric measurement by surveying or collecting a primary dataset 
  • Structural equation modeling (SEM) is a combination of psychometric measurement (measurement model part) and a series of regression (structural model part)
Econometrics
What’s next? Other advanced methods!

There is a line with advanced methods that social work scholars could use; Longitudinal analysis (a.k.a. panel analysis in econometrics), Multilevel modeling, Social Network Analysis, Geospatial analysis, Computational analysis (e.g., Natural Language processing, machine learning …). The methods could be combined and referred to as such, for example, multilevel SEM, longitudinal SEM, and causal inference with spatial data …

Longitudinal Modeling

You may be interested in the longitudinal change that occurs in an individual. For example, you may be wondering how Adverse Childhood Experiences (ACEs) affect life in their later life. To do this, you need to study longitudinal analysis. The methodology you will use may vary depending on whether you are doing psychometric-based or econometric-based research. For example, psychometrics may use longitudinal structural equations (e.g., latent growth models), but econometrics may use the methods for panel analysis (e.g., Generalized Estimating Equations-accounting for non-normality).

Multilevel Modeling
Ref: Why We Need Multilevel Analysis from Multilevel Modeling | Coursera.

You may be interested in multi-level systems; in social work, Bronfenbrenner’s ecological systems theory is the primary approach that understanding the issues of populations. In many social science research questions, variables are clustered in the upper level of the system. For example, you may wonder about the effect of a particular neighborhood, school, or country-level variable on an individual (e.g., neighborhood effect). It could be answered better using multilevel modeling (also known as hierarchical linear modeling).

Geospatial Analysis & Geographic Information System

Some social work scholars interested in community-level also utilize Geographic Information systems to visualize neighborhood-level characteristics (e.g., community asset mapping).

Reference

Social Network Analysis

Social workers care about social relationships and social connections. The Social Network Analysis (SNA) technique is particularly useful if you want to understand how different actors and systems are connected. Duke Network Analysis Center is famous for social networks and health workshops, and fortunately, the workshop materials are now open to all interested in learning SNA.

Reference

Data Science / Computational Social Science

If you are interested in using big data in particular (e.g., electronic health records or administrative data in child welfare or justice system) or utilizing text as a quantitative form of data (e.g., social media posts), computational methods or data science can be the methods you should learn. Natural Language processing is the technique that transforms text data into a computable format.

Others

Statistical software 

Since replicability and collaboration are getting increasingly important in social scientists’ research (lots of journals encourage the authors to provide the data and code in submission), I recommend you start to practice coding instead of using SPSS or AMOS as soon as possible. In social sciences, Stata is most widely used. Stata free webinars can be found here: https://www.stata.com/training/webinar/

  • Coding experience is required in the following order: SPSS (requires no coding) > Stata (requires coding for replicability, but it can be done without coding) > R (requires a lot of practice in getting accustomed to the package)
  • For structural equation modeling: AMOS (requires no coding) > Stata (requires coding for replicability, but it can be done without coding, but AMOS and Mplus provide more options for SEM analysis) > Mplus (requires coding but not extremely difficult) > R (requires lots of coding)

Recently, R has been gaining popularity since it is free and can be expanded by user-written packages! Once you get comfortable with writing the code (e.g., Stata), the speed of learning how to code with another software (e.g., R) can be accelerated. It is your choice to start coding from R or Python instead of Stata. Still, I think you can start from any syntax-based program (which I mean by requires coding) since familiarizing yourself with syntax writing is the asset that makes you learn another software with less anxiety and frustration.

Update: ChatGPT even provides the functionality that converts the code from one language (e.g., STATA) to another one (e.g., R). If you are comfortable with any language, it will get easier for you to do it in another language.

https://twitter.com/matt_motta/status/1614697014620454914?s=20&t=X2GidYmxWq9DW7H1q2TdHw

Workshops/Resources

Outside of school, advanced statistical methods can be learned from these websites (some of them are too expensive to pay out of your pocket): 

If you are interested in cross-cultural research and structural equation modeling, I have some recommendations on books.

  • February 2, 2022