[Stata] Multinomial Logistic Regression: mlogit, mlogtest

Multinomial logistic regression is a method for modeling categorical outcomes with more than two levels. It allows us to estimate the probability of each outcome as a function of some predictor variables, and to test hypotheses about the effects of these variables.

In this blog post, I will use the nhanes2 dataset from Stata, which contains data on health and nutrition of a sample of US adults. I will use the variable region as the dependent variable, which has four categories: Northeast, Midwest, South, and West. I will use the variables age, race, sex, and rural as the predictors.

Step 1: Use the mlogit command to regress your multicategory dependent variable on your predictors

The mlogit command in Stata fits a multinomial logistic regression model, also known as a polytomous logit model. The syntax is:

Stata

mlogit depvar indepvars, baseoutcome(#)

where depvar is the categorical outcome variable, indepvars are the predictor variables, and options are some additional options for the model. One of the options is rrr, which tells Stata to report the coefficients as relative risk ratios, instead of log odds. Another option is baseoutcome(#), which specifies the value of depvar that will be the base or reference category. The default is to choose the most frequent category.

Without rrr option, the coefficients represent the log odds of being in the outcome category relative to the reference category, for a one-unit increase in the predictor variable, holding all other variables constant. Let’s interpret the specific results for the Northeast (NE) region with the log odds. However, please note that we never interpret coefficients/log odds in papers. So, we need to stick with interpreting ORs.

Age: The coefficient of -0.0008355 for age suggests that as age increases by one year, the log odds of the outcome slightly decrease, but this is not statistically significant (p = 0.629), indicating that age may not have a meaningful impact on the outcome in this model.
Race:
- Black: With a coefficient of -1.869474, being Black significantly decreases the log odds of the outcome compared to the reference race group, with a very significant p-value (p < 0.001). This indicates a strong negative association between being Black and the likelihood of the outcome.
- Other: A coefficient of -0.7457402 suggests that being in the “Other” race category decreases the log odds of the outcome compared to the reference race group, with this effect being statistically significant (p = 0.047).
Sex (Female): The coefficient of -0.1072987 for females indicates that being female slightly decreases the log odds of the outcome compared to males (the likely reference group), though this result is marginally significant (p = 0.071).
Rural (Rural): A coefficient of -1.12706 for rural residents suggests that living in a rural area significantly decreases the log odds of the outcome compared to non-rural residents, with a very significant p-value (p < 0.001).

To get the relative risk ratio (RRR), the command is:

Stata

mlogit region age i.race i.sex i.rural, rrr

This tells Stata to use the region variable as the dependent variable, and to include age as a continuous predictor, and race, sex, and rural as categorical predictors. The i. prefix before the categorical variables indicates that they are factor variables, and Stata will create dummy variables for each level. The rrr option tells Stata to report the relative risk ratios.

The output shows the relative risk ratios for each outcome category, compared to the base category, which is Northeast by default. The relative risk ratio is the ratio of the probability of choosing a certain category over the probability of choosing the base category, for a one-unit increase in the predictor variable, holding other variables constant.

The output also shows the standard errors, z-statistics, p-values, and 95% confidence intervals for each relative risk ratio. These are used to test the significance of the effects of the predictor variables.
The output also shows the log likelihood, the Wald chi-square statistic, the p-value, and the pseudo R-squared for the overall model fit. The log likelihood is the value of the log likelihood function at the estimated coefficients.
- The Wald chi-square statistic is a test of the joint significance of all the coefficients in the model, excluding the intercepts.
- The p-value is the probability of obtaining a Wald chi-square statistic as large or larger than the observed one, under the null hypothesis that all the coefficients are zero.
- The pseudo R-squared is a measure of how well the model fits the data, compared to a model with no predictors. It ranges from 0 to 1, with higher values indicating better fit.

Interpretation of RRR

Age: The RRR of 0.9991648 for age suggests that as age increases by one year, the risk of the outcome slightly decreases (RRR < 1), but this is not statistically significant (p = 0.629), indicating that age may not have a meaningful impact on the outcome in this model.
Race:
- Black: With an RRR of 0.1542048, being Black significantly decreases the risk of the outcome by about 84.6% compared to the reference race group (RRR < 1, p < 0.000), holding other variables constant.
- Other: An RRR of 0.474383 suggests that being in the “Other” race category decreases the risk of the outcome by about 52.6% compared to the reference race group, and this is statistically significant (p = 0.047).
Sex (Female): The RRR of 0.8982573 for females indicates that being female decreases the risk of the outcome by about 10.2% compared to males (the likely reference group), though this result is marginally significant (p = 0.071).
Rural (Rural): An RRR of 0.3239843 for rural residents suggests that living in a rural area decreases the risk of the outcome by about 67.6% compared to non-rural residents, with high statistical significance (p < 0.000).

Interpretation of Relative Risk Ratio versus Odds Ratio

💡 TL;DR: In multinomial logistic regression (mlogit), the Relative Risk Ratio (RRR) is mathematically identical to the Odds Ratio (OR) from binary logistic regression. Both are calculated as exp(β). The difference lies not in the math—but in terminological convention.

Stata’s mlogit, rrr simply outputs exp(β_k). It’s the same formula you would get from logit with , or.

In other words, RRR = OR = exp(coefficient) in both binary and multinomial settings. There’s no computational/mathematical difference.

Concept	RRR	OR
Math	`exp(β)`	`exp(β)`
Model type	Multinomial logit	Binary logit
Interpretation	Risk of one category vs base	Odds of event vs non-event
Terminology	Clarity in multi-category comparisons	Standard binary outcome framing
Interpretation	“How much more likely is outcome k vs. base category?”	“How much more likely is event A vs. not-A?”

🔸 OR emphasizes odds (ratio of probabilities to their complements)
🔸 RRR emphasizes relative probability between outcome categories

Thus, RRR sounds more intuitive when comparing multinomial logistic regressions, even if it’s technically an odds ratio between the outcome and the base. You can clearly see if you go to Step #2.

Step 2. Computing odds ratios: `listcoef` command

Even though the mlogit command in Stata does not support or option in terms of output, we can compute factor change in odds for unit increase in variable using user-created listcoef command.

Stata

listcoef, help

b = raw coefficient
z = z-score for test of b=0
P>|z| = p-value for z-test
e^b = exp(b) = factor change in odds for unit increase in X (odds ratio)
e^bStdX = exp(b*SD of X) = change in odds for SD increase in X

Step 3. Computing Marginal Effects: `mchange` command

Then, you can use the margins or mchange command to present marginal effects of your predictors of interest. The mchange command calculates the marginal change in the predicted probability of the outcome variable for a change in one or more explanatory variables, holding other variables constant.

Stata

mchange varname

You can see the average predictions of marginal effects of all categories in the multinomial logistic regression. The mchange command in mlogit does not depend on the reference group. Unlike multinomial logistic regression coefficients, which compare categories relative to a reference group, mchange reports marginal effects that show the absolute change in probability for each category when an independent variable (e.g., age) changes.

This is why changing the reference group does not affect the mchange output—because marginal effects describe how the probability of each outcome changes independently, rather than comparing them to a baseline category.

For a 1 unit increase in age:
- The probability of choosing “Don’t know” increases by 0.003 (0.3%) (p < .001).
- The probability of choosing “App” decreases by 0.001 (0.1%) (p < .001).
- The probability of choosing “Website” does not significantly change (p = 0.867).
- The probability of choosing “Both App & Website” decreases by 0.002 (0.2%) (p < .001).
Pr(y|base): These are the base probabilities for each outcome category without considering the change in age. They represent the model’s predictions for the average individual in the dataset. Average predictions are essentially the predicted probabilities of each outcome category when the predictor variables are set to their average values (or baseline levels in categorical cases). These predictions give us a baseline scenario against which we can compare the effects of changes in predictor variables.
- Don’t know: 37.4%
- App: 10.9%
- Website: 32.4%
- Both app and website: 19.4%

Step 4. Model Testing: `mlogtest` command

In multinomial logistic regression, the Wald test and the Likelihood Ratio (LR) test are two primary statistical methods used to assess the significant contributions to the model of each independent variables.

Wald Test

Likelihood Ratio (LR) Test

– Approach: Evaluates whether an estimated coefficient differs significantly from zero by considering its standard error.
– Limitations: In small sample sizes or when the coefficient is large, the standard error may be inflated, potentially leading to unreliable results. (stats.oarc.ucla.edu)

– Approach: Compares the goodness-of-fit between two nested models: one with and one without the predictor in question. The null hypothesis (

H_0

) states that all coefficients associated with a given variable are equal to zero (i.e., the variable has no effect on the dependent variable).
– Advantages: Generally considered more robust and powerful, especially in smaller samples. bookdown.org

To conduct LR and Wald tests followed by mlogit command in Stata, you need to use the mlogtest command to conduct LR and Wald tests of key predictors of interest.

Stata

search mlogtest // need to install the package first 
mlogtest // waldtest
mlogtest, lr // lrtest

According to the output, all three variables (srage, srsex, ah33new) have statistically significant effects on the dependent variable (ins6tp_m), as their p-values are all below 0.05. This suggests that removing these variables from the model would lead to a significant loss of explanatory power.

⭐ Error message: “factor-variable and time-series operators not allowed“

The error message indicates that factor variables (e.g., categorical variables specified with i.varname) are not allowed. It occurs because mlogtest does not support factor variables. To resolve this issue, you need to manually create dummy variables for categorical predictors (i.e., dummy coding; see this post for more details). before running mlogit. You can do this using the tab command with the gen() option.

Stata

mlogit region age i.race sex // i. operator is not compatible with mlogtest

tab race, gen(race)
mlogit region age race2 race3 sex // omit reference group after dummy-coding

Testing IIA Assumption

IIA is a critical assumption when using multinomial logit models. It states that the odds of preferring one choice over another do not depend on the presence or absence of other “irrelevant” alternatives. In simpler terms, the relative preferences between options remain consistent, regardless of other choices available.

Suppose we’re studying people’s preferences for living in different regions: Northeast (NE), Midwest (MW), South (S), and West (W). Each person selects one region to live in. The IIA assumption implies that if someone prefers the Midwest (MW) over the Northeast (NE), their preference should remain the same even if we introduce a new option (say, the South or West).
If IIA is violated, it means that the introduction of new alternatives affects people’s preferences. For instance, if adding the South (S) as an option suddenly makes more people prefer the Midwest (MW) over the Northeast (NE), then IIA is violated.

We can test IIA assumptions using mlogtest command. Before running it, we need to first set a random seed by using set seed command. Using the same seed number, you can test the IIA assumption using the mlogtest, hausman and mlogtest, smhsiao commands.

Stata

set seed 153456
mlogtest, hausman
mlogtest, smhsiao
mlogtest, iia // you can run all iia assumption test at once

According to the results in Small-Hsiao Tests of IIA Assumption, all regions (NE, MW, S, W) have p-values well above 0.05, indicating no evidence against the IIA assumption. This means the odds of choosing one outcome over another are independent of the presence of other alternatives.

Regarding Hausman Tests of IIA Assumption results, the negative chi-square values for NE and S indicate that the model does not meet asymptotic assumptions for these cases. For MW and W, the chi-square values are not significant (p-values of 1.000 and 0.988, respectively), suggesting no evidence against the IIA assumption.

If your P > Chi2 are significant (p < .05), you are violating IIA assumption.
- However, depending on your sample size, there is a debate on the utility/relevance of the IIA assumption test. You can see this post and cite it
  - Allison, P. (2012). How relevant is the independence of irrelevant alternatives?. Statistical Horizons.
- You can also consider another model, such as the “mixed logit” model (MXL), that relaxes IIA assumptions. The user-created mixlogit command allows you to implement it. Please see this study as an example and this article for more information on the mixed logit model.
If your P > Chi2 are NOT significant (p > .05), you are NOT violating the assumption and can move forward with your multinomial logit.
- Some of your test statistics are negative and it is also an evidence that IIA assumption has not been violated – Hausman and McFadden (1984, p. 1226)

Tip. Troubleshooting regarding mlogtest command

It seems mlogtest, hausman and mlogtest, smhsiao command does not work with if condition in the mlogit command. In other words, if you run mlogit with if condition, it will return the invalid syntax error. I guess this is an error in the package, but you need to make sure not to use if condition to use mlogtest, hausman or mlogtest, smhsiao. You can drop the sample before running these command.

Further, if smhsiao test returns the error such as “basecategory not found,” you can solve the problem by creating another variable with egen= group(varname) command.

Stata

egen outcome= group(depvar)
mlogit outcome independentvars
mlogtest, smhsiao

Please find this statalist post regarding this error.

Step 5. Model Fit Statistics: `fitstat` command

We can use the fitstat command to examine the overall model fit.

Stata

fitstat

Log-likelihoods and Chi-square provide a basis for comparing models, with the chi-square test indicating the model is significantly better than an intercept-only model.

R-squared values (McFadden, Cox-Snell/ML, etc.) offer insight into the model’s explanatory power, which appears to be relatively low (McFadden’s R2 is 0.034 – 3.4%), suggesting that the model explains only a small portion of the variance in the outcome.

Information Criteria (AIC, BIC) help in model selection across different models, where lower values generally indicate a better model fit considering the complexity of the model. It is not feasible to interpret the model fit statistics with only one model.

Reference

Multinomial Logistic Regression | Stata Data Analysis Examples (ucla.edu)

Mlogit1.pdf (nd.edu)

Mlogit2.pdf (nd.edu)

multinom_st.pdf (washington.edu)

A Hands-on Tutorial – Logit, Ordered Logit, and Multinomial Logit Models in Stata – Research Guides at Princeton University

Interpreting multinomial logistic regression in Stata – BAILEY DEBARMORE

February 27, 2024

Rodrigo says:

April 3, 2024 at 12:03 PM

Thanks for your post.

Exist some alternative test when violates IIA Assumption?
gologit2 could be an alternative?

Thanks

- Nari says:
  
  April 21, 2024 at 2:07 PM
  
  You can either cite the study on the excuse of violation – such as the non-relevance of IIA assumption in small sample size. Alternatively, you can consider using mixlogitfor the mixed logit model. I updated the post for more information on it! gologit2 is an alternative for the case when you violate the assumption for ologit.

[Stata] Multinomial Logistic Regression: mlogit, mlogtest

Step 1: Use the mlogit command to regress your multicategory dependent variable on your predictors

Interpretation of Relative Risk Ratio versus Odds Ratio

Step 2. Computing odds ratios: `listcoef` command

Step 3. Computing Marginal Effects: `mchange` command

Step 4. Model Testing: `mlogtest` command

Testing IIA Assumption

Tip. Troubleshooting regarding mlogtest command

Step 5. Model Fit Statistics: `fitstat` command

Reference

Related Posts

2 Responses

Leave a ReplyCancel reply

Translate this page into:

Categories

[Stata] Multinomial Logistic Regression: mlogit, mlogtest

Step 1: Use the mlogit command to regress your multicategory dependent variable on your predictors

Interpretation of Relative Risk Ratio versus Odds Ratio

Step 2. Computing odds ratios: listcoef command

Step 3. Computing Marginal Effects: mchange command

Step 4. Model Testing: mlogtest command

Testing IIA Assumption

Tip. Troubleshooting regarding mlogtest command

Step 5. Model Fit Statistics: fitstat command

Reference

Share this:

Related Posts

2 Responses

Leave a ReplyCancel reply

Translate this page into:

Categories

Step 2. Computing odds ratios: `listcoef` command

Step 3. Computing Marginal Effects: `mchange` command

Step 4. Model Testing: `mlogtest` command

Step 5. Model Fit Statistics: `fitstat` command