Medical researchers often want to find out how medically relevant outcomes are related to other factors. To do so, they carry out analyses and fit models that are based on assumptions about the nature of the research data. This article describes three methods which may be used when one commonly made assumption is not met. The methods are demonstrated on a real dataset in which the outcome is an index of harmful use of alcohol, with higher scores indicating a higher incidence of harmful behaviours. The frequency distribution of the outcome, alc_harm, is shown in Box 1.
The full article is accessible to AMA members and paid subscribers. Login to read more or purchase a subscription now.
Please note: institutional and Research4Life access to the MJA is now provided through Wiley Online Library.
- 1. Lewis-Beck C, Lewis-Beck M. Applied regression: an introduction. 2nd ed. Thousand Oaks, Calif: Sage, 2016.
- 2. Hardy M. Regression with dummy variables. Newbury Park, Calif: Sage, 1993.
- 3. Gibbons J. Nonparametric statistics: an introduction. Thousand Oaks, Calif: Sage, 1992.
- 4. Glass E, Peckham P, Sanders J. Consequences of failure to meet assumptions underlying the fixed effects analyses of variance and covariance. Rev Educ Res 1972; 42: 237-288.
- 5. Mooney C, Duval R. Bootstrapping: a nonparametric approach to statistical inference. Newbury Park, Calif: Sage; 1993.
- 6. Craig JC, Williams GJ, Jones M, et al. The accuracy of clinical symptoms and signs for the diagnosis of serious bacterial infection in young febrile children: prospective cohort study of 15 781 febrile illnesses. BMJ 2010; 340: c1594.
- 7. Hardin J, Hilbe J. Generalized linear models and extensions. 3rd ed. College Station, Tex: Stata Press, 2012.
- 8. Long J, Freese J. Regression models for categorical dependent variables using Stata. 3rd ed. College Station, Tex: Stata Press, 2014.
I thank Dr Lesley Inglis for her careful editing and suggestions, Professor Michael Jones for his advice and encouragement, and Babucarr Sowe for allowing me to use his data.
No relevant disclosures.