Multi-level zero-inflated Poisson regression modelling of correlated count data with excess zeros

  • Lee, Andy H.
  • Wang, Kui
  • Scott, Jane A.
  • Yau, Kelvin K.W.
  • McLachlan, Geoffrey J.
Statistical Methods in Medical Research 15(1):p 47-61, February 2006. | DOI: 10.1191/0962280206sm429oa

Count data with excess zeros relative to a Poisson distribution are common in many biomedical applications. A popular approach to the analysis of such data is to use a zero-inflated Poisson (ZIP) regression model. Often, because of the hierarchical study design or the data collection procedure, zero-inflation and lack of independence may occur simultaneously, which render the standard ZIP model inadequate. To account for the preponderance of zero counts and the inherent correlation of observations, a class of multi-level ZIP regression model with random effects is presented. Model fitting is facilitated using an expectation-maximization algorithm, whereas variance components are estimated via residual maximum likelihood estimating equations. A score test for zero-inflation is also presented. The multi-level ZIP model is then generalized to cope with a more complex correlation structure. Application to the analysis of correlated count data from a longitudinal infant feeding study illustrates the usefulness of the approach.

Copyright ©2006Sage Publications