Regression models for count data with excess zeros: A comparison using survey data

Adhin, Bhaskar and Hemanth Kumar, AK and Kannan, T and Dasan, B (2023) Regression models for count data with excess zeros: A comparison using survey data. Regression models for count data with excess zeros: A comparison using survey data, 19(1) (1-13).

[thumbnail of 202340.pdf] Text
202340.pdf - Published Version

Download (500kB)

Abstract

Presence of excess zeros and the distributions are major concern in modeling count data. Zero inflated and hurdle models are regression techniques which can handle zero inflated count data. This study compares various count regression models for survey data observed with excess zeros. The data for the study is obtained from a survey conducted to assess the harms attributable to drinkers among children. Poisson, negative binomial and their zero inflated and hurdle versions were compared by fitting them to two count response variables, number of physical and number of psychological harms. The models were compared using fit indices, residual analysis and predicted values. The robustness of the models were also compared using simulated data sets. Results indicated that the Poisson regression was less robust to deviations from the distributional assumptions. The negative binomial regression and hurdle regression model were found to be suitable to model the number of physical and number of psychological harms respectively. The results showed that excess zeros in count data does not imply zero inflation. The zero inflated or hurdle models are suitable for zero inflated data. The selection between the zero inflated and hurdle models should be based on the assumed cause of zeros.

Affiliation: ICMR-National Institute for Research in Tuberculosis
Item Type: Article
URI: http://eprints.nirt.res.in/id/eprint/2212

Actions (login required)

View Item View Item