ON THE ZERO AND k-INFLATED NEGATIVE BINOMIAL DISTRIBUTION WITH APPLICATIONS
Keywords:
inflated count data, overdispersion, ZkINB distribution, excessive counts, negative binomial distribution, count distributionsDOI:
https://doi.org/10.17654/0972361723037Abstract
In the literature, there are a significant number of studies on mixtures and compound probability distributions used for count data with inflated frequencies. This study extended some existing zero-inflated distributions, by considering the flexibility of peaks in the data with excessive counts other than zeros and handled an overdispersion in the data. Moreover, this study formulated a proposed zero- and k-inflated negative binomial (ZkINB) distribution which is a mixture of a multinomial logistic and negative binomial distribution. The multinomial logistic component captures the occurrence of excessive counts, at zero and at k > 0, while the negative binomial component captures the counts that are assumed to follow a negative binomial distribution. The probability mass function (pmf) and the moment generating function (mgf ) of the distribution were derived in order to compute some vital structural properties of the formulated distribution, such as the mean and the variance. Examples showed that the formulated ZkINB seems to capture better distributions as compared with other existing distributions for inflated count data.
Received: August 30, 2022
Revised: March 3, 2023
Accepted: May 18, 2023
References
M. A. Abdel-Aty and A. E. Radwan, Modeling traffic accident occurrence and involvement, Accident Analysis & Prevention 32(5) (2000), 633-642. doi:10.1016/s0001-4575(99)00094-9.
A. Agresti, Categorical Data Analysis, 2nd ed., John Wiley & Sons, Inc., Publication, Hoboken, New Jersey, USA, 2002. DOI:10.1002/0471249688.
M. Arora, Extended Poisson models for count data with inflated frequencies, Department of Mathematics and Statistics, Old Dominion University, Norfolk, Virginia, United States of America, 2018. DOI: 10.25777/nz1e-d763.
M. Arora and N. R. Chaganty, EM estimation for zero- and k-inflated Poisson regression model, Computation 9(9) (2021), 94. https://doi.org/10.3390/computation9090094.
M. Arora, N. Rao Chaganty and K. F. Sellers, A flexible regression model for zero- and k-inflated count data, J. Stat. Comput. Simul. 91(9) (2021), 1815-1845. DOI: 10.1080/00949655.2021.1872077.
G. Baetschmann and R. Winkelmann, Modeling zero-inflated count data when exposure varies: with an application to tumor counts, Biom. J. 55(5) (2013), 679-686. doi:10.1002/bimj.201200021.
S. C. Barry and A. H. Welsh, Generalized additive modelling and zero inflated count data, Ecological Modelling 157(2-3) (2002), 179-188. doi:10.1016/s0304-3800(02)00194-1.
A. C. Cameron and P. K. Trivedi, Regression analysis of count data, 2nd ed., Econometric Society Monograph No. 53, Cambridge University Press, 2013.
G. A. Dagne, Hierarchical Bayesian analysis of correlated zero-inflated count data, Biom. J. 46(6) (2004), 653-663. doi:10.1002/bimj.200310077.
W. Gardner, E. P. Mulvey and E. C. Shaw, Regression analyses of counts and rates: Poisson, overdispersed Poisson, and negative binomial models, Psychological Bulletin 118(3) (1995), 392-404. doi: 10.1037/0033-2909.118.3.392.
W. H. Greene, Accounting for excess zeros and sample selection in Poisson and negative binomial regression models, Technical Report No. EC-94-10, Department of Economics, Stern School of Business, New York University, 1994. SRRN: https://ssrn.com/abstract=1293115.
D. B. Hall, Zero-inflated Poisson and binomial regression with random effects: a case study, Biometrics 56(4) (2000), 1030-1039. doi: 10.1111/j.0006-341x.2000.01030.x.
J. M. Hilbe, Modeling Count Data, Cambridge University Press, New York, New York, USA, 2014. https://doi.org/10.1017/CBO9781139236065.
M. Kong, S. Xu, S. M. Levy and S. Datta, GEE type inference for clustered zero-inflated negative binomial regression with application to dental caries, Comput. Statist. Data Anal. 85 (2015), 54-66. doi:10.1016/j.csda.2014.11.014.
T. H. Lin and M. H. Tsai, Modeling health survey data with excessive zero and K responses, Stat. Med. 32(9) (2012), 1572-1583. doi:10.1002/sim.5650.
B. J. Park and D. Lord, Adjustment for maximum likelihood estimate of negative binomial dispersion parameter, Transportation Research Record: Journal of the Transportation Research Board 2061(1) (2008), 9-19. doi:10.3141/2061-02.
A. Moghimbeigi, M. R. Eshraghian, K. Mohammad and B. Mcardle, Multilevel zero-inflated negative binomial regression modeling for over-dispersed count data with extra zeros, J. Appl. Stat. 35(10) (2008), 1193-1202. doi:10.1080/02664760802273203.
J. A. Nelder and R. W. M. Wedderburn, Generalized linear models, Journal of the Royal Statistical Society, Series A (General) 135(3) (1972), 370. doi:10.2307/2344614.
C. Odhiambo, S. Kibika and E. Okango, The zero inflated negative binomial - Shanker distribution and its application to HIV exposed infant data, International Journal of Probability and Statistics 9(1) (2020), 7-13. DOI: 10.5923/j.ijps.20200901.02.
F. B. Oppong, E. C. Mbukam and A. A. Agyapong, Statistical models for analyzing count data, International Journal of Scientific and Engineering Research 8(2) (2017), 454-460.
K. M. Sakthivel, C. S. Rajitha and K. B. Alshad, Zero-inflated negative binomial-Sushila distribution and its application, Int. J. Pure Appl. Math. 117(13) (2017), 117-126.
K. Sellers and G. Shmueli, Data dispersion: now you see it… now you don’t, Comm. Statist. Theory Methods 42 (2013), 3134-3147.
P. F. Thall and S. C. Vail, Some covariance models for longitudinal count data with over-dispersion, Biometrics 46 (1990), 657-671.
D. Yamrubboon, A. Thongteeraparp, W. Bodhisuwan and K. Jampachaisri, Zero inflated negative binomial-Sushila distribution and its application, AIP Conf. Proc. (2017). doi:10.1063/1.5012263.
K. K. W. Yau, K. Wang and A. H. Lee, Zero-inflated negative binomial mixed regression modeling of over-dispersed count data with extra zeros, Biom. J. 45(4) (2003), 437-452. doi:10.1002/bimj.200390024.
A. F. Zuur, E. N. Ieno, N. J. Walker, A. A. Saveliev and G. M. Smith, Zero-truncated and zero-inflated models for count data, Mixed Effects Models and Extensions in Ecology with R, 2009, pp. 261-293. doi:10.1007/978-0-387-87458-6_11.
Downloads
Published
Issue
Section
License
Copyright (c) 2023 Pushpa Publishing House, Prayagraj, India

This work is licensed under a Creative Commons Attribution 4.0 International License.
____________________________
Attribution: Credit Pushpa Publishing House as the original publisher, including title and author(s) if applicable.
No Derivatives: Modifying or creating derivative works not allowed without written permission.
Contact Pushpa Publishing House for more info or permissions.
Journal Impact Factor: 