PREDICTION OF THE DETERMINANTS OF THE NUMBER OF ANTENATAL CARE VISITS IN NFHS IV SURVEY OF INDIA: MODELING EXCESS ZERO OF COUNT DATA
Keywords:
ANC visits, zero-inflated, hurdle, NFHS IVDOI:
https://doi.org/10.17654/0973514323010Abstract
Poisson and negative binomial regression models are most widely used to model count data. Count data models are used to predict the factors that affect the number of antenatal care visits (ANC) received by pregnant women throughout their ninth and tenth months of pregnancy. It is imperative to take into account the presence of excess zero ANC visits which could be either structural or sampling zeros. Simulated datasets were produced using negative binomial distributions with a range of dispersion values (20, 40, and 120) and different zero percentages (22, 40, 60, and 70 percent). The purpose of this study was to quantify the bias and poor fit introduced by the fitting Least Square Regression (LSR), Poisson regression (PR), negative binomial regression (NBR), zero-inflated Poisson regression (ZIPR), zero-inflated negative binomial regression (ZINBR), hurdle Poisson regression (HPR) and hurdle negative binomial regression (HNBR) models. To assess zero-inflated practices and look into the relationship between the number of antenatal visits and the socio-demographic profile of women in their ninth and tenth months of pregnancy, the maternal health data from the National Family Health Survey (NFHS IV) study were subsequently examined. The relative quality of the regression models was assessed using Vuong tests and Akaike information criterion (AIC) values. The zero-inflated negative binomial regression (ZINBR) model performed better than the other models, according to the simulation study’s findings and with the application of ANC visits, which indicated that they had decreased AIC values under all zero inflation and over-dispersion scenarios.
Received: February 24, 2023
Revised: March 17, 2023
Accepted: April 8, 2023
References
M. A. Beydoun et al., Antioxidant status and its association with elevated depressive symptoms among US adults: National Health and Nutrition Examination Surveys 2005-06, British Journal of Nutrition 109 (2013), 1714-1729. Advance online publication. doi: 10.1017/S0007114512003467.
Cindy Xin Feng, A comparison of zero-inflated and hurdle models for modeling zero-inflated count data, Journal of Statistical Distributions and Applications 8(8) (2021), 1-19.
C. E. Rose, On the use of zero-inflated and hurdle models for modeling vaccine adverse event count data, Journal of Biopharmaceutical Statistics 16 (2006), 463-481. DOI: 10.1080/10543400600719384.
Diane Lambert, Zero-inflated Poisson regression, with an application to defects in manufacturing Technometrics 34(1) (1992), 1-14.
Daniel Biftu Bekalo, Zero-inflated models for count data: an application to number of antenatal care service visits, Annals of Data Science 8 (2021), 683-708.
Bruce A. Desmarais and J. J. Harden, Testing for zero-inflation in count models: bias correction for the Vuong test, The Stata Journal 13(4) (2013), 810-835.
M. Genius and E. Strazzera, A note about model selection and tests for non-nested contingent valuation models, Econom. Lett. 74(3) (2002), 363-370.
G. Nanjundan and Sadiq Pasha, A note on the characterization of zero-inflated Poisson mode, Open Journal of Statistics 5 (2015), 140-142. http://www.scirp.org/journal/ojs; http://dx.doi.org/10.4236/ojs.2015.52017.
John Mullahy, Specification and testing of some modified count data models, J. Econometrics 33(3) (1986), 341-365.
John Haslett, A. C. Parnell, J. Hinde and R. de A. Moral, Modelling excess zeros in count data: a new perspective on modelling approaches, International Statistical Review 90 (2022), 216-236. https://doi.org/10.1111/insr.12479.
Kakoli Rani Bhowmik, Sumonkanti Das and Md. Atiqul Islam, Modelling the number of antenatal care visits in Bangladesh to determine the risk factors for reduced antenatal care attendance, PLOS ONE 15(1) (2020), 1-19. https://doi.org/10.1371/journal.pone.0228215.
Lili Puspita Rahayua, K. Sadik and I. Indahwati, Overdispersion study of Poisson and zero-inflated Poisson regression for some characteristics of the data on lamda, International Journal of Advances in Intelligent Informatics 2(3) (2016), 140-148. DOI: http://dx.doi.org/10.26555/ijain.v2i3.73 W: http://ijain.org; E: info@ijain.org.
J. R. Mahalik et al., Changes in health risk behaviors for males and females from early adolescence through early adulthood, Health Psychology 32 (2013), 685-694. doi: 10.1037/a0031658.
Y. Min and A. Agresti, Random effect models for repeated measures of zero inflated count data, Stat. Model. 5 (2005), 1-19.
Mei-Chen Hu, M. Pavlicova and E. V. Nunes, Zero-inflated and hurdle models of count data with extra zeros: examples from an HIV-risk reduction intervention trial, Am. J. Drug Alcohol Abuse 37(5) (2011), 367-375. doi: 10.3109/00952990.2011.597280.
N. Ismail, Handling overdispersion with negative binomial and generalized Poisson regression models, Casualty Actuarial Society Forum (2007), 103-158.
Oyindamola B. Yusuf et al., On the performance of the Poisson, negative binomial and generalized Poisson regression models in the prediction of antenatal care visits in Nigeria, American Journal of Mathematics and Statistics 5(3) (2015), 128-136. DOI: 10.5923/j.ajms.20150503.04.
R Core Team, R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing, Vienna, Austria, 2013.
Remi Mrume Sakia, Application of the power series probability distributions for the analysis of zero-inflated insect count data, Open Access Library Journal 5 (2018), e4735. DOI: 10.4236/oalib.1104735.
S. Yang, L. I. Harlow, G. Puggioni and C. A. Redding, A comparison of different methods of zero-inflated data analysis and an application in health surveys, Journal of Modern Applied Statistical Methods 16(1) (2017), 518-543. doi: 10.22237/jmasm/1493598600.
Sujan Rudra and Soma Chowdhury Biswas, Models for analyzing over-dispersed hurdle negative binomial regression model: application to manufactured cigarette use, Journal of Reliability and Statistical Studies 12(2) (2019), 51-60.
D. I. Warton, Many zeros does not mean zero-inflation: comparing the goodness-of-fit of parametric models to multivariate abundance data, Environmetrics 16 (2005), 275-289.
Gary King, Event count models for international relations: generalizations and applications, International Studies Quarterly 33(2) (1989), 123-147.
John M. Williamson, Hung-Mo Lin, Robert H. Lyles and Allen W. Hightower, Power calculations for ZIP and ZINB models, Journal of Data Science 5 (2007), 519-534.
Ting Hsiang Lin and Min-Hsiao Tsai, Modeling health survey data with excessive zero and K responses, Stat. Med. 32 (2013), 1572-1583. https://doi.org/10.1002/sim.5650.
Downloads
Published
Issue
Section
License
Copyright (c) 2023 JP Journal of Biostatistics

This work is licensed under a Creative Commons Attribution 4.0 International License.
_________________________
Attribution: Credit Pushpa Publishing House as the original publisher, including title and author(s) if applicable.
Non-Commercial Use: For non-commercial purposes only. No commercial activities without explicit permission.
No Derivatives: Modifying or creating derivative works not allowed without written permission.
Contact Puspha Publishing House for more info or permissions.
Journal Impact Factor: 


Google h-index: 10