×

The r-hypergeometric distribution: characterization, mathematical methods, simulations, and applications in sciences and engineering. (English) Zbl 1529.60024

Summary: In this article, we introduce a novel univariate discrete distribution called the r-hypergeometric model. This distribution has the sampling characteristics with replacement of the binomial distribution and no-order sampling of the hypergeometric distribution. Mathematical expressions are obtained for computing probabilities, the mode, and moments of the new distribution. Two simulation algorithms are proposed using the acceptance-rejection and inverse-transform methods and computational features to generate values of an r-hypergeometric distributed random variable. Python and R codes are implemented to perform computational experiments. Applications of mathematical methods based on the new distribution in sciences and engineering employing simulated and real-world data sets are provided. A comparison with existing distributions is also included. In addition to the mathematical results, some findings obtained from our study are related to a better computational performance of the inverse-transform method. Also, we identify applications of our model that are not covered by the traditional count distributions. In addition, we establish distinct probabilities for the same event under the binomial, hypergeometric, and r-hypergeometric distributions, with their means also being distinct, but their variances, skewness, and kurtosis converge to the same value.
{© 2022 John Wiley & Sons, Ltd.}

MSC:

60E05 Probability distributions: general theory
62E10 Characterization and structure theory of statistical distributions
94A20 Sampling theory in information and communication theory

Software:

R; Python
Full Text: DOI

References:

[1] JohnsonNL, KempAW, KotzS. Univariate Discrete Distributions. Wiley; 2005. · Zbl 1092.62010
[2] MarquesFJ, LootsMT, BekkerA. Series representations for densities functions of a family of distributions—application to sums of independent random variables. Math Methods Appl Sci. 2019;42:5718‐5735. · Zbl 1436.62058
[3] TapiaA, GiampaoliV, DiazMP, LeivaV. Sensitivity analysis of longitudinal count responses: a local influence approach and application to medical data. J Appl Stat. 2019;46:1021‐1042. · Zbl 1516.62623
[4] HalcoussisD, PhillipsGM, ArzumanyanG. Sample Size Illustrations, Confidence Intervals, and Margins of Error: Hypergeometric Distribution Tables for Attorneys an Experts: 99 Percent Confidence Level. California, US: Walnut Oak Press; 2018.
[5] WrightT. Exact Confidence Bounds When Sampling From Small Finite Universes: An Easy Reference Based on the Hypergeometric Distribution. New York, US: Springer; 2012.
[6] SrivastavaHM, AgarwalR, JainS. Integral transform and fractional derivative formulas involving the extended generalized hypergeometric functions and probability distributions. Math Methods Appl Sci. 2017;40:255‐273. · Zbl 1359.26009
[7] SarndalCE. Some properties of the negative hypergeometric distribution and its limit distributions. Metrika. 1968;13:170‐190. · Zbl 0177.47501
[8] SibuyaM. Generalized hypergeometric, digamma and trigamma distributions. Ann Inst Stat Math. 1979;31:373‐390. · Zbl 0448.62008
[9] SchusterEF, SypeWR. On the negative hypergeometric distribution. Int J Math Educ Sci Technol. 1987;18:453‐459. · Zbl 0624.62024
[10] BowmanKO, KastenbaumMA, ShentonLR. The negative hypergeometric distribution and estimation by moments. Commun Stat: Simul Comput. 1992;21:301‐332. · Zbl 0850.62255
[11] BerryKJ, MielkePW. Exact confidence limits for population proportions based on the negative hypergeometric probability distribution. Percept Mot Skills. 1996;83:1216‐1218.
[12] Ruíz‐EspejoM, SinghHP, SaxenaS. On inverse sampling without replacement. Stat Pap. 2008;49:133‐137. · Zbl 1152.62006
[13] AntalE, TilleY. Simple random sampling with over‐replacement. J Stat Plan Infer. 2011;141:597‐601. · Zbl 1197.62007
[14] SchusterEF, SypeWR. Presenting the negative hypergeometric distribution to the introductory statistics courses. Int J Math Educ Sci Technol. 1993;24:523‐526.
[15] OllerosJ, RamosH. The hypergeometric distribution as a Poisson binomial. TEST. 1991;6:35‐43. · Zbl 0741.60009
[16] PlachkyD. Relationships between the negative binomial and negative hypergeometric distributions, with applications to testing and estimation. Am J Math Manag Sci. 2003;23:1‐6.
[17] HuDP, CuiYQ, YinAH. Improved negative binomial approximation for negative hypergeometric distribution. Appl Mech Mater. 2013;427:2549‐2553.
[18] BhatiD, AhmedIS. On uniform‐negative binomial distribution including Gauss hypergeometric function and its application in count regression modeling. Commun Stat: Theory Methods. 2019;50:3106‐3122. · Zbl 07530970
[19] KucukogluI, SimsekB, SimsekY. An approach to negative hypergeometric distribution by generating function for special numbers and polynomials. Turk J Math. 2019;43:2337‐2353. · Zbl 1439.05011
[20] DodgeHF, RomigHG. Sample Inspection Tables: Single and Double Sampling. India: Wiley; 2014.
[21] SchillingE. Average run length and the OC curve of sampling plans. Qual Eng. 2005;17:399‐404.
[22] SchillingEG, NeubauerDV. Acceptance Sampling in Quality Control. FL, US: C; 2017. · Zbl 1411.62015
[23] SirkenMG. The Hansen‐Hurwitz estimator revisited: PPS sampling without replacement. In: Proceedings of the Annual Meeting of the American Statistical Association: Survey Methods Section; 2001; New York, US.
[24] AzorinF, Sanchez‐CrespoJL. Methods and Application of Sampling. Madrid, Spain: Alianza Universidad Textos; 1986.
[25] MaitiT. Horvitz‐Thompson estimator. International Encyclopedia of Statistical Science. Heidelberg, Germany: Springer, Berlin; 2011.
[26] LeivaV, MarchantC, RuggeriF, SauloH. A criterion for environmental assessment using Birnbaum‐Saunders attribute control charts. Environmetrics. 2015;26:463‐476. · Zbl 1525.62167
[27] RojasF, LeivaV, HuertaM, Martin‐BarreiroC. Lot‐size models with uncertain demand considering its skewness/kurtosis and stochastic programming applied to hospital pharmacy with sensor‐related COVID‐19 data. Sensors. 2021;21:5198.
[28] CabezasX, Martin‐BarreiroC, GarciaS, DelgadoE, LeivaV. A two‐stage location problem with order solved using a Lagrangian algorithm and stochastic programming for a potential use in COVID‐19 vaccination based on sensor‐related data. Sensors. 2021;21:5352.
[29] RodriguezM, LeivaV, HuertaM, LilloM, TapiaA, RuggeriF. An asymmetric area model‐based approach for small area estimation applied to survey data. REVSTAT: Stat J. 2021;19:399‐420. · Zbl 1483.62030
[30] LeaoJ, LeivaV, SauloH, TomazellaV. Incorporation of frailties into a cure rate regression model and its diagnostics and application to melanoma data. Stat Med. 2018;37:4421‐4440.
[31] Figueroa‐ZúñigaJI, BayesCL, LeivaV, LiuS. Robust beta regression modeling with errors‐in‐variables: A Bayesian approach and numerical applications. Stat Pap. 2022;63:919‐942. · Zbl 07553170
[32] BalakrishnanN, GuptaR, KunduD, LeivaV, SanhuezaA. On some mixture models based on the Birnbaum‐Saunders distribution and associated inference. J Stat Plan Infer. 2011;141:2175‐2190. · Zbl 1214.62014
[33] AzevedoC, LeivaV, AthaydeE, BalakrishnanN. Shape and change point analyses of the Birnbaum‐Saunders‐t hazard rate and associated estimation. Comput Stat Data Anal. 2012;56:3887‐3897. · Zbl 1255.62312
[34] CostaE, Santos‐NetoM, LeivaV. Optimal sample size for the Birnbaum‐Saunders distribution under decision theory with symmetric and asymmetric loss functions. Symmetry. 2021;13:926.
[35] CastroM, PaletiR, BhatCR. A latent variable representation of count data models to accommodate spatial and temporal dependence: Application to predicting crash frequency at intersections. Transp Res B. 2012;46:253‐272.
[36] Garcia‐PapaniF, LeivaV, Uribe‐OpazoMA, AykroydRG. Birnbaum‐Saunders spatial regression models: diagnostics and application to chemical data. Chemom Intell Lab Syst. 2018;177:114‐128.
[37] LeivaV, RojasE, GaleaM, SanhuezaA. Diagnostics in Birnbaum‐Saunders accelerated life models with an application to fatigue data. Appl Stoch Model Bus Ind. 2014;30:115‐131. · Zbl 07879451
[38] SauloH, LeaoJ, LeivaV, AykroydRG. Birnbaum‐Saunders autoregressive conditional duration models applied to high‐frequency financial data. Stat Pap. 2019;60:1605‐1629. · Zbl 1432.62315
[39] HuertaM, LeivaV, LiuS, RodriguezM, VillegasD. On a partial least squares regression model for asymmetric data with a chemical application in mining. Chemom Intell Lab Syst. 2019;190:55‐68.
[40] MarchantC, LeivaV, CysneirosFJA. A multivariate log‐linear model for Birnbaum‐Saunders distributions. IEEE Trans Reliab. 2016;65:816‐827.
[41] Díaz‐GarcíaJA. Influence diagnostics for multivariate elliptic regression linear models. Commun Stat: Theory Methods. 2003;32:625‐641. · Zbl 1029.62067
[42] Ramirez‐FigueroaJA, Martin‐BarreiroC, Nieto‐LibreroAB, LeivaV, Galindo‐VillardónMP. A new principal component analysis by particle swarm optimization with an environmental application for data science. Stoch Env Res Risk A. 2021;35:1969‐1984.
[43] Martin‐BarreiroC, Ramirez‐FigueroaJA, CabezasX, LeivaV, Galindo‐VillardónMP. Disjoint and functional principal component analysis for infected cases and deaths due to COVID‐19 in South American countries with sensor‐related data. Sensors. 2021;21:4094.
[44] Martin‐BarreiroC, Ramirez‐FigueroaJA, CabezasX, LeivaV, Martin‐CasadoA, Galindo‐VillardónMP. A new algorithm for computing disjoint orthogonal components in the parallel factor analysis model with simulations and applications to real‐world data. Mathematics. 2021;9:2058.
[45] ChuW. Products of two
[( {}_1{\phi}_1 \]\) series and implications. Math Methods Appl Sci. 2022. doi:10.1002/mma.855
[46] ChattamvelliR, ShanmugamS. Discrete Distributions in Engineering and the Applied Sciences. California, US: Morgan and Claypool Publishers San Rafael; 2020. · Zbl 1450.62001
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.