Far East Journal of Theoretical Statistics

The Far East Journal of Theoretical Statistics publishes original research papers and survey articles in the field of theoretical statistics, covering topics such as Bayesian analysis, multivariate analysis, and stochastic processes.

Submit Article

PREDICTIVE PERFORMANCE ANALYSIS OF PEAKS-OVER-THRESHOLD APPROACH WITHOUT-OF-BAG ESTIMATE

Authors

  • Gueï Cyrille Okou
  • Kolé Keita
  • Amadou Kamagaté

Keywords:

out-of-bag estimate, Newton-Raphson method, maximum likelihood estimator, extreme values theory, generalized Pareto distribution

DOI:

https://doi.org/10.17654/0972086325017

Abstract

In this paper, we propose an integrated approach combining graphical and analytical methods to select the optimal threshold and estimate the scale and shape parameters. For each threshold in the range of admissible thresholds derived from the graphical Hill and mean-excess methods, we estimate the scale and shape parameters of the generalized Pareto distribution (GPD) using a procedure combining the out-of-bag estimation (OOBE) with a modified Newton-Raphson numerical optimization algorithm. This numerical method uses a penalized log likelihood function. The numerical results for sample size 100 at 2000 show the reduction in the estimation bias. The proposed method is efficient than that of non-parametric approaches used by Hill and Smooth Hill estimators in terms of accuracy, as proven by numerical experiments and application to real river database obtained from the Lobo and Yzeron stations. These results show the good predictive performance of the OOBE method combined with a modified Newton-Raphson numerical optimization algorithm.

Received: July 28, 2025
Accepted: September 4, 2025

References

[1] S. El Adlouni and T. B. M. J. Ouarda, Comparaison des méthodes d’estimation des paramètres du modèle gev non-stationnaire, Revue des Sciences de l’Eau 21(1) (2008), 35-50.

[2] W. H. Armstrong, M. J. Collinsand and N. P. Snyder, Trends in flood discharge and peaks over threshold per water year on climate sensitive rivers in the northeastern United States, American Geophysical Union, San Francisco, 2011.

[3] F. Badaoui, A. Amar, G. C. Okou, A. Zoglat and S. EL Adlouni, Reliable estimation of capital requirement for banking sector using peak over threshold approach, IJAMAS 56(4) (2014), 05014010.

[4] A. A. Balkema and L. de Haan, Statistical inference using extreme order statistics, Ann. Probab. 3(1) (1975), 119-131.

[5] H. M. Barakat, E. M. Nigm and H. A. Alaswed, The Hill estimators under power normalization, Applied Mathematical Modelling 45 (2017), 813-822.

[6] J. Beirlant, J. L. Teugels and P. Vynckier, Practical Analysis of Extreme Values, Leuven University Press, Louvain, 1996.

[7] S. G. Coles, An Introduction to Statistical Modeling of Extreme Values, Springer-Verlag, Berlin, 2001.

[8] J. Danielsson, L. de Haan and C. G. de Vries, Using a bootstrap method to choose the sample fraction in tail index estimation, J. Multivar. Anal. 76(2) (2001), 226-248.

[9] A. C. Davison and R. L. Smith, Models for exceedances over high thresholds, R. Stat. Soc. Ser. B. 52 (1990), 393-442.

[10] A. C. Davison and J. Stander, Modeling excesses over high thresholds with an application, Statistical Extremes and Applications, J. Tiago de DE Oliveira, ed., Netherlands, 1984.

[11] R. A. Deidda, Multiple threshold method for fitting the generalized Pareto distribution to rainfall time series, HESS 14(12) (2010), 792-804.

[12] H. Drees, L. de Haan and S. Resnick, How to make a Hill plot, Ann. Statist. 28 (2000), 254-274.

[13] D. J. Dupuis, Exceedances over high thresholds: a guide to threshold selection, Extremes 1 (1999), 251-261.

[14] P. Embrechts, C. Klppelberg and T. Mikosch, Modelling Extremal Events for Finance and Insurance, Springer-Verlag, Berlin, 1997.

[15] R. A. Fisher and L. H. C. Tippett, Limiting Forms of the Frequency Distribution of the Largest or Smallest Member of a Sample, Cambridge University Press, Cambridge, 1928.

[16] B. V. Gnedenko, Sur la distribution limite du terme maximum d’une serie aleatoire, Ann. Math. 44(3) (1943), 423-453.

[17] S. D. Grimshaw, Computing maximum likelihood estimates for the generalized Pareto distribution, Technometrics 35(2) (1993), 185-191.

[18] A. Guillou and P. Willems, Application de la théorie des valeurs extrêmes en hydrologie, RSA 54(2) (2006), 5-31.

[19] P. Hall, Using the bootstrap to estimate mean squared error and select smoothing parameter in nonparametric problems, J. Multivar. Anal. 32(2) (1990), 177-203.

[20] B. M. Hill, A simple general approach to inference about the tail of a distribution, Annals of Statistics 13 (1975), 331-341.

[21] J. R. M. Hosking and J. R. Wallis, Parameter and quantile estimation for the generalized Pareto distribution, Technometrics 29 (1987), 67-90.

[22] A. Langousis, A. Mamalakis, M. Puliga and R. Deidda, Threshold detection for the generalized Pareto distribution: review of representative methods and application to the NOAA NCDC daily rainfall database, Water. Resour. Res. 52 (2016), 2659-2681.

[23] O.-A. Lymperi and E. A. Varouchakis, Modeling extreme precipitation data in a mining area, Math. Geosci. 56 (2024), 1405-1437.

[24] G. Matthys and J. Beirlant, Estimating the extreme value index and high quantiles with exponential regression models, Stat. Sin. 13 (2003), 853-880.

[25] R. Mínguez, Automatic threshold selection for generalized Pareto and Pareto-Poisson distributions in rainfall analysis: a case study using the NOAA NCDC daily rainfall database, Atmosphere 16(1) (2025), 61.

[26] M. Nguyen, A. E. D. Veraart, B. Taisne, C. T. Tan and D. A. Lallemant, Dynamic extreme value model with application to volcanic eruption forecasting, Math. Geosci. 56 (2024), 841-865.

[27] G. C. Okou, Estimation du risque financier par l’approche de Peaks Over Threshold (POT) et de la théorie des copules, Université Mohammed V-Agdal, Faculté des Sciences, 2014.

[28] J. Pickands, Residual life time at great age, Ann. Math. 2(5) (1975), 119-131.

[29] B. Raggad, Gestion des risques: théorie et application au marché pétrolier, Institut Supérieur de Gestion de Tunis, Université de Tunis, 2007.

[30] S. Resnick and C. Starica, Smoothing the Hill estimator, Advances in Applied Probability 29 (1997), 271-293.

[31] R. L. Smith, Maximum likelihood estimation in a class of nonregular cases, Biometrika 72 (1975), 67-90.

[32] P. Thompson, Y. Caiand, D. Reeve and J. Stander, Automated threshold selection methods for extreme wave analysis, Coast. Eng. 56(10) (2009), 1013-1021.

[33] E. Towler, D. Llewellyn, A. Prein and E. Gilleland, Extreme-value analysis for the characterization of extremes in water resources: a generalized workflow and case study on New Mexico monsoon precipitation, Weather Clim. Extrem. 29 (2020), 100260.

[34] Z. Xiangxian and Ge A. Wenlei, New method to choose the threshold in the POT model, First International Conference on Information Science and Engineering, Nanjing, China, 2009, pp. 750-753.

[35] A. Zoglat, S. EL Adlouni, E. Ezzahid, F. Badaoui, A. Amar and G. C. Okou, Statistical methods to expect extreme values: Application of POT approach to CAC40 return index, IJSE 10(1) (2013), 1-13.

[36] A. Zoglat, S. EL Adlouni, F. Badaoui, A. Amar and G. C. Okou, Managing hydrological risks with extreme modeling: application of peaks over threshold model to the Loukkos watershed, Morocco, J. Hydrol. Eng. 19(9) (2014), 05014010.

Published

2025-10-16

Issue

Section

Articles

How to Cite

PREDICTIVE PERFORMANCE ANALYSIS OF PEAKS-OVER-THRESHOLD APPROACH WITHOUT-OF-BAG ESTIMATE. (2025). Far East Journal of Theoretical Statistics , 69(3), 347-381. https://doi.org/10.17654/0972086325017

Similar Articles

1-10 of 65

You may also start an advanced similarity search for this article.