PREDICTIVE PERFORMANCE ANALYSIS OF PEAKS-OVER-THRESHOLD APPROACH WITHOUT-OF-BAG ESTIMATE
Keywords:
out-of-bag estimate, Newton-Raphson method, maximum likelihood estimator, extreme values theory, generalized Pareto distributionDOI:
https://doi.org/10.17654/0972086325017Abstract
In this paper, we propose an integrated approach combining graphical and analytical methods to select the optimal threshold and estimate the scale and shape parameters. For each threshold in the range of admissible thresholds derived from the graphical Hill and mean-excess methods, we estimate the scale and shape parameters of the generalized Pareto distribution (GPD) using a procedure combining the out-of-bag estimation (OOBE) with a modified Newton-Raphson numerical optimization algorithm. This numerical method uses a penalized log likelihood function. The numerical results for sample size 100 at 2000 show the reduction in the estimation bias. The proposed method is efficient than that of non-parametric approaches used by Hill and Smooth Hill estimators in terms of accuracy, as proven by numerical experiments and application to real river database obtained from the Lobo and Yzeron stations. These results show the good predictive performance of the OOBE method combined with a modified Newton-Raphson numerical optimization algorithm.
Received: July 28, 2025
Accepted: September 4, 2025
References
[1] S. El Adlouni and T. B. M. J. Ouarda, Comparaison des méthodes d’estimation des paramètres du modèle gev non-stationnaire, Revue des Sciences de l’Eau 21(1) (2008), 35-50.
[2] W. H. Armstrong, M. J. Collinsand and N. P. Snyder, Trends in flood discharge and peaks over threshold per water year on climate sensitive rivers in the northeastern United States, American Geophysical Union, San Francisco, 2011.
[3] F. Badaoui, A. Amar, G. C. Okou, A. Zoglat and S. EL Adlouni, Reliable estimation of capital requirement for banking sector using peak over threshold approach, IJAMAS 56(4) (2014), 05014010.
[4] A. A. Balkema and L. de Haan, Statistical inference using extreme order statistics, Ann. Probab. 3(1) (1975), 119-131.
[5] H. M. Barakat, E. M. Nigm and H. A. Alaswed, The Hill estimators under power normalization, Applied Mathematical Modelling 45 (2017), 813-822.
[6] J. Beirlant, J. L. Teugels and P. Vynckier, Practical Analysis of Extreme Values, Leuven University Press, Louvain, 1996.
[7] S. G. Coles, An Introduction to Statistical Modeling of Extreme Values, Springer-Verlag, Berlin, 2001.
[8] J. Danielsson, L. de Haan and C. G. de Vries, Using a bootstrap method to choose the sample fraction in tail index estimation, J. Multivar. Anal. 76(2) (2001), 226-248.
[9] A. C. Davison and R. L. Smith, Models for exceedances over high thresholds, R. Stat. Soc. Ser. B. 52 (1990), 393-442.
[10] A. C. Davison and J. Stander, Modeling excesses over high thresholds with an application, Statistical Extremes and Applications, J. Tiago de DE Oliveira, ed., Netherlands, 1984.
[11] R. A. Deidda, Multiple threshold method for fitting the generalized Pareto distribution to rainfall time series, HESS 14(12) (2010), 792-804.
[12] H. Drees, L. de Haan and S. Resnick, How to make a Hill plot, Ann. Statist. 28 (2000), 254-274.
[13] D. J. Dupuis, Exceedances over high thresholds: a guide to threshold selection, Extremes 1 (1999), 251-261.
[14] P. Embrechts, C. Klppelberg and T. Mikosch, Modelling Extremal Events for Finance and Insurance, Springer-Verlag, Berlin, 1997.
[15] R. A. Fisher and L. H. C. Tippett, Limiting Forms of the Frequency Distribution of the Largest or Smallest Member of a Sample, Cambridge University Press, Cambridge, 1928.
[16] B. V. Gnedenko, Sur la distribution limite du terme maximum d’une serie aleatoire, Ann. Math. 44(3) (1943), 423-453.
[17] S. D. Grimshaw, Computing maximum likelihood estimates for the generalized Pareto distribution, Technometrics 35(2) (1993), 185-191.
[18] A. Guillou and P. Willems, Application de la théorie des valeurs extrêmes en hydrologie, RSA 54(2) (2006), 5-31.
[19] P. Hall, Using the bootstrap to estimate mean squared error and select smoothing parameter in nonparametric problems, J. Multivar. Anal. 32(2) (1990), 177-203.
[20] B. M. Hill, A simple general approach to inference about the tail of a distribution, Annals of Statistics 13 (1975), 331-341.
[21] J. R. M. Hosking and J. R. Wallis, Parameter and quantile estimation for the generalized Pareto distribution, Technometrics 29 (1987), 67-90.
[22] A. Langousis, A. Mamalakis, M. Puliga and R. Deidda, Threshold detection for the generalized Pareto distribution: review of representative methods and application to the NOAA NCDC daily rainfall database, Water. Resour. Res. 52 (2016), 2659-2681.
[23] O.-A. Lymperi and E. A. Varouchakis, Modeling extreme precipitation data in a mining area, Math. Geosci. 56 (2024), 1405-1437.
[24] G. Matthys and J. Beirlant, Estimating the extreme value index and high quantiles with exponential regression models, Stat. Sin. 13 (2003), 853-880.
[25] R. Mínguez, Automatic threshold selection for generalized Pareto and Pareto-Poisson distributions in rainfall analysis: a case study using the NOAA NCDC daily rainfall database, Atmosphere 16(1) (2025), 61.
[26] M. Nguyen, A. E. D. Veraart, B. Taisne, C. T. Tan and D. A. Lallemant, Dynamic extreme value model with application to volcanic eruption forecasting, Math. Geosci. 56 (2024), 841-865.
[27] G. C. Okou, Estimation du risque financier par l’approche de Peaks Over Threshold (POT) et de la théorie des copules, Université Mohammed V-Agdal, Faculté des Sciences, 2014.
[28] J. Pickands, Residual life time at great age, Ann. Math. 2(5) (1975), 119-131.
[29] B. Raggad, Gestion des risques: théorie et application au marché pétrolier, Institut Supérieur de Gestion de Tunis, Université de Tunis, 2007.
[30] S. Resnick and C. Starica, Smoothing the Hill estimator, Advances in Applied Probability 29 (1997), 271-293.
[31] R. L. Smith, Maximum likelihood estimation in a class of nonregular cases, Biometrika 72 (1975), 67-90.
[32] P. Thompson, Y. Caiand, D. Reeve and J. Stander, Automated threshold selection methods for extreme wave analysis, Coast. Eng. 56(10) (2009), 1013-1021.
[33] E. Towler, D. Llewellyn, A. Prein and E. Gilleland, Extreme-value analysis for the characterization of extremes in water resources: a generalized workflow and case study on New Mexico monsoon precipitation, Weather Clim. Extrem. 29 (2020), 100260.
[34] Z. Xiangxian and Ge A. Wenlei, New method to choose the threshold in the POT model, First International Conference on Information Science and Engineering, Nanjing, China, 2009, pp. 750-753.
[35] A. Zoglat, S. EL Adlouni, E. Ezzahid, F. Badaoui, A. Amar and G. C. Okou, Statistical methods to expect extreme values: Application of POT approach to CAC40 return index, IJSE 10(1) (2013), 1-13.
[36] A. Zoglat, S. EL Adlouni, F. Badaoui, A. Amar and G. C. Okou, Managing hydrological risks with extreme modeling: application of peaks over threshold model to the Loukkos watershed, Morocco, J. Hydrol. Eng. 19(9) (2014), 05014010.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Far East Journal of Theoretical Statistics, PUSHPA PUBLISHING HOUSE, PRAYAGRAJ, INDIA

This work is licensed under a Creative Commons Attribution 4.0 International License.
____________________________
Attribution: Credit Pusha Publishing House as the original publisher, including title and author(s) if applicable.
Non-Commercial Use: For non-commercial purposes only. No commercial activities without explicit permission.
No Derivatives: Modifying or creating derivative works not allowed without written permission.
Contact Pusha Publishing House for more info or permissions.

