Advances and Applications in Statistics

The Advances and Applications in Statistics is an internationally recognized journal indexed in the Emerging Sources Citation Index (ESCI). It provides a platform for original research papers and survey articles in all areas of statistics, both computational and experimental in nature.

Submit Article

DISCOVERING PATTERNS AND DEVIATIONS IN DATA: COMPARISON OF ANOMALY DETECTION PROCEDURE IN REGRESSION

Authors

  • Syahirah Suboh
  • Izzatdin Abdul Aziz

Keywords:

anomaly detection, regression

DOI:

https://doi.org/10.17654/0972361724063

Abstract

Finding interesting or unusual patterns plays an important role in a lot of fields, including regression, business intelligence, data mining, and decision making. In many domain applications, anomaly detection and regression are both important, with the aim of identifying patterns or trends that can pose challenges in their respective areas. As a result, empirical analysis was done to find out how well leverage, DFFITS, and standardized residual performed in identifying abnormalities.

Findings. Mostly, for discovering patterns of data in cases regarding simple and multiple linear regression, if the data is flagged as high leverage, then there is a possibility that it may highly influence the estimated regression. However, in both the cases, there are hardly any side effects from the performance of $R^2$. It emphasizes that there is still a strong relationship between independent and dependent variables. Besides, if the data is flagged as influential or has a high standardized residual, it is found that the model adequacy has increased $R^{2} = 0.4324$ to $R^{2} = 0.6877$ and $R^{2} = 0.7056$ to $R^{2} = 0.7532$.

Received: May 20, 2024
Revised: July 4, 2024
Accepted: July 19, 2024

References

M. A. Hayes and M. A. M. Capretz, Contextual anomaly detection framework for big sensor data, J. Big Data 2(1) (2015), 2.

L. Basora, X. Olive and T. Dubot, Recent advances in anomaly detection methods applied to aviation, Aerospace 6(11) (2019), 117.

V. Yepmo, G. Smits, O. Pivert and V. Yepmo Tchaghe, Anomaly explanation : a review anomaly explanation: a review, Data and Knowledge Engineering, 2022.

[Online]. Available: https://hal.archives-ouvertes.fr/hal-03449887.

S. Thudumu, P. Branch, J. Jin and J. J. Singh, A comprehensive survey of anomaly detection techniques for high dimensional big data, J. Big Data 7(1) (2020), 1-30.

A. Zimek and P. Filzmoser, There and back again: outlier detection between statistical reasoning and data mining algorithms, Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, Wiley-Blackwell, Vol. 8, 2018. doi: 10.1002/widm.1280.

V. Chandola, A. Banerjee and V. Kumar, Anomaly detection: a survey, ACM Computing Surveys (CSUR) 41(3) (2009), 1-58.

V. S. L’vov, A. Pomyalov and I. Procaccia, Outliers, extreme events, and multiscaling, Phys. Rev. E 63(5) (2001), 56118.

R. D. Cook, Detection of influential observation in linear, Technometrics 19(1) (1977), 15-18.

S. Raj and S. Kannan, Detection of outliers in regression model for medical data, International Journal of Medical Research and Health Sciences 6(7) (2017), 50-56. [Online]. Available: www.ijmrhs.com.

Gh. Babaee, F. Amani, A. Biglarian and M. Keshavarz, Detection of outliers methods in medical studies, Tehran University of Medical Sciences Journal 65(7) (2007), 24-27. Accessed: February 28, 2024.

[Online]. Available: https://tumj.tums.ac.ir/article-1-753-en.html.

Y. Yang, J. Yu, C. Wang and J. Wen, Risk assessment of crowd-gathering in urban open public spaces supported by spatio-temporal big data, Sustainability (Switzerland) 14(10) (2022), 6175. doi: 10.3390/su14106175.

D. C. Montgomery, Statistical Quality Control, Wiley Global Education, 2012.

H. N. Ukponmwan and F. B. Ajibade, Evaluation of techniques for univariate normality test using Monte Carlo simulation, American Journal of Theoretical and Applied Statistics 6(5-1) (2017), 51-61. doi: 10.11648/j.ajtas.s.2017060501.18.

M. Rafferty, P. Brogan, J. Hastings, D. Laverty, X. Liu and R. Khan, Local Anomaly Detection by Application of Regression Analysis on PMU Data, 2018. [Online]. Available: http://go.qub.ac.uk/oa-feedback.

W. Hao et al., Anomaly event detection in security surveillance using two-stream based model, Security and Communication Networks, 2020.

doi: 10.1155/2020/8876056.

G. M. Oyeyemi, A. Bukoye and I. Akeyede, Comparison of outlier detection procedures in multiple linear regressions, American Journal of Mathematics and Statistics 5(1) (2015), 37-41. doi: 10.5923/j.ajms.20150501.06.

S. Weisberg, Applied Linear Regression, John Wiley & Sons, Vol. 528, 2005.

Published

07-08-2024

Issue

Section

Articles

How to Cite

DISCOVERING PATTERNS AND DEVIATIONS IN DATA: COMPARISON OF ANOMALY DETECTION PROCEDURE IN REGRESSION. (2024). Advances and Applications in Statistics , 91(9), 1195-1215. https://doi.org/10.17654/0972361724063

Similar Articles

1-10 of 69

You may also start an advanced similarity search for this article.