DISCOVERING PATTERNS AND DEVIATIONS IN DATA: COMPARISON OF ANOMALY DETECTION PROCEDURE IN REGRESSION
Keywords:
anomaly detection, regressionDOI:
https://doi.org/10.17654/0972361724063Abstract
Finding interesting or unusual patterns plays an important role in a lot of fields, including regression, business intelligence, data mining, and decision making. In many domain applications, anomaly detection and regression are both important, with the aim of identifying patterns or trends that can pose challenges in their respective areas. As a result, empirical analysis was done to find out how well leverage, DFFITS, and standardized residual performed in identifying abnormalities.
Findings. Mostly, for discovering patterns of data in cases regarding simple and multiple linear regression, if the data is flagged as high leverage, then there is a possibility that it may highly influence the estimated regression. However, in both the cases, there are hardly any side effects from the performance of $R^2$. It emphasizes that there is still a strong relationship between independent and dependent variables. Besides, if the data is flagged as influential or has a high standardized residual, it is found that the model adequacy has increased $R^{2} = 0.4324$ to $R^{2} = 0.6877$ and $R^{2} = 0.7056$ to $R^{2} = 0.7532$.
Received: May 20, 2024
Revised: July 4, 2024
Accepted: July 19, 2024
References
M. A. Hayes and M. A. M. Capretz, Contextual anomaly detection framework for big sensor data, J. Big Data 2(1) (2015), 2.
L. Basora, X. Olive and T. Dubot, Recent advances in anomaly detection methods applied to aviation, Aerospace 6(11) (2019), 117.
V. Yepmo, G. Smits, O. Pivert and V. Yepmo Tchaghe, Anomaly explanation : a review anomaly explanation: a review, Data and Knowledge Engineering, 2022.
[Online]. Available: https://hal.archives-ouvertes.fr/hal-03449887.
S. Thudumu, P. Branch, J. Jin and J. J. Singh, A comprehensive survey of anomaly detection techniques for high dimensional big data, J. Big Data 7(1) (2020), 1-30.
A. Zimek and P. Filzmoser, There and back again: outlier detection between statistical reasoning and data mining algorithms, Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, Wiley-Blackwell, Vol. 8, 2018. doi: 10.1002/widm.1280.
V. Chandola, A. Banerjee and V. Kumar, Anomaly detection: a survey, ACM Computing Surveys (CSUR) 41(3) (2009), 1-58.
V. S. L’vov, A. Pomyalov and I. Procaccia, Outliers, extreme events, and multiscaling, Phys. Rev. E 63(5) (2001), 56118.
R. D. Cook, Detection of influential observation in linear, Technometrics 19(1) (1977), 15-18.
S. Raj and S. Kannan, Detection of outliers in regression model for medical data, International Journal of Medical Research and Health Sciences 6(7) (2017), 50-56. [Online]. Available: www.ijmrhs.com.
Gh. Babaee, F. Amani, A. Biglarian and M. Keshavarz, Detection of outliers methods in medical studies, Tehran University of Medical Sciences Journal 65(7) (2007), 24-27. Accessed: February 28, 2024.
[Online]. Available: https://tumj.tums.ac.ir/article-1-753-en.html.
Y. Yang, J. Yu, C. Wang and J. Wen, Risk assessment of crowd-gathering in urban open public spaces supported by spatio-temporal big data, Sustainability (Switzerland) 14(10) (2022), 6175. doi: 10.3390/su14106175.
D. C. Montgomery, Statistical Quality Control, Wiley Global Education, 2012.
H. N. Ukponmwan and F. B. Ajibade, Evaluation of techniques for univariate normality test using Monte Carlo simulation, American Journal of Theoretical and Applied Statistics 6(5-1) (2017), 51-61. doi: 10.11648/j.ajtas.s.2017060501.18.
M. Rafferty, P. Brogan, J. Hastings, D. Laverty, X. Liu and R. Khan, Local Anomaly Detection by Application of Regression Analysis on PMU Data, 2018. [Online]. Available: http://go.qub.ac.uk/oa-feedback.
W. Hao et al., Anomaly event detection in security surveillance using two-stream based model, Security and Communication Networks, 2020.
doi: 10.1155/2020/8876056.
G. M. Oyeyemi, A. Bukoye and I. Akeyede, Comparison of outlier detection procedures in multiple linear regressions, American Journal of Mathematics and Statistics 5(1) (2015), 37-41. doi: 10.5923/j.ajms.20150501.06.
S. Weisberg, Applied Linear Regression, John Wiley & Sons, Vol. 528, 2005.
Downloads
Published
Issue
Section
License
Copyright (c) 2024 Pushpa Publishing House, Prayagraj, India

This work is licensed under a Creative Commons Attribution 4.0 International License.
____________________________
Attribution: Credit Pushpa Publishing House as the original publisher, including title and author(s) if applicable.
No Derivatives: Modifying or creating derivative works not allowed without written permission.
Contact Pushpa Publishing House for more info or permissions.
Journal Impact Factor: 