school

UM E-Theses Collection (澳門大學電子學位論文庫)

check Full Text
Title

Outlier detection with data stream mining approach in high-dimenional time series data

English Abstract

OUTLIER DETECTION WITH DATA STREAM MINING APPROACH IN HIGHDIMENSIONAL TIME SERIES DATA By WANG DAN TONG Thesis Supervisor: Dr. Simon Fong Department of Computer and Information Science Master of Science in E-commerce Technology Outlier detection has applications in fraud identification in the industries. It can also serve as a preprocessing technique that cleans noise from the training data for the subsequent model induction. As a preprocessing technique, it is necessary to distinguish whether the outliers are superfluous noises which warrant removal, or they are interesting anomalies which should be learnt as well. In this thesis, the author presents a novel algorithm that uses probabilistic and density-based model combined with incrementally optimized very fast decision tree (ioVFDT) for embracing outliers and for generalizing recognition powers. The outlier removal mechanism is empowered by logics that keep counts of outlier occurrence. Additional classification rules are formulated for outliers that consistently map data to correct target classes. Otherwise they would be removed. The proposed algorithm is tested with nine different scenarios time series datasets for verifying its efficacy.

Issue date

2017.

Author

Wang, Dan Tong

Faculty
Faculty of Science and Technology
Department
Department of Computer and Information Science
Degree

M.Sc.

Subject

Outliers (Statistics)

Data mining

Supervisor

Fong, Chi Chiu

Files In This Item

Full-text (Intranet only)

Location
1/F Zone C
Library URL
991005784989706306