Faculty Achievement View More Faculty Achievements

Dr. Tariq Mahmood and Usman Ali

Published Works
IBA faculty and PhD scholar co-author a paper on a novel framework for concept drift detection using autoencoders for classification problems in data streams

Professor, Department of Computer Science, School of Mathematics and Computer Science (SMCS), Dr. Tariq Mahmood; and PhD scholar and Lecturer SMCS, Usman Ali co-authored a paper titled ‘A novel framework for concept drift detection using autoencoders for classification problems in data streams’, published in the International Journal of Machine Learning and Cybernetics (HEC-W and an impact factor of 3.1).

Abstract

In streaming data environments, data characteristics and probability distributions are likely to change over time, causing a phenomenon called concept drift, which poses challenges for machine learning models to predict accurately. In such non-stationary environments, there is a need to detect concept drift and update the model to maintain an acceptable predictive performance. Existing approaches to drift detection have inherent problems like requirements of truth labels in supervised detection methods and high false positive rate in case of unsupervised drift detection. In this paper, we propose a semi-supervised Autoencoder based Drift Detection Method (AEDDM) aimed at detecting drift without the need of truth labels, yet with a high confidence that the detected drift is real. In a binary classification setting, AEDDM uses two autoencoders in a layered architecture, trained on labelled data and uses a thresholding mechanism based on reconstruction error to signal the presence of drift. The proposed method has been evaluated on four synthetic and four real world datasets with different drifting scenarios. In case of real-world datasets, the induced and detected drifts have been evaluated from classifier’s performance viewpoint using seven mostly used batch classifiers as well as from adaptation perspective in an online learning environment using Hoeffding Tree classifier. The results show that AEDDM affectively detects the distributional changes in data which are most likely to impact the classifier’s performance (real drift) while ignoring the virtual drift thus considerably reducing the false alarms with an ability to adapt in terms of classification performance.

The paper can be accessed here.