Threaded ensembles of autoencoders for stream learning

Abstract

Anomaly detection in streaming data is an important problem in numerous application domains. Most existing model-based approaches to stream learning are based on decision trees due to their fast construction speed. This paper introduces streaming autoencoder (SA), a fast and novel anomaly detection algorithm based on ensembles of neural networks for evolving data streams. It is a one-class learner, which only requires data from the positive class for training and is accurate even when anomalous training data are rare. It features an ensemble of threaded autoencoders with continuous learning capacity. Furthermore, the SA uses a 2-step detection mechanism to ensure that real anomalies are detected with low false-positive rates. The method is highly efficient because it processes data streams in parallel with multithreads and alternating buffers. Our analysis shows that SA has a linear runtime and requires constant memory space. Empirical comparisons to the state-of-the-art methods on multiple benchmark data sets demonstrate that the proposed method detects anomalies efficiently with fewer false alarms.

Publication
Computational Intelligence