Almuammar, Manal and Fasli, Maria (2019) Learning Patterns from Imbalanced Evolving Data Streams. In: 2018 IEEE International Conference on Big Data (Big Data), 2018-12-10 - 2018-12-13, Seattle, WA, USA.
Almuammar, Manal and Fasli, Maria (2019) Learning Patterns from Imbalanced Evolving Data Streams. In: 2018 IEEE International Conference on Big Data (Big Data), 2018-12-10 - 2018-12-13, Seattle, WA, USA.
Almuammar, Manal and Fasli, Maria (2019) Learning Patterns from Imbalanced Evolving Data Streams. In: 2018 IEEE International Conference on Big Data (Big Data), 2018-12-10 - 2018-12-13, Seattle, WA, USA.
Abstract
Learning patterns from evolving data streams is challenging due to the characteristics of such streams: being continuous, unbounded and high speed data of non-stationary nature, which must be processed on the fly, using minimal computational resources. An additional challenge is imposed by the imbalanced data streams in many real-world applications, this difficulty becomes more prominent in multi-class learning tasks. This paper investigates the multi-class imbalance problem in non-stationary streams and develops a method to exploit realtime stream data and capture the dynamic of patterns from heterogeneous streams. In particular, we seek to extend concept drift adaptation techniques into imbalanced classes’ scenarios, and accordingly, we use an adaptive learner to classify multiple streams over a sequence of titled time windows. We include examples of the falsely classified instances in the training set, then we propose using a dynamic support threshold to discover the frequent patterns in these streams. We conduct an experiment on the car parking lots environment of a typical University with three simulated streams from sensors, smart pay stations and a mobile application. The result indicates the efficiency of applying adaptive learner approaches and modifying the training set to cope with the concept drift in multi-class imbalance scenarios, it also shows the merit of using a dynamic threshold to detect the rare patterns from evolving streams.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Additional Information: | Published proceedings: 2018 IEEE International Conference on Big Data (Big Data) |
Uncontrolled Keywords: | data stream; pattern discovery; imbalanced classes; evolving stream; rare pattern |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Divisions: | Faculty of Science and Health Faculty of Science and Health > Computer Science and Electronic Engineering, School of |
SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
Depositing User: | Unnamed user with email elements@essex.ac.uk |
Date Deposited: | 19 Mar 2019 13:05 |
Last Modified: | 06 Nov 2024 06:21 |
URI: | http://repository.essex.ac.uk/id/eprint/24230 |
Available files
Filename: fasli_big_data_02.pdf