A survey on semi-supervised learning for delayed partially labelled data streams

HM Gomes, M Grzenda, R Mello, J Read… - ACM Computing …, 2022 - dl.acm.org
Unlabelled data appear in many domains and are particularly relevant to streaming
applications, where even though data is abundant, labelled data is rare. To address the …

Insomnia: Towards concept-drift robustness in network intrusion detection

G Andresini, F Pendlebury, F Pierazzi… - Proceedings of the 14th …, 2021 - dl.acm.org
Despite decades of research in network traffic analysis and incredible advances in artificial
intelligence, network intrusion detection systems based on machine learning (ML) have yet …

Kappa updated ensemble for drifting data stream mining

A Cano, B Krawczyk - Machine Learning, 2020 - Springer
Learning from data streams in the presence of concept drift is among the biggest challenges
of contemporary machine learning. Algorithms designed for such scenarios must take into …

Improving the performance of bagging ensembles for data streams through mini-batching

G Cassales, H Gomes, A Bifet, B Pfahringer… - Information Sciences, 2021 - Elsevier
Often, machine learning applications have to cope with dynamic environments where data
are collected in the form of continuous data streams with potentially infinite length and …

An overview of complex data stream ensemble classification

X Zhang, M Han, H Wu, M Li… - Journal of Intelligent & …, 2021 - content.iospress.com
With the rapid development of information technology, data streams in various fields are
showing the characteristics of rapid arrival, complex structure and timely processing …

Distributed and parallel ensemble classification for big data based on Kullback-Leibler random sample partition

C Wei, J Zhang, T Valiullin, W Cao, Q Wang… - … and Architectures for …, 2020 - Springer
In this article, we use a Kullback-Leibler random sample partition data model to generate a
set of disjoint data blocks, where each block is a good representation of the entire data set …

Hoeffding trees with nmin adaptation

E García-Martín, N Lavesson, H Grahn… - 2018 IEEE 5th …, 2018 - ieeexplore.ieee.org
Machine learning software accounts for a significant amount of energy consumed in data
centers. These algorithms are usually optimized towards predictive performance, ie …

Improving parallel performance of ensemble learners for streaming data through data locality with mini-batching

G Cassales, H Gomes, A Bifet… - 2020 IEEE 22nd …, 2020 - ieeexplore.ieee.org
Machine Learning techniques have been employed in virtually all domains in the past few
years. New applications demand the ability to cope with dynamic environments like data …

Energy modeling of Hoeffding tree ensembles

E García-Martín, A Bifet… - Intelligent Data Analysis, 2021 - content.iospress.com
Energy consumption reduction has been an increasing trend in machine learning over the
past few years due to its socio-ecological importance. In new challenging areas such as …

Energy-aware very fast decision tree

E García-Martín, N Lavesson, H Grahn… - International Journal of …, 2021 - Springer
Recently machine learning researchers are designing algorithms that can run in embedded
and mobile devices, which introduces additional constraints compared to traditional …