A systematic review and replicability study of bert4rec for sequential recommendation

A Petrov, C Macdonald - Proceedings of the 16th ACM Conference on …, 2022 - dl.acm.org
BERT4Rec is an effective model for sequential recommendation based on the Transformer
architecture. In the original publication, BERT4Rec claimed superiority over other available …

Temporal quality degradation in AI models

D Vela, A Sharp, R Zhang, T Nguyen, A Hoang… - Scientific Reports, 2022 - nature.com
As AI models continue to advance into many real-life applications, their ability to maintain
reliable quality over time becomes increasingly important. The principal challenge in this …

Uncertainty reduction for model adaptation in semantic segmentation

F Fleuret - Proceedings of the IEEE/CVF Conference on …, 2021 - openaccess.thecvf.com
Abstract Traditional methods for Unsupervised Domain Adaptation (UDA) targeting semantic
segmentation exploit information common to the source and target domains, using both …

BERTs of a feather do not generalize together: Large variability in generalization across models with similar test set performance

RT McCoy, J Min, T Linzen - arXiv preprint arXiv:1911.02969, 2019 - arxiv.org
If the same neural network architecture is trained multiple times on the same dataset, will it
make similar linguistic generalizations across runs? To study this question, we fine-tuned …

Randomness in neural network training: Characterizing the impact of tooling

D Zhuang, X Zhang, S Song… - Proceedings of Machine …, 2022 - proceedings.mlsys.org
The quest for determinism in machine learning has disproportionately focused on
characterizing the impact of noise introduced by algorithmic design choices. In this work, we …

[HTML][HTML] Understanding the performance of knowledge graph embeddings in drug discovery

S Bonner, IP Barrett, C Ye, R Swiers, O Engkvist… - Artificial Intelligence in …, 2022 - Elsevier
Abstract Knowledge Graphs (KG) and associated Knowledge Graph Embedding (KGE)
models have recently begun to be explored in the context of drug discovery and have the …

Cross-dataset transfer learning for motor imagery signal classification via multi-task learning and pre-training

Y Xie, K Wang, J Meng, J Yue, L Meng… - Journal of Neural …, 2023 - iopscience.iop.org
Objective. Deep learning (DL) models have been proven to be effective in decoding motor
imagery (MI) signals in Electroencephalogram (EEG) data. However, DL models' success …

Leveraging the meta-embedding for text classification in a resource-constrained language

MR Hossain, MM Hoque, N Siddique - Engineering Applications of Artificial …, 2023 - Elsevier
This paper proposes an intelligent text classification framework for a resource-constrained
language like Bengali, which is considered a challenging task due to the lack of standard …

Elo uncovered: Robustness and best practices in language model evaluation

M Boubdir, E Kim, B Ermis, S Hooker… - arXiv preprint arXiv …, 2023 - arxiv.org
In Natural Language Processing (NLP), the Elo rating system, originally designed for
ranking players in dynamic games such as chess, is increasingly being used to evaluate …

21 000 birds in 4.5 h: efficient large‐scale seabird detection with machine learning

B Kellenberger, T Veen, E Folmer… - Remote Sensing in …, 2021 - Wiley Online Library
We address the task of automatically detecting and counting seabirds in unmanned aerial
vehicle (UAV) imagery using deep convolutional neural networks (CNNs). Our study area …