TempoWiC: An evaluation benchmark for detecting meaning shift in social media

Coannotating: Uncertainty-guided work allocation between human and large language models for data annotation

M Li, T Shi, C Ziems, MY Kan, NF Chen, Z Liu… - arXiv preprint arXiv …, 2023 - arxiv.org

Annotated data plays a critical role in Natural Language Processing (NLP) in training
models and evaluating their performance. Given recent developments in Large Language …

被引用次数：27 相关文章所有 7 个版本

[PDF] arxiv.org

A survey on contextualised semantic shift detection

S Montanelli, F Periti - arXiv preprint arXiv:2304.01666, 2023 - arxiv.org

Semantic Shift Detection (SSD) is the task of identifying, interpreting, and assessing the
possible change over time in the meanings of a target word. Traditionally, SSD has been …

被引用次数：24 相关文章所有 3 个版本

[PDF] acm.org

Lexical Semantic Change through Large Language Models: a Survey

F Periti, S Montanelli - ACM Computing Surveys, 2024 - dl.acm.org

Lexical Semantic Change (LSC) is the task of identifying, interpreting, and assessing the
possible change over time in the meanings of a target word. Traditionally, LSC has been …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

A systematic comparison of contextualized word embeddings for lexical semantic change

F Periti, N Tahmasebi - arXiv preprint arXiv:2402.12011, 2024 - arxiv.org

Contextualized embeddings are the preferred tool for modeling Lexical Semantic Change
(LSC). Current evaluations typically focus on a specific task known as Graded Change …

被引用次数：9 相关文章所有 6 个版本

[PDF] arxiv.org

(Chat) GPT v BERT: Dawn of Justice for Semantic Change Detection

F Periti, H Dubossarsky, N Tahmasebi - arXiv preprint arXiv:2401.14040, 2024 - arxiv.org

In the universe of Natural Language Processing, Transformer-based language models like
BERT and (Chat) GPT have emerged as lexical superheroes with great power to solve open …

被引用次数：7 相关文章所有 8 个版本

[PDF] arxiv.org

Tweet insights: a visualization platform to extract temporal insights from twitter

D Loureiro, K Rezaee, T Riahi, F Barbieri… - arXiv preprint arXiv …, 2023 - arxiv.org

This paper introduces a large collection of time series data derived from Twitter,
postprocessed using word embedding techniques, as well as specialized fine-tuned …

被引用次数：7 相关文章所有 2 个版本

[PDF] arxiv.org

SuperTweetEval: A Challenging, Unified and Heterogeneous Benchmark for Social Media NLP Research

D Antypas, A Ushio, F Barbieri, L Neves… - arXiv preprint arXiv …, 2023 - arxiv.org

Despite its relevance, the maturity of NLP for social media pales in comparison with general-
purpose models, metrics and benchmarks. This fragmented landscape makes it hard for the …

被引用次数：4 相关文章所有 4 个版本

[PDF] arxiv.org

The LSCD Benchmark: a Testbed for Diachronic Word Meaning Tasks

D Schlechtweg, SM Virk, N Arefyev - arXiv preprint arXiv:2404.00176, 2024 - arxiv.org

Lexical Semantic Change Detection (LSCD) is a complex, lemma-level task, which is usually
operationalized based on two subsequently applied usage-level tasks: First, Word-in …

被引用次数：2 相关文章所有 2 个版本

[PDF] ceur-ws.org

[PDF][PDF] The Time-Embedding Travelers at WiC-ITA.

F Periti, H Dubossarsky - EVALITA, 2023 - ceur-ws.org

The WiC-ITA shared task aims to determine whether a word appearing in two distinct
sentences carries the same meaning. The task consists of two subtasks: binary classification …

被引用次数：4 相关文章所有 2 个版本

[HTML] nature.com Full View

[HTML][HTML] A Dataset for Evaluating Contextualized Representation of Biomedical Concepts in Language Models

H Rouhizadeh, I Nikishina, A Yazdani, A Bornet… - Scientific Data, 2024 - nature.com

Due to the complexity of the biomedical domain, the ability to capture semantically
meaningful representations of terms in context is a long-standing challenge. Despite …