Anonymisation models for text data: State of the art, challenges and future directions

H Brown, K Lee, F Mireshghallah, R Shokri… - Proceedings of the 2022 …, 2022 - dl.acm.org

Natural language reflects our private lives and identities, making its privacy concerns as
broad as those of real life. Language models lack the ability to understand the context and …

被引用次数：211 相关文章所有 6 个版本

[HTML] sciencedirect.com

[HTML][HTML] Beyond rating scales: With targeted evaluation, language models are poised for psychological assessment

ONE Kjell, K Kjell, HA Schwartz - Psychiatry Research, 2023 - Elsevier

In this narrative review, we survey recent empirical evaluations of AI-based language
assessments and present a case for the technology of large language models to be poised …

被引用次数：25 相关文章所有 9 个版本

[PDF] mit.edu

The text anonymization benchmark (tab): A dedicated corpus and evaluation framework for text anonymization

I Pilán, P Lison, L Øvrelid, A Papadopoulou… - Computational …, 2022 - direct.mit.edu

We present a novel benchmark and associated evaluation metrics for assessing the
performance of text anonymization methods. Text anonymization, defined as the task of …

被引用次数：64 相关文章所有 6 个版本

[PDF] arxiv.org

Identifying and mitigating privacy risks stemming from language models: A survey

V Smith, AS Shamsabadi, C Ashurst… - arXiv preprint arXiv …, 2023 - arxiv.org

Rapid advancements in language models (LMs) have led to their adoption across many
sectors. Alongside the potential benefits, such models present a range of risks, including …

被引用次数：20 相关文章所有 2 个版本

[PDF] adalerner.com

[PDF][PDF] " It'sa Fair Game", or Is It? Examining How Users Navigate Disclosure Risks and Benefits When Using LLM-Based Conversational Agents

Z Zhang, M Jia, HP Lee, B Yao, S Das, A Lerner, T Li - 2024 - adalerner.com

The widespread use of Large Language Model (LLM)-based conversational agents (CAs),
especially in high-stakes domains, raises many privacy concerns. Building ethical LLM …

被引用次数：22 相关文章

[PDF] aclanthology.org

Preserving privacy through dememorization: An unlearning technique for mitigating memorization risks in language models

A Kassem, O Mahmoud, S Saad - Proceedings of the 2023 …, 2023 - aclanthology.org

Abstract Large Language models (LLMs) are trained on vast amounts of data, including
sensitive information that poses a risk to personal privacy if exposed. LLMs have shown the …

被引用次数：19 相关文章所有 2 个版本

[PDF] nature.com

Man vs the machine in the struggle for effective text anonymisation in the age of large language models

C Patsakis, N Lykousas - Scientific Reports, 2023 - nature.com

The collection and use of personal data are becoming more common in today's data-driven
culture. While there are many advantages to this, including better decision-making and …

被引用次数：15 相关文章所有 11 个版本

[PDF] arxiv.org

Grandma Karl is 27 years old–research agenda for pseudonymization of research data

E Volodina, S Dobnik… - 2023 IEEE Ninth …, 2023 - ieeexplore.ieee.org

Accessibility of research data is critical for advances in many research fields, but textual data
often cannot be shared due to the personal and sensitive information which it contains, eg …

被引用次数：4 相关文章所有 6 个版本

[PDF] aaai.org

Learning to unlearn: Instance-wise unlearning for pre-trained classifiers

S Cha, S Cho, D Hwang, H Lee, T Moon… - Proceedings of the AAAI …, 2024 - ojs.aaai.org

Since the recent advent of regulations for data protection (eg, the General Data Protection
Regulation), there has been increasing demand in deleting information learned from …

被引用次数：20 相关文章所有 5 个版本

[PDF] aclanthology.org

On text-based personality computing: Challenges and future directions

Q Fang, A Giachanou, A Bagheri… - Findings of the …, 2023 - aclanthology.org

Text-based personality computing (TPC) has gained many research interests in NLP. In this
paper, we describe 15 challenges that we consider deserving the attention of the NLP …

被引用次数：7 相关文章所有 4 个版本