A review on fairness in machine learning
An increasing number of decisions regarding the daily lives of human beings are being
controlled by artificial intelligence and machine learning (ML) algorithms in spheres ranging …
controlled by artificial intelligence and machine learning (ML) algorithms in spheres ranging …
Trustworthy artificial intelligence: a review
Artificial intelligence (AI) and algorithmic decision making are having a profound impact on
our daily lives. These systems are vastly used in different high-stakes applications like …
our daily lives. These systems are vastly used in different high-stakes applications like …
[PDF][PDF] DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models.
Abstract Generative Pre-trained Transformer (GPT) models have exhibited exciting progress
in their capabilities, capturing the interest of practitioners and the public alike. Yet, while the …
in their capabilities, capturing the interest of practitioners and the public alike. Yet, while the …
Holistic evaluation of language models
Language models (LMs) are becoming the foundation for almost all major language
technologies, but their capabilities, limitations, and risks are not well understood. We present …
technologies, but their capabilities, limitations, and risks are not well understood. We present …
The capacity for moral self-correction in large language models
We test the hypothesis that language models trained with reinforcement learning from
human feedback (RLHF) have the capability to" morally self-correct"--to avoid producing …
human feedback (RLHF) have the capability to" morally self-correct"--to avoid producing …
Auditing large language models: a three-layered approach
Large language models (LLMs) represent a major advance in artificial intelligence (AI)
research. However, the widespread use of LLMs is also coupled with significant ethical and …
research. However, the widespread use of LLMs is also coupled with significant ethical and …
A survey on the fairness of recommender systems
Recommender systems are an essential tool to relieve the information overload challenge
and play an important role in people's daily lives. Since recommendations involve …
and play an important role in people's daily lives. Since recommendations involve …
[图书][B] Towards a standard for identifying and managing bias in artificial intelligence
R Schwartz, R Schwartz, A Vassilev, K Greene… - 2022 - dwt.com
As individuals and communities interact in and with an environment that is increasingly
virtual, they are often vulnerable to the commodification of their digital footprint. Concepts …
virtual, they are often vulnerable to the commodification of their digital footprint. Concepts …
A holistic approach to undesired content detection in the real world
We present a holistic approach to building a robust and useful natural language
classification system for real-world content moderation. The success of such a system relies …
classification system for real-world content moderation. The success of such a system relies …
Toward causal representation learning
The two fields of machine learning and graphical causality arose and are developed
separately. However, there is, now, cross-pollination and increasing interest in both fields to …
separately. However, there is, now, cross-pollination and increasing interest in both fields to …