AboutMe: Using self-descriptions in webpages to document the effects of english pretraining data filters
Large language models'(LLMs) abilities are drawn from their pretraining data, and model
development begins with data curation. However, decisions around what data is retained or …
development begins with data curation. However, decisions around what data is retained or …
On the evaluation of machine-generated reports
Large Language Models (LLMs) have enabled new ways to satisfy information needs.
Although great strides have been made in applying them to settings like document ranking …
Although great strides have been made in applying them to settings like document ranking …
Famus: Frames across multiple sources
Understanding event descriptions is a central aspect of language processing, but current
approaches focus overwhelmingly on single sentences or documents. Aggregating …
approaches focus overwhelmingly on single sentences or documents. Aggregating …
Report on The Search Futures Workshop at ECIR 2024
The First Search Futures Workshop, in conjunction with the Fourty-sixth European
Conference on Information Retrieval (ECIR) 2024, looked into the future of search to ask …
Conference on Information Retrieval (ECIR) 2024, looked into the future of search to ask …
Uncovering Differences in Persuasive Language in Russian versus English Wikipedia
B Li, A Panasyuk, C Callison-Burch - arXiv preprint arXiv:2409.19148, 2024 - arxiv.org
We study how differences in persuasive language across Wikipedia articles, written in either
English and Russian, can uncover each culture's distinct perspective on different subjects …
English and Russian, can uncover each culture's distinct perspective on different subjects …
Uncovering deep-rooted cultural differences (UNCOVER)
A Panasyuk, B Li… - Disruptive Technologies in …, 2024 - spiedigitallibrary.org
This study delves into the interconnected realms of Debates, Fake News, and Propaganda,
with an emphasis on discerning prominent ideological underpinnings distinguishing …
with an emphasis on discerning prominent ideological underpinnings distinguishing …