关注
Rongwu Xu
Rongwu Xu
其他姓名许 融武
在 mails.tsinghua.edu.cn 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
The Earth is Flat because...: Investigating LLMs' Belief towards Misinformation via Persuasive Conversation
R Xu, BS Lin, S Yang, T Zhang, W Shi, T Zhang, Z Fang, W Xu, H Qiu
ACL 2024, 2023
192023
Knowledge Conflicts for LLMs: A Survey
R Xu, Z Qi, C Wang, H Wang, Y Zhang, W Xu
arXiv preprint arXiv:2403.08319, 2024
162024
How Alignment and Jailbreak Work: Explain LLM Safety through Intermediate Hidden States
Z Zhou, H Yu, X Zhang, R Xu, F Huang, Y Li
arXiv preprint arXiv:2406.05644, 2024
52024
MISO: legacy-compatible privacy-preserving single sign-on using trusted execution environments
R Xu, S Yang, F Zhang, Z Fang
2023 IEEE 8th European Symposium on Security and Privacy (EuroS&P), 352-372, 2023
52023
LSync: A universal event-synchronizing solution for live streaming
Y Xu, F Dang, R Xu, X Chen, Y Liu
IEEE INFOCOM 2022-IEEE Conference on Computer Communications, 2188-2197, 2022
52022
Liferec: A mobile app for lifelog recording and ubiquitous recommendation
J Li, H Zhang, Z He, R Xu, P Wu, M Zhang, Y Liu, S Ma
Proceedings of the 2022 Conference on Human Information Interaction and …, 2022
32022
Walking in Others' Shoes: How Perspective-Taking Guides Large Language Models in Reducing Toxicity and Bias
R Xu, Z Zhou, T Zhang, Z Qi, S Yao, K Xu, W Xu, H Qiu
arXiv preprint arXiv:2407.15366, 2024
12024
Preemptive Answer" Attacks" on Chain-of-Thought Reasoning
R Xu, Z Qi, W Xu
ACL 2024 Findings, 2024
12024
DebateQA: Evaluating Question Answering on Debatable Knowledge
R Xu, X Qi, Z Qi, W Xu, Z Guo
arXiv preprint arXiv:2408.01419, 2024
2024
Course-Correction: Safety Alignment Using Synthetic Preferences
R Xu, Y Cai, Z Zhou, R Gu, H Weng, Y Liu, T Zhang, W Xu, H Qiu
arXiv preprint arXiv:2407.16637, 2024
2024
MR-BEN: A Comprehensive Meta-Reasoning Benchmark for Large Language Models
Z Zeng, Y Liu, Y Wan, J Li, P Chen, J Dai, Y Yao, R Xu, Z Qi, W Zhao, ...
arXiv preprint arXiv:2406.13975, 2024
2024
: A Universal Timeline-Synchronizing Solution for Live Streaming
F Dang, Y Xu, R Xu, X Chen, Y Liu
IEEE/ACM Transactions on Networking, 2024
2024
Exploring Chinese Humor Generation: A Study on Two-Part Allegorical Sayings
R Xu
IJCNN 2024, 2024
2024
Tempo: Confidentiality Preservation in Cloud-Based Neural Network Training
R Xu, Z Fang
IJCNN 2024, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–14