Active example selection for in-context learning Y Zhang, S Feng, C Tan arXiv preprint arXiv:2211.04486, 2022 | 107 | 2022 |
Conversations gone alright: Quantifying and predicting prosocial outcomes in online conversations J Bao, J Wu, Y Zhang, E Chandrasekharan, D Jurgens Proceedings of the Web Conference 2021, 1134-1145, 2021 | 35 | 2021 |
Effective Prompt Extraction from Language Models Y Zhang, N Carlini, D Ippolito arXiv preprint arXiv:2307.06865, 2023 | 27* | 2023 |
Selective explanations: Leveraging human input to align explainable ai V Lai, Y Zhang, C Chen, QV Liao, C Tan Proceedings of the ACM on Human-Computer Interaction 7 (CSCW2), 1-35, 2023 | 20 | 2023 |
Flame: Few-shot learning from natural language explanations Y Zhou, Y Zhang, C Tan arXiv preprint arXiv:2306.08042, 2023 | 6 | 2023 |
Learning to Ignore Adversarial Attacks Y Zhang, Y Zhou, S Carton, C Tan arXiv preprint arXiv:2205.11551, 2022 | 3 | 2022 |
Biasx:" thinking slow" in toxic content moderation with explanations of implied social biases Y Zhang, S Nanduri, L Jiang, T Wu, M Sap arXiv preprint arXiv:2305.13589, 2023 | 2 | 2023 |
Building a Flexible Knowledge Graph to Capture Real-World Events. L Burdick, O Ignat, Y Zhang, R Mihalcea, M Wang, S Wilson, Y Wei, ... TAC, 2019 | 1 | 2019 |
Forcing Diffuse Distributions out of Language Models Y Zhang, A Schwarzschild, N Carlini, Z Kolter, D Ippolito arXiv preprint arXiv:2404.10859, 2024 | | 2024 |