Fundamental limitations of alignment in large language models Y Wolf, N Wies, O Avnery, Y Levine, A Shashua arXiv preprint arXiv:2304.11082, 2023 | 130 | 2023 |
Direct observation of vortices in an electron fluid A Aharon-Steinberg, T Völkl, A Kaplan, AK Pariari, I Roy, T Holder, Y Wolf, ... Nature 607 (7917), 74-80, 2022 | 78 | 2022 |
Unusual spin polarization in the chirality-induced spin selectivity Y Wolf, Y Liu, J Xiao, N Park, B Yan ACS nano 16 (11), 18601-18607, 2022 | 46 | 2022 |
Para-hydrodynamics from weak surface scattering in ultraclean thin flakes Y Wolf, A Aharon-Steinberg, B Yan, T Holder Nature Communications 14 (1), 2334, 2023 | 6 | 2023 |
Tradeoffs Between Alignment and Helpfulness in Language Models Y Wolf, N Wies, D Shteyman, B Rothberg, Y Levine, A Shashua arXiv preprint arXiv:2401.16332, 2024 | | 2024 |