Training language models to follow instructions with human feedback L Ouyang, J Wu, X Jiang, D Almeida, C Wainwright, P Mishkin, C Zhang, ... Advances in neural information processing systems 35, 27730-27744, 2022 | 7181 | 2022 |
Gpt-4 technical report J Achiam, S Adler, S Agarwal, L Ahmad, I Akkaya, FL Aleman, D Almeida, ... arXiv preprint arXiv:2303.08774, 2023 | 1870 | 2023 |
Learning to summarize with human feedback N Stiennon, L Ouyang, J Wu, D Ziegler, R Lowe, C Voss, A Radford, ... Advances in Neural Information Processing Systems 33, 3008-3021, 2020 | 1207 | 2020 |
Webgpt: Browser-assisted question-answering with human feedback R Nakano, J Hilton, S Balaji, J Wu, L Ouyang, C Kim, C Hesse, S Jain, ... arXiv preprint arXiv:2112.09332, 2021 | 760 | 2021 |
Improving image generation with better captions J Betker, G Goh, L Jing, T Brooks, J Wang, L Li, L Ouyang, J Zhuang, ... Computer Science. https://cdn. openai. com/papers/dall-e-3. pdf 2 (3), 8, 2023 | 279 | 2023 |
Recursively summarizing books with human feedback J Wu, L Ouyang, DM Ziegler, N Stiennon, R Lowe, J Leike, P Christiano arXiv preprint arXiv:2109.10862, 2021 | 203 | 2021 |
Training language models to follow instructions with human feedback, 2022 L Ouyang, J Wu, X Jiang, D Almeida, CL Wainwright, P Mishkin, C Zhang, ... URL https://arxiv. org/abs/2203.02155 13, 1, 2022 | 175 | 2022 |
Self-critiquing models for assisting human evaluators W Saunders, C Yeh, J Wu, S Bills, L Ouyang, J Ward, J Leike arXiv preprint arXiv:2206.05802, 2022 | 131 | 2022 |
Training language models to follow instructions with human feedback. arXiv L Ouyang, J Wu, X Jiang, D Almeida, CL Wainwright, P Mishkin, C Zhang, ... arXiv preprint arXiv:2203.02155, 2022 | 80 | 2022 |
Training language models to follow instructions with human feedback. arXiv 2022 L Ouyang, J Wu, X Jiang, D Almeida, CL Wainwright, P Mishkin, C Zhang, ... arXiv preprint arXiv:2203.02155 10, 2022 | 37 | 2022 |
Practical optimal experiment design with probabilistic programs L Ouyang, MH Tessler, D Ly, N Goodman arXiv preprint arXiv:1608.05046, 2016 | 22 | 2016 |
Semantic coherence facilitates distributional learning L Ouyang, L Boroditsky, MC Frank Cognitive science 41, 855-884, 2017 | 19 | 2017 |
Learning to summarize from human feedback, 2020 N Stiennon, L Ouyang, J Wu, DM Ziegler, R Lowe, C Voss, A Radford, ... URL https://arxiv. org/abs, 2009 | 10 | 2009 |
Fabular: Regression formulas as probabilistic programming J Borgström, AD Gordon, L Ouyang, C Russo, A Ścibior, M Szymczak Proceedings of the 43rd Annual ACM SIGPLAN-SIGACT Symposium on Principles of …, 2016 | 8 | 2016 |
webppl-oed: A practical optimal experiment design system. L Ouyang, MH Tessler, D Ly, ND Goodman CogSci, 2018 | 7 | 2018 |
Recursively summarizing books with human feedback, 2021 J Wu, L Ouyang, DM Ziegler, N Stiennon, R Lowe, J Leike, P Christiano URL https://arxiv. org/abs/2109.10862, 0 | 7 | |
Semantic coherence facilitates distributional learning of word meanings L Ouyang, L Boroditsky, M Frank Proceedings of the Annual Meeting of the Cognitive Science Society 34 (34), 2012 | 3 | 2012 |
Bayesian inference of regular expressions from human-generated example strings L Ouyang arXiv preprint arXiv:1805.08427, 2018 | 2 | 2018 |
Pedagogical learning L Ouyang, MC Frank arXiv preprint arXiv:1711.09401, 2017 | 1 | 2017 |
The Effect of Learning on Learning L Ouyang Stanford University, 2015 | | 2015 |