Guilherme Penedo 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	1067	1067
h 指数	8	8
i10 指数	7	7

0

760

380

190

570

2022202320243 310 752

Guilherme Penedo

Guilherme Penedo

ML Research Engineer at 🤗 HuggingFace

在 huggingface.co 的电子邮件经过验证


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
The RefinedWeb dataset for Falcon LLM: outperforming curated corpora with web data, and web data only G Penedo, Q Malartic, D Hesslow, R Cojocaru, A Cappelli, H Alobeidli, ... arXiv preprint arXiv:2306.01116, 2023	546	2023
Falcon-40B: an open large language model with state-of-the-art performance E Almazrouei, H Alobeidli, A Alshamsi, A Cappelli, R Cojocaru, M Debbah, ...	199	2023
The falcon series of open language models E Almazrouei, H Alobeidli, A Alshamsi, A Cappelli, R Cojocaru, M Debbah, ... arXiv preprint arXiv:2311.16867, 2023	197	2023
The falcon series of language models: Towards open frontier models E Almazrouei, H Alobeidli, A Alshamsi, A Cappelli, R Cojocaru, ... Hugging Face repository, 2023	31	2023
The RefinedWeb dataset for Falcon LLM: Outperforming curated corpora with web data only G Penedo, Q Malartic, D Hesslow, R Cojocaru, H Alobeidli, A Cappelli, ... Advances in Neural Information Processing Systems 36, 79155-79172, 2023	30	2023
The refinedweb dataset for falcon llm: Outperforming curated corpora with web data, and web data only. arXiv 2023 G Penedo, Q Malartic, D Hesslow, R Cojocaru, A Cappelli, H Alobeidli, ... arXiv preprint arXiv:2306.01116, 0	24
Falcon-40B: an open large language model with state-of-the-art performance. 2023 E Almazrouei, H Alobeidli, A Alshamsi, A Cappelli, R Cojocaru, M Debbah, ... URL https://falconllm. tii. ae, 2023	10	2023
The refinedweb dataset for falcon llm: Outperforming curated corpora with web data only G Penedo, Q Malartic, D Hesslow, R Cojocaru, H Alobeidli, A Cappelli, ... Advances in Neural Information Processing Systems 36, 2024	8	2024
The Falcon Series of Open Language Models.(2023) E Almazrouei, H Alobeidli, A Alshamsi, A Cappelli, R Cojocaru, M Debbah, ... arXiv preprint arXiv:2311.16867, 2023	8	2023
The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only.” arXiv G Penedo, Q Malartic, D Hesslow, R Cojocaru, A Cappelli, H Alobeidli, ... arXiv preprint arXiv:2306.01116, 2023	7	2023
AlGhafa Evaluation Benchmark for Arabic Language Models E Almazrouei, R Cojocaru, M Baldo, Q Malartic, H Alobeidli, D Mazzotta, ... Proceedings of ArabicNLP 2023, 244-275, 2023	6	2023
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale G Penedo, H Kydlíček, A Lozhkov, M Mitchell, C Raffel, L Von Werra, ... arXiv preprint arXiv:2406.17557, 2024	1	2024
Artery in Microgravity (AIM): Assembly, integration, and testing for a student payload for the ISS L García Mozos, D Saroya, Y Roelvink, N Santos D'Amore, S Gabetti, ... 4th Symposium on Space Educational Activities, 2022		2022

系统目前无法执行此操作，请稍后再试。

文章 1–13

共建清朗的网络空间,如遇有害信息,请举报。
本站数据皆整合自互联网公开资源索引,方便科研学术方面查询,并不存储相关数据资源;如对此有异议,请联系我们解决.
© 2023 学术资源搜索 @联系我们 | 申请短期会员 | 数据源提交