K2: A foundation language model for geoscience knowledge understanding and utilization

C Deng, T Zhang, Z He, Q Chen, Y Shi, Y Xu… - Proceedings of the 17th …, 2024 - dl.acm.org
Large language models (LLMs) have achieved great success in general domains of natural
language processing. In this paper, we bring LLMs to the realm of geoscience with the …

AceMap: Knowledge Discovery through Academic Graph

X Wang, L Fu, X Gan, Y Wen, G Zheng, J Ding… - arXiv preprint arXiv …, 2024 - arxiv.org
The exponential growth of scientific literature requires effective management and extraction
of valuable insights. While existing scientific search engines excel at delivering search …

多视角网页分类数据集构建及性能评估

孙辰星, 刘伟, 卢彬, 梁诗宇, 诸云强… - 南京大学学报(自然科学 …, 2024 - jns.nju.edu.cn
摘要网页分类是互联网数据挖掘中的一项重要任务, 在信息搜索, 推荐系统和知识发现等领域
发挥着关键作用. 然而, 现有的公开网页数据集缺乏多视角信息, 难以适用于蕴含复杂特征的网页 …

AutoFAIR: Automatic Data FAIRification via Machine Reading

T Ma, W Liu, B Lu, X Gan, Y Zhu, L Fu… - arXiv preprint arXiv …, 2024 - arxiv.org
The explosive growth of data fuels data-driven research, facilitating progress across diverse
domains. The FAIR principles emerge as a guiding standard, aiming to enhance the …