作者
Naoki Shibata, Yuya Kajikawa, Ichiro Sakata
发表日期
2012/1
期刊
Journal of the American society for information science and technology
卷号
63
期号
1
页码范围
78-85
出版商
Wiley Subscription Services, Inc., A Wiley Company
简介
In this article, we build models to predict the existence of citations among papers by formulating link prediction for 5 large‐scale datasets of citation networks. The supervised machine‐learning model is applied with 11 features. As a result, our learner performs very well, with the F1 values of between 0.74 and 0.82. Three features in particular, link‐based Jaccard coefficient difference in betweenness centrality, and cosine similarity of term frequency–inverse document frequency vectors, largely affect the predictions of citations. The results also indicate that different models are required for different types of research areas—research fields with a single issue or research fields with multiple issues. In the case of research fields with multiple issues, there are barriers among research fields because our results indicate that papers tend to be cited in each research field locally. Therefore, one must consider the typology of …
引用总数
2012201320142015201620172018201920202021202220232024111119101251071217177
学术搜索中的文章
N Shibata, Y Kajikawa, I Sakata - Journal of the American society for information science …, 2012