作者
Wei Emma Zhang, Quan Z Sheng, Jey Han Lau, Ermyas Abebe
发表日期
2017/4/3
图书
Proceedings of the 26th International Conference on World Wide Web
页码范围
1221-1229
简介
Programming community-based question-answering (PCQA) websites such as Stack Overflow enable programmers to find working solutions to their questions. Despite detailed posting guidelines, duplicate questions that have been answered are frequently created. To tackle this problem, Stack Overflow provides a mechanism for reputable users to manually mark duplicate questions. This is a laborious effort, and leads to many duplicate questions remain undetected. Existing duplicate detection methodologies from traditional community based question-answering (CQA) websites are difficult to be adopted directly to PCQA, as PCQA posts often contain source code which is linguistically very different from natural languages. In this paper, we propose a methodology designed for the PCQA domain to detect duplicate questions. We model the detection as a classification problem over question pairs. To extract …
引用总数
2017201820192020202120222023381513111214
学术搜索中的文章
WE Zhang, QZ Sheng, JH Lau, E Abebe - Proceedings of the 26th International Conference on …, 2017