Interactive document clustering revisited: a visual analytics approach

E Sherkat, S Nourashrafeddin, EE Milios… - Proceedings of the 23rd …, 2018 - dl.acm.org
Proceedings of the 23rd International Conference on Intelligent User Interfaces, 2018dl.acm.org
Document clustering is an efficient way to get insight into large text collections. Due to the
personalized nature of document clustering, even the best fully automatic algorithms cannot
create clusters that accurately reflect the user» s perspectives. To incorporate the user» s
perspective in the clustering process and, at the same time, effectively visualize document
collections to enhance user's sense-making of data, we propose a novel visual analytics
system for interactive document clustering. We built our system on top of clustering …
Document clustering is an efficient way to get insight into large text collections. Due to the personalized nature of document clustering, even the best fully automatic algorithms cannot create clusters that accurately reflect the user»s perspectives. To incorporate the user»s perspective in the clustering process and, at the same time, effectively visualize document collections to enhance user's sense-making of data, we propose a novel visual analytics system for interactive document clustering. We built our system on top of clustering algorithms that can adapt to user's feedback. First, the initial clustering is created based on the user-defined number of clusters and the selected clustering algorithm. Second, the clustering result is visualized to the user. A collection of coordinated visualization modules and document projection is designed to guide the user towards a better insight into the document collection and clusters. The user changes clusters and key-terms iteratively as a feedback to the clustering algorithm until the result is satisfactory. In key-term based interaction, the user assigns a set of key-terms to each target cluster to guide the clustering algorithm. A set of quantitative experiments, a use case, and a user study have been conducted to show the advantages of the approach for document analytics based on clustering.
ACM Digital Library
以上显示的是最相近的搜索结果。 查看全部搜索结果