Short text clustering algorithms, application and challenges: A survey
The number of online documents has rapidly grown, and with the expansion of the Web,
document analysis, or text analysis, has become an essential task for preparing, storing …
document analysis, or text analysis, has become an essential task for preparing, storing …
Eliciting people's first-order concerns: Text analysis of open-ended survey questions
B Ferrario, S Stantcheva - AEA Papers and Proceedings, 2022 - aeaweb.org
We illustrate the design and use of open-ended survey questions to elicit people's first-order
concerns on policies. Closed-ended questions are the backbone of surveys but may prime …
concerns on policies. Closed-ended questions are the backbone of surveys but may prime …
A dirichlet multinomial mixture model-based approach for short text clustering
Short text clustering has become an increasingly important task with the popularity of social
media like Twitter, Google+, and Facebook. It is a challenging problem due to its sparse …
media like Twitter, Google+, and Facebook. It is a challenging problem due to its sparse …
Data clustering: 50 years beyond K-means
AK Jain - Pattern recognition letters, 2010 - Elsevier
Organizing data into sensible groupings is one of the most fundamental modes of
understanding and learning. As an example, a common scheme of scientific classification …
understanding and learning. As an example, a common scheme of scientific classification …
Semantics-aware content-based recommender systems
Content-based recommender systems (CBRSs) rely on item and user descriptions (content)
to build item representations and user profiles that can be effectively exploited to suggest …
to build item representations and user profiles that can be effectively exploited to suggest …
Tagme: on-the-fly annotation of short text fragments (by wikipedia entities)
P Ferragina, U Scaiella - Proceedings of the 19th ACM international …, 2010 - dl.acm.org
We designed and implemented TAGME, a system that is able to efficiently and judiciously
augment a plain-text with pertinent hyperlinks to Wikipedia pages. The specialty of TAGME …
augment a plain-text with pertinent hyperlinks to Wikipedia pages. The specialty of TAGME …
Short text classification in twitter to improve information filtering
In microblogging services such as Twitter, the users may become overwhelmed by the raw
data. One solution to this problem is the classification of short text messages. As short texts …
data. One solution to this problem is the classification of short text messages. As short texts …
Self-taught convolutional neural networks for short text clustering
J Xu, B Xu, P Wang, S Zheng, G Tian, J Zhao - Neural Networks, 2017 - Elsevier
Short text clustering is a challenging problem due to its sparseness of text representation.
Here we propose a flexible Self-Taught Convolutional neural network framework for Short …
Here we propose a flexible Self-Taught Convolutional neural network framework for Short …
Learning to link with wikipedia
This paper describes how to automatically cross-reference documents with Wikipedia: the
largest knowledge base ever known. It explains how machine learning can be used to …
largest knowledge base ever known. It explains how machine learning can be used to …
Learning to classify short and sparse text & web with hidden topics from large-scale data collections
This paper presents a general framework for building classifiers that deal with short and
sparse text & Web segments by making the most of hidden topics discovered from large …
sparse text & Web segments by making the most of hidden topics discovered from large …