Topic modeling using latent Dirichlet allocation: A survey

U Chauhan, A Shah - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
We are not able to deal with a mammoth text corpus without summarizing them into a
relatively small subset. A computational tool is extremely needed to understand such a …

Pre-training is a hot topic: Contextualized document embeddings improve topic coherence

F Bianchi, S Terragni, D Hovy - arXiv preprint arXiv:2004.03974, 2020 - arxiv.org
Topic models extract groups of words from documents, whose interpretation as a topic
hopefully allows for a better understanding of the data. However, the resulting word groups …

Neural topic model via optimal transport

H Zhao, D Phung, V Huynh, T Le, W Buntine - arXiv preprint arXiv …, 2020 - arxiv.org
Recently, Neural Topic Models (NTMs) inspired by variational autoencoders have obtained
increasingly research interest due to their promising results on text analysis. However, it is …

Scholastic: Graphical human-AI collaboration for inductive and interpretive text analysis

MH Hong, LA Marsh, JL Feuston, J Ruppert… - Proceedings of the 35th …, 2022 - dl.acm.org
Interpretive scholars generate knowledge from text corpora by manually sampling
documents, applying codes, and refining and collating codes into categories until …

MetaLDA: A topic model that efficiently incorporates meta information

H Zhao, L Du, W Buntine, G Liu - 2017 IEEE international …, 2017 - ieeexplore.ieee.org
Besides the text content, documents and their associated words usually come with rich sets
of meta information, such as categories of documents and semantic/syntactic features of …

Hierarchical Deep Document Model

Y Yang, JP Lalor, A Abbasi… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Topic modeling is a commonly used text analysis tool for discovering latent topics in a text
corpus. However, while topics in a text corpus often exhibit a hierarchical structure (eg …

A knowledge graph enhanced topic modeling approach for herb recommendation

X Wang, Y Zhang, X Wang, J Chen - … 2019, Chiang Mai, Thailand, April 22 …, 2019 - Springer
Abstract Traditional Chinese Medicine (TCM) plays an important role in Chinese society and
is an increasingly popular therapy around the world. A data-driven herb recommendation …

Presyndromic surveillance for improved detection of emerging public health threats

M Nobles, R Lall, RW Mathes, DB Neill - Science Advances, 2022 - science.org
Existing public health surveillance systems that rely on predefined symptom categories, or
syndromes, are effective at monitoring known illnesses, but there is a critical need for …

A word embeddings informed focused topic model

H Zhao, L Du, W Buntine - Asian conference on machine …, 2017 - proceedings.mlr.press
In natural language processing and related fields, it has been shown that the word
embeddings can successfully capture both the semantic and syntactic features of words …

Bert-based latent semantic analysis (Bert-LSA): a case study on geospatial data technology and application trend analysis

Q Cheng, Y Zhu, J Song, H Zeng, S Wang, K Sun… - Applied Sciences, 2021 - mdpi.com
Geospatial data is an indispensable data resource for research and applications in many
fields. The technologies and applications related to geospatial data are constantly …