Topic modeling using latent Dirichlet allocation: A survey
We are not able to deal with a mammoth text corpus without summarizing them into a
relatively small subset. A computational tool is extremely needed to understand such a …
relatively small subset. A computational tool is extremely needed to understand such a …
MetaLDA: A topic model that efficiently incorporates meta information
Besides the text content, documents and their associated words usually come with rich sets
of meta information, such as categories of documents and semantic/syntactic features of …
of meta information, such as categories of documents and semantic/syntactic features of …
Discovering online shopping preference structures in large and frequently changing store assortments
The authors develop an attribute-based mixed-membership model of consumers' preference
for stockkeeping units in store assortments. The model represents the underlying “topics of …
for stockkeeping units in store assortments. The model represents the underlying “topics of …
Homogeneity-based transmissive process to model true and false news in social networks
An overwhelming number of true and false news stories are posted and shared in social
networks, and users diffuse the stories based on multiple factors. Diffusion of news stories …
networks, and users diffuse the stories based on multiple factors. Diffusion of news stories …
Modeling topics in DFA-based lemmatized Gujarati text
U Chauhan, S Shah, D Shiroya, D Solanki, Z Patel… - Sensors, 2023 - mdpi.com
Topic modeling is a machine learning algorithm based on statistics that follows
unsupervised machine learning techniques for mapping a high-dimensional corpus to a low …
unsupervised machine learning techniques for mapping a high-dimensional corpus to a low …
Leveraging external information in topic modelling
Besides the text content, documents usually come with rich sets of meta-information, such as
categories of documents and semantic/syntactic features of words, like those encoded in …
categories of documents and semantic/syntactic features of words, like those encoded in …
Is p-value 0.05 enough? A study on the statistical evaluation of classifiers
NM Neumann, A Plastino, JAP Junior… - The Knowledge …, 2020 - cambridge.org
Statistical significance analysis, based on hypothesis tests, is a common approach for
comparing classifiers. However, many studies oversimplify this analysis by simply checking …
comparing classifiers. However, many studies oversimplify this analysis by simply checking …
Covariate-dependent hierarchical Dirichlet process
H Zhang, S Wade, N Bochkina - arXiv preprint arXiv:2407.02676, 2024 - arxiv.org
The intricacies inherent in contemporary real datasets demand more advanced statistical
models to effectively address complex challenges. In this article we delve into problems …
models to effectively address complex challenges. In this article we delve into problems …
A Bayesian nonparametric topic model for microbiome data using subject attributes
T Okui - IPSJ Transactions on Bioinformatics, 2020 - jstage.jst.go.jp
Microbiome data have been obtained relatively easily in recent years, and currently, various
methods for analyzing microbiome data are being proposed. Latent Dirichlet allocation …
methods for analyzing microbiome data are being proposed. Latent Dirichlet allocation …
Dynamic content based ranking
S Virtanen, M Girolami - International Conference on …, 2020 - proceedings.mlr.press
We introduce a novel state space model for a set of sequentially time-stamped partial
rankings of items and textual descriptions for the items. Based on the data, the model infers …
rankings of items and textual descriptions for the items. Based on the data, the model infers …