Topic modeling in embedding spaces

AB Dieng, FJR Ruiz, DM Blei - Transactions of the Association for …, 2020 - direct.mit.edu
Topic modeling analyzes documents to learn meaningful patterns of words. However,
existing topic models fail to learn interpretable topics when working with large and heavy …

Gromov-wasserstein learning for graph matching and node embedding

H Xu, D Luo, H Zha, LC Duke - International conference on …, 2019 - proceedings.mlr.press
A novel Gromov-Wasserstein learning framework is proposed to jointly match (align) graphs
and learn embedding vectors for the associated graph nodes. Using Gromov-Wasserstein …

Projection‐based techniques for high‐dimensional optimal transport problems

J Zhang, P Ma, W Zhong, C Meng - Wiley Interdisciplinary …, 2023 - Wiley Online Library
Optimal transport (OT) methods seek a transformation map (or plan) between two probability
measures, such that the transformation has the minimum transportation cost. Such a …

The dynamic embedded topic model

AB Dieng, FJR Ruiz, DM Blei - arXiv preprint arXiv:1907.05545, 2019 - arxiv.org
Topic modeling analyzes documents to learn meaningful patterns of words. For documents
collected in sequence, dynamic topic models capture how these patterns vary over time. We …

Topic-guided variational autoencoders for text generation

W Wang, Z Gan, H Xu, R Zhang, G Wang… - arXiv preprint arXiv …, 2019 - arxiv.org
We propose a topic-guided variational autoencoder (TGVAE) model for text generation.
Distinct from existing variational autoencoder (VAE) based approaches, which assume a …

Ensembling classical machine learning and deep learning approaches for morbidity identification from clinical notes

V Kumar, DR Recupero, D Riboni, R Helaoui - IEEE Access, 2020 - ieeexplore.ieee.org
The past decade has seen an explosion of the amount of digital information generated
within the healthcare domain. Digital data exist in the form of images, video, speech …

Explainable legal case matching via inverse optimal transport-based rationale extraction

W Yu, Z Sun, J Xu, Z Dong, X Chen, H Xu… - Proceedings of the 45th …, 2022 - dl.acm.org
As an essential operation of legal retrieval, legal case matching plays a central role in
intelligent legal systems. This task has a high demand on the explainability of matching …

Evaluating progress on machine learning for longitudinal electronic healthcare data

D Bellamy, L Celi, AL Beam - arXiv preprint arXiv:2010.01149, 2020 - arxiv.org
The Large Scale Visual Recognition Challenge based on the well-known Imagenet dataset
catalyzed an intense flurry of progress in computer vision. Benchmark tasks have propelled …

Hierarchical optimal transport for document representation

M Yurochkin, S Claici, E Chien… - Advances in neural …, 2019 - proceedings.neurips.cc
The ability to measure similarity between documents enables intelligent summarization and
analysis of large corpora. Past distances between documents suffer from either an inability …

Discriminative topic mining via category-name guided text embedding

Y Meng, J Huang, G Wang, Z Wang, C Zhang… - Proceedings of The …, 2020 - dl.acm.org
Mining a set of meaningful and distinctive topics automatically from massive text corpora has
broad applications. Existing topic models, however, typically work in a purely unsupervised …