Topic modeling in embedding spaces
Topic modeling analyzes documents to learn meaningful patterns of words. However,
existing topic models fail to learn interpretable topics when working with large and heavy …
existing topic models fail to learn interpretable topics when working with large and heavy …
Gromov-wasserstein learning for graph matching and node embedding
A novel Gromov-Wasserstein learning framework is proposed to jointly match (align) graphs
and learn embedding vectors for the associated graph nodes. Using Gromov-Wasserstein …
and learn embedding vectors for the associated graph nodes. Using Gromov-Wasserstein …
Projection‐based techniques for high‐dimensional optimal transport problems
Optimal transport (OT) methods seek a transformation map (or plan) between two probability
measures, such that the transformation has the minimum transportation cost. Such a …
measures, such that the transformation has the minimum transportation cost. Such a …
The dynamic embedded topic model
Topic modeling analyzes documents to learn meaningful patterns of words. For documents
collected in sequence, dynamic topic models capture how these patterns vary over time. We …
collected in sequence, dynamic topic models capture how these patterns vary over time. We …
Topic-guided variational autoencoders for text generation
We propose a topic-guided variational autoencoder (TGVAE) model for text generation.
Distinct from existing variational autoencoder (VAE) based approaches, which assume a …
Distinct from existing variational autoencoder (VAE) based approaches, which assume a …
Ensembling classical machine learning and deep learning approaches for morbidity identification from clinical notes
The past decade has seen an explosion of the amount of digital information generated
within the healthcare domain. Digital data exist in the form of images, video, speech …
within the healthcare domain. Digital data exist in the form of images, video, speech …
Explainable legal case matching via inverse optimal transport-based rationale extraction
As an essential operation of legal retrieval, legal case matching plays a central role in
intelligent legal systems. This task has a high demand on the explainability of matching …
intelligent legal systems. This task has a high demand on the explainability of matching …
Evaluating progress on machine learning for longitudinal electronic healthcare data
The Large Scale Visual Recognition Challenge based on the well-known Imagenet dataset
catalyzed an intense flurry of progress in computer vision. Benchmark tasks have propelled …
catalyzed an intense flurry of progress in computer vision. Benchmark tasks have propelled …
Hierarchical optimal transport for document representation
The ability to measure similarity between documents enables intelligent summarization and
analysis of large corpora. Past distances between documents suffer from either an inability …
analysis of large corpora. Past distances between documents suffer from either an inability …
Discriminative topic mining via category-name guided text embedding
Mining a set of meaningful and distinctive topics automatically from massive text corpora has
broad applications. Existing topic models, however, typically work in a purely unsupervised …
broad applications. Existing topic models, however, typically work in a purely unsupervised …