Orchestrating the development lifecycle of machine learning-based IoT applications: A taxonomy and survey
Machine Learning (ML) and Internet of Things (IoT) are complementary advances: ML
techniques unlock the potential of IoT with intelligence, and IoT applications increasingly …
techniques unlock the potential of IoT with intelligence, and IoT applications increasingly …
Dense text-to-image generation with attention modulation
Existing text-to-image diffusion models struggle to synthesize realistic images given dense
captions, where each text prompt provides a detailed description for a specific image region …
captions, where each text prompt provides a detailed description for a specific image region …
Rethinking spatial dimensions of vision transformers
Abstract Vision Transformer (ViT) extends the application range of transformers from
language processing to computer vision tasks as being an alternative architecture against …
language processing to computer vision tasks as being an alternative architecture against …
Swad: Domain generalization by seeking flat minima
Abstract Domain generalization (DG) methods aim to achieve generalizability to an unseen
target domain by using only training data from the source domains. Although a variety of DG …
target domain by using only training data from the source domains. Although a variety of DG …
Rainbow memory: Continual learning with a memory of diverse samples
Continual learning is a realistic learning scenario for AI models. Prevalent scenario of
continual learning, however, assumes disjoint sets of classes as tasks and is less realistic …
continual learning, however, assumes disjoint sets of classes as tasks and is less realistic …
Stargan v2: Diverse image synthesis for multiple domains
A good image-to-image translation model should learn a mapping between different visual
domains while satisfying the following properties: 1) diversity of generated images and 2) …
domains while satisfying the following properties: 1) diversity of generated images and 2) …
Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram
R Yamamoto, E Song, JM Kim - ICASSP 2020-2020 IEEE …, 2020 - ieeexplore.ieee.org
We propose Parallel WaveGAN, a distillation-free, fast, and small-footprint waveform
generation method using a generative adversarial network. In the proposed method, a non …
generation method using a generative adversarial network. In the proposed method, a non …
The majority can help the minority: Context-rich minority oversampling for long-tailed classification
The problem of class imbalanced data is that the generalization performance of the classifier
deteriorates due to the lack of data from minority classes. In this paper, we propose a novel …
deteriorates due to the lack of data from minority classes. In this paper, we propose a novel …
Cutmix: Regularization strategy to train strong classifiers with localizable features
Regional dropout strategies have been proposed to enhance performance of convolutional
neural network classifiers. They have proved to be effective for guiding the model to attend …
neural network classifiers. They have proved to be effective for guiding the model to attend …
Character region awareness for text detection
Scene text detection methods based on neural networks have emerged recently and have
shown promising results. Previous methods trained with rigid word-level bounding boxes …
shown promising results. Previous methods trained with rigid word-level bounding boxes …