Orchestrating the development lifecycle of machine learning-based IoT applications: A taxonomy and survey

B Qian, J Su, Z Wen, DN Jha, Y Li, Y Guan… - ACM Computing …, 2020 - dl.acm.org
Machine Learning (ML) and Internet of Things (IoT) are complementary advances: ML
techniques unlock the potential of IoT with intelligence, and IoT applications increasingly …

Dense text-to-image generation with attention modulation

Y Kim, J Lee, JH Kim, JW Ha… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Existing text-to-image diffusion models struggle to synthesize realistic images given dense
captions, where each text prompt provides a detailed description for a specific image region …

Rethinking spatial dimensions of vision transformers

B Heo, S Yun, D Han, S Chun… - Proceedings of the …, 2021 - openaccess.thecvf.com
Abstract Vision Transformer (ViT) extends the application range of transformers from
language processing to computer vision tasks as being an alternative architecture against …

Swad: Domain generalization by seeking flat minima

J Cha, S Chun, K Lee, HC Cho… - Advances in Neural …, 2021 - proceedings.neurips.cc
Abstract Domain generalization (DG) methods aim to achieve generalizability to an unseen
target domain by using only training data from the source domains. Although a variety of DG …

Rainbow memory: Continual learning with a memory of diverse samples

J Bang, H Kim, YJ Yoo, JW Ha… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Continual learning is a realistic learning scenario for AI models. Prevalent scenario of
continual learning, however, assumes disjoint sets of classes as tasks and is less realistic …

Stargan v2: Diverse image synthesis for multiple domains

Y Choi, Y Uh, J Yoo, JW Ha - Proceedings of the IEEE/CVF …, 2020 - openaccess.thecvf.com
A good image-to-image translation model should learn a mapping between different visual
domains while satisfying the following properties: 1) diversity of generated images and 2) …

Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram

R Yamamoto, E Song, JM Kim - ICASSP 2020-2020 IEEE …, 2020 - ieeexplore.ieee.org
We propose Parallel WaveGAN, a distillation-free, fast, and small-footprint waveform
generation method using a generative adversarial network. In the proposed method, a non …

The majority can help the minority: Context-rich minority oversampling for long-tailed classification

S Park, Y Hong, B Heo, S Yun… - Proceedings of the …, 2022 - openaccess.thecvf.com
The problem of class imbalanced data is that the generalization performance of the classifier
deteriorates due to the lack of data from minority classes. In this paper, we propose a novel …

Cutmix: Regularization strategy to train strong classifiers with localizable features

S Yun, D Han, SJ Oh, S Chun… - Proceedings of the …, 2019 - openaccess.thecvf.com
Regional dropout strategies have been proposed to enhance performance of convolutional
neural network classifiers. They have proved to be effective for guiding the model to attend …

Character region awareness for text detection

Y Baek, B Lee, D Han, S Yun… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Scene text detection methods based on neural networks have emerged recently and have
shown promising results. Previous methods trained with rigid word-level bounding boxes …