[HTML][HTML] Neural architecture search: A contemporary literature review for computer vision applications

M Poyser, TP Breckon - Pattern Recognition, 2024 - Elsevier
Abstract Deep Neural Networks have received considerable attention in recent years. As the
complexity of network architecture increases in relation to the task complexity, it becomes …

Vitas: Vision transformer architecture search

X Su, S You, J Xie, M Zheng, F Wang, C Qian… - … on Computer Vision, 2022 - Springer
Vision transformers (ViTs) inherited the success of NLP but their structures have not been
sufficiently investigated and optimized for visual tasks. One of the simplest solutions is to …

Quantum circuit architecture search for variational quantum algorithms

Y Du, T Huang, S You, MH Hsieh, D Tao - npj Quantum Information, 2022 - nature.com
Variational quantum algorithms (VQAs) are expected to be a path to quantum advantages
on noisy intermediate-scale quantum devices. However, both empirical and theoretical …

Agree to disagree: Adaptive ensemble knowledge distillation in gradient space

S Du, S You, X Li, J Wu, F Wang… - advances in neural …, 2020 - proceedings.neurips.cc
Distilling knowledge from an ensemble of teacher models is expected to have a more
promising performance than that from a single one. Current methods mainly adopt a vanilla …

Nas-ood: Neural architecture search for out-of-distribution generalization

H Bai, F Zhou, L Hong, N Ye… - Proceedings of the …, 2021 - openaccess.thecvf.com
Recent advances on Out-of-Distribution (OoD) generalization reveal the robustness of deep
learning models against distribution shifts. However, existing works focus on OoD …

Prioritized architecture sampling with monto-carlo tree search

X Su, T Huang, Y Li, S You, F Wang… - Proceedings of the …, 2021 - openaccess.thecvf.com
One-shot neural architecture search (NAS) methods significantly reduce the search cost by
considering the whole search space as one network, which only needs to be trained once …

Towards improving the consistency, efficiency, and flexibility of differentiable neural architecture search

Y Yang, S You, H Li, F Wang… - Proceedings of the …, 2021 - openaccess.thecvf.com
Most differentiable neural architecture search methods construct a super-net for search and
derive a target-net as its sub-graph for evaluation. There exists a significant gap between the …

Gradient descent effects on differential neural architecture search: A survey

S Santra, JW Hsieh, CF Lin - IEEE Access, 2021 - ieeexplore.ieee.org
Gradient Descent, an effective way to search for the local minimum of a function, can
minimize training and validation loss of neural architectures and also be incited in an …

K-shot nas: Learnable weight-sharing for nas with k-shot supernets

X Su, S You, M Zheng, F Wang… - International …, 2021 - proceedings.mlr.press
In one-shot weight sharing for NAS, the weights of each operation (at each layer) are
supposed to be identical for all architectures (paths) in the supernet. However, this rules out …

Zarts: On zero-order optimization for neural architecture search

X Wang, W Guo, J Su, X Yang… - Advances in Neural …, 2022 - proceedings.neurips.cc
Differentiable architecture search (DARTS) has been a popular one-shot paradigm for NAS
due to its high efficiency. It introduces trainable architecture parameters to represent the …