Sustainable ai: Environmental implications, challenges and opportunities

CJ Wu, R Raghavendra, U Gupta… - Proceedings of …, 2022 - proceedings.mlsys.org
This paper explores the environmental impact of the super-linear growth trends for AI from a
holistic perspective, spanning Data, Algorithms, and System Hardware. We characterize the …

Avoiding overfitting: A survey on regularization methods for convolutional neural networks

CFGD Santos, JP Papa - ACM Computing Surveys (CSUR), 2022 - dl.acm.org
Several image processing tasks, such as image classification and object detection, have
been significantly improved using Convolutional Neural Networks (CNN). Like ResNet and …

Deep model reassembly

X Yang, D Zhou, S Liu, J Ye… - Advances in neural …, 2022 - proceedings.neurips.cc
In this paper, we explore a novel knowledge-transfer task, termed as Deep Model
Reassembly (DeRy), for general-purpose model reuse. Given a collection of heterogeneous …

Machine learning for microcontroller-class hardware: A review

SS Saha, SS Sandha, M Srivastava - IEEE Sensors Journal, 2022 - ieeexplore.ieee.org
The advancements in machine learning (ML) opened a new opportunity to bring intelligence
to the low-end Internet-of-Things (IoT) nodes, such as microcontrollers. Conventional ML …

Zero-cost proxies for lightweight nas

MS Abdelfattah, A Mehrotra, Ł Dudziak… - arXiv preprint arXiv …, 2021 - arxiv.org
Neural Architecture Search (NAS) is quickly becoming the standard methodology to design
neural network models. However, NAS is typically compute-intensive because multiple …

Neural architecture search on imagenet in four gpu hours: A theoretically inspired perspective

W Chen, X Gong, Z Wang - arXiv preprint arXiv:2102.11535, 2021 - arxiv.org
Neural Architecture Search (NAS) has been explosively studied to automate the discovery of
top-performer neural networks. Current works require heavy training of supernet or intensive …

Neural architecture search: Insights from 1000 papers

C White, M Safari, R Sukthanker, B Ru, T Elsken… - arXiv preprint arXiv …, 2023 - arxiv.org
In the past decade, advances in deep learning have resulted in breakthroughs in a variety of
areas, including computer vision, natural language understanding, speech recognition, and …

Neural architecture search for spiking neural networks

Y Kim, Y Li, H Park, Y Venkatesha, P Panda - European conference on …, 2022 - Springer
Abstract Spiking Neural Networks (SNNs) have gained huge attention as a potential energy-
efficient alternative to conventional Artificial Neural Networks (ANNs) due to their inherent …

[HTML][HTML] A tutorial on automatic hyperparameter tuning of deep spectral modelling for regression and classification tasks

D Passos, P Mishra - Chemometrics and Intelligent Laboratory Systems, 2022 - Elsevier
Deep spectral modelling for regression and classification is gaining popularity in the
chemometrics domain. A major topic in the deep learning (DL) modelling of spectral data is …

Diswot: Student architecture search for distillation without training

P Dong, L Li, Z Wei - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com
Abstract Knowledge distillation (KD) is an effective training strategy to improve the
lightweight student models under the guidance of cumbersome teachers. However, the large …