Scale mlperf-0.6 models on google tpu-v3 pods

D Narayanan, M Shoeybi, J Casper… - Proceedings of the …, 2021 - dl.acm.org

Large language models have led to state-of-the-art accuracies across several tasks.
However, training these models efficiently is challenging because: a) GPU memory capacity …

被引用次数：695 相关文章所有 11 个版本

[PDF] neurips.cc

Improving robustness using generated data

S Gowal, SA Rebuffi, O Wiles… - Advances in …, 2021 - proceedings.neurips.cc

Recent work argues that robust training requires substantially larger datasets than those
required for standard classification. On CIFAR-10 and CIFAR-100, this translates into a …

被引用次数：310 相关文章所有 7 个版本

[PDF] mlr.press

Learning to simulate complex physics with graph networks

A Sanchez-Gonzalez, J Godwin… - International …, 2020 - proceedings.mlr.press

Here we present a machine learning framework and model implementation that can learn to
simulate a wide variety of challenging physical domains, involving fluids, rigid solids, and …

被引用次数：1271 相关文章所有 8 个版本

[PDF] thecvf.com

Retinatrack: Online single stage joint detection and tracking

Z Lu, V Rathod, R Votel… - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com

Traditionally multi-object tracking and object detection are performed using separate
systems with most prior works focusing exclusively on one of these aspects over the other …

被引用次数：263 相关文章所有 7 个版本

[PDF] ieee.org

Networking systems of AI: On the convergence of computing and communications

L Song, X Hu, G Zhang, P Spachos… - IEEE Internet of …, 2022 - ieeexplore.ieee.org

Artificial intelligence (AI) and 5G system have been two hot technical areas that are
changing the world. On the deep convergence of computing and communication, networking …

被引用次数：72 相关文章所有 2 个版本

[PDF] thecvf.com

Context r-cnn: Long term temporal context for per-camera object detection

S Beery, G Wu, V Rathod, R Votel… - Proceedings of the …, 2020 - openaccess.thecvf.com

In static monitoring cameras, useful contextual information can stretch far beyond the few
seconds typical video understanding models might see: subjects may exhibit similar …

被引用次数：151 相关文章所有 11 个版本

[PDF] arxiv.org

Improving 3d object detection through progressive population based augmentation

S Cheng, Z Leng, ED Cubuk, B Zoph, C Bai… - Computer Vision–ECCV …, 2020 - Springer

Data augmentation has been widely adopted for object detection in 3D point clouds.
However, all previous related efforts have focused on manually designing specific data …

被引用次数：96 相关文章所有 7 个版本

[PDF] arxiv.org

Lightweight Deep Learning for Resource-Constrained Environments: A Survey

HI Liu, M Galindo, H Xie, LK Wong, HH Shuai… - ACM Computing …, 2024 - dl.acm.org

Over the past decade, the dominance of deep learning has prevailed across various
domains of artificial intelligence, including natural language processing, computer vision …

被引用次数：15 相关文章所有 3 个版本

[PDF] arxiv.org

A large batch optimizer reality check: Traditional, generic optimizers suffice across batch sizes

Z Nado, JM Gilmer, CJ Shallue, R Anil… - arXiv preprint arXiv …, 2021 - arxiv.org

Recently the LARS and LAMB optimizers have been proposed for training neural networks
faster using large batch sizes. LARS and LAMB add layer-wise normalization to the update …

被引用次数：43 相关文章所有 4 个版本

[PDF] arxiv.org

Kaisa: an adaptive second-order optimizer framework for deep neural networks

JG Pauloski, Q Huang, L Huang… - Proceedings of the …, 2021 - dl.acm.org

Kronecker-factored Approximate Curvature (K-FAC) has recently been shown to converge
faster in deep neural network (DNN) training than stochastic gradient descent (SGD); …

被引用次数：30 相关文章所有 10 个版本