Norm matters: efficient and accurate normalization schemes in deep networks- 学术资源搜索

Norm matters: efficient and accurate normalization schemes in deep networks

E Hoffer, R Banner, I Golan… - Advances in Neural …, 2018 - proceedings.neurips.cc

Advances in Neural Information Processing Systems, 2018•proceedings.neurips.cc

Abstract

Over the past few years, Batch-Normalization has been commonly used in deep networks, allowing faster training and high performance for a wide variety of applications. However, the reasons behind its merits remained unanswered, with several shortcomings that hindered its use for certain tasks. In this work, we present a novel view on the purpose and function of normalization methods and weight-decay, as tools to decouple weights' norm from the underlying optimized objective. This property highlights the connection between practices such as normalization, weight decay and learning-rate adjustments. We suggest several alternatives to the widely used batch-norm, using normalization in and spaces that can substantially improve numerical stability in low-precision implementations as well as provide computational and memory benefits. We demonstrate that such methods enable the first batch-norm alternative to work for half-precision implementations. Finally, we suggest a modification to weight-normalization, which improves its performance on large-scale tasks.

proceedings.neurips.cc

展开收起

被引用次数：189 相关文章所有 6 个版本

以上显示的是最相近的搜索结果。查看全部搜索结果

Google学术搜索按钮

安装不用了

example.edu/paper.pdf

搜索

获取 PDF 文件

引用

References