Generative pretraining from pixels
Inspired by progress in unsupervised representation learning for natural language, we
examine whether similar models can learn useful representations for images. We train a …
examine whether similar models can learn useful representations for images. We train a …
Analysis of {Large-Scale}{Multi-Tenant}{GPU} clusters for {DNN} training workloads
With widespread advances in machine learning, a number of large enterprises are
beginning to incorporate machine learning models across a number of products. These …
beginning to incorporate machine learning models across a number of products. These …
To understand deep learning we need to understand kernel learning
Generalization performance of classifiers in deep learning has recently become a subject of
intense study. Deep models, which are typically heavily over-parametrized, tend to fit the …
intense study. Deep models, which are typically heavily over-parametrized, tend to fit the …
Zico: Efficient {GPU} memory sharing for concurrent {DNN} training
GPUs are the workhorse in modern server infrastructure fueling advances in a number of
compute-intensive workloads such as deep neural network (DNN) training. Several recent …
compute-intensive workloads such as deep neural network (DNN) training. Several recent …
Shallow neural network with kernel approximation for prediction problems in highly demanding data networks
M Lopez-Martin, B Carro… - Expert Systems with …, 2019 - Elsevier
Intrusion detection and network traffic classification are two of the main research
applications of machine learning to highly demanding data networks eg IoT/sensors …
applications of machine learning to highly demanding data networks eg IoT/sensors …
[PDF][PDF] Multi-tenant GPU clusters for deep learning workloads: Analysis and implications
With widespread advances in machine learning, a number of large enterprises are
beginning to incorporate machine learning models across a number of products. These …
beginning to incorporate machine learning models across a number of products. These …
Diving into the shallows: a computational perspective on large-scale shallow learning
Remarkable recent success of deep neural networks has not been easy to analyze
theoretically. It has been particularly hard to disentangle relative significance of architecture …
theoretically. It has been particularly hard to disentangle relative significance of architecture …
Gaussian quadrature for kernel features
Kernel methods have recently attracted resurgent interest, showing performance competitive
with deep neural networks in tasks such as speech recognition. The random Fourier features …
with deep neural networks in tasks such as speech recognition. The random Fourier features …
Quantum kitchen sinks: An algorithm for machine learning on near-term quantum computers
Noisy intermediate-scale quantum computing devices are an exciting platform for the
exploration of the power of near-term quantum applications. Performing nontrivial tasks in …
exploration of the power of near-term quantum applications. Performing nontrivial tasks in …
Low-precision random Fourier features for memory-constrained kernel approximation
We investigate how to train kernel approximation methods that generalize well under a
memory budget. Building on recent theoretical work, we define a measure of kernel …
memory budget. Building on recent theoretical work, we define a measure of kernel …