Distributed machine learning for wireless communication networks: Techniques, architectures, and applications

S Hu, X Chen, W Ni, E Hossain… - … Surveys & Tutorials, 2021 - ieeexplore.ieee.org
Distributed machine learning (DML) techniques, such as federated learning, partitioned
learning, and distributed reinforcement learning, have been increasingly applied to wireless …

Custom hardware architectures for deep learning on portable devices: a review

KS Zaman, MBI Reaz, SHM Ali… - … on Neural Networks …, 2021 - ieeexplore.ieee.org
The staggering innovations and emergence of numerous deep learning (DL) applications
have forced researchers to reconsider hardware architecture to accommodate fast and …

Programmable coarse grained and sparse matrix compute hardware with advanced scheduling

E Nurvitadhi, B Vembu, NCG Von Borries… - US Patent …, 2019 - Google Patents
One embodiment provides for a compute apparatus to per form machine learning
operations, the compute apparatus comprising a decode unit to decode a single instruction …

[HTML][HTML] 基于深度学习卷积神经网络的地震数据随机噪声去除

韩卫雪, 周亚同, 池越 - 石油物探, 2018 - xml-data.org
为了有效去除地震数据随机噪声, 提出了一种基于卷积神经网络(CNN) 的地震数据随机噪声去除
算法. 算法的关键在于构建一个适用于地震数据去噪的CNN, 包含输入层, 卷积层, 激活层 …

Performance modeling and evaluation of distributed deep learning frameworks on gpus

S Shi, Q Wang, X Chu - 2018 IEEE 16th Intl Conf on …, 2018 - ieeexplore.ieee.org
Deep learning frameworks have been widely deployed on GPU servers for deep learning
applications in both academia and industry. In training deep neural networks (DNNs), there …

An architecture-level analysis on deep learning models for low-impact computations

H Li, Z Wang, X Yue, W Wang, H Tomiyama… - Artificial Intelligence …, 2023 - Springer
Deep neural networks (DNNs) have made significant achievements in a wide variety of
domains. For the deep learning tasks, multiple excellent hardware platforms provide efficient …

[HTML][HTML] Effect of neural network structure in accelerating performance and accuracy of a convolutional neural network with GPU/TPU for image analytics

A Ravikumar, H Sriraman, PMS Saketh… - PeerJ Computer …, 2022 - peerj.com
Background In deep learning the most significant breakthrough in the field of image
recognition, object detection language processing was done by Convolutional Neural …

Performance analysis of deep neural networks using computer vision

N Sindhwani, R Anand, S Meivel… - EAI Endorsed …, 2021 - publications.eai.eu
INTRODUCTION: In recent years, deep learning techniques have been made to outperform
the earlier state-of-the-art machine learning techniques in many areas, with one of the most …

Specialized fixed function hardware for efficient convolution

R Barik, E Ould-Ahmed-Vall, X Chen… - US Patent …, 2020 - Google Patents
One embodiment provides for a compute apparatus to perform machine learning operations,
the apparatus comprising a decode unit to decode a single instruction into a decoded …

Benchmarking deep learning frameworks: Design considerations, metrics and beyond

L Liu, Y Wu, W Wei, W Cao, S Sahin… - 2018 IEEE 38th …, 2018 - ieeexplore.ieee.org
With increasing number of open-source deep learning (DL) software tools made available,
benchmarking DL software frameworks and systems is in high demand. This paper presents …