An Efficient Hybrid Deep Learning Accelerator for Compact and Heterogeneous CNNs

F Qararyah, MW Azhar, P Trancoso - ACM Transactions on Architecture …, 2024 - dl.acm.org
Resource-efficient Convolutional Neural Networks (CNNs) are gaining more attention.
These CNNs have relatively low computational and memory requirements. A common …

Fibha: fixed budget hybrid CNN accelerator

F Qararyah, MW Azhar… - 2022 IEEE 34th …, 2022 - ieeexplore.ieee.org
Seeking the “sweet spot” in the accuracy-efficiency trade-off is increasing the heterogeneity
of state-of-the-art Convolutional Neural Networks (CNNs). Such CNN models exhibit …

Fast prototyping next-generation accelerators for new ml models using mase: Ml accelerator system exploration

J Cheng, C Zhang, Z Yu… - arXiv preprint arXiv …, 2023 - arxiv.org
Machine learning (ML) accelerators have been studied and used extensively to compute ML
models with high performance and low power. However, designing such accelerators …

Fadec: FPGA-based acceleration of video depth estimation by hw/sw co-design

N Hashimoto… - … Conference on Field …, 2022 - ieeexplore.ieee.org
3D reconstruction from videos has become increasingly popular for various applications,
including navigation for autonomous driving of robots and drones, augmented reality (AR) …

A high performance reconfigurable hardware architecture for lightweight convolutional neural network

F An, L Wang, X Zhou - Electronics, 2023 - mdpi.com
Since the lightweight convolutional neural network EfficientNet was proposed by Google in
2019, the series of models have quickly become very popular due to their superior …

FACCU: Enable fast accumulation for high-speed DSP systems

M Wang, X Cheng, D Zou… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
A number of fast accumulation (FACCU) designs are proposed for one of the essential
components in DSP systems, the accumulator, to largely boost the DSP's maximum …

ReAFM: A reconfigurable nonlinear activation function module for neural networks

X Wu, S Liang, M Wang, Z Wang - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Deep neural networks (DNNs) with various nonlinear activation functions (NAFs) have
achieved unprecedented successes, sparking interest in efficient DNN hardware …

Bottleneck-Stationary Compact Model Accelerator With Reduced Requirement on Memory Bandwidth for Edge Applications

HG Mun, S Moon, B Kim, KJ Lee… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
State-of-the-art compact models such as MobileNets and EfficientNets are structured using a
linear bottleneck and inverted residuals. Hardware architecture using a single dataflow …

A reconfigurable convolutional neural networks accelerator based on fpga

Y Tang, H Ren, Z Zhang - … on Communications and Networking in China, 2022 - Springer
With the development of lightweight convolutional neural networks (CNNs), these newly
proposed networks are more powerful than previous conventional models [,] and can be well …

[PDF][PDF] An Efficient Hybrid Deep Learning Accelerator for Compact and Heterogeneous CNNs

F Mohammad Qararyah, M Azhar… - 2024 - research.chalmers.se
During the early years following the AlexNet breakthrough [24], researchers have mainly
focused on designing Convolutional Neural Networks (CNNs) with higher accuracy [17, 47 …