A survey of algorithmic and hardware optimization techniques for vision convolutional neural networks on FPGAs

A Sateesan, S Sinha, S KG, AP Vinod - Neural Processing Letters, 2021 - Springer
In today's world, the applications of convolutional neural networks (CNN) are limitless and
are employed in numerous fields. The CNNs get wider and deeper to achieve near-human …

Weightless neural networks for efficient edge inference

Z Susskind, A Arora, IDS Miranda, LAQ Villon… - Proceedings of the …, 2022 - dl.acm.org
Weightless neural networks (WNNs) are a class of machine learning model which use table
lookups to perform inference, rather than the multiply-accumulate operations typical of deep …

Logicwisard: Memoryless synthesis of weightless neural networks

IDS Miranda, A Arora, Z Susskind… - 2022 IEEE 33rd …, 2022 - ieeexplore.ieee.org
Weightless neural networks (WNNs) are an alternative pattern recognition technique where
RAM nodes function as neurons. As both training and inference require mostly table …

Energy efficient fixed-point inference system of convolutional neural network

CY Lo, CW Sham - 2020 IEEE 63rd International Midwest …, 2020 - ieeexplore.ieee.org
In this paper, we implemented a high accuracy, high throughput, low bit-width, and low
power consumption convolutional neural network (CNN) inference system for handwritten …

[HTML][HTML] Maintaining Symmetry between Convolutional Neural Network Accuracy and Performance on an Edge TPU with a Focus on Transfer Learning Adjustments

C DeLozier, J Blanco, R Rakvic, J Shey - Symmetry, 2024 - mdpi.com
Transfer learning has proven to be a valuable technique for deploying machine learning
models on edge devices and embedded systems. By leveraging pre-trained models and fine …

Studying the Impact of TensorFlow and PyTorch Bindings on Machine Learning Software Quality

H Li, GK Rajbahadur, CP Bezemer - ACM Transactions on Software …, 2024 - dl.acm.org
Bindings for machine learning frameworks (such as TensorFlow and PyTorch) allow
developers to integrate a framework's functionality using a programming language different …

Resource-Efficient Spectrum-Based Traffic Classification On Constrained Devices

D Góez, EA Beyazıt, LA Fletscher… - IEEE Open Journal …, 2024 - ieeexplore.ieee.org
Traffic Classification (TC) systems are designed to identify the applications generating
network traffic. Recent advancements in TC leverage Deep Learning (DL) techniques …

FPGA-based high-performance CNN accelerator architecture with high DSP utilization and efficient scheduling mode

Q Yin, Y Li, H Huang, H Li, Q Zhang… - … Conference on High …, 2020 - ieeexplore.ieee.org
Due to the great increase of the on-chip block memory for the latest field programmable gate
array (FPGA), the highly efficient utilization of the on-chip DSP Slice has become the …

Real‐time small bowel visualization quality assessment in wireless capsule endoscopy images using different lightweight embeddable models

V Sadeghi, A Mehridehnavi… - … Journal of Imaging …, 2024 - Wiley Online Library
Wireless capsule endoscopy (WCE) captures huge number of images, but only a fraction are
medically relevant. We propose automated real‐time small bowel visualization quality …

EPA: The effective pipeline architecture for CNN accelerator with high performance and computing efficiency based on FPGA

J Zhang, Q Yin, W Hu, Y Li, H Li, N Ye… - Concurrency and …, 2023 - Wiley Online Library
Thanks to the great developments of the latest Field Programmable Gate Array (FPGA), the
performance bottleneck of Deep Learning hardware accelerators has been converted to …