Improving performance and energy consumption in embedded systems via binary acceleration: A survey

N Paulino, JC Ferreira, JMP Cardoso - ACM Computing Surveys (CSUR), 2020 - dl.acm.org
The breakdown of Dennard scaling has resulted in a decade-long stall of the maximum
operating clock frequencies of processors. To mitigate this issue, computing shifted to multi …

A soft multi-core architecture for edge detection and data analysis of microarray images

G Kornaros - Journal of Systems Architecture, 2010 - Elsevier
As configurable processing advances, elements from the traditional approaches of both
hardware and software development can be combined by incorporating customized …

Transparent trace-based binary acceleration for reconfigurable HW/SW systems

J Bispo, N Paulino, JMP Cardoso… - IEEE Transactions on …, 2012 - ieeexplore.ieee.org
This paper presents a novel approach to accelerate program execution by mapping
repetitive traces of executed instructions, called Megablocks, to a runtime reconfigurable …

Transparent acceleration of program execution using reconfigurable hardware

N Paulino, JC Ferreira, J Bispo… - … Design, Automation & …, 2015 - ieeexplore.ieee.org
The acceleration of applications, running on a general purpose processor (GPP), by
mapping parts of their execution to reconfigurable hardware is an approach which does not …

Transparent Runtime Migration of Loop‐Based Traces of Processor Instructions to Reconfigurable Processing Units

J Bispo, N Paulino, JMP Cardoso… - International Journal of …, 2013 - Wiley Online Library
The ability to map instructions running in a microprocessor to a reconfigurable processing
unit (RPU), acting as a coprocessor, enables the runtime acceleration of applications and …

From instruction traces to specialized reconfigurable arrays

J Bispo, N Paulino, JMP Cardoso… - … Computing and FPGAs, 2011 - ieeexplore.ieee.org
This paper presents an offline tool-chain which automatically extracts loops (Mega blocks)
from Micro Blaze instruction traces and creates a tailored Reconfigurable Processing Unit …

Improving performance and energy efficiency of embedded processors via post-fabrication instruction set customization

H Noori, F Mehdipour, K Inoue, K Murakami - The Journal of …, 2012 - Springer
Encapsulating critical computation subgraphs as application-specific instruction set
extensions is an effective technique to enhance the performance and energy efficiency of …

Application-Specific Customizable Embedded Systems

G Kornaros - Multi-Core Embedded Systems, 2018 - taylorfrancis.com
Review Questions............................. 63Bibliography................................ 63Embedded system
development seeks ever more efficient processors and new automation methodologies to …

A parallel systematic resampling algorithm for high-speed particle filters in embedded systems

Q Gan, JMP Langlois, Y Savaria - Circuits, Systems, and Signal …, 2014 - Springer
In this paper, we propose a parallel systematic resampling (PSR) algorithm for particle filters,
which is a new form of systematic resampling (SR). The PSR algorithm makes iterations …

A reconfigurable architecture for binary acceleration of loops with memory accesses

N Paulino, JC Ferreira, JMP Cardoso - ACM Transactions on …, 2014 - dl.acm.org
This article presents a reconfigurable hardware/software architecture for binary acceleration
of embedded applications. A Reconfigurable Processing Unit (RPU) is used as a …