Search Page | SpringerLink

Transactions on High-Performance Embedded Architectures and Compilers V

Transactions on HiPEAC aims at the timely dissemination of research contributions in computer architecture and compilation methods...

Cristina Silvano, Koen Bertels, ... Per Stenström in Lecture Notes in Computer Science

Book 2019

Building application-specific operating systems: a profile-guided approach

Although operating system optimization has been studied extensively, previous work mainly focuses on solving performance problems. In the cloud era,...

Pengfei Yuan, Yao Guo, ... Hong Mei in Science China Information Sciences

Article 13 August 2018

An Energy-Efficient Computing Approach by Filling the Connectome Gap

This paper presents an energy-efficient neuromorphic computing approach by filling the connectome gap between algorithm, brain, and VLSI. The gap...

Yasunao Katayama, Toshiyuki Yamane, Daiju Nakano in Unconventional Computation and Natural Computation

Conference paper 2014

Scalable-Grain Pipeline Parallelization Method for Multi-core Systems

How to parallelize the great amount of legacy sequential programs is the most difficult challenge faced by multi-core designers. The existing...

Peng Liu, Chunming Huang, ... Mei Yang in Network and Parallel Computing

Conference paper 2013

Transactions on High-Performance Embedded Architectures and Compilers III

Transactions on HiPEAC aims at the timely dissemination of research contributions in computer architecture and compilation methods for...

Per Stenström in Lecture Notes in Computer Science

Book 2011

Transactions on High-Performance Embedded Architectures and Compilers IV

Transactions on HiPEAC aims at the timely dissemination of research contributions in computer architecture and compilation methods for...

Per Stenström in Lecture Notes in Computer Science

Book 2011

Extracting More Parallelism: the 3D-Wave

If higher performance is required, a parallel application developer might have to extract more parallelism than initially employed in the...

Ben Juurlink, Mauricio Alvarez-Mesa, ... Alex Ramirez in Scalable Parallel Programming Applied to H.264/AVC Decoding

Chapter 2012

Feedback-Based Global Instruction Scheduling for GPGPU Applications

In the face of the memory wall even in high bandwidth systems such as GPUs, an efficient handling of memory accesses and memory-related instructions...

Constantin Timm, Markus Görlich, ... Heinrich Müller in Computational Science and Its Applications – ICCSA 2012

Conference paper 2012

Exploiting Parallelism: the 2D-Wave

In the previous chapter we have analyzed various parallelization approaches for H.264 decoding and concluded that in order to scale to a large number...

Ben Juurlink, Mauricio Alvarez-Mesa, ... Alex Ramirez in Scalable Parallel Programming Applied to H.264/AVC Decoding

Chapter 2012

Eighth MEDEA Workshop

It is our pleasure to welcome you to this special section of Transactions on High-Performance Embedded Architectures and Compilers (HiPEAC),...

Sandro Bartolini, Pierfrancesco Foglia, Cosimo Antonia Prete in Transactions on High-Performance Embedded Architectures and Compilers III

Chapter 2011

Cache Efficiency and Scalability on Multi-core Architectures

Two electrical engineering applications from industry partners dealing with sparse matrices were analyzed regarding cache efficiency and scalability...

Thomas Müller, Carsten Trinitis, Jasmin Smajic in Parallel Computing Technologies

Conference paper 2011

Resource-Aware Compiler Prefetching for Fine-Grained Many-Cores

Super-scalar, out-of-order processors that can have tens of read and write requests in the execution window place significant demands on Memory Level...

George C. Caragea, Alexandros Tzannes, ... Uzi Vishkin in International Journal of Parallel Programming

Article 01 March 2011

A Systematic Design Space Exploration Approach to Customising Multi-Processor Architectures: Exemplified Using Graphics Processors

A systematic approach to customising Homogeneous Multi-Processor (HoMP) architectures is described. The approach involves a novel design space...

Ben Cope, Peter Y. K. Cheung, ... Lee Howes in Transactions on High-Performance Embedded Architectures and Compilers IV

Chapter 2011

Transactions on High-Performance Embedded Architectures and Compilers II

Per Stenström in Lecture Notes in Computer Science

Book 2009

MorphoSys reconfigurable hardware for cryptography: the twofish case

This paper presents the mapping and performance analysis of the Twofish algorithm on MorphoSys. MorphoSys is a reconfigurable architecture that can...

Sohaib Majzoub, Hassan Diab in The Journal of Supercomputing

Article 12 March 2010

OTAWA: An Open Toolbox for Adaptive WCET Analysis

The analysis of worst-case execution times has become mandatory in the design of hard real-time systems: it is absolutely necessary to know an upper...

Clément Ballabriga, Hugues Cassé, ... Pascal Sainrat in Software Technologies for Embedded and Ubiquitous Systems

Conference paper 2010

Improved Scalability by Using Hardware-Aware Thread Affinities

The complexity of an efficient thread management steadily rises with the number of processor cores and heterogeneities in the design of system...

Sven Mallach, Carsten Gutwenger in Facing the Multicore-Challenge

Chapter 2010

A Seamless Virtualization Approach for Transparent Dynamical Function Mapping Targeting Heterogeneous and Reconfigurable Systems

Future systems are not only heading towards increased parallelism, but also embrace heterogeneity and reconfigurability. We therefore present an...

Rainer Buchty, David Kramer, ... Wolfgang Karl in Reconfigurable Computing: Architectures, Tools and Applications

Conference paper 2009

A Hardware Task Scheduler for Embedded Video Processing

Modern embedded Systems-on-a-Chip deploy multiple programmable cores to meet increasing performance requirements of video, graphics, and modem...

Ghiath Al-Kadi, Andrei Sergeevich Terechko in High Performance Embedded Architectures and Compilers

Conference paper 2009

Parallel H.264 Decoding on an Embedded Multicore Processor

In previous work the 3D-Wave parallelization strategy was proposed to increase the parallel scalability of H.264 video decoding. This strategy is...

Arnaldo Azevedo, Cor Meenderinck, ... Alex Ramirez in High Performance Embedded Architectures and Compilers

Conference paper 2009

Search

Filters

Search Results

Search

Navigation