We are improving our search experience.

As we work to add all features, to check which content you have full access to, or for advanced search, go back to the old search.

Search

Please fill in this field.

Search Results

Showing 1-20 of 23 results
  1. Transactions on High-Performance Embedded Architectures and Compilers V

    Transactions on HiPEAC aims at the timely dissemination of research contributions in computer architecture and compilation methods...
    Cristina Silvano, Koen Bertels, ... Per Stenström in Lecture Notes in Computer Science
    Book 2019
  2. Building application-specific operating systems: a profile-guided approach

    Although operating system optimization has been studied extensively, previous work mainly focuses on solving performance problems. In the cloud era,...

    Pengfei Yuan, Yao Guo, ... Hong Mei in Science China Information Sciences
    Article 13 August 2018
  3. An Energy-Efficient Computing Approach by Filling the Connectome Gap

    This paper presents an energy-efficient neuromorphic computing approach by filling the connectome gap between algorithm, brain, and VLSI. The gap...
    Yasunao Katayama, Toshiyuki Yamane, Daiju Nakano in Unconventional Computation and Natural Computation
    Conference paper 2014
  4. Scalable-Grain Pipeline Parallelization Method for Multi-core Systems

    How to parallelize the great amount of legacy sequential programs is the most difficult challenge faced by multi-core designers. The existing...
    Peng Liu, Chunming Huang, ... Mei Yang in Network and Parallel Computing
    Conference paper 2013
  5. Transactions on High-Performance Embedded Architectures and Compilers III

    Transactions on HiPEAC aims at the timely dissemination of research contributions in computer architecture and compilation methods for...
    Book 2011
  6. Transactions on High-Performance Embedded Architectures and Compilers IV

    Transactions on HiPEAC aims at the timely dissemination of research contributions in computer architecture and compilation methods for...
    Book 2011
  7. Extracting More Parallelism: the 3D-Wave

    If higher performance is required, a parallel application developer might have to extract more parallelism than initially employed in the...
    Ben Juurlink, Mauricio Alvarez-Mesa, ... Alex Ramirez in Scalable Parallel Programming Applied to H.264/AVC Decoding
    Chapter 2012
  8. Feedback-Based Global Instruction Scheduling for GPGPU Applications

    In the face of the memory wall even in high bandwidth systems such as GPUs, an efficient handling of memory accesses and memory-related instructions...
    Constantin Timm, Markus Görlich, ... Heinrich Müller in Computational Science and Its Applications – ICCSA 2012
    Conference paper 2012
  9. Exploiting Parallelism: the 2D-Wave

    In the previous chapter we have analyzed various parallelization approaches for H.264 decoding and concluded that in order to scale to a large number...
    Ben Juurlink, Mauricio Alvarez-Mesa, ... Alex Ramirez in Scalable Parallel Programming Applied to H.264/AVC Decoding
    Chapter 2012
  10. Eighth MEDEA Workshop

    It is our pleasure to welcome you to this special section of Transactions on High-Performance Embedded Architectures and Compilers (HiPEAC),...
    Sandro Bartolini, Pierfrancesco Foglia, Cosimo Antonia Prete in Transactions on High-Performance Embedded Architectures and Compilers III
    Chapter 2011
  11. Cache Efficiency and Scalability on Multi-core Architectures

    Two electrical engineering applications from industry partners dealing with sparse matrices were analyzed regarding cache efficiency and scalability...
    Thomas Müller, Carsten Trinitis, Jasmin Smajic in Parallel Computing Technologies
    Conference paper 2011
  12. Resource-Aware Compiler Prefetching for Fine-Grained Many-Cores

    Super-scalar, out-of-order processors that can have tens of read and write requests in the execution window place significant demands on Memory Level...

    George C. Caragea, Alexandros Tzannes, ... Uzi Vishkin in International Journal of Parallel Programming
    Article 01 March 2011
  13. A Systematic Design Space Exploration Approach to Customising Multi-Processor Architectures: Exemplified Using Graphics Processors

    A systematic approach to customising Homogeneous Multi-Processor (HoMP) architectures is described. The approach involves a novel design space...
    Ben Cope, Peter Y. K. Cheung, ... Lee Howes in Transactions on High-Performance Embedded Architectures and Compilers IV
    Chapter 2011
  14. MorphoSys reconfigurable hardware for cryptography: the twofish case

    This paper presents the mapping and performance analysis of the Twofish algorithm on MorphoSys. MorphoSys is a reconfigurable architecture that can...

    Sohaib Majzoub, Hassan Diab in The Journal of Supercomputing
    Article 12 March 2010
  15. OTAWA: An Open Toolbox for Adaptive WCET Analysis

    The analysis of worst-case execution times has become mandatory in the design of hard real-time systems: it is absolutely necessary to know an upper...
    Clément Ballabriga, Hugues Cassé, ... Pascal Sainrat in Software Technologies for Embedded and Ubiquitous Systems
    Conference paper 2010
  16. Improved Scalability by Using Hardware-Aware Thread Affinities

    The complexity of an efficient thread management steadily rises with the number of processor cores and heterogeneities in the design of system...
    Sven Mallach, Carsten Gutwenger in Facing the Multicore-Challenge
    Chapter 2010
  17. A Seamless Virtualization Approach for Transparent Dynamical Function Mapping Targeting Heterogeneous and Reconfigurable Systems

    Future systems are not only heading towards increased parallelism, but also embrace heterogeneity and reconfigurability. We therefore present an...
    Rainer Buchty, David Kramer, ... Wolfgang Karl in Reconfigurable Computing: Architectures, Tools and Applications
    Conference paper 2009
  18. A Hardware Task Scheduler for Embedded Video Processing

    Modern embedded Systems-on-a-Chip deploy multiple programmable cores to meet increasing performance requirements of video, graphics, and modem...
    Ghiath Al-Kadi, Andrei Sergeevich Terechko in High Performance Embedded Architectures and Compilers
    Conference paper 2009
  19. Parallel H.264 Decoding on an Embedded Multicore Processor

    In previous work the 3D-Wave parallelization strategy was proposed to increase the parallel scalability of H.264 video decoding. This strategy is...
    Arnaldo Azevedo, Cor Meenderinck, ... Alex Ramirez in High Performance Embedded Architectures and Compilers
    Conference paper 2009