AYGUADE PARRA, EDUARD
Deadline-Based MapReduce Workload Management. IEEE Transactions on Network and Service Management PP, 1-14 (2013).
Assessing the impact of network compression on Molecular Dynamics and Finite Element Methods. 14th International Conference on High-Performance Computing and Communications (HPCC-2012) (2012).
Autonomic Placement of Mixed Batch and Transactional Workloads. IEEE Transactions on Parallel and Distributed Systems 23, 219-231 (2012).
BSArc: Blacksmith Streaming Architecture for HPC Accelerators. ACM International Conference on Computing Frontiers (2012).
Counter-Based Power Modeling Methods: Top-Down vs. Bottom-Up. The Computer Journal (2012).doi:10.1093/comjnl/bxs116
DMA++: On the Fly Data Realignment for On-Chip Memories. Computers, IEEE Transactions on 61, 237 -250 (2012).
DMA-circular: an enhanced high level programmable DMA controller for optimized management of on-chip local memories. Proceedings of the 9th conference on Computing Frontiers 113–122 (2012).doi:10.1145/2212908.2212925
Energy accounting for shared virtualized environments under DVFS using PMC-based power models. Future Generation Computer Systems 28, 457 - 468 (2012).
Hardware-software coherence protocol for the coexistence of caches and local memories. Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis 89:1–89:11 (2012).at <http://dl.acm.org/citation.cfm?id=2388996.2389117>
Hybrid Parallel Programming with MPI/StarSs. Applications, Tools and Techniques on the Road to Exascale Computing (2012).
Hybrid/Heterogeneous Programming with OmpSs and its Software/Hardware Implications. Programming Multi-Core and Many-Core Computing Systems (Wiley Series on Parallel and Distributed Computing) (2012).at <http://www.par.univie.ac.at/~pllana/manycore_book/>
On the Instrumentation of OpenMP and OmpSs Tasking Constructs. Euro-Par 2012: Parallel Processing Workshops. Lecture Notes in Computer Science 7640, 414-428 (2012).
Integrating Dataflow Abstractions into the Shared Memory Model. Computer Architecture and High Performance Computing (SBAC-PAD), 2012 IEEE 24th International Symposium on 243–251 (2012).
Introducing Speculative Optimizations in Task Dataflow with Language Extensions and Runtime Support. 2nd Workshop on Data-Flow Execution Models for Extreme Scale Computing (DFM 2012), In conjunction with 21st International Conference on Parallel Architectures and Compilation Techniques (PACT-2012) (2012).
A methodology for the evaluation of high response time on E-commerce users and sales. Information Systems Frontiers (2012).doi:10.1007/s10796-012-9387-4
OmpSs-OpenCL Programming Model for Heterogeneous Systems. 25th International Workshop on Languages and Compilers for Parallel Computing (LCPC2012) (2012).
PPMC : A Programmable Pattern based Memory Controller. Applied Reconfigurable Computing (ARC) 1–12 (2012).
PPMC : Hardware Scheduling and Memory Management Support for Multi Accelerators. 22nd International Conference on Field Programmable Logic and Applications (FPL-2012 (2012).
Productive Programming of GPU Clusters with OmpSs. 26th IEEE International Parallel and Distributed Processing Symposium (IPDPS 2012) 557-568 (2012).doi:http://doi.ieeecomputersociety.org/10.1109/IPDPS.2012.58
Supporting Stateful Tasks in a Dataflow Graph. Proceedings of the 21st international conference on Parallel architectures and compilation techniques 435–436 (2012).
A Systematic Methodology to Generate Decomposable and Responsive Power Models for CMPs. Computers, IEEE Transactions on PP, 1 (2012).
Transactional Access to Shared Memory in StarSs, a Task Based Programming Model. 18th International Conference, Euro-Par 2012 514–525 (2012).
The Abstract Streaming Machine: Compile-Time Performance Modelling of Stream Programs on Heterogeneous Multiprocessors. Transactions on HiPEAC 5, (2011).
Assessing Accelerator-based HPC Reverse Time Migration. Transactions on Parallel and Distributed Systems, Special Issue on Accelerators 22(1), 147-162 (2011).
A CellBE-based HPC Application for the Analysis of Vulnerabilities in Cryptographic Hash Functions. (2011).at <http://dx.doi.org/10.1109/HPCC.2010.113>
Design space exploration for aggressive core replication schemes in CMPs. Proceedings of the 20th international symposium on High performance distributed computing 269–270 (2011).doi:http://doi.acm.org/10.1145/1996130.1996169
Energy accounting for shared virtualized environments under DVFS using PMC-based power models. Future Generation Computer Systems 28, 457 - 468 (2011).
Implementation of a Reverse Time Migration Kernel using the HCE High Level Synthesis Tool. Field Programmable Technology 1–8 (2011).
Integrating dataflow abstractions into transactional memory. Systems for Future Multi-Core Architectures (SFMA'11) (2011).
Integrating Dataflow Abstractions into Transactional Memory. First Workshop on Systems for Future Multi-Core Architectures (SFMA'11) 1–6 (2011).
Mercurium: Design Decisions for a S2S Compiler. Cetus Users and Compiler Infastructure Workshop in conjunction with PACT 2011 (2011).
Non-intrusive Estimation of QoS Degradation Impact on E-commerce User Satisfaction. Network Computing and Applications (NCA), 2011 10th IEEE International Symposium on 179–186 (2011).
OmpSs: A PROPOSAL FOR PROGRAMMING HETEROGENEOUS MULTI-CORE ARCHITECTURES. Parallel Processing Letters 21, 173-193 (2011).
Optimizing the Exploitation of Multicore Processors and GPUs with OpenMP and OpenCL. Lecture Notes in Computer Science 6548/2011, 215-229 (2011).
Poster: programming clusters of GPUs with OMPSs. Proceedings of the international conference on Supercomputing 378–378 (2011).doi:http://doi.acm.org/10.1145/1995896.1995961
Productive Cluster Programming with OmpSs. Euro-Par 2011 Parallel Processing 6852, 555-566 (2011).
Reconfigurable Memory Controller with Programmable Pattern Support. 5th HiPEAC Workshop on Reconfigurable Computing (WRC 2011) 55–64 (2011).
Resource-aware Adaptive Scheduling for MapReduce Clusters. ACM/IFIP/USENIX 12th International Middleware Conference (Middleware 2011) (2011).
TARCAD: A template architecture for reconfigurable accelerator designs. IEEE Symposium on Application Specific Processors (SASP) 8-15 (2011).
A Template System for the Efficient Compilation of Domain Abstractions onto Reconfigurable Computers. 5th HiPEAC Workshop on Reconfigurable Computing (WRC 2011) 65–74 (2011).
Accurate Energy Accounting for Shared Virtualized Environments using PMC-based Power Modeling Techniques. (2010).
ACOTES Project: Advanced Compiler Technologies for Embedded Streaming Harm Munk. International Journal of Parallel Programming 39, 397-450 (2010).
Analysis of Task Offloading for Accelerators. (2010).at <http://www.springerlink.com/content/978-3-642-11514-1>
Automatic Prefetch and Modulo Scheduling Transformations for the Cell BE Architecture. (2010).at <http://doi.ieeecomputersociety.org/10.1109/TPDS.2009.97>
BSC contributions in Energy-aware Resource Management for Large Scale Distributed Systems. 1st Year Workshop of the COST Action IC0804 on Energy Efficiency in Large Scale Distributed Systems 76-79 (2010).
Buffer sizing for self-timed stream programs on heterogeneous distributed memory multiprocessors. International conference on High-Performance Embedded Architectures and Compilers (HiPEAC) 2010 96-110 (2010).