Publications
Export 156 results:
Sort by: Author Title [ Type
] Year Filters: Author is Jesús Labarta [Clear All Filters]
Making the Best of Temporal Locality: Just-in-Time Renaming and Lazy Write-Back on the Cell/B.E. International Journal of High Performance Computing Applications 25, (2011).
A high-productivity task-based programming model for clusters. Concurr. Comput. : Pract. Exper. 24, 2421–2448 (2012).
CellSs: Making it easier to Program the Cell Broadband Engine processor. (2007).at <http://researchweb.watson.ibm.com/journal/rd/515/perez.html>
Automatic Grid workflow based on imperative programming languages. 18 , 1169-1186 (2006).
WAS Control Center: An Autonomic Performance-Triggered Tracing Environment for WebSphere. (2005).at <http://www.bsc.es/media/388.pdf>
Unveiling Internal Evolution of Parallel Application Computation Phases. ICPP'2011: International Conference on Parallel Processing (ICPP) 155-164 (2011).
Tuning Dynamic Web Applications using Fine-Grain Analysis. 13th Euromicro Conference on Parallel, Distributed and Network-based Processing (PDP'05) 84-91 (2005).doi:http://dx.doi.org/10.1109/EMPDP.2005.44
Trace Spectral Analysis toward Dynamic Levels of Detail. 17th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2011, Tainan, Taiwan 332 - 339 (2011).
Task Superscalar: An Out-of-Order Task Pipeline. IEEE/ACM Intl. Symp. on Microarchitecture (MICRO-43) 89-100 (2010).at <http://dx.doi.org/10.1109/MICRO.2010.13>
A Study of Speculative Distributed Scheduling on the Cell/B.E. proceedings of the 25th IEEE International Parallel & Distributed Processing Symposium (2011).
Programmable and Scalable Reductions on Clusters. Proceedings of 27th IEEE International Parallel and Distributed Processing Symposium (IEEE IPDPS) (2013).
Productive Programming of GPU Clusters with OmpSs. 26th IEEE International Parallel and Distributed Processing Symposium (IPDPS 2012) 557-568 (2012).doi:http://doi.ieeecomputersociety.org/10.1109/IPDPS.2012.58
Predicting MPI buffer addresses. (2004).at <http://springerlink.metapress.com/content/7y3ljkr3jd9k/>
Poster: programming clusters of GPUs with OMPSs. Proceedings of the international conference on Supercomputing 378–378 (2011).doi:http://doi.acm.org/10.1145/1995896.1995961
A Portable Implementation of the Integral Histogram in StarSs. Proceedings of the SC11 conference (2011).
Performance Impact of Unaligned memory Operations in SIMD Extensions for Video CODEC Applications. IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS 2007) 62-71 (2007).
Performance Data Extrapolation in Parallel Codes. ICPADS '10: Proceedings of the 16th International Conference on Parallel and Distributed Systems (2010).
Parallel implementation of the integral histogram. Proceedings of the 13th international conference on Advanced concepts for intelligent vision systems 586–598 (2011).at <http://dl.acm.org/citation.cfm?id=2034246.2034306>


