Publications

Export 155 results:
Author Title Type [ Year(Asc)]
Filters: Author is Jesús Labarta  [Clear All Filters]
2014
Servat, H., Llort, G., González, J., Giménez, J. & Labarta, J. Identifying code phases using piece-wise linear regressions. 28th IEEE International Parallel & Distributed Processing Symposium (IPDPS) 941-951 (2014).
2012
González, J., Huck, K., Giménez, J. & Labarta, J. Automatic Refinement of Parallel Applications Structure Detection. LSPP '12: Proceedings of the 2012 Workshop on Large-Scale Parallel Processing (2012).
Tejedor, E. et al. A high-productivity task-based programming model for clusters. Concurr. Comput. : Pract. Exper. 24, 2421–2448 (2012).
Ayguadé, E. et al. Programming Multi-Core and Many-Core Computing Systems (Wiley Series on Parallel and Distributed Computing) (John Wiley & Sons, Inc., 2012). at <http://www.par.univie.ac.at/~pllana/manycore_book/>
Bueno-Hedo, J. et al. Productive Programming of GPU Clusters with OmpSs. 26th IEEE International Parallel and Distributed Processing Symposium (IPDPS 2012) 557-568 (2012). doi:http://doi.ieeecomputersociety.org/10.1109/IPDPS.2012.58
2011
Tejedor, E. et al. ClusterSs: a Task-based Programming Model for Clusters. Proceedings of the 20th International ACM Symposium on High Performance Distributed Computing, San Jose, California, USA 267–268 (2011). doi:http://doi.acm.org/10.1145/1996130.1996168
Casas, M., Servat, H., Badia, R. M. & Labarta, J. Extracting the optimal sampling frequency of applications using spectral analysis. Concurrency and Computation: Practice and Experience n/a–n/a (2011). doi:10.1002/cpe.1819
Servat, H., Llort, G., Giménez, J., Huck, K. & Labarta, J. Folding: detailed analysis with coarse sampling. Tools for High Performance Computing 2011. Proceedings of the 5th International Workshop on Parallel Tools for High Performance Computing (2011).
Bellens, P., Pérez, J. M., Badia, R. M. & Labarta, J. Making the Best of Temporal Locality: Just-in-Time Renaming and Lazy Write-Back on the Cell/B.E. International Journal of High Performance Computing Applications 25, (2011).
Duran, A. et al. OmpSs: A PROPOSAL FOR PROGRAMMING HETEROGENEOUS MULTI-CORE ARCHITECTURES. Parallel Processing Letters 21, 173-193 (2011).
Ferrer, R. et al. Optimizing the Exploitation of Multicore Processors and GPUs with OpenMP and OpenCL. Lecture Notes in Computer Science 6548/2011, 215-229 (2011).
Bellens, P., Palaniappan, K., Badia, R. M., Seetharaman, G. & Labarta, J. Parallel implementation of the integral histogram. Proceedings of the 13th international conference on Advanced concepts for intelligent vision systems 586–598 (2011). at <http://dl.acm.org/citation.cfm?id=2034246.2034306>
Bellens, P., Palaniappan, K., Badia, R. M., Seetharaman, G. & Labarta, J. A Portable Implementation of the Integral Histogram in StarSs. Proceedings of the SC11 conference (2011).
Bueno, J. et al. Poster: programming clusters of GPUs with OMPSs. Proceedings of the international conference on Supercomputing 378–378 (2011). doi:http://doi.acm.org/10.1145/1995896.1995961
Bueno, J. et al. Productive Cluster Programming with OmpSs. Euro-Par 2011 Parallel Processing 6852, 555-566 (2011).
González, J. et al. Simulating Whole Supercomputer Applications. IEEE Micro 31, 32-45 (2011).
Subotic, V., Labarta, J. & Valero, M. Simulation environment to study overlapping of communication and computation. (2011).
Bellens, P., Pérez, J. M., Badia, R. M. & Labarta, J. A Study of Speculative Distributed Scheduling on the Cell/B.E. proceedings of the 25th IEEE International Parallel & Distributed Processing Symposium (2011).
Llort, G. et al. Trace Spectral Analysis toward Dynamic Levels of Detail. 17th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2011, Tainan, Taiwan 332 - 339 (2011).
Labarta, J. (in Perfromance TUning of Cientific Applications. CRC Press. ISBN 978-1-4398-1569-4, 2011).
Servat, H., Llort, G., Giménez, J., Huck, K. & Labarta, J. Unveiling Internal Evolution of Parallel Application Computation Phases. ICPP'2011: International Conference on Parallel Processing (ICPP) 155-164 (2011).
2010
Casas, M., Badia, R. M. & Labarta, J. Automatic Phase Detection and Structure Extraction of MPI Applications. (2010).
Torres, J. et al. BSC contributions in Energy-aware Resource Management for Large Scale Distributed Systems. 1st Year Workshop of the COST Action IC0804 on Energy Efficiency in Large Scale Distributed Systems 76-79 (2010).
Etinski, M., Corbalán, J., Labarta, J. & Valero, M. BSLD Threshold Driven Power Management Policy for HPC Centers. (2010).
Huck, K. & Labarta, J. Detailed load balance analysis of large scale parallel applications. (2010).
Marjanovic, V., Labarta, J., Ayguadé, E. & Valero, M. Effective Communication and Computation Overlap with Hybrid MPI/SMPSs. (2010).
Ayguadé, E. et al. Extending OpenMP to Survive the Heterogeneous Multi-core Era. (2010).
Giménez, J. et al. Guided Performance Analysis Combining Profile and Trace Tools. (2010).
Pérez, J. M., Badia, R. M. & Labarta, J. Handling task dependencies under strided and aliased references. (2010).
Llort, G., González, J., Servat, H., Giménez, J. & Labarta, J. On-line Detection of Large-scale Parallel Application's Structure. 24th IEEE International Parallel and Distributed Processing Symposium (IPDPS'2010) (2010).
Etinski, M., Corbalán, J., Labarta, J. & Valero, M. Optimizing Job Performance Under a Given Power Constraint In HPC Centers. (2010).
Badia, R. M. et al. Optimizing the Exploitation of Multicore Processors and GPUs with OpenMP and OpenCL. (2010).
Marjanovic, V., Labarta, J., Ayguadé, E. & Valero, M. Overlapping Communication and Computation by Using a Hybrid MPI/SMPSs Approach. (2010).
González, J., Giménez, J. & Labarta, J. Performance Data Extrapolation in Parallel Codes. ICPADS '10: Proceedings of the 16th International Conference on Parallel and Distributed Systems (2010).
Labarta, J. StarSs: A Programming Model for the Multicore Era. (2010).
Etsion, Y. et al. Task Superscalar: An Out-of-Order Task Pipeline. IEEE/ACM Intl. Symp. on Microarchitecture (MICRO-43) 89-100 (2010). at <http://dx.doi.org/10.1109/MICRO.2010.13>
Etsion, Y. et al. Task Superscalar: Using Processors as Functional Units. USENIX Workshop on Hot Topics In Parallelism (HotPar) (2010).
Etinski, M., Corbalán, J., Labarta, J. & Valero, M. Utilization Driven Power-Aware Job Scheduling. (2010).
2009
González, J., Giménez, J. & Labarta, J. Automatic Detection of Parallel Applications Computation Phases. IPDPS '09: Proceedings of the 23rd IEEE International Parallel and Distributed Processing Symposium (2009).
González, J., Giménez, J. & Labarta, J. Automatic evaluation of the computation structure of parallel applications. PDCAT'09: Proceedings of the 10th International Conference on Parallel and Distributed Computing, Applications and Technologies (2009).
Bellens, P. et al. CellSs: Scheduling Techniques to Better Exploit Memory Hierarchy. (2009).
Etsion, Y., Ramirez, A., Badia, R. M. & Labarta, J. Cores as Functional Units: A Task-Based, Out-of-Order, Dataflow Pipeline. Advanced Computer Architecture and Compilation for Embedded Systems (ACACES) (2009).
Servat, H., Llort, G., Giménez, J. & Labarta, J. Detailed performance analysis using coarse grain sampling. (2009).
Bellens, P., Pérez, J. M., Badia, R. M. & Labarta, J. Exploiting Locality on the Cell/B.E. Through Bypassing. (2009).
Ayguadé, E. et al. An Extension of the StarSs Programming Model for Platforms with Multiple GPUs. (2009).
Sirvent, R., Badia, R. M. & Labarta, J. Graph-based task replication for workflow applications. (2009).
Planas, J., Badia, R. M., Ayguadé, E. & Labarta, J. Hierarchical task based programming with StarSs. (2009).
Badia, R. M., Pérez, J. M., Ayguadé, E. & Labarta, J. Impact of the memory hierarchy on shared memory architectures in multicore programming models. (2009).
Bellens, P., Pérez, J. M., Badia, R. M. & Labarta, J. Just-in-time renaming and Lazy Write-back on the Cell/BE. (2009).
Badia, R. M. et al. Parallelizing Dense and Banded Linear Algebra Libraries Using SMPSs. (2009).

Pages