Publications

Export 108 results:
Author Title [ Type(Desc)] Year
Filters: Author is Xavier Martorell  [Clear All Filters]
Book Chapter
E. Ayguadé, Badia, R. M., Bellens, P., Bueno-Hedo, J., Duran, A., Etsion, Y., Farreras, M., Ferrer, R., Labarta, J., Marjanovic, V., Martinell, L., Martorell, X., Pérez, J. M., Planas, J., Ramirez, A., Teruel, X., Tsalouchidou, I., and Valero, M., Hybrid/Heterogeneous Programming with OmpSs and its Software/Hardware Implications, in Programming Multi-Core and Many-Core Computing Systems (Wiley Series on Parallel and Distributed Computing) , Wiley Series on "Parallel and Distributed Computing"., John Wiley & Sons, Inc., 2012.
Conference Paper
S. Florentino, Mateo, S., Beltran, V., Bosque, J. L., Martorell, X., and Ayguadé, E., Leveraging OmpSs to Exploit Hardware Accelerators, in International Symposium on Computer Architecture and High Performance Computing, Paris, France, 2014, pp. 112–119.
A. Fernández, Beltran, V., Martorell, X., Badia, R. M., Ayguadé, E., and Labarta, J., Task-Based Programming with OmpSs and Its Application, in Workshop on Software for Exascale Computing (SPPEXA), Porto, Portugal, 2014, pp. 602–613.
J. Ciesko, Mateo, S., Teruel, X., Beltran, V., Martorell, X., Badia, R. M., Ayguadé, E., and Labarta, J., Task-Parallel Reductions in OpenMP and OmpSs, in 10th International Workshop on OpenMP, IWOMP 2014, Salvador de Bahia, Brazil, 2014, pp. 1–15.
M. Milovanovic, Ferrer, R., Unsal, O., Cristal, A., Martorell, X., Ayguadé, E., Labarta, J., and Valero, M., Transactional Memory and OpenMP, in International Workshop on OpenMP (IWOMP-2007), Beijing, China, 2007, pp. 37–53.
International Conferences
Y. N. Patt, Foglia, P., Duesterwald, E., Faraboschi, P., and Martorell, X., 5th Int. Conf. on High Performance Embedded Architectures and Compilers (HiPEAC 2010). Pisa, Italy, 2010.
D. Oro, Fernandez, C., Segura, C., Martorell, X., and Hernando, J., Accelerating Boosting-based Face Detection on GPUs, Proc. of the 41st International Conference on Parallel Processing. Conference Publishing Services (CPS), USA, 2012.
R. Bertran, González, M., Becerra, Y., Carrera, D., Beltran, V., Martorell, X., Torres, J., and Ayguadé, E., Accurate Energy Accounting for Shared Virtualized Environments using PMC-based Power Modeling Techniques. 11th ACM/IEEE International Conference on Grid Computing (Grid 2010), 2010.
R. Ferrer, Beltran, V., González, M., Martorell, X., and Ayguadé, E., Analysis of Task Offloading for Accelerators. Proc. of the 5th International Conference, HiPEAC 2010, Pisa, Italy, January 25- 27, 2010, 2010.
G. Utrera, Gil, M., and Martorell, X., Analyzing the impact of programming models for efficient communication overlap in high-speed networks, International Conference on High Performance Computing & Simulation, HPCS 2014, Bologna, Italy, 21-25 July, 2014. IEEE, pp. 218–225, 2014.
J. Guitart, Martorell, X., Torres, J., and Ayguadé, E., Application/Kernel Cooperation Towards the Efficient Execution of Shared-memory Parallel Java Codes, 17th IEEE International Parallel and Distributed Processing Symposium (IPDPS'03). Nice, France, 2003.
M. Alvanos, Tiotto, E., Farreras, M., and Martorell, X., Automatic Communication Coalescing for Irregular Computations in UPC Language, Proc. of the 2012 CASCON conference. IBM Toronto Lab, Toronto, Canada, 2012.
C. González, Fernández, M., Jiménez, D., Álvarez, C., and Martorell, X., Automatic generation of application-specific hardware accelerators on OpenSPARC, International Symposium on Code Generation and Optimization (CGO 2011). 2011.
A. Duran, González, M., Corbalán, J., Martorell, X., Ayguadé, E., Labarta, J., and Silvera, R., Automatic Thread Distribution for Nested Parallelism in OpenMP. 19th ACM International Conference on Supercomputing, 2005.
A. Duran, Teruel, X., Ferrer, R., Martorell, X., and Ayguadé, E., Barcelona OpenMP Tasks Suite: A Set of Benchmarks Targeting the Exploitation of Task Parallelism in OpenMP. Proc. of the 2009 International Conference on Parallel Processing (ICPP 2009), 2009.
F. Cabarcas, Rico, A., Ródenas, D., Martorell, X., Ramirez, A., and Ayguadé, E., CellSim: A Cell Processor Simulation Infrastructure, 2007 Advanced Computer Architecture and Compilation for Embedded Systems (ACACES-07). pp. 279-282, 2007.
S. Royuela, Duran, A., and Martorell, X., Compiler automatic discovery of OmpSs task dependencies, Proceedings of the workshop on Languages and Compilers for Parallel Computing. 2012.
E. Sandes, Miranda, G., de Melo, A. C. M. Alve, Martorell, X., and Ayguadé, E., CUDAlign 3.0: Parallel Biological Sequence Comparison in Large GPU Clusters, 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing. IEEE, Chicago, IL, United States, pp. 160–169, 2014.
R. Bertran, González, M., Martorell, X., Navarro, N., and Ayguadé, E., Decomposable and Responsive Power Models for Multicore Processors using Performance Counters. Proc. of the 24th ACM Int. Conf. on Supercomputing, pp.147-158, 2010.
L. Álvarez, Bertran, R., González, M., Martorell, X., Navarro, N., and Ayguadé, E., Design space exploration for aggressive core replication schemes in CMPs, Proceedings of the 20th international symposium on High performance distributed computing. ACM, New York, NY, USA, pp. 269–270, 2011.
N. Vujic, González, M., Ayguadé, E., Martorell, X., Ramirez, A., and Cabarcas, F., DMA++: On the Fly Data Realignment for On-Chip Memories, 16th IEEE International Symposium on High-Performance Computer Architecture. Bangalore (India), 2010.
N. Vujic, Álvarez, L., González, M., Martorell, X., and Ayguadé, E., DMA-circular: an enhanced high level programmable DMA controller for optimized management of on-chip local memories, Proceedings of the 9th conference on Computing Frontiers. ACM, New York, NY, USA, pp. 113–122, 2012.
J. Guitart, Martorell, X., Torres, J., and Ayguadé, E., Efficient Execution of Parallel Java Applications, 3rd Annual Workshop on Java for High Performance Computing. Sorrento, Italy, pp. 31-35, 2001.
E. de Sandes, Miranda, G., Melo, A., Martorell, X., and Ayguadé, E., Fine-grain parallel megabase sequence comparison with multiple heterogeneous GPUs, 19th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. Orlando, United States, pp. 383–384, 2014.
L. Álvarez, Vilanova, L., González, M., Martorell, X., Navarro, N., and Ayguadé, E., Hardware-software coherence protocol for the coexistence of caches and local memories, Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis. IEEE Computer Society Press, Los Alamitos, CA, USA, pp. 89:1–89:11, 2012.
A. Filgueras, Gil, E., Álvarez, C., Jiménez, D., Martorell, X., Langer, J., and Noguera, J., Heterogeneous tasking on SMP/FPGA SoCs: the case of OmpSs and the Zynq, 21st IFIP/IEEE International Conference on Very Large Scale Integration (VLSI-SoC). IEEE, Istambul, Turkey, pp. 290–291, 2013.
M. González, Vujic, N., Martorell, X., Ayguadé, E., Eichenberger, A. E., Chen, T., Sura, Z., Zhang, T., O'Brien, K., and O'Brien, K., Hybrid Access-Specific Software Cache Techniques for the Cell BE Architecture. Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques (PACT'08), 2008.
J. Bueno-Hedo, Badia, R. M., Martorell, X., Ayguadé, E., and Labarta, J., Implementing OmpSs Support for Regions of Data in Architectures with Multiple Address Spaces, 27th International Conference on Supercomputing (ICS). ACM, Eugene, OR, United States, pp. 359–368, 2013.
M. Alvanos, Tiotto, E., Farreras, M., and Martorell, X., Improving communication in PGAS environments: Data prefetching and aggregation in UPC, 20th Annual International Conference hosted by the Centre for Advanced Studies & Research (CASCON 2011). ACM Digital library, Toronto, ON, Canada, 2011.
M. Alvanos, Farreras, M., Tiotto, E., Amaral, J. N., and Martorell, X., Improving Communication in PGAS Environments: Static and Dynamic Coalescing in UPC, 27th International Conference on Supercomputing (ICS). ACM, Eugene, OR, United States, pp. 129–138, 2013.
M. Alvanos, Gabriel, T., Farreras, M., Tiotto, E., Amaral, J. N., and Martorell, X., Improving Performance of All-to-all Communication Through Loop Scheduling in PGAS Environments, 27th International Conference on Supercomputing (ICS). ACM, Eugene, OR, United States, pp. 457–458, 2013.
P. Dadvand, Rossi, R., Gil, M., Martorell, X., Juanpere, E., Ildelsohn, S. R., and Oñate, E., Migration of a Generic Multi-Physics Framework to HPC Environments, 23rd International Conference on Parallel Computational Fluid Dynamics. 2011.
J. Balart, González, M., Martorell, X., Ayguadé, E., Sura, Z., Chen, T., Zhang, T., O'Brien, K., and O'Brien, K., A Novel Asynchronous Software Cache Implementation for the Cell-BE Processor. Proceedings of the 20th International Workshop on Languages and Compilers for Parallel Computing, 2007.
A. Filgueras, Gil, E., Jiménez-González, D., Alvarez, C., Martorell, X., Langer, J., Noguera, J., and Vissers, K., OmpSs@Zynq All-Programmable SoC Ecosystem, 22nd ACM/SIGDA International Symposium on Field-Programmable Gate Arrays . ACM/SIGDA, USA, pp. 137-146, 2014.
D. Cabrera, Martorell, X., Gaydadjiev, G. N., Ayguadé, E., and Jiménez-González, D., OpenMP Extensions for FPGA Accelerators. Proc. of the 9th Int. Conf. on Systems, Architectures, Modeling, and Simulation (SAMOS '09), pp.17-24, 2009.
X. Teruel, Barton, C., Duran, A., Martorell, X., Ayguadé, E., Unnikrishnan, P., Zhang, G., and Silvera, R., OpenMP Tasking Analysis for Programmers. Proc. of the CAS Conference 2009, 2009.
X. Teruel, Unnikrishnan, P., Martorell, X., Ayguadé, E., Silvera, R., Zhang, G., and Tiotto, E., OpenMP Tasks in IBM XL Compilers. Proceedings of CASCON 2008, 2008.
G. Almasi, Archer, C., Erway, C. C., Heidelberger, P., Martorell, X., Moreira, J., Steinmacher-Burow, B. D., and Zheng, Y., Optimization of MPI Collective Communication on Blue Gene/L Systems. 19th ACM International Conference on Supercomputing, 2005.
D. Ródenas, Martorell, X., Ayguadé, E., Labarta, J., Almasi, G., Cascaval, C., Castaños, J., and Moreira, J., Optimizing NANOS OpenMP for the IBM Cyclops Multithreaded Architecture. 19th International Parallel and Distributed Processing Symposium (IPDPS '05), 2005.
D. Jiménez-González, Martorell, X., and Ramirez, A., Performance Analysis of Cell Broadband Engine for High Memory Bandwidth Applications, IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS-2007). pp. 210-219, 2007.
J. Bueno, Duran, A., Martorell, X., Ayguadé, E., Badia, R. M., and Labarta, J., Poster: programming clusters of GPUs with OMPSs, Proceedings of the international conference on Supercomputing. ACM, New York, NY, USA, pp. 378–378, 2011.
J. Bueno, Martinell, L., Duran, A., Farreras, M., Martorell, X., Badia, R. M., Ayguadé, E., and Labarta, J., Productive Cluster Programming with OmpSs, Euro-Par 2011 Parallel Processing, vol. 6852. Springer Berlin / Heidelberg, pp. 555-566, 2011.
J. Bueno-Hedo, Planas, J., Duran, A., Badia, R. M., Martorell, X., Ayguadé, E., and Labarta, J., Productive Programming of GPU Clusters with OmpSs, 26th IEEE International Parallel and Distributed Processing Symposium (IPDPS 2012). IEEE Computer Society, pp. 557-568, 2012.
E. Ayguadé, Badia, R. M., Cabrera, D., Duran, A., González, M., Igual, F., Jiménez, D. A., Labarta, J., Martorell, X., Mayo, R., Pérez, J. M., and Quintana-Ortí, E. S., A Proposal to Extend the OpenMP Tasking Model for Heterogeneous Architectures. Proceedings of the International Workshop on OpenMP (IWOMP 2009), 2009.
M. Alvanos, Amaral, J. N., Tiotto, E., Farreras, M., and Martorell, X., Reducing Compiler-Inserted Instrumentation in Unified-Parallel-C Code Generation, International Symposium on Computer Architecture and High Performance Computing. IEEE Press, Paris, France, pp. 270–277, 2014.
J. Bueno, Martorell, X., Costa, J., Cortes, T., Ayguadé, E., Zhang, G., Barton, C., and Silvera, R., Reducing Data Access Latency in SDSM Systems using Runtime Optimizations, 19th Annual International Conference hosted by the Centre for Advanced Studies & Research (CASCON 2010). ACM Digital library, Toronto, ON, Canada, pp. 160-173, 2010.
J. Costa, Cortes, T., Martorell, X., Ayguadé, E., and Labarta, J., Running OpenMP Applications Efficiently on an Everything-shared SDSM. 18th International Parallel and Distributed Processing Symposium (IPDPS-2004), 2004.
C. Ciobanu, Martorell, X., Kuzmanov, G. K., Ramirez, A., and Gaydadjiev, G. N., Scalability Evaluation of a Polymorphic Register File: A CG Case Study, Architecture of Computing Systems - ARCS 2011. Springer-Verlag Berlin Heidelberg, pp. 13-25, 2011.
M. Álvarez, Ramirez, A., Martorell, X., Ayguadé, E., and Valero, M., Scalability of Macroblock-level Parallelism for H.264 Decoding, Advanced Computer Architecture and Compilation for Embedded Systems. ACACES 2008, Poster. pp. 59-62, 2008.
X. Teruel, Martorell, X., Duran, A., Ferrer, R., and Ayguadé, E., Support for OpenMP Tasks in Nanos v4. Proceedsings of CASCON 2007, 2007.

Pages