Identifying code phases using piece-wise linear regressions. 28th IEEE International Parallel & Distributed Processing Symposium (IPDPS) 941-951 (2014).
Detailed and simultaneous power and performance analysis. Concurrency and Computation: Practice and Experience (2013). doi:10.1002/cpe.3188
Framework for a Productive Performance Optimization. Parallel Computing Systems and Applications 39, 336-353 (2013).
On the Usefulness of Object Tracking Techniques in Performance Analysis. Proceedings of SC13: International Conference for High Performance Computing, Networking, Storage and Analysis 29:1–29:11 (2013). doi:10.1145/2503210.2503267
On the Instrumentation of OpenMP and OmpSs Tasking Constructs. Euro-Par 2012: Parallel Processing Workshops. Lecture Notes in Computer Science 7640, 414-428 (2012).
Folding: detailed analysis with coarse sampling. Tools for High Performance Computing 2011. Proceedings of the 5th International Workshop on Parallel Tools for High Performance Computing (2011).
Trace Spectral Analysis toward Dynamic Levels of Detail. 17th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2011, Tainan, Taiwan 332 - 339 (2011).
Unveiling Internal Evolution of Parallel Application Computation Phases. ICPP'2011: International Conference on Parallel Processing (ICPP) 155-164 (2011).
On-line Detection of Large-scale Parallel Application's Structure. 24th IEEE International Parallel and Distributed Processing Symposium (IPDPS'2010) (2010).