Publications (52)

2015

  1. Fast-coding robust motion estimation model in a GPU

    Proceedings of SPIE - The International Society for Optical Engineering

  2. On processing extreme data

    Scalable Computing, Vol. 16, Núm. 4, pp. 467-490

2014

  1. Performance evaluation of OpenACC compilers

    Proceedings - 2014 22nd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, PDP 2014

2013

  1. Programming for GPUs: The directive-based approach

    Proceedings - 2013 8th International Conference on P2P, Parallel, Grid, Cloud and Internet Computing, 3PGCIC 2013

  2. A preliminary evaluation of OpenACC implementations

    Journal of Supercomputing

2012

  1. accULL: An user-directed approach to heterogeneous programming

    Proceedings of the 2012 10th IEEE International Symposium on Parallel and Distributed Processing with Applications, ISPA 2012

  2. accULL: An OpenACC implementation with CUDA and OpenCL support

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

  3. Optimization strategies in different CUDA architectures using llCoMP

    Microprocessors and Microsystems

  4. Directive-based programming for GPUs: A comparative study

    Proceedings of the 14th IEEE International Conference on High Performance Computing and Communications, HPCC-2012 - 9th IEEE International Conference on Embedded Software and Systems, ICESS-2012

2011

  1. Optimize or wait? Using llc fast-prototyping tool to evaluate CUDA optimizations

    Proceedings - 19th International Euromicro Conference on Parallel, Distributed, and Network-Based Processing, PDP 2011

  2. Case studies in automatic gPGPU code generation with llc

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

  3. Automatic code generation for GPUs in llc

    Journal of Supercomputing

2009

  1. Automatic hybrid MPI+OpenMP code generation with llc

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

  2. Toward the parallelization of GSL

    Journal of Supercomputing, Vol. 48, Núm. 1, pp. 88-114

  3. IDEWEP: Web service for astronomical parallel image deconvolution

    Journal of Network and Computer Applications, Vol. 32, Núm. 1, pp. 293-313

2007

  1. Parallelizing dense linear algebra operations with task queues in 11c

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

  2. Parallelization of a public image restoration algorithm

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

  3. Generation of microlensing magnification patterns with high performance computing techniques

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

2006

  1. Applying high performance computing techniques in astrophysics

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

  2. Basic skeletons in llc

    Parallel Computing, Vol. 32, Núm. 7-8, pp. 491-506