Programação com GPU/Aceleradores: Introdução à Programação CUDA
Denise Stringhini
2024, curso na Escola de Verão do Laboratório Nacional de Computação Científica, vídeo disponível em: www.youtube.com/watch?v=PvgTxSlDwQU
[LINK]
Improving Convolution via Cache Hierarchy Tiling and Reduced Packing
Victor Ferrari, Rafael Sousa, Marcio Pereira, João P. L. de Carvalho, José Nelson Amaral, and Guido Araujo
2023, in Proceedings of the International Conference on Parallel Architectures and Compilation Techniques (PACT '22). Association for Computing Machinery, New York, NY, USA, 538–539
[LINK]
Source Matching and Rewriting for MLIR Using String-Based Automata
Vinicius Espindola, Luciano Zago, Hervé Yviquel, and Guido Araujo
2023 ACM Trans. Archit. Code Optim. 20, 2, Article 22 (June 2023), 26 pages
[LINK]
Advancing Direct Convolution Using Convolution Slicing Optimization and ISA Extensions
Victor Ferrari, Rafael Sousa, Marcio Pereira, João P. L. De Carvalho, José Nelson Amaral, José Moreira, and Guido Araujo
2023 ACM Trans. Archit. Code Optim. 20, 4, Article 54 (December 2023), 26 pages
[LINK]
A Self-distributing System Framework for the Computing Continuum
Roberto Rodrigues Filho, Renato S. Dias, João Seródio, Barry Porter, Fábio M. Costa, Edson Borin, Luiz F. Bittencourt
2023 in 32nd International Conference on Computer Communications and Networks (ICCCN 2023). Honolulu, HI, USA, [ACEITO]
[LINK]
Tensor slicing and optimization for multicore NPUs, Journal of Parallel and Distributed Computing
Rafael Sousa, Marcio Pereira, Yongin Kwon, Taeho Kim, Namsoon Jung, Chang Soo Kim, Michael Frank, Guido Araujo
Volume 175, 2023, Pages 66-79, ISSN 0743-7315
[LINK]
The OpenMP Cluster Programming Model
Hervé Yviquel, Marcio Pereira, Emílio Francesquini, Guilherme Valarini, Gustavo Leite, Pedro Rosso, Rodrigo Ceccato, Carla Cusihualpa, Vitoria Dias, Sandro Rigo, Alan Souza, and Guido Araujo
2023 in Workshop Proceedings of the 51st International Conference on Parallel Processing (ICPP Workshops '22). Association for Computing Machinery, New York, NY, USA, Article 17, 1–11
[LINK]
An OpenMP-only Linear Algebra Library for Distributed Architectures
C. Cardoso, H. Yviquel, G. Valarini, G. Leite, R. Ceccato, M. Pereira, A. Souza, G. Araujo
2022 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW), Bordeaux, France, 2022, pp. 17-24
[LINK]
A multiple linear regression approach for understanding the trade-offs in learning HPC job scheduling heuristics
Lucas de Sousa Rosa, Danilo Carastan-Santos, Alfredo Goldman
[LINK]
Evaluating execution time predictions on GPU kernels using an analytical model and machine learning techniques
Marcos Amaris, Raphael Camargo, Daniel Cordeiro, Alfredo Goldman, Denis Trystram
2023 Journal of Parallel and Distributed Computing, 171, 66-78
[LINK]
Ion-Molecule Collision Cross-Section Simulation using Linked-cell and Trajectory Parallelization
Samuel Cajahuaringa, Leandro N. Zanotto, Daniel L. Z. Caetano, Sandro Rigo, Hervé Yviquel, Munir S. Skaf, Guido Araujo
2022 IEEE 34th International Symposium on Computer Architecture and High-Performance Computing (SBAC-PAD), Bordeaux, France, 2022, pp. 150-159
[LINK]
Special Issue on Compiling for Accelerators
G. Araujo and L. Wanner
2022, in IEEE Micro, vol. 42, no. 5, pp. 6-8, 1 Sept.-Oct. 2022
[LINK]
Using Barrier Elision to Improve Transactional Code Generation
Bruno Honorio, Joao P. L. de Carvalho, Catalina Morales, Alexandro Baldassin, Guido Araujo
2022 ACM Trans. Archit. Code Optim. 19, 3, Article 46 (September 2022), 23 pages
[LINK]
Implementing the Broadcast Operation in a Distributed Task-based Runtime
R. Ceccato, H. Yviquel, M. Pereira, A. Souza and G. Araujo,
2022 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW), Bordeaux, France, 2022, pp. 25-32
[LINK]
Parallelizing Git Checkout: a Case Study of I/O Parallelism
Matheus Tavares Bernardino and Alfredo Goldman
2022 IEEE 34th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), Bordeaux, France, 2022, pp. 293-304
[LINK]
Compiling Files in Parallel: A Study with GCC
Giuliano Belinassi; Richard Biener; Jan Hubička; Daniel Cordeiro; Alfredo Goldman
2022 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW), Bordeaux, France, 2022, pp. 1-8
[LINK]
Coisas para Fazer Antes de Paralelizar
Alfredo Goldman, Vitor Tessari Terra, Sarita Mazzini Bruschi .
in Minicursos do XXIII Simpósio em Sistemas Computacionais de Alto Desempenho, 2022, cap. 3
[LINK]
In search of efficient scheduling heuristics from simulations and Machine Learning
Lucas Rosa, Alfredo Goldman
2022, in Anais Estendidos do XXIII Simpósio em Sistemas Computacionais de Alto Desempenho, (pp. 17-24). Porto Alegre: SBC
[LINK]
Technical Debt Prioritization: A Developer's Perspective
Diogo Pina; Carolyn Seaman; Alfredo Goldman
2022 IEEE/ACM International Conference on Technical Debt (TechDebt), Pittsburgh, PA, USA, 2022, pp. 46-55
[LINK]
Coisas para Saber Antes de Fazer o Seu Próprio Benchmarks Game
Alfredo Goldman, Elisa Uhura, Sarita Mazzini Bruschi
2022, in Minicursos da XXII Escola Regional de Alto Desempenho da Região Sul (ERAD-RS 2022), Org. Arthur Lorenzon, Márcio Castro, Mauricio Pillon. (pp. 85-103). Porto Alegre: SBC
[LINK]
Prefácio e Organização dos Anais da XIII Escola Regional de Alto Desempenho de São Paulo
Hélio Crestana Guardia, Ricardo Menotti, Alfredo Goldman, Flávia Pisani
2022, Anais da XIII Escola Regional de Alto Desempenho de São Paulo. (pp. i-xi). São Carlos: SBC
[LINK]
Accelerating Graph Applications Using Phased Transactional Memory
Catalina Morales, Rafael Murari, Joao P. L. de Carvalho, Bruno Honorio, Alexandro Baldassin and Guido Araujo
2021, in 27th Int. European Conference on Parallel and Distributed Computing (Euro-Par 2021), Sep. 2021
[LINK]
Efficient Tensor Slicing for Multicore NPUs using Memory Burst Modeling
R. Sousa, B. Jung, J. Kwak, M. Frank and G. Araujo,
2021 IEEE 33rd International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), Belo Horizonte, Brazil, 2021, pp. 84-93
[LINK]