LOCA SERIES/Reading Club: BerryBees: Breadth First Search by Bit-Tensor-Cores

Fecha: 06/May/2025 Time: 16:00

Place:

[HYBRID] Room: 1-3-2, BSC Main Building and Online via Zoom.

Primary tabs

AI4S registration link

Abstract

Breadth First Search (BFS) plays a key role in computational science, networking, and artificial intelligence applications. Although the BFS approach has been extensively studied, particularly in its direction-optimized form, existing implementations still present three main issues: (1) high memory footprint; (2) the under-realized lightweight representations using bitmaps; and (3) the underuse of modern hardware such as Tensor Cores Units (TCUs).

In this paper, we propose BerryBees, an efficient algebraic BFS algorithm that leverages the Matrix Multiply-Accumulate (MMA) instructions of TCUs. The novelty of BerryBees lies in (1) the Binarized Row Slice (BRS) format, which encodes the adjacency matrix by using bitmaps to represent non-empty row segments; and (2) a warp-level algorithm that leverages TCUs for accelerating both SpMV and SpMSpV operations for enhanced BFS performance. The experimental results on three latest NVIDIA GPUs show that BerryBees outperforms five state-of-the-art BFS methods: GAP, Gunrock, Enterprise, GSWITCH, and GraphBLAST, and delivers average speedups of 1.42 ×, 1.97×, 5.05×, 1.24×, and 3.74× (up to 9.99×, 13.66×, 114.07×, 13.97×, and 24.74×), respectively.

Short bio

Yuyao Niu is a third-year PhD student at the Barcelona Supercomputing Center (BSC) and the Polytechnic University of Catalonia (UPC). Her research focuses on high-performance computing, parallel computing, and sparse matrix computation.
 

Speakers

Speaker: Yuyao Niu. Currently on her third-year PhD at the Barcelona Supercomputing Center (BSC) and the Polytechnic University of Catalonia (UPC)
Host: Teresa Cervero. Leading Research Engineer. Computer Sciences - Technical Management HW Engineering, BSC