Europar
Disaggregated Design for GPU-Based Volumetric Data Structures
Example of a disaggregated layout optimized to reduce communication overhead
Volumetric data structures typically prioritize data locality, focusing on efficient memory access patterns. This singular focus can neglect other critical performance factors, such as occupancy, communication, and kernel fusion. We introduce a novel disaggregated design that rebalances trade-offs between locality and these objectives—reducing communication overhead on distributed memory architectures, mitigating register pressure in complex boundary conditions, and enabling kernel fusion.
We provide a thorough analysis of its benefits on a single-node multi-GPU Lattice Boltzmann Method (LBM) solver. Our evaluation spans dense, block-sparse, and multi-resolution discretizations, demonstrating our design’s flexibility and efficiency. Leveraging this approach, we achieve up to a speedup over state-of-the-art solutions.
Download publicationResearch Authors
Ahmed H. Mahmoud
MIT CSAIL
Related Publications
2024
Optimized GPU Implementation of Grid Refinement in Lattice Boltzmann MethodOptimized GPU-accelerated algorithm for implementing grid refinement…
2022
Neon: A Multi-GPU Programming Model for Grid-based ComputationsWe present Neon, a new programming model for grid-based computation…
Get in touch
Something pique your interest? Get in touch if you’d like to learn more about Autodesk Research, our projects, people, and potential collaboration opportunities.
Contact us