Examples#

We’ve curated a growing collection of practical examples that showcase the power and flexibility of Iris for distributed computing and matrix operations. From basic memory operations to sophisticated GEMM implementations, there’s something here for everyone. And guess what? We’re constantly adding more examples as we discover new patterns and optimizations!

Directory Structure#

Basic Operations#

  • 00_load: Load operations across multiple GPUs

  • 01_store: Store operations across multiple GPUs

  • 02_all_load: Load operations where all GPUs load simultaneously

  • 03_all_store: Store operations where all GPUs store simultaneously

  • 04_atomic_add: Atomic add operations across multiple GPUs

  • 05_atomic_xchg: Atomic exchange operations across multiple GPUs

Communication Patterns#

GEMM Operations#

Utilities#

  • benchmark: Benchmarking utilities and performance testing tools

  • common: Common utilities and shared code for examples