ATOM Documentation
ATOM (Accelerated Training and Optimization for Models) is AMD’s high-performance LLM serving framework optimized for ROCm platforms.
Getting Started
User Guides
- ATOM Architecture Guide
- ATOM Configuration Guide
- Quick Reference
- 1. Master Configuration (
Config) - 2. Compilation Configuration (
CompilationConfig) - 3. Quantization Configuration (
QuantizationConfig&LayerQuantConfig) - 4. Parallel Configuration (
ParallelConfig) - 5. Speculative Decoding Configuration (
SpeculativeConfig) - 6. Sampling Parameters (
SamplingParams) - 7. CLI Arguments (
EngineArgs) - 8. Environment Variables
- 9. Decision Tree – Choosing a Compilation Level
- Source Files
- ATOM Model Support Guide
- ATOM Model Operations Guide
- ATOM Scheduling & KV Cache Guide
- ATOM Distributed Inference Guide
- ATOM Compilation & CUDA Graphs Guide
- ATOM Serving & Benchmarking Guide
API Reference
Features
High Performance: Optimized kernels for AMD Instinct GPUs
Model Support: Wide range of LLM architectures (Llama, GPT, etc.)
Distributed Serving: Multi-GPU and multi-node deployment
Compilation: CUDAGraph and ROCm optimizations
Benchmarking: Built-in performance measurement tools
Supported GPUs
GPU |
Architecture |
Memory |
Status |
|---|---|---|---|
AMD Instinct MI300X |
CDNA 3 (gfx942) |
192 GB HBM3 |
✅ Fully Supported |
AMD Instinct MI250X |
CDNA 2 (gfx90a) |
128 GB HBM2e |
✅ Fully Supported |
AMD Instinct MI300A |
CDNA 3 (gfx950) |
128 GB HBM3 |
🧪 Experimental |
Quick Links
GitHub: https://github.com/ROCm/ATOM
ROCm Documentation: https://rocm.docs.amd.com
Getting Help
Documentation: https://rocm.github.io/ATOM/
GitHub Issues: https://github.com/ROCm/ATOM/issues
ROCm Community: https://github.com/ROCm/ROCm/discussions