Getting Started
User Guides
Config
CompilationConfig
CompilationLevel
CUDAGraphMode
QuantizationConfig
LayerQuantConfig
QuantType
ParallelConfig
SpeculativeConfig
SamplingParams
EngineArgs
atom/utils/envs.py
envs.py
Qwen3ForCausalLM
Qwen3MoeForCausalLM
LlamaForCausalLM
MixtralForCausalLM
DeepseekV2ForCausalLM
DeepSeekMTP
GptOssForCausalLM
Glm4MoeForCausalLM
num_hidden_layers
Attention
base_attention.py
attention_mha.py
attention_mla.py
attentions/backends.py
FusedMoE
moe.py
topK.py
FusedMoEParallelConfig
fused_moe/mori_prepare_finalize.py
fused_moe/config.py
fused_moe_triton.py
RMSNorm
layernorm.py
LayerNorm
SiluAndMul
activation.py
VocabParallelEmbedding
embed_head.py
ParallelLMHead
RotaryEmbedding
rotary_embedding.py
get_rope()
Sampler
sampler.py
RejectionSampler
rejection_sampler.py
atom/model_ops/
atom/model_ops/attentions/
atom/model_ops/fused_moe/
atom/utils/
allocate
deallocate
API Reference
Please activate JavaScript to enable the search functionality.