Getting Started
Kernels & Layers
Python API
LittleKernel
Advanced Topics
Examples
MoE (Mixture of Experts) AllReduce kernel for tensor parallelism.
# Test MoE AllReduce bash scripts/launch.sh python/triton_dist/test/nvidia/test_moe_reduce_ar.py 8192 2048 1536 32 2