MoE ReduceScatter
MoE (Mixture of Experts) ReduceScatter kernel for tensor parallelism.
API Reference
- create_moe_rs_context(...)
Creates context for MoE ReduceScatter operation.
Example Usage
# Test MoE ReduceScatter
bash scripts/launch.sh python/triton_dist/test/nvidia/test_moe_reduce_rs.py 8192 2048 1536 32 2