Pipeline Parallel Block
High-level abstraction for Pipeline Parallelism.
Description
This module provides building blocks for implementing pipeline parallelism in large models.
Example Usage
# Test PP Block
bash scripts/launch.sh python/triton_dist/test/nvidia/test_pp_block.py \
--bsz 8 --seq_len 128 --num_blocks 4 --pp_size 4 --model <model_path>