Skip to content

Megatron

Tensor Parallelism: Splitting One Layer Across Many GPUs

June 19, 2026