Sharded Backwards

On distributed backwards passes. DDP, FSDP, TP, SP, EP, Partial placements, and more.

February 20, 2026 · 22 min

Sharded Einstein Notation

Einstein notation extended with sharding subscripts for reasoning about distributed ML. Discusses: DDP, FSDP, Torch's DTensor, Tensor Parallel, Sequence Parallel, and Ring Attention.

February 15, 2026 · 21 min