nemo-mbridge-perf-moe-optimization-workflow

Workflows

Systematic workflow for MoE training optimization in Megatron Bridge, based on the Megatron-Core MoE paper. Covers the Three Walls framework, parallel folding, recompute strategy, dispatcher choice, and CUDA-graph bring-up.

Install

openclaw skills install nemo-mbridge-perf-moe-optimization-workflow
Loading README...