Support for GRPO with Megatron backend (#592)
Support for GRPO with Megatron backend and fix a configuration bug when not using virtual pipeline. Calibrated with FSDP backend.
Showing
examples/grpo_trainer/run_qwen2-7b_math.sh
0 → 100644
scripts/format.sh
100644 → 100755
File mode changed from 100644 to 100755
Please
register
or
sign in
to comment