examples/grpo_trainer · v0.2.0.post1 · ZhangXiaoyun / verl

[BREAKING][misc] feat: change micro_batch_size to micro_batch_size_per_gpu (#136) · f2a76acd

## Summary

This PR changes all the micro_batch_size to micro_batch_size_per_gpu.

**The Core logic of setting batch size:**
- **All algorithmic metrics** (train batch size, ppo mini batch size):
are global (from the perspective of single-controller), which will be
normalized in each Worker.
- **All performance-related parameters** (micro batch size, max token
length in dynamic batch size) are local parameters, which represent the
data sizes per GPU (i.e., each Worker).

## Main Changes

1. Change the scripts and config and delete the normalization for
micro_bsz
2. Fix CI for SFT

committed Jan 27, 2025

f2a76acd

Name	Last commit	Last update
..
run_deepseek7b_llm.sh		Loading commit data...
run_deepseek7b_llm_seq_balance.sh		Loading commit data...
run_qwen2-7b.sh		Loading commit data...
run_qwen2-7b_seq_balance.sh		Loading commit data...