- 09 Apr, 2025 1 commit
-
-
Yaoyu Zhu committed
-
- 08 Apr, 2025 1 commit
-
-
Yaoyu Zhu committed
-
- 22 Mar, 2025 3 commits
- 21 Mar, 2025 10 commits
-
-
ZhangXiaoyun committed
-
Yaoyu Zhu committed
-
ZhangXiaoyun committed
-
ZhangXiaoyun committed
-
ZhangXiaoyun committed
-
Junrong Lin committed
-
Prevents training hangs by validating `num_key_value_heads % ulysses_sequence_parallel_size == 0` before training.
Yu Feng committed -
## What does this PR do? Add document for using vLLM 0.8 in verl ## Who can review? @eric-haibin-lin
hoshi-hiyouga committed -
HL committed
- 20 Mar, 2025 5 commits
-
-
Adding Openmanus-RL: a llm agent rl tunning repo with verl
Kunlun Zhu committed -
Add `verl` as the `framework` parameter to the SwanLab config table, so more developers can see that this training comes from `verl`.
Ze-Yi LIN committed -
HL committed
-
https://github.com/volcengine/verl/issues/680 Changes: - Move math-verify to the optional dependencies. Now it can be installed via `cd verl && pip install -e .[math]` - Revert using naive verifier for math dataset. Users can switch to math-verify or custom a new `compute_score` function.
Yuyang Ding committed -
Chi Zhang committed
-
- 19 Mar, 2025 3 commits
-
-
ZhangXiaoyun committed
-
ZhangXiaoyun committed
-
We propose a more accurate description of DeepRetrieval. Thanks for your awesome work!
Patrick Jiang committed
-
- 18 Mar, 2025 3 commits
-
-
Yuqian Fu committed
-
Use ray actor instead of task to run main_task - Ray task is retried in system error(oom/segmentfault), which may cause unexpectedly behavior - Actor is more trackable in ray dashboard, e.g logging/stacktrace/profile close #539
Joel committed -
Commit c3420692 Rebase caused error. Try to revert and add an assertion check.
Blue Space committed
-
- 17 Mar, 2025 4 commits
-
-
Chi Zhang committed
-
This PR adds **DeepEnlighten** to the "Awesome Work Using Verl" section. Co-authored-by: yu_wang <yuwang@astri.com> Co-authored-by: Chi Zhang <zhangchi.usc1992@bytedance.com>
yuwang91 committed -
- As titled
Guangming Sheng committed -
#22 . WIP, will add more details tomorrow :) --------- Co-authored-by: zhaochenyang20 <zhaochen20@outlook.com>
Junrong Lin committed
-
- 16 Mar, 2025 3 commits
-
-
Chi Zhang committed
-
add MetaSpatial in Awesome Work using EasyR1
PzySeere committed -
Fengqing Jiang committed
-
- 15 Mar, 2025 2 commits
-
-
Guangming Sheng committed
-
## What does this PR do? Use metric_utils to maintain the logic of computing metrics, avoiding too many lines in ppo trainer ## Who can review? @vermouth1992 @PeterSH6
hoshi-hiyouga committed
-
- 14 Mar, 2025 5 commits
-
-
Support for GRPO with Megatron backend and fix a configuration bug when not using virtual pipeline. Calibrated with FSDP backend.
Blue Space committed -
Yuqian Fu committed
-
This PR adds the `lr_warmup_steps` configuration. Note the `num_warmup_steps` is prior to `lr_warmup_steps_ratio`.
Shawn/Yuxuan Tong committed -
BearBiscuit committed
-
## Summary Providing an option in the config to turn off the `torch.compile` used in `dp_actor.py` ## Usage Adding the following line to the driver or cli scripts to turn off `torch.compile`. ```python +actor_rollout_ref.actor.use_torch_compile=False ``` Otherwise, `torch.compile` will be used by default ## Related Issue #354 #245 --------- Signed-off-by: Hongpeng Guo <hpguo@anyscale.com>
Hongpeng Guo committed
-