Commits · main · ZhangXiaoyun / verl

09 Apr, 2025 1 commit
- ignore slurm files · cc30d918
  Yaoyu Zhu committed Apr 09, 2025
  
  cc30d918 Browse Files
08 Apr, 2025 1 commit
- update gitignore · ccaf5c17
  Yaoyu Zhu committed Apr 08, 2025
  
  ccaf5c17 Browse Files
22 Mar, 2025 3 commits
- fix codev reward function and add correct ratio stats for training dataset · e4574048
  Yaoyu Zhu committed Mar 22, 2025
  
  e4574048 Browse Files
- Merge branch 'main' of http://62.234.201.16/ZhangXiaoyun/verl · ff7d9e53
  Yaoyu Zhu committed Mar 22, 2025
  
  ff7d9e53 Browse Files
- fix vllm version in installation and ignore train.slurm · bed22742
  Yaoyu Zhu committed Mar 22, 2025
  
  bed22742 Browse Files
21 Mar, 2025 10 commits
- Merge branch 'main' of http://62.234.201.16/ZhangXiaoyun/verl · 427ae50b
  ZhangXiaoyun committed Mar 22, 2025
  
  427ae50b Browse Files
- vllm version · b8a6bd25
  ZhangXiaoyun committed Mar 22, 2025
  
  b8a6bd25 Browse Files
- update codev reward function · 7e9bfb40
  Yaoyu Zhu committed Mar 21, 2025
  
  7e9bfb40 Browse Files
- gitig · c0598a8f
  ZhangXiaoyun committed Mar 21, 2025
  
  c0598a8f Browse Files
- modify · 98fc5c65
  ZhangXiaoyun committed Mar 21, 2025
  
  98fc5c65 Browse Files
- modify · 8dd09ef8
  ZhangXiaoyun committed Mar 21, 2025
  
  8dd09ef8 Browse Files
- [Bug Fix] Fix SGLang rollout error under multi node (#652) · 612823ae
  Junrong Lin committed Mar 21, 2025
  
  612823ae Browse Files
- [misc] Add Ulysses parallel config precheck (#674) · e67dea67
```
Prevents training hangs by validating `num_key_value_heads %
ulysses_sequence_parallel_size == 0` before training.
```
  Yu Feng committed Mar 21, 2025
  e67dea67 Browse Files
- docs: add vllm 0.8 page (#694) · 0342042e
```
## What does this PR do?

Add document for using vLLM 0.8 in verl

## Who can review?

@eric-haibin-lin
```
  hoshi-hiyouga committed Mar 20, 2025
  0342042e Browse Files
- docs: fix broken news rendering (#691) · b2ad8fd0
  HL committed Mar 21, 2025
  
  b2ad8fd0 Browse Files
20 Mar, 2025 5 commits
- docs: Adding Openmanus-RL to the Awesome work (#688) · 7df1ffc0
```
Adding Openmanus-RL: a llm agent rl tunning repo with verl
```
  Kunlun Zhu committed Mar 20, 2025
  7df1ffc0 Browse Files
- [tracking] swanlab add `verl` config (#663) · 94788851
```
Add `verl` as the `framework` parameter to the SwanLab config table, so
more developers can see that this training comes from `verl`.
```
  Ze-Yi LIN committed Mar 20, 2025
  94788851 Browse Files
- docs: add meetup slides (#681) · 847bf252
  HL committed Mar 20, 2025
  
  847bf252 Browse Files
- Make Math-Verify Optional (#683) · 529a4fe0
```
https://github.com/volcengine/verl/issues/680

Changes:
- Move math-verify to the optional dependencies. Now it can be installed
via `cd verl && pip install -e .[math]`
- Revert using naive verifier for math dataset. Users can switch to
math-verify or custom a new `compute_score` function.
```
  Yuyang Ding committed Mar 20, 2025
  529a4fe0 Browse Files
- [ci] fix ci (#675) · 5367156a
  Chi Zhang committed Mar 20, 2025
  
  5367156a Browse Files
19 Mar, 2025 3 commits
- modify · c59f5a27
  ZhangXiaoyun committed Mar 19, 2025
  
  c59f5a27 Browse Files
- Initial commit · cafa8371
  ZhangXiaoyun committed Mar 19, 2025
  
  cafa8371 Browse Files
- Update the description of DeepRetrieval (#664) · 468adf22
```
We propose a more accurate description of DeepRetrieval.
Thanks for your awesome work!
```
  Patrick Jiang committed Mar 19, 2025
  468adf22 Browse Files
18 Mar, 2025 3 commits
- [misc] fix the wrong url (#657) · c3e530de
  Yuqian Fu committed Mar 18, 2025
  
  c3e530de Browse Files
- misc: change main_task to TaskRunner actor (#648) · c6dc8b73
```
Use ray actor instead of task to run main_task
- Ray task is retried in system error(oom/segmentfault), which may cause
unexpectedly behavior
- Actor is more trackable in ray dashboard, e.g
logging/stacktrace/profile

close #539
```
  Joel committed Mar 18, 2025
  c6dc8b73 Browse Files
- [Bug Fix] Revert the RLHFDataset truncation config (#645) · ff137945
```
Commit c3420692 Rebase caused error. Try to revert and add an assertion
check.
```
  Blue Space committed Mar 18, 2025
  ff137945 Browse Files
17 Mar, 2025 4 commits
- [ci] feat: move dataset.yml to another GPU (#639) · e49fb572
  Chi Zhang committed Mar 18, 2025
  
  e49fb572 Browse Files
- Added DeepEnlighten to Awesome Work Using Verl section (#641) · 87a81365
```
This PR adds **DeepEnlighten** to the "Awesome Work Using Verl" section.

Co-authored-by: yu_wang <yuwang@astri.com>
Co-authored-by: Chi Zhang <zhangchi.usc1992@bytedance.com>
```
  yuwang91 committed Mar 17, 2025
  87a81365 Browse Files
- [doc] update DAPO (#640) · ffd50a49
```
- As titled
```
  Guangming Sheng committed Mar 17, 2025
  ffd50a49 Browse Files
- [rollout] feat: add SGLang as rollout engine to verl (#490) · 333e6d62
```
#22 . WIP, will add more details tomorrow :)

---------

Co-authored-by: zhaochenyang20 <zhaochen20@outlook.com>
```
  Junrong Lin committed Mar 17, 2025
  333e6d62 Browse Files
16 Mar, 2025 3 commits
- fix readme (#624) · 3b18b0eb
  Chi Zhang committed Mar 16, 2025
  
  3b18b0eb Browse Files
- readme: add MetaSpatial project (#617) · 3ec83117
```
add MetaSpatial in Awesome Work using EasyR1
```
  PzySeere committed Mar 15, 2025
  3ec83117 Browse Files
- [fix] fix python env issue in install (#619) · d754a0cb
  Fengqing Jiang committed Mar 15, 2025
  
  d754a0cb Browse Files
15 Mar, 2025 2 commits
- [misc] fix: validation batch repeat before feed into rollout (#614) · cb943be5
  Guangming Sheng committed Mar 15, 2025
  
  cb943be5 Browse Files
- misc: separate metric utils from ppo trainer (#599) · 6133ae92
```
## What does this PR do?

Use metric_utils to maintain the logic of computing metrics, avoiding
too many lines in ppo trainer

## Who can review?

@vermouth1992 @PeterSH6
```
  hoshi-hiyouga committed Mar 15, 2025
  6133ae92 Browse Files
14 Mar, 2025 5 commits

Support for GRPO with Megatron backend (#592) · c3420692

Support for GRPO with Megatron backend and fix a configuration bug when
not using virtual pipeline.

Calibrated with FSDP backend.

committed Mar 14, 2025

c3420692 Browse Files

fix: Add error mechanism for mini-batch/batch size divisibility validation (#559) · 4d27461d
Yuqian Fu committed Mar 14, 2025

4d27461d Browse Files

[config] feat: lr_warmup_steps (#564) · 22657bad

This PR adds the `lr_warmup_steps` configuration.

Note the `num_warmup_steps` is prior to `lr_warmup_steps_ratio`.

committed Mar 14, 2025

22657bad Browse Files

[update] delete useless config params (#591) · 99ea19a3
BearBiscuit committed Mar 14, 2025

99ea19a3 Browse Files

[Config] Providing an option to turn off `torch.compile` in actor (#554) · 54574690

## Summary

Providing an option in the config to turn off the `torch.compile` used
in `dp_actor.py`

## Usage

Adding the following line to the driver or cli scripts to turn off
`torch.compile`.
```python
+actor_rollout_ref.actor.use_torch_compile=False
```
Otherwise, `torch.compile` will be used by default

## Related Issue

#354 #245

---------

Signed-off-by: Hongpeng Guo <hpguo@anyscale.com>

committed Mar 14, 2025

54574690 Browse Files