-
Add stronger reward verification sandbox (#233) · 5a66ed26
Add stronger verification support as is used in https://github.com/PRIME-RL/PRIME - [x] Batched verification - [x] Python interpreter - [x] Stronger math verifier - [x] Continuous score for code test Re-opening https://github.com/volcengine/verl/pull/207 to trigger automatic workflows
Zefan Wang committed
Name |
Last commit
|
Last update |
---|---|---|
.. | ||
test_sandbox.py | Loading commit data... |