docs: add deepscaler

607b4935 · HL · GitHub · 95560d7d · 607b4935
Unverified Commit 607b4935 authored Feb 11, 2025 by HL Committed by GitHub Feb 11, 2025
Hide whitespace changes
Inline Side-by-side

Showing with 1 additions and 0 deletions

README.md
+1 -0

No files found.
--- a/README.md
+++ b/README.md
@@ -123,5 +123,6 @@ verl is inspired by the design of Nemo-Aligner, Deepspeed-chat and OpenRLHF. The
 - [TinyZero](https://github.com/Jiayi-Pan/TinyZero): a reproduction of DeepSeek R1 Zero recipe for reasoning tasks
 - [RAGEN](https://github.com/ZihanWang314/ragen): a general-purpose reasoning agent training framework
 - [Logic R1](https://github.com/Unakar/Logic-RL): a reproduced DeepSeek R1 Zero on 2K Tiny Logic Puzzle Dataset.
+- [deepscaler](https://github.com/agentica-project/deepscaler): iterative context scaling with GRPO

 We are HIRING! Send us an [email](mailto:haibin.lin@bytedance.com) if you are interested in internship/FTE opportunities in MLSys/LLM reasoning/multimodal alignment.