Add cognitive behavior paper (#489)

d414c479 · Chi Zhang · GitHub · 686438ca · d414c479
Unverified Commit d414c479 authored Mar 05, 2025 by Chi Zhang Committed by GitHub Mar 05, 2025
Hide whitespace changes
Inline Side-by-side

Showing with 1 additions and 0 deletions

README.md
+1 -0

No files found.
--- a/README.md
+++ b/README.md
@@ -134,6 +134,7 @@ verl is inspired by the design of Nemo-Aligner, Deepspeed-chat and OpenRLHF. The
 - [FIRE](https://arxiv.org/abs/2410.21236): Flaming-hot initiation with regular execution sampling for large language models
 - [ReSearch](https://github.com/Agent-RL/ReSearch): Learning to **Re**ason with **Search** for LLMs via Reinforcement Learning
 - [DeepRetrieval](https://github.com/pat-jj/DeepRetrieval): Let LLMs learn to **search** and **retrieve** desirable docs with RL
+- [cognitive-behaviors](https://github.com/kanishkg/cognitive-behaviors): Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs

 ## Contribution Guide
 Contributions from the community are welcome! Please checkout our [roadmap](https://github.com/volcengine/verl/issues/22) and [release plan](https://github.com/volcengine/verl/issues/354).