- 28 Nov, 2024 1 commit
-
-
Guangming Sheng committed
-
- 27 Nov, 2024 1 commit
-
-
* [misc] add small scale gemma2-2b example for debug * [misc] fix: add eval() and train() for gradient checkpoint
Guangming Sheng committed
-
- 25 Nov, 2024 1 commit
-
-
* Update base.py * Update base.py * update Signed-off-by: Kai-Hsun Chen <kaihsun@anyscale.com> --------- Signed-off-by: Kai-Hsun Chen <kaihsun@anyscale.com>
Kai-Hsun Chen committed
-
- 22 Nov, 2024 1 commit
-
-
* [ci] update some tests for hybrid programming model * [ci] update detached worker tests
Guangming Sheng committed
-
- 21 Nov, 2024 1 commit
-
-
Kai-Hsun Chen committed
-
- 11 Nov, 2024 1 commit
-
-
* [doc]fix: typo in fsdp dtensor weight loader extension * [misc] fix: vllm gpu executor issue when world_size is 1
Peter Sheng committed
-
- 01 Nov, 2024 2 commits
-
-
* [misc] update tutorial for opensource version * fix deleted item * clear output
Peter Sheng committed -
* [misc] fix: fix pypi package missing of megatron model and fix torchrun issue * lint * fix sft script * update version
Peter Sheng committed
-
- 31 Oct, 2024 3 commits
-
-
* [doc] fix: delete deprecated element in config doc * update readme to fix url
Peter Sheng committed -
* [release] update verl doc url and update version and setup * fix init with only fsdp and update setup for pypi * update version
Peter Sheng committed -
shengguangming committed
-