[init] feat: upload first open source version of verl
parents
Showing
.gitignore
0 → 100644
.readthedocs.yaml
0 → 100644
.style.yapf
0 → 100644
LICENSE
0 → 100644
This diff is collapsed.
Click to expand it.
Notice.txt
0 → 100644
README.md
0 → 100644
docs/Makefile
0 → 100644
docs/README.md
0 → 100644
docs/_static/logo.png
0 → 100644
82.7 KB
docs/advance/dpo_extension.rst
0 → 100644
docs/advance/fsdp_extension.rst
0 → 100644
docs/advance/megatron_extension.rst
0 → 100644
docs/advance/placement.rst
0 → 100644
docs/conf.py
0 → 100644
docs/examples/config.rst
0 → 100644
This diff is collapsed.
Click to expand it.
docs/examples/gsm8k_example.rst
0 → 100644
docs/examples/ppo_code_architecture.rst
0 → 100644
docs/index.rst
0 → 100644
docs/preparation/install.rst
0 → 100644
docs/preparation/prepare_data.rst
0 → 100644
docs/preparation/reward_function.rst
0 → 100644
docs/requirements-docs.txt
0 → 100644
docs/workers/fsdp_workers.rst
0 → 100644
docs/workers/megatron_workers.rst
0 → 100644
docs/workers/ray_trainer.rst
0 → 100644
This diff is collapsed.
Click to expand it.
examples/data_preprocess/full_hh_rlhf.py
0 → 100644
examples/data_preprocess/gsm8k.py
0 → 100644
examples/data_preprocess/hellaswag.py
0 → 100644
examples/data_preprocess/math.py
0 → 100644
examples/ppo_trainer/run_deepseek7b_llm.sh
0 → 100644
examples/ppo_trainer/run_qwen2-7b.sh
0 → 100644
examples/ppo_trainer/run_qwen2-7b_rm.sh
0 → 100644
examples/ppo_trainer/run_qwen2.5-32b.sh
0 → 100644
examples/ray/tutorial.ipynb
0 → 100644
This diff is collapsed.
Click to expand it.
examples/sft/gsm8k/run_deepseek_6b7.sh
0 → 100644
examples/sft/gsm8k/run_gemma_2b.sh
0 → 100644
examples/sft/gsm8k/run_gemma_7b.sh
0 → 100644
patches/megatron_v4.patch
0 → 100644
This diff is collapsed.
Click to expand it.
requirements.txt
0 → 100644
| transformers | ||
| hydra-core | ||
| tensordict < 0.3.1 | ||
| numpy | ||
| pytest | ||
| deepspeed | ||
| pybind11 | ||
| codetiming | ||
| yapf | ||
| wandb | ||
| git+https://github.com/NVIDIA/TransformerEngine.git@stable | ||
| # vllm==0.5.4 # vllm is installed in image building to avoid ray conflicts | ||
| \ No newline at end of file |
setup.py
0 → 100644
single_controller/__init__.py
0 → 100644
single_controller/base/__init__.py
0 → 100644
single_controller/base/decorator.py
0 → 100644
This diff is collapsed.
Click to expand it.
single_controller/base/dp.py
0 → 100644
single_controller/base/megatron/__init__.py
0 → 100644
single_controller/base/megatron/worker.py
0 → 100644
single_controller/base/worker.py
0 → 100644
single_controller/base/worker_group.py
0 → 100644
single_controller/ray/__init__.py
0 → 100644
single_controller/ray/base.py
0 → 100644
This diff is collapsed.
Click to expand it.
single_controller/ray/decorator.py
0 → 100644
single_controller/ray/dp.py
0 → 100644
single_controller/ray/megatron.py
0 → 100644
single_controller/version/version
0 → 100644
verl/__init__.py
0 → 100644
verl/models/README.md
0 → 100644
verl/models/__init__.py
0 → 100644
verl/models/llama/megatron/__init__.py
0 → 100644
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
verl/models/registry.py
0 → 100644
verl/models/weight_loader_registry.py
0 → 100644
verl/protocol.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/third_party/__init__.py
0 → 100644
verl/third_party/vllm/__init__.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/third_party/vllm/vllm_v_0_3_1/config.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/third_party/vllm/vllm_v_0_3_1/llm.py
0 → 100644
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
verl/third_party/vllm/vllm_v_0_3_1/worker.py
0 → 100644
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
verl/third_party/vllm/vllm_v_0_4_2/config.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/third_party/vllm/vllm_v_0_4_2/llm.py
0 → 100644
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
verl/third_party/vllm/vllm_v_0_4_2/worker.py
0 → 100644
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
verl/third_party/vllm/vllm_v_0_5_4/config.py
0 → 100644
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
verl/third_party/vllm/vllm_v_0_5_4/llm.py
0 → 100644
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
verl/third_party/vllm/vllm_v_0_5_4/worker.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/trainer/__init__.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/trainer/config/evaluation.yaml
0 → 100644
This diff is collapsed.
Click to expand it.
verl/trainer/config/generation.yaml
0 → 100644
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
verl/trainer/config/ppo_trainer.yaml
0 → 100644
This diff is collapsed.
Click to expand it.
verl/trainer/config/sft_trainer.yaml
0 → 100644
This diff is collapsed.
Click to expand it.
verl/trainer/fsdp_sft_trainer.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/trainer/main_eval.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/trainer/main_generation.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/trainer/main_ppo.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/trainer/ppo/__init__.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/trainer/ppo/actor/__init__.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/trainer/ppo/actor/base.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/trainer/ppo/actor/dp_actor.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/trainer/ppo/actor/megatron_actor.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/trainer/ppo/core_algos.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/trainer/ppo/critic/__init__.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/trainer/ppo/critic/base.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/trainer/ppo/critic/dp_critic.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/trainer/ppo/critic/megatron_critic.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/trainer/ppo/hybrid_engine/__init__.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/trainer/ppo/hybrid_engine/base.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/trainer/ppo/hybrid_engine/fsdp_vllm.py
0 → 100644
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
verl/trainer/ppo/ray_trainer.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/trainer/ppo/reward_model/__init__.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/trainer/ppo/reward_model/base.py
0 → 100644
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
verl/trainer/ppo/rollout/__init__.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/trainer/ppo/rollout/base.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/trainer/ppo/rollout/hf_rollout.py
0 → 100644
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
verl/trainer/ppo/rollout/naive/__init__.py
0 → 100644
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
verl/trainer/ppo/rollout/tokenizer.py
0 → 100644
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
verl/trainer/ppo/workers/__init__.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/trainer/ppo/workers/fsdp_workers.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/trainer/ppo/workers/megatron_workers.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/trainer/runtime_env.yaml
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/__init__.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/config.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/dataset/README.md
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/dataset/__init__.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/dataset/rl_dataset.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/dataset/rm_dataset.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/dataset/sft_dataset.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/debug/__init__.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/debug/performance.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/debug/trajectory_tracker.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/distributed.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/fs.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/fsdp_utils.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/hdfs_io.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/import_utils.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/logger/__init__.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/logger/aggregate_logger.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/logging_utils.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/megatron/__init__.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/megatron/memory.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/megatron/optimizer.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/megatron/optimizer_config.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/megatron/pipeline_parallel.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/megatron/sequence_parallel.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/megatron/tensor_parallel.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/megatron_utils.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/memory_buffer.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/model.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/py_functional.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/ray_utils.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/rendezvous/__init__.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/rendezvous/ray_backend.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/reward_score/__init__.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/reward_score/gsm8k.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/reward_score/math.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/torch_dtypes.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/torch_functional.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/utils/tracking.py
0 → 100644
This diff is collapsed.
Click to expand it.
verl/version/version
0 → 100644
This diff is collapsed.
Click to expand it.