[Feature] Assert Single Batch for `val_dataloader` (#424)

This is an enhancement for the single batch strategy for `val_dataloader`, making https://github.com/volcengine/verl/pull/353 more robust.

[Feature] Assert Single Batch for `val_dataloader` (#424)
This is an enhancement for the single batch strategy for `val_dataloader`, making https://github.com/volcengine/verl/pull/353 more robust.
6e4a445f · Shawn/Yuxuan Tong · GitHub · 60c92147 · 6e4a445f
Unverified Commit 6e4a445f authored Feb 28, 2025 by Shawn/Yuxuan Tong Committed by GitHub Feb 28, 2025
Hide whitespace changes
Inline Side-by-side

Showing with 3 additions and 2 deletions

verl/trainer/ppo/ray_trainer.py
+3 -2

No files found.
--- a/verl/trainer/ppo/ray_trainer.py
+++ b/verl/trainer/ppo/ray_trainer.py
@@ -528,10 +528,11 @@ class RayPPOTrainer(object):
            collate_fn=collate_fn)
        assert len(self.train_dataloader) >= 1
-        assert len(self.val_dataloader) >= 1
+        assert len(
+            self.val_dataloader
+        ) == 1, "Validation dataloader must have a single batch, which inference engines will schedule the memory themselves."
        print(f'Size of train dataloader: {len(self.train_dataloader)}')
-        print(f'Size of val dataloader: {len(self.val_dataloader)}')
        # inject total_training_steps to actor/critic optim_config. This is hacky.
        total_training_steps = len(self.train_dataloader) * self.config.trainer.total_epochs