-
Notifications
You must be signed in to change notification settings - Fork 2.3k
Pull requests: volcengine/verl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Revert "[trainer] fix: address serialization issues when using async reward function and ray ppo trainer"
#3819
opened Oct 18, 2025 by
vermouth1992
Loading…
[trainer] fix: Add
data.seed
to config
#3815
opened Oct 18, 2025 by
HollowMan6
Loading…
7 tasks done
[data] feat: filter out malformed data together with long prompts
#3814
opened Oct 18, 2025 by
HollowMan6
Loading…
7 tasks done
[data, trainer] feat: add support for limiting samples from dataset
#3812
opened Oct 18, 2025 by
HollowMan6
Loading…
7 tasks done
Fix: Remove torch.quantile-based percentile metrics to resolve tensor size limit error
#3810
opened Oct 18, 2025 by
szrlee
Loading…
fix: gradient vanish in FusedLinearForPPO
#3765
opened Oct 14, 2025 by
sanxing-chen
Loading…
1 of 7 tasks
build(deps): bump sglang[all] from 0.5.2 to 0.5.3.post1
dependencies
Pull requests that update a dependency file
python
Pull requests that update python code
#3749
opened Oct 13, 2025 by
dependabot
bot
Loading…
[recipe, hardware] feat: Add GRPO with full weight updates for 1.5B models on a single GPU
#3747
opened Oct 13, 2025 by
KabakaWilliam
Loading…
6 tasks done
[model]{bug-fix}: Fix wrong log prob calculations using fused kernels and USP
#3724
opened Oct 10, 2025 by
joshyan1
Loading…
4 of 7 tasks
[trainer] fix: data balance dp tokens logic with sequence parallelism
#3691
opened Oct 7, 2025 by
puneeshkhanna
Loading…
3 of 7 tasks
[data] fix: Remove duplicated bos tokens in
RLHFDataset
#3682
opened Oct 5, 2025 by
kAIto47802
Loading…
7 tasks done
[trainer, worker] feat: more flexible and easy-to-use reward model
#3679
opened Oct 4, 2025 by
yyDing1
Loading…
fix Ray runtime env no working dir issue
#3673
opened Oct 4, 2025 by
kunling-anyscale
Loading…
7 tasks
[data] feat: TransferQueue - An asynchronous streaming data management system
#3649
opened Sep 30, 2025 by
0oshowero0
Loading…
7 tasks done
[recipe] feat: Qwen3-235B-A22B on Ascend NPU
#3628
opened Sep 25, 2025 by
johnjunjun7
Loading…
7 tasks
[recipe] An agent-lightning like RL training pipeline
#3610
opened Sep 25, 2025 by
linxxx3
Loading…
7 tasks done
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.