Skip to content

Pull requests: volcengine/verl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[trainer] fix: Add data.seed to config
#3815 opened Oct 18, 2025 by HollowMan6 Loading…
7 tasks done
[data] feat: filter out malformed data together with long prompts
#3814 opened Oct 18, 2025 by HollowMan6 Loading…
7 tasks done
[data, trainer] feat: add support for limiting samples from dataset
#3812 opened Oct 18, 2025 by HollowMan6 Loading…
7 tasks done
fix: gradient vanish in FusedLinearForPPO
#3765 opened Oct 14, 2025 by sanxing-chen Loading…
1 of 7 tasks
build(deps): bump sglang[all] from 0.5.2 to 0.5.3.post1 dependencies Pull requests that update a dependency file python Pull requests that update python code
#3749 opened Oct 13, 2025 by dependabot bot Loading…
[trainer] feat: vlm support for sft engine
#3729 opened Oct 11, 2025 by techkang Loading…
Feat/log fit sub reward
#3710 opened Oct 9, 2025 by Arinlzy Loading…
7 tasks
Eval protocol integration
#3690 opened Oct 6, 2025 by benjibc Draft
[data] fix: Remove duplicated bos tokens in RLHFDataset
#3682 opened Oct 5, 2025 by kAIto47802 Loading…
7 tasks done
fix Ray runtime env no working dir issue
#3673 opened Oct 4, 2025 by kunling-anyscale Loading…
7 tasks
[BREAKING][misc] feat: Abstract optimizer
#3656 opened Sep 30, 2025 by EduardDurech Loading…
[algo] feat: Add SVD-LoRA GRPO
#3637 opened Sep 27, 2025 by alekseymalakhov11 Loading…
5 of 7 tasks
[recipe] feat: Qwen3-235B-A22B on Ascend NPU
#3628 opened Sep 25, 2025 by johnjunjun7 Loading…
7 tasks
[recipe] An agent-lightning like RL training pipeline
#3610 opened Sep 25, 2025 by linxxx3 Loading…
7 tasks done
ProTip! Type g i on any issue or pull request to go back to the issue listing page.