How to load a pre-trained LoRA Adapter for GRPO Training for Multi-Stage RL? #3386

awinml · 2025-09-07T17:51:59Z

awinml
Sep 7, 2025

I’m implementing a multi-stage reinforcement learning (RL) pipeline for reasoning tasks using GRPO, and I’d like to:

Train a model on Dataset A with LoRA.
Save the trained LoRA adapter weights.
Resume training on Dataset B by continuing to update the same LoRA adapter, rather than initializing a fresh one.

Setup:

Base model: Qwen3
Adapter: LoRA
Rollout engine: vLLM (multi-turn)
Training algorithm: GRPO

In standard Hugging Face + PEFT workflows, I can load a pre-trained LoRA adapter like this:

from peft import PeftModel, PeftConfig
from transformers import AutoModelForCausalLM

config = PeftConfig.from_pretrained("path_to_trained_lora_adapter")
base_model = AutoModelForCausalLM.from_pretrained(config.base_model_name_or_path)
lora_model = PeftModel.from_pretrained(base_model, "path_to_trained_lora_adapter")

However, in my current GRPO trainer config, LoRA is initialized from scratch via these parameters:

actor_rollout_ref.model.lora_rank=32 \
actor_rollout_ref.model.lora_alpha=32 \
actor_rollout_ref.model.target_modules=all-linear \

There doesn’t appear to be a config option (e.g., lora_path or pretrained_adapter_name_or_path) to load an existing adapter instead of initializing new LoRA weights.

Question:
How can I configure the GRPO trainer to load and continue training from a pre-trained LoRA adapter? Is this supported, and if so, what’s the correct way to specify the adapter path in the config or code?

piood · 2025-09-18T16:18:53Z

piood
Sep 18, 2025

Now verl can't load and continue training from a pre-trained LoRA adapter, after the pr (#3523) it can work.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to load a pre-trained LoRA Adapter for GRPO Training for Multi-Stage RL? #3386

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

How to load a pre-trained LoRA Adapter for GRPO Training for Multi-Stage RL? #3386

Uh oh!

awinml Sep 7, 2025

Replies: 1 comment

Uh oh!

piood Sep 18, 2025

awinml
Sep 7, 2025

piood
Sep 18, 2025