Hao Li, Xiaogeng Liu, Hung-Chun Chiu, Dianqi Li, Ning Zhang, Chaowei Xiao.
The official implementation of NeurIPS 2025 paper "DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agents".
We provide the evaluation of DRIFT on GPT-4o-mini and GPT-4o, full code (including other models) will be released later, you can reproduce the results following:
pip install "agentdojo==0.1.26"
pip install -r requirements.txtexport OPENAI_API_KEY=your_keypython pipeline_main.py \
--model gpt-4o-mini-2024-07-18 \
--build_constraints --injection_isolation --dynamic_validationpython pipeline_main.py \
--model gpt-4o-mini-2024-07-18 --do_attack \
--attack_type important_instructions \
--build_constraints --injection_isolation --dynamic_validationIf you want to evaluate under adaptive attack, add configure of --adaptive_attack.
If you find this work useful in your research or applications, we appreciate that if you can kindly cite:
@articles{DRIFT,
title={DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agents},
author={Hao Li and Xiaogeng Liu and Hung-Chun Chiu and Dianqi Li and Ning Zhang and Chaowei Xiao},
journal = {NeurIPS},
year={2025}
}
