Skip to content

jonas-becker/problem-drift

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Stay Focused: Problem Drift in Multi-Agent Debate

This repository contains the code for the paper "Stay Focused: Problem Drift in Multi-Agent Debate".

This repository is still under construction and subject to change.

DRIFTEval Dataset

The human dataset DRIFTEval is available at DriftEval.json. It includes both the labels and the explanations of the labels for 170 discussion excerpts.

Usage

Install dependencies

conda env create -f environment.yaml

Run experiments

To run the code, you need the MALLM framework which is available here and have it running.

Experiment 1 concerns the investigation of multi-agent debate. Experiment 2 concerns the DRIFTJudge and DRIFTPolicy.

First, you need to download the datasets:

python data/data_download.py

Then, you can run this code with the following commands:

Run experiments:

python batch_mallm.py exp1/exp1_batch.json
python batch_mallm.py exp2/exp2_batch.json

Run evaluations:

python exp1_evaluation.py
python exp2_evaluation.py

Create figures:

python exp1_create_figures.py
python exp2_create_figures.py

Citation:

    comming soon

About

The official repository for the paper "Stay Focused: Problem Drift in Multi-Agent Debate".

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published