Stay Focused: Problem Drift in Multi-Agent Debate

This repository contains the code for the paper "Stay Focused: Problem Drift in Multi-Agent Debate".

This repository is still under construction and subject to change.

DRIFTEval Dataset

The human dataset DRIFTEval is available at DriftEval.json. It includes both the labels and the explanations of the labels for 170 discussion excerpts.

Usage

Install dependencies

conda env create -f environment.yaml

Run experiments

To run the code, you need the MALLM framework which is available here and have it running.

Experiment 1 concerns the investigation of multi-agent debate. Experiment 2 concerns the DRIFTJudge and DRIFTPolicy.

First, you need to download the datasets:

python data/data_download.py

Then, you can run this code with the following commands:

Run experiments:

python batch_mallm.py exp1/exp1_batch.json
python batch_mallm.py exp2/exp2_batch.json

Run evaluations:

python exp1_evaluation.py
python exp2_evaluation.py

Create figures:

python exp1_create_figures.py
python exp2_create_figures.py

Citation:

    comming soon

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
data		data
exp1		exp1
exp2		exp2
exp3		exp3
models		models
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
batch_mallm.py		batch_mallm.py
create_dataset.py		create_dataset.py
environment.yaml		environment.yaml
exp1.slurm		exp1.slurm
exp1_create_figures.py		exp1_create_figures.py
exp1_evaluation.py		exp1_evaluation.py
exp1_evaluation.slurm		exp1_evaluation.slurm
exp1_extract_drifting.py		exp1_extract_drifting.py
exp2.slurm		exp2.slurm
exp2_create_figures.py		exp2_create_figures.py
exp2_evaluation.py		exp2_evaluation.py
exp3_create_figures.py		exp3_create_figures.py
exp3_evaluation.py		exp3_evaluation.py
focus_calculator.py		focus_calculator.py
metrics.json		metrics.json
metrics_for_figures.json		metrics_for_figures.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Stay Focused: Problem Drift in Multi-Agent Debate

DRIFTEval Dataset

Usage

Install dependencies

Run experiments

Citation:

About

Uh oh!

Releases

Packages

Languages

License

jonas-becker/problem-drift

Folders and files

Latest commit

History

Repository files navigation

Stay Focused: Problem Drift in Multi-Agent Debate

DRIFTEval Dataset

Usage

Install dependencies

Run experiments

Citation:

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages