BugHunter

An automated pipeline that uses LLM agents to solve different types of software issues in Docker containers. The system supports multiple task types including bug fixing, bug location, and targeted fixes with location hints. Now includes comprehensive trajectory recording for detailed analysis of agent interactions.

Features

🤖 Multiple Task Types: Fix bugs, locate bugs, or fix with location hints
🔧 Modular Architecture: Clean separation of concerns with task-specific implementations
🐳 Docker Integration: Automatic container management and command execution
📝 Structured Logging: Comprehensive logging with configurable levels
🎯 Selective Execution: Run specific test instances or task types
📊 Result Tracking: JSON output with task-specific results
🛤️ Trajectory Recording: Detailed recording of agent interactions, commands, and responses
⚙️ Easy Setup: Automated setup and validation with built-in CLI command

Task Types

1. Fix Bug (`fix_bug`)

Purpose: Analyze the problem and provide a complete patch to fix the issue. Output: Complete patch or code changes Usage: Default task type, good for end-to-end bug fixing

2. Locate Bug (`locate_bug`)

Purpose: Identify the specific file and line number where the bug is located. Output: File path and line number of the bug Usage: When you need to find where the bug is without fixing it

3. Fix with Location (`fix_with_location`)

Purpose: Fix a bug when you already know approximately where it is located. Output: Targeted patch for the specific location Usage: When you have hints about bug location for more efficient fixing

Quick Start

Prerequisites

Python 3.8+, tested with 3.12
Docker
OpenAI API key (or other supported LLM provider)

Installation

Clone the repository and navigate to the project directory
Run the automated setup:
```
python main.py setup
```
This will:
- Check Python and Docker installation
- Validate Docker permissions
- Install Python dependencies
- Create environment file from template
- Validate configuration files
- Test Docker functionality

Configure your API key:

# Edit .env with your API keys
nano .env

Usage

# Run with custom config file
python main.py run --config custom_config.yaml

CLI Commands

Setup Command

# Setup with custom config file
python main.py setup --config custom_config.yaml

Runs comprehensive system setup and validation:

✅ Checks Python version compatibility (3.8+)
✅ Verifies Docker installation and permissions
✅ Installs Python dependencies from requirements.txt
✅ Creates .env file from template if needed
✅ Validates configuration files (uses config.yaml by default, or custom file if specified)
✅ Tests Docker functionality with hello-world image

Run Command

python main.py run [options]

Executes the bug-solving pipeline with various options for task types, models, and output configuration.

Evaluate Command

python main.py evaluate <results_file> [options]

Evaluates the correctness of locate_bug results by comparing LLM outputs with gold truth data.

Web Interface

# Start the web interface
python web/run.py

Provides a user-friendly web interface for managing tasks, viewing results, and interacting with the system.

Trajectory Recording

The system now records comprehensive trajectories of all agent interactions, including:

System prompts and initial task setup
Agent responses with extracted thoughts and actions
Command executions with full output and error information
API call statistics including token usage and costs
State tracking of working directory and open files

Trajectory Format

Each trajectory includes:

{
    "environment": "dongqa930/dunst-project_dunst:1215",
    "trajectory": [
        {
            "action": "ls -F\n",
            "observation": "AUTHORS.rst\nCHANGELOG.rst\n...",
            "response": "Let's list out some of the files...",
            "state": "{\"open_file\": \"n/a\", \"working_dir\": \"/path\"}",
            "thought": "Let's explore the repository structure..."
        }
    ],
    "history": [
        {
            "message_type": "system_prompt",
            "role": "system", 
            "content": "You are an expert software engineer...",
            "agent": "primary"
        }
    ],
    "info": {
        "exit_status": "submitted",
        "submission": "diff --git a/file.py...",
        "model_stats": {
            "total_cost": 0.0,
            "instance_cost": 0.0,
            "tokens_sent": 1500,
            "tokens_received": 800,
            "api_calls": 5
        }
    }
}

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
bughunter		bughunter
data		data
web		web
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.yaml		config.yaml
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

BugHunter

Features

Task Types

1. Fix Bug (`fix_bug`)

2. Locate Bug (`locate_bug`)

3. Fix with Location (`fix_with_location`)

Quick Start

Prerequisites

Installation

Usage

CLI Commands

Setup Command

Run Command

Evaluate Command

Web Interface

Trajectory Recording

Trajectory Format

About

Uh oh!

Languages

License

WnRock/BugHunter

Folders and files

Latest commit

History

Repository files navigation

BugHunter

Features

Task Types

1. Fix Bug (fix_bug)

2. Locate Bug (locate_bug)

3. Fix with Location (fix_with_location)

Quick Start

Prerequisites

Installation

Usage

CLI Commands

Setup Command

Run Command

Evaluate Command

Web Interface

Trajectory Recording

Trajectory Format

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages

1. Fix Bug (`fix_bug`)

2. Locate Bug (`locate_bug`)

3. Fix with Location (`fix_with_location`)