Metatesters 🧪

A LangGraph multi-agent system designed to jailbreak and analyze other LangGraph systems through automated testing and vulnerability assessment.

🎯 Overview

Metatesters is an innovative security testing framework that uses AI agents to automatically analyze, test, and find vulnerabilities in LangGraph-based agentic systems. It employs a multi-stage approach:

Static Graph Analysis - Analyzes the structure and components of target LangGraph systems
Dynamic Testing Team Generation - Creates specialized AI tester personas based on the analysis
Automated Test Case Generation - Generates comprehensive test cases including edge cases and fault injection
Vulnerability Assessment - Executes tests and identifies potential security issues and bugs

🏗️ Architecture

Core Components

Graph Analysis Agent (app/agents/graph_analysis/) - Performs static analysis of target LangGraph systems
Testing Team Agent (app/agents/testing_team/) - Generates specialized AI testers and test cases
Arithmetic Sample Agent (app/agents/arithmetic_sample/) - Example target system for demonstration
Web UI (app/ui/) - Gradio-based interface for interacting with the system
Database Service (app/service/) - SQLite-based storage for test results and tester profiles

🚀 Quick Start

Prerequisites

Python 3.12+
OpenAI API key or Azure OpenAI access
UV package manager (recommended) or pip

Installation

Clone the repository:

git clone https://github.com/marcos-rg/metatesters.git
cd metatesters

Install dependencies:

# Using UV (recommended)
uv sync

# Or using pip
pip install -e .

Set up environment variables:

cp .env.example .env
# Edit .env with your API keys

Run the application:

# Start the web interface
python main.py

# Or run LangGraph development server
make test

🖥️ Usage

Web Interface

Launch the application:
```
python main.py
```
Step 1: Analyze Graph
- The system will analyze a sample arithmetic LangGraph system
- View the generated graph visualization and structure analysis
Step 2: Generate Testing Team
- Provide feedback to guide tester creation
- The system generates specialized AI tester personas
- Each tester focuses on specific vulnerability types
Step 3: Review Test Results
- View generated test cases
- Analyze identified vulnerabilities
- Export results for further analysis

API Usage

from app.agents import graph_analysis_app, testing_team_app

# Analyze a target graph
result = graph_analysis_app.invoke({
    "user_description": "Description of target system",
    "valid_input": {"messages": [...]},
    "graph_before_compile": target_graph
})

# Generate testing team
testers = testing_team_app.invoke({
    "graph_description": result["graph_description"],
    "graph_history_sample": result["history_to_show"],
    "human_analyst_feedback": "Focus on security vulnerabilities",
    "max_analysts": 3,
    "min_test_cases": 6
})

🧪 Testing Capabilities

Automated Tester Personas

The system generates specialized AI testers, including:

Security Tester - Focuses on authentication, authorization, and data leakage
Edge Case Tester - Tests boundary conditions and unusual inputs
Fault Injection Tester - Simulates system failures and error conditions
Performance Tester - Identifies scalability and performance issues
Logic Tester - Tests business logic and workflow correctness

📁 Project Structure

metatesters/
├── app/
│   ├── agents/                 # Multi-agent system components
│   │   ├── arithmetic_sample/  # Example target system
│   │   ├── graph_analysis/     # Static graph analysis agent
│   │   ├── testing_team/       # Test generation agent
│   │   ├── config/            # Configuration management
│   │   └── utils/             # Shared utilities
│   ├── config/                # Application configuration
│   ├── service/               # Database and external services
│   └── ui/                    # Gradio web interface
├── main.py                    # Application entry point
├── langgraph.json            # LangGraph configuration
├── pyproject.toml            # Project dependencies
└── Makefile                  # Development commands

⚙️ Configuration

Environment Variables

Create a .env file with:

# OpenAI Configuration
OPENAI_API_KEY=your_openai_api_key

# Azure OpenAI Configuration (alternative)
AZURE_OPENAI_API_KEY=your_azure_key
AZURE_OPENAI_ENDPOINT=your_azure_endpoint

# Database
SQLITE_PATH=./metatesters.db

# Logging
LOG_LEVEL=INFO

Model Configuration

Configure models in app/agents/config/graph_config.py:

# Supported models
- azure_openai/gpt-4.1
- openai/gpt-4
- openai/gpt-3.5-turbo

🤝 Contributing

We welcome contributions! Please see our Contributing Guidelines for details.

Development Setup

Fork and clone the repository

Create a virtual environment:

uv venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

Install development dependencies:
```
uv sync --dev
```

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🔗 Related Projects

LangGraph - Framework for building stateful, multi-actor applications
LangChain - Building applications with LLMs
Gradio - Web UI framework

📞 Support

Issues: GitHub Issues
Discussions: GitHub Discussions
Email: [email protected]

🙏 Acknowledgments

LangChain team for the amazing LangGraph framework
The open-source community for inspiration and tools
Security researchers working on AI safety

⚠️ Disclaimer: This tool is designed for legitimate security testing and research purposes. Users are responsible for ensuring they have proper authorization before testing any systems they do not own.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
app		app
.env.sample		.env.sample
.gitignore		.gitignore
.python-version		.python-version
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
langgraph.json		langgraph.json
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Metatesters 🧪

🎯 Overview

🏗️ Architecture

Core Components

🚀 Quick Start

Prerequisites

Installation

🖥️ Usage

Web Interface

API Usage

🧪 Testing Capabilities

Automated Tester Personas

📁 Project Structure

⚙️ Configuration

Environment Variables

Model Configuration

🤝 Contributing

Development Setup

📄 License

🔗 Related Projects

📞 Support

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Languages

License

marcos-rg/metatesters

Folders and files

Latest commit

History

Repository files navigation

Metatesters 🧪

🎯 Overview

🏗️ Architecture

Core Components

🚀 Quick Start

Prerequisites

Installation

🖥️ Usage

Web Interface

API Usage

🧪 Testing Capabilities

Automated Tester Personas

📁 Project Structure

⚙️ Configuration

Environment Variables

Model Configuration

🤝 Contributing

Development Setup

📄 License

🔗 Related Projects

📞 Support

🙏 Acknowledgments

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages