ConsumerBench

📑 Overview

ConsumerBench is a comprehensive benchmarking framework that evaluates the runtime performance of user-defined GenAI applications under realistic conditions on end-user devices.

🚀 Benchmark Setup

# Clone the repository
git clone https://github.com/your-org/ConsumerBench.git
cd ConsumerBench

# Set up environment
conda create -n consumerbench python=3.10
conda activate consumerbench
pip install -r requirements.txt

Install applications

Follow instructions mentioned in applications/

Adding config

Add your own yml workflow in configs/

Running benchmark

Run the benchmark using the command

python src/scripts/run_consumerbench.py --config <path-to-config>

🔧 Hardware / System Requirements

The benchmark has been tested on the following hardware:

Setup 1:
- CPU: Intel(R) Xeon(R) Gold 6126 CPU @ 2.60GHz
- GPU: NVIDIA RTX 6000
- System Memory: 32GB
- CPU cores: 12
Setup 2:
- Macbook Pro M1
- Unified Memory: 32GB

📋 Repository Structure

ConsumerBench/
├── src/                    # Source code
├── inference_backends/     # Inference backends
├── models/                 # GenAI models
├── applications/           # Applications
├── configs/                # Example user configurations & workflows
└── scripts/                # Result processing and plotting scripts

🧩 Current Supported Applications

💬 Chatbot

Text-to-text generation for chat and Q&A with:

Local backend mimicking OpenAI API
Powered by llama.cpp for efficient CPU-GPU co-execution
Located in applications/Chatbot

🔍 DeepResearch

Agent-based reasoning for complex fact gathering:

Built on open-deep-research framework
Served via LiteLLM
Located in applications/DeepResearch

🖼️ ImageGen

Text-to-image generation optimized for edge devices:

Utilizes stable-diffusion-webui in API mode
Located in applications/ImageGen

🎙️ LiveCaptions

Audio-to-text transcription for real-time and offline use:

Whisper-based backend over HTTP
Located in applications/LiveCaptions

System Metrics Collection

Run the script:

./scripts/run_benchmark.sh configs/workflow_imagegen.yml 0

This script collects:

GPU metrics - Compute/memory bandwidth (DCGM)
CPU utilization - Via stat utility
CPU memory bandwidth - Via pcm-memory utility
GPU power - Via NVML utility
CPU power - Via RAPL utility

Results Analysis

Results are saved in the results directory with timestamps. PDF plots are automatically generated.

To modify Service Level Objectives (SLOs):

Chatbot: scripts/result_processing/parse-results-chatbot-log.py
DeepResearch: scripts/result_processing/parse-results-deepresearch-log.py
ImageGen: scripts/result_processing/parse-results-imagegen-log.py
LiveCaptions: scripts/result_processing/parse-results-whisper-log.py

📝 Experiment Configurations

Exclusive Execution

Application	Config
Chatbot	`configs/workflow_chatbot.yml`
LiveCaptions	`configs/workflow_live_captions.yml`
ImageGen	`configs/workflow_imagegen.yml`

CPU-only: Change device from "gpu" to "cpu" in the configs.

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
applications		applications
configs		configs
inference_backends		inference_backends
monitors		monitors
scripts		scripts
src		src
.DS_Store		.DS_Store
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ConsumerBench

📑 Overview

🚀 Benchmark Setup

Install applications

Adding config

Running benchmark

🔧 Hardware / System Requirements

📋 Repository Structure

🧩 Current Supported Applications

💬 Chatbot

🔍 DeepResearch

🖼️ ImageGen

🎙️ LiveCaptions

System Metrics Collection

Results Analysis

📝 Experiment Configurations

Exclusive Execution

Concurrent Execution

Model Sharing (Inference Server)

End-to-End User Workflow

About

Uh oh!

Releases

Packages

Contributors 3

Languages

License

efeslab/ConsumerBench

Folders and files

Latest commit

History

Repository files navigation

ConsumerBench

📑 Overview

🚀 Benchmark Setup

Install applications

Adding config

Running benchmark

🔧 Hardware / System Requirements

📋 Repository Structure

🧩 Current Supported Applications

💬 Chatbot

🔍 DeepResearch

🖼️ ImageGen

🎙️ LiveCaptions

System Metrics Collection

Results Analysis

📝 Experiment Configurations

Exclusive Execution

Concurrent Execution

Model Sharing (Inference Server)

End-to-End User Workflow

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages