feat: Add multilingual prompt optimizer with LangGraph agent support #16894

ChristopheZhao · 2025-03-15T10:24:19Z

What I'm trying to accomplish

This PR aims to enhance the text-to-image generation workflow by adding intelligent prompt processing capabilities. Specifically, it:

Makes the WebUI more accessible to non-English speakers by automatically detecting and translating prompts
Improves image generation quality through prompt optimization using LangGraph-based agent workflows
Provides a seamless experience that works within the existing txt2img interface

Summary of changes in code

Added scripts/txt2img_prompt_optimizer.py - A new script that:
- Implements a Script class that integrates with the WebUI's txt2img tab
- Uses LangGraph to create an agent-based workflow for prompt processing
- Detects non-English text and translates it to English
- Optimizes prompts to improve generation quality while preserving intent
- Handles API key management through environment variables
- Provides graceful fallbacks when optional dependencies are missing
Updated requirements.txt to include:
- python-dotenv for environment variable management
- langgraph for building the agent workflow
Updated requirements_versions.txt with specific versions:
- Added compatible versions of new dependencies
- Ensured version compatibility with existing dependencies
Updated .gitignore to exclude:
- .env files containing sensitive API keys

Issues fixed

This PR addresses the feature request in Issue #4576, which requested multilingual prompt support but was previously marked as "not planned".

The implementation:

Adds multilingual support through automatic translation of non-English prompts
Goes beyond the original request by also implementing prompt optimization
Integrates seamlessly with the existing txt2img interface without requiring changes to the core pipeline

Screenshots/videos:

Here's a demonstration of how our system handles backend translations and their effectiveness for prompts in various languages. We will use 'a kitten under a pine tree' as a prompt to test the effects across different languages.

Chinese (simplified):

backend

frontend

Japanese;

backend

frontend

French;

backend

frontend

Spanish;

backend

frontend

Vietnamese.

backend

frontend

Kiswahili

backend

frontend

And, of course, English prompts are also automatically optimized.
- backend

frontend

Checklist:

I have read contributing wiki page
I have performed a self-review of my own code
My code follows the style guidelines
My code passes tests

- Add txt2img_prompt_optimizer.py script for automatic prompt translation and optimization - Support non-English prompts with automatic translation to English - Implement prompt optimization using LangGraph workflow - Add python-dotenv and langgraph dependencies - Update requirements.txt and requirements_versions.txt with new dependencies

ChristopheZhao added 2 commits March 15, 2025 08:08

Translate and optimize code to comply with coding styles

654ee7e

ChristopheZhao requested a review from AUTOMATIC1111 as a code owner March 15, 2025 10:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add multilingual prompt optimizer with LangGraph agent support #16894

feat: Add multilingual prompt optimizer with LangGraph agent support #16894

ChristopheZhao commented Mar 15, 2025 •

edited

Loading

feat: Add multilingual prompt optimizer with LangGraph agent support #16894

Are you sure you want to change the base?

feat: Add multilingual prompt optimizer with LangGraph agent support #16894

Conversation

ChristopheZhao commented Mar 15, 2025 • edited Loading

What I'm trying to accomplish

Summary of changes in code

Issues fixed

Screenshots/videos:

Chinese (simplified):

Japanese;

French;

Spanish;

Vietnamese.

Kiswahili

Checklist:

ChristopheZhao commented Mar 15, 2025 •

edited

Loading