DreamCanvas - AI-Powered Creativity

DreamCanvas is an AI-powered image generator that allows users to create high-quality, creative images using ComfyUI and integrates with a local Large Language Model (LLM) via Ollama. This project includes a FastAPI backend, a dynamic web interface, and support for user-configurable models and servers.

Setup

Requirements

Conda Environment:
- The project uses Conda for environment management. Make sure Conda is installed on your system.
ComfyUI:
- ComfyUI should be installed and running. You must have the checkpoint realvisxlV50Lightning.Ng9I.safetensors installed in the checkpoints folder for the workflow.
- Alternatively, you can modify workflow.json to use any other model/checkpoint.
Ollama:
- Ollama LLM server should be installed and accessible.
Configuration via .env:
- The project uses a .env file for configuring server addresses. Below are custom configuration settings:
```
COMFYUI_SERVER_ADDRESS=192.168.1.10:8188
OLLAMA_SERVER_ADDRESS=192.168.1.10:11436
```
- Adjust these values to match your environment.

Installation

Clone the Repository:

git clone https://github.com/Teachings/DreamCanvas.git
cd DreamCanvas

Set Up Conda Environment:

Create and activate the Conda environment:

conda create --name dreamcanvas python=3.12
conda activate dreamcanvas

Install Dependencies:

Install the project dependencies listed in requirements.txt:
```
pip install -r requirements.txt
```
Set Up .env File:

Ensure the .env file exists in the project root and contains the correct server addresses for ComfyUI and Ollama.
```
COMFYUI_SERVER_ADDRESS=192.168.1.10:8188
OLLAMA_SERVER_ADDRESS=192.168.1.10:11436
```

Running the Server

Local Environment

To run the FastAPI server in your local environment, use the following command:

uvicorn backend.main:app --reload --host 0.0.0.0 --port 8000

This will start the app on http://localhost:8000/.

To ensure that ComfyUI is functioning correctly, you can test the connection using the workflow defined in workflow.json.

Running with Docker

If you prefer to run the application inside a Docker container, the following steps guide you through building and running the containerized application.

1. Build the Docker Image

Navigate to the project directory and build the Docker image:

docker build -t dreamcanvas .

2. Run the Docker Container

Once the Docker image is built, run the container:

docker run -d -p 8000:8000 --env-file .env --name dreamcanvas dreamcanvas

This command will:

Start the container in detached mode (-d).
Map port 8000 of the container to port 8000 on your host.
Use the .env file to set environment variables.

3. Access the Application

You can now access the application at http://localhost:8000/

Functionality

Positive and Negative Prompts

Positive Prompt: Specifies the elements to include in the generated image (e.g., "4k, highly detailed, hyperrealistic").
Negative Prompt: Specifies elements to avoid in the image (e.g., "blurry, watermark").

LLM-Assisted Prompt Generation

Ask LLM for a Creative Idea: The user can request a creative prompt suggestion from a locally hosted LLM (Ollama). The generated prompt can be applied to the positive prompt field.

Quick Prompts

Preconfigured Prompts: Both positive and negative quick prompts are available via buttons. Clicking a button auto-fills the corresponding input field.
Custom Prompts: Themed prompts (like Halloween or Christmas) are dynamically loaded from the quick_prompts.json file. Adding new themes is as easy as editing this file.

Image Caching and Navigation

Image History: The app caches generated images within the session. Users can navigate through cached images using the Previous and Next buttons.
Cache Clearing: Cached images are cleared when the browser is refreshed or when the Reset button is clicked.

UI Reset

The Reset button clears all input fields, resets generated images, and clears the image cache.

Architecture

Backend

The backend is powered by FastAPI and handles the following operations:

Generating images using ComfyUI.
Fetching creative suggestions from the local LLM.
Serving quick prompts from configuration files.

Key Endpoints

POST /generate_images/
- Description: Generates an AI image using the provided prompts and image settings.
- Request Example:
```
{
  "positive_prompt": "a beautiful sunset",
  "negative_prompt": "blurry",
  "steps": 25,
  "width": 512,
  "height": 512
}
```
- Response: A binary stream containing the generated image.

POST /ask_llm/

Description: Requests a creative prompt from the local LLM server (Ollama).

Request Example:

{
  "positive_prompt": "a beautiful sunset"
}

Response:

{
  "assistant_reply": "How about a stunning sunset over the mountains with golden light reflecting on the water?"
}

GET /quick_prompts/

Description: Retrieves quick prompt configurations from the quick_prompts.json file for dynamic UI updates.

Response Example:

{
  "Positive Quick Prompts": [
    { "label": "4K", "value": "4K" },
    { "label": "Highly Detailed", "value": "highly detailed" }
  ],
  "Negative Quick Prompts": [
    { "label": "Blurry", "value": "blurry" },
    { "label": "Watermark", "value": "watermark" }
  ],
  "Halloween": [
    { "label": "Black Background", "value": "black background" },
    { "label": "Witch", "value": "witch" }
  ]
}

Frontend

The frontend is built with HTML, CSS, and JavaScript. It dynamically interacts with the backend for:

Generating images.
Fetching creative prompts from the LLM.
Loading quick prompt configurations from quick_prompts.json.

UI Components

Image Generation Form:
- Includes fields for positive and negative prompts, image steps, width, and height.
- Quick prompt buttons for easy input.
LLM Integration:
- A section that allows users to request and apply creative prompts generated by the LLM.
Image Display and Navigation:
- Displays the generated images and includes buttons for navigating through cached images.
Reset Functionality:
- A Reset button to clear all input fields and generated image history.

Tools and Libraries

FastAPI: Web framework for building the backend.
Uvicorn: ASGI server used to run the FastAPI application.
Ollama: Locally hosted LLM for generating creative prompts.
Pillow: Python Imaging Library used to handle image operations.
Bootstrap: CSS framework for styling the UI.
JavaScript (Fetch API): Handles asynchronous requests to the backend.

Testing

You can test the ComfyUI workflow by running the FastAPI server as described above. Use the /generate_images/ endpoint to generate images and validate that the workflow is functioning correctly.

To test the LLM integration, use the /ask_llm/ endpoint to request a creative prompt from the locally hosted Ollama LLM.

For Docker-based setups, ensure that the .env file is correctly set up with the server addresses and run the container as described in the Running with Docker section.

Kill Server

If you need to force kill the server process, you can use the following command:

sudo kill -9 $(sudo lsof -t -i :8000)

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
backend		backend
docs		docs
ui		ui
.dockerignore		.dockerignore
.env		.env
.gitignore		.gitignore
Dockerfile		Dockerfile
Readme.md		Readme.md
main.py		main.py
quick_prompts.json		quick_prompts.json
requirements.txt		requirements.txt
workflow.json		workflow.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DreamCanvas - AI-Powered Creativity

Table of Contents

Setup

Requirements

Installation

Running the Server

Local Environment

Running with Docker

1. Build the Docker Image

2. Run the Docker Container

3. Access the Application

Functionality

Positive and Negative Prompts

LLM-Assisted Prompt Generation

Quick Prompts

Image Caching and Navigation

UI Reset

Architecture

Backend

Key Endpoints

Frontend

UI Components

Tools and Libraries

Testing

Kill Server

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Teachings/DreamCanvas

Folders and files

Latest commit

History

Repository files navigation

DreamCanvas - AI-Powered Creativity

Table of Contents

Setup

Requirements

Installation

Running the Server

Local Environment

Running with Docker

1. Build the Docker Image

2. Run the Docker Container

3. Access the Application

Functionality

Positive and Negative Prompts

LLM-Assisted Prompt Generation

Quick Prompts

Image Caching and Navigation

UI Reset

Architecture

Backend

Key Endpoints

Frontend

UI Components

Tools and Libraries

Testing

Kill Server

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages