This is an old revision of the document!

Stable Diffusion

Stable Diffusion is an AI model for text to image generation. There are various GUIs available to tweak settings, add specific styles like Disney or photorealistic (Lora models) and help adjusting existing images via picture to picture or selection infill.

The guides below focus on the Automatic1111 Stable Diffusion WebUI.

Linux Docker install

This docker requires docker as well as the GPU drivers installed on the host system.

https://github.com/AbdBarho/stable-diffusion-webui-docker

git clone https://github.com/AbdBarho/stable-diffusion-webui-docker
cd stable-diffusion-webui-docker
vi docker-compose.yaml
#adjust name to stable-diffusion if desired
#adjust data and output dir location to /opt/stable-diffusion/data and output respectively if desired
mkdir -p /opt/stable-diffusion/{data,output}

#Build the ai and download the models.
#Note, first start will take 15-60 minutes to download models to data folder as cache depending on internet connection
#size approximately 10GB

docker compose --profile download up --build

# if download errors occur, just repeat the command.

# build the desired interface:
# docker compose --profile [ui] up --build
# where [ui] is one of: invoke | auto | auto-cpu | comfy | comfy-cpu

# use auto-cpu or comfy-cpu for cpu only interface

#docker compose --profile auto-cpu up --build
docker compose --profile auto up --build

# later:
docker compose --profile auto up -d

access the app on:

http://localhost:7860/

Windows install for AMD GPU

Windows AMD AUTOMATIC1111 stable diffusion webui using Microsoft DirectML for GPU acceleration:

Download and run installers for Python 3.10.6 (ticking Add to PATH) and git

Clone the fork as AMD is not officially supported
Paste into command prompt in a directory where it should live

git clone https://github.com/lshqqytiger/stable-diffusion-webui-directml
cd stable-diffusion-webui-directml
git submodule init
git submodule update

venv\Scripts\python.exe -m pip install --upgrade pip
venv\Scripts\pip.exe install torch-directml

Right-click webui-user.bat and adjust the command line args to
```
COMMANDLINE_ARGS=--use-directml --skip-torch-cuda-test
```
If you have only 4-6gb vram, try adding these additional flags to webui-user.bat's command line argslike so:
```
--opt-sub-quad-attention --lowvram --disable-nan-check
```
Double-click webui-user.bat in that directory, this will download ~2.7GB torch and other python modules, followed by the 4GB Stable-Diffusion 1.5 pruned model.
If it looks like it is stuck when installing or running, press enter in the terminal and it should continue.
once done, a browser opens to http://localhost:7860/

Gallery Addon

As an extension for SD-webui:
Open the Extensions tab in SD-webui.

Select the Install from URL option.
Enter https://github.com/zanllp/sd-webui-infinite-image-browsing
Click on the Install button.
Wait for the installation to complete and click on Apply and restart UI.

general advice

Some general Stable Diffusion prompt advice

Models / Settings

LoRA models/styles/tools as well as base models can be searched and downloaded from CivitAI

Realistic Vision + Lora

Description for photorealistic images:

Download model Realistic Vision V6.0 B1 Base Model from https://civitai.com/models/4201/realistic-vision-v11 and save it into the models/stable-diffusion directory
create models/Lora directory
download LORA Style hyperrealism art and save to models/Lora directory
download LORA epi_noiseoffset and save to models/Lora directory

in web interface go to extensions → install from url and paste to install:

https://github.com/mcmonkeyprojects/sd-dynamic-thresholding
https://github.com/opparco/stable-diffusion-webui-composable-lora

then go to installed tab and click apply and quit
restart stable diffusion web ui
go to Lora tab, configure entries (which will create json files, then the entries disappear)

Base settings:

set sampling steps to 20
set sampling method to DPM++ SDE Karras
set width to 768 and height to 512
set CFG scale to 6
set seed to -1
enable Dynamic Thresholding (CFG Scale Fix)
enable all composable Lora