Text-to-Image
File size: 1,096 Bytes
58c041b
 
aaf30a1
 
e3ed172
aaf30a1
58c041b
aaf30a1
1c718bb
e3ed172
1c718bb
 
 
e3ed172
1c718bb
aaf30a1
1c718bb
bb28e2e
36baa94
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
---
license: creativeml-openrail-m
datasets:
- ChristophSchuhmann/improved_aesthetics_6plus
- drhead/laion_hd_21M_deduped
pipeline_tag: text-to-image
---

Currently released models: 

**ZeroDiffusion-Base v0.9** (zd_base_v0-9 and zd_base_v0-9_ema) - a base model trained on zero terminal SNR over roughly 20 million samples

**ZeroDiffusion-Inpaint v0.9** (zd_inpaint_v0-9 and zd_inpaint_v0-9_ema) - an experimental finetune of the stable-diffusion-inpainting model, initialized from a merge of ZD 0.9

The intention of this model is to provide a training base for other models, and to provide researchers with a clean model base to test zero terminal SNR with.

For this model to work well, you will probably need CFG rescale, which is implemented in this plugin: https://github.com/Seshelle/CFG_Rescale_webui

You must also download the corresponding YAML file and put it in the folder with the model (assuming you are using A1111's webui or similar).  It won't work without it.  It will tell webui to use the model in v-prediction mode.

Trained as part of Google's TPU Research Cloud program.