xinsir commited on
Commit
108cd42
1 Parent(s): c331053

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +28 -0
README.md CHANGED
@@ -75,6 +75,7 @@ prompt: 1girl, solo, ball, swimsuit, bikini, mole, beachball, white bikini, brea
75
  ![image7](./000092_scribble_concat.webp)
76
 
77
 
 
78
  ## How to Get Started with the Model
79
 
80
  Use the code below to get started with the model.
@@ -194,3 +195,30 @@ images = pipe(
194
 
195
  images[0].save(f"your image save path, png format is usually better than jpg or webp in terms of image quality but got much bigger")
196
  ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
75
  ![image7](./000092_scribble_concat.webp)
76
 
77
 
78
+
79
  ## How to Get Started with the Model
80
 
81
  Use the code below to get started with the model.
 
195
 
196
  images[0].save(f"your image save path, png format is usually better than jpg or webp in terms of image quality but got much bigger")
197
  ```
198
+
199
+
200
+ ## Evaluation Data
201
+ The test data is randomly sample from popular wallpaper anime images(pixiv, nijijourney and so on), the purpose of the project is to letting everyone can draw an anime Illustration.
202
+ We select 100 images and generate text with waifu-tagger[https://huggingface.co/spaces/SmilingWolf/wd-tagger] and generate 4 images per prompt, totally 400 images generated, the images
203
+ should be 1024 * 1024 or same bucket resolution to acheive the best performance. We caculate the Laion Aesthetic Score to measure the beauty and the PerceptualSimilarity to measure the
204
+ control ability, we find the quality of images have a good consistency with the meric values. We compare our methods with other SOTA huggingface models and list the result below. We are
205
+ the models that have highest aesthectic score, and can generate visually appealing images if you prompt it properly.
206
+
207
+ ## Quantitative Result
208
+ | metric | xinsir/anime-painter | lllyasviel/control_v11p_sd15_scribble |
209
+ |-------|-------|-------|-------|
210
+ | laion_aesthetic | **5.95** | 5.86 |
211
+ | perceptual similarity | **0.5171** | 0.577 |
212
+
213
+ laion_aesthetic(the higher the better)
214
+ perceptual similarity(the lower the better)
215
+
216
+ Note: The values are caculated when save in webp format, when save in png the aesthetic values will increase 0.1-0.3, but the relative relation remains unchanged.
217
+
218
+ ### Conclusion
219
+
220
+ <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
221
+
222
+ In our evaluation, the model got better aesthetic score in anime images compared with lllyasviel/control_v11p_sd15_scribble, we want to compare with other sdxl-1.0-scribble model but find nothing, The model is better in control ability when test with perception similarity due to bigger base model and complex data augmentation.
223
+ Besides, the model has lower rate to generate abnormal images which tend to include some abnormal human structure.
224
+