Type | |
Stats | 1,202 |
Reviews | (119) |
Published | Jun 8, 2024 |
Base Model | |
Usage Tips | Clip Skip: 2 |
Hash | AutoV2 3299DE22AC |
a semi-Realistic(2.5D) Pony-diffusion based Mixed model
This is another open/free merged model named Pinkie Pie pony mix
Merged Models
tangbohu-lofi-pony-basemodel (by @tangbohu) to enhance basis model
yaminabepony (by @KKTT6783 ) to add pretty asian face style.
MIST XL Hyper Character Style Model AiARTiST by @AiARTiST (used to fix details: block "out." level)
Recipes
Model Mixer https://github.com/wkpark/sd-webui-model-mixer is used to mix all models in one step. recipe details are included in the model checkpoint or some images. (so you can use/modify this recpie easily by using the model-mixer)
VAE included.
This is a screenshot of sd-webui with the model-mixer extension:
The basic recipe is the follow
step #1 : base model A + model B x 0.3 = mix_A (text encoder excluded) DARE merge method (a simplified DARE method is supported by the model-mixer)
- after some trial and error,
OUT01reduced from0.3to0.1step #2 : block level mix - mix_A + model C = final mix - DARE merge method
- explain: block level merges on MID + OUT00~OUT08.
- SDXL OUT00 ~ OUT02 blocks have a wide effect, especially on the face style.
- NOTE: DARE method uses random pivoting internally, so the results may be slightly different in each merge process.
Adjust settings: this is the adjust settings to optimize the details and tone of model ,
time_embed.*andout.*weights have been tuned (please see https://github.com/hako-mikan/sd-webui-supermerger?tab=readme-ov-file#adjust)
Recommends
All posted images use μ-DDetailer script extension
recommended sampler: Euler a
recommended image size (width range is about 640~1024 / height range is 640~1344):
1024x1024 (default SDXL)
640x960 / 768x1152 / 800x1200
832x1216 / 896x1152
see also SDXL resolution cheat sheet: https://www.reddit.com/r/StableDiffusion/comments/15c3rf6/sdxl_resolution_cheat_sheet/
recommended CFG scale: 3~7
with Dynamic Threasholding (CFG Fix )
- CFG scale 10
- mimic cfg scale 3~7
Set Clip skip: 2 recommended (Clip skip:1 also works fine), ENSD: 31337
Useful AUTOMATIC1111's webui extensions
the following sd-webui extensions are recommended.
or use famous ADetailer extension
civitai extension to add several useful features for civitai.
ChangeLog
2024/05/16 - first release out
2024/05/18 - v1.3 released with minor text-encoder fixes. (full rebuild)
specific text-encoder weights replaced with the yaminabepony 's BASE
BASE:layers.1.*, BASE:resblocks.5.* have some bug and replaced.
more weight level bugs will be fixed soon☕👀
2024/05/26 - v1.4 released with minor text-encoder fixes. (v1.0 + additional text-encoder fixes)
v1.4 = v1.0 + additional text-encoder fixes with yaminabepony's text-encoder.
BASE:layers.1.* with 1.0 weights (DARE merge)
BASE:resblocks.1.* with 0.2 weights (DARE merge)
BASE:resblocks:5.* with 1.0 weights (DARE merge)
2024/06/08 - v1.5 released with minor text-encoder fixes (+brightness adjusted)
2024/06/08 - v1.6 released with minor text-encoder fixes (v1.5 hot fix)
2024/06/08 - v2.0 released with "OUT08" block level fix. (enhanced details)
TODO
add more realistic skin tone
reduce western style face,
fix detail levels (v2.x)
Known Bugs
(v1.0~) With some prompts it produce ziggling images for example:
and the cause of this error is the text-encoder of the original models (in this case the RealDream Pony v2 produce exactly the same error under A1111.) this issue slightly fixed in the v1.2 merged model.
This issue was suspected to be an error with a specific weights and was resolved by replacing specific CLIP weights.
(this issue resolved in the RealDream Pony v3)
2. Some prompt words make the generated images look a bit somewhat cartoonish and ugly: e.g.) large eys, grin, ...
License
All used models here have "Have different permissions when sharing merges" license permission, so I do not add additional restrictions on it.
The original Pony-diffusion v4 license disclaim "Same license restriction" so I do not add any restriction except same license restriction. (Please see https://huggingface.co/AstraliteHeart/pony-diffusion-v4 and https://huggingface.co/spaces/CompVis/stable-diffusion-license)
This model permits users to:
✔Use the model without crediting the creator
✔Sell images they generate
✔Run on services that generate images for money
✔Share merges using this model
✔Sell this model or merges using this model
❌Have different permissions when sharing merges
Specifically, the OpenRAIL-M license permits users to own the rights to the images they generate and to use these images for commercial purposes (Stable Diffusion) (Baseten). This openness supports a wide range of applications, from creative projects to commercial services, enabling businesses and individuals to leverage the model's capabilities for various purposes.
Therefore, if you're considering using Stable Diffusion for commercial products or services, the licensing terms do support such use, as long as the guidelines and restrictions outlined in the license are followed. (from chatgpt)
Support me
If you like my work, feel free to buy me a coffee at ko-fi. https://ko-fi.com/mixboy
Suggested Resources
Discussion
FANTASTIC!!!I love the face of the checkpoint. However, it can't always generate such perfect face as the cover, looking forward for some improvement in the next version.
Hello the model is great and I like it very much. Sorry for the stupid question but this is my first pony model and whenever I use the adetailer the face is distorted or blurred. I have tried yolo8n and yolo9c, same thing with both. Is there a setting I need to change? Thank you
Thank you for your model.
According to my messages you have released v 1.5 can you upload it back again and are there versions 1.1, 1.2?