| Type | |
| Stats | 201,989 45,520,629 | 
| Reviews | (17,967) | 
| Published | Jan 8, 2024 | 
| Base Model | |
| Hash | AutoV2 67AB2FD8EC | 
Pony Diffusion V6 is a versatile SDXL finetune capable of producing stunning SFW and NSFW visuals of various anthro, feral, or humanoids species and their interactions based on simple natural language prompts.
CHECK "ABOUT THIS VERSION" ON THE RIGHT IF YOU ARE NOT ON "V6" FOR IMPORTANT INFORMATION.
Please join our Discord Server to support development of new versions of this model and get access to free SD bot and check out more examples of this model capabilities on our prompt sharing website or follow the author on Twitter.
Important information
Make sure you load this model with clip skip 2 (or -2 in some software), otherwise you will be getting low quality blobs.
This model supports a wide array of styles and aesthetics but provides an opinionated default prompt template that allows generation of high quality samples with no negative prompt and otherwise default settings
score_9, score_8_up, score_7_up, score_6_up, score_5_up, score_4_up, just describe what you want, tag1, tag2(previous Pony Diffusion models used a simpler score_9 quality modifier, the longer version of V6 XL version is a training issue that was too late to correct during training, you can still use score_9 but it has a much weaker effect compared to full string. You can learn more about these tags here).
The model is designed to not need negative prompts in most cases and does not need other quality modifiers like "hd", "masterpiece", etc...
Other special data selection tags include, 'source_pony', 'source_furry', 'source_cartoon' and 'source_anime' and ratings of 'rating_safe', 'rating_questionable' and 'rating_explicit'.
This model is capable of recognizing many popular and obscure characters and series.
If you are looking specifically for pony style, I recommend using one of the two following templates `anthro/feral pony, rest of the prompt` or `source_pony, rest of the prompt`.
This model is trained on combination of natural language prompts and tags and is capable of understanding both, so describing intended result using normal language works in most cases, although you can add some tags after the main prompt to boost them.
Using Euler a with 25 steps and resolution of 1024px is recommended although model generally can do most supported SDXL resolution.
This model will sometimes generate pseudo signatures that are hard to remove even with negative prompts, this is unfortunately a training issue that would be corrected in future models. If that's an issue for you I suggest trying V5.5 or inpainting.
Special thanks
- Iceman for helping to procure necessary training resources 
- Haru for assistance with captioning efforts 
- Cookie for technical expertise in training 
- PSAI Server Subscribers for supporting the project costs 
- PSAI Server Moderators for being vigilant and managing the community 
Technical details
The model has been trained on ~2.6M images aesthetically ranked based on authors personal preferences, with roughly 1:1 ratio between anime/cartoon/furry/pony datasets and 1:1 ratio between safe/questionable/explicit ratings. About 50% of all images has been captioned with high quality detailed captions, which results in very strong natural language capabilities.
All images has been trained with both captions (when available) and tags, artists' names have been removed and source data has been filtered based on our Opt-in/Opt-out program. Any explicit content involving underage characters has been filtered out.
License
This model is licensed under a modified Fair AI Public License 1.0-SD (https://freedevproject.org/faipl-1.0-sd/) license.
The following modifications have been added to Fair AI Public License:
You are not permitted to run inference of this model on websites or applications allowing any form of monetization (paid inference, faster tiers, etc.). This applies to any derivative models or model merges.
If you want to use this model commercially, please reach us at contact@purplesmart.ai.
Explicit permission for commercial inference has been granted to CivitAi and Hugging Face.
Suggested Resources 
Discussion
What are the minumum requirements to run this checkpoint? I use A111 interface. This checkpoint is extremely slow and it takes ages to render 1 single image. My setup is:
RAM: 32GB
CPU: Intel(R) Core(TM) i7-4790 CPU @ 3.60GHz   3.60 GHz
GPU:  Nvidia Geforce RTX 2060 (6GB)
Do you have any advice to make it faster?
is it normal, that generations take for fucking ever? im talking 4 minutes for one picture even though i have a strong pc and other models can do it in under 20 seconds. i using the stable diffusion webui.
Any prompt tips for preventing clothing like aprons, dresses, and skirts from being sucked in and vacuum-sealed to the crotch?
After downloading, I tried to change the model locally and could not. It seems to be loading in the middle of the process. Can someone help me!