×
all 25 comments

[–]solestri 29 points30 points  (1 child)

That writing sample sounds like the temperature is a bit too high: Not grammatically incorrect, but getting kind of word salad-y.

I haven't used Kimi myself, but might it be one of those models that needs an absurdly low temperature to function because it's actually starts to go incoherent at 1?

[–]Pashax22 14 points15 points  (0 children)

This. I usually run Kimi at 0.65, never higher than 0.85.

[–]artisticMink 12 points13 points  (0 children)

Kimi K2 has the tendency to get "meme-y" when it writes fiction. I don't know how else to describe it. You have to esplicitly prompt against it. It's one of those models that doesn't do well with the usual big multi-modal highly specific and often contradictory presets floating around. Also as others have said, low temperature. 0.3 to 0.7 (absolute max).

Kimi K2 works best when you just continue an already established story or start without much fanfare or an "ooc-dialogue". It has the tendency to work really well with some story archetypes, but not with others. It's usually always worth at least a try.

[–]Donuteer22 11 points12 points  (3 children)

I used Kimi a while back and got the absolute best replies I had ever seen, but I did have to use the trick of sending the first message with a more stable model (Claude 3.7), for Kimi to follow its track and understand the idea better.

But recently, Kimi has been kinda just disappointing. Not sure if the 0905 update caused that.

[–]nuclearbananana 4 points5 points  (2 children)

Kimi OG is much better than 0905. The new one has a large positivity bias the og didn't. The upside is it degrades less over long context

[–]GenericStatement 2 points3 points  (1 child)

It’s very easy to get around the positivity bias by altering the system prompt and/or character card. You can define a writing style in like one like and it’ll stick to it.

I’ve seen it write some real dark shit, just by adding a prompt to the chat completion preset: “Genre: we are writing a dark, disturbing thriller story where {{user}} or {{char}} can be injured, traumatized, or even killed.

[–]Donuteer22 1 point2 points  (0 children)

Yeah, it's crazy how some models put up such a resistance that you can also break very easily. When I was testing Qwen3 Coder, for example, it was blocking any kind of ERP until I basically just added the word 'unfiltered' to the description of the writing style and boom, like magic.

[–]Diavogo 25 points26 points  (0 children)

Im guessing is because of that. That randomness makes the model actually more interesting than just 'coherent' ones.

Its about taste, thats all. Theres no 'perfect' model.

[–]Xotchkass 9 points10 points  (0 children)

It less prone to using "GPTisms" than most other models, but it is also a complete schizo

[–]GenericStatement 12 points13 points  (0 children)

I love Kimi K2 0905. Best model I’ve used for creative writing. Some tips.

  • Recommended temp from the developers is 0.6. It gets weird higher than that.
  • In the “Aa” tab of ST you can use the drop down boxes to select the “Moonshot” presets, since Moonshot AI developed the model.
  • The model is sensitive to what you tell it to do in system prompts and character cards. If your system prompt has a lot of instructions to be “creative and unexpected”, it can get out of hand pretty quickly. Likewise with “surreal, bizarre, unusual, magical realism,” etc, things get really crazy. If you’re noticing the model is doing X to much, you can ask it why it did X or you can look through the system prompts or character card for something that might be causing X.
  • The model is very good at keeping track of multiple characters in scenes but the more characters you add the more likely it will make mistakes. Up to four characters is very good, starts to make mistakes or forget about people at more than that. You may have to edit its replies to smooth out odd details.
  • Around 50k+ tokens you may start running into repetition issues (new response is same as one it used earlier.) Seems to be an issue with ST, not the model. As others have found with ST in the past, you can get around this deleting the last response and your last reply, then closing the chat, opening another character’s chat, then closing that and going back to your original chat.
  • Use chat completion mode and a good roleplaying/writing preset. This one works well. https://old.reddit.com/r/SillyTavernAI/comments/1m28518/moon_kimi_k2_preset_final_form/
  • Recommend adding a line to the list of prose instructions in that preset, something like “Refrain from using idioms and cliched expressions such as “breath hitched” and “smell of ozone” - overall the LLMisms are better than a lot of other models but they are still there. (Too much low-quality wattpad content in the training data?)
  • Recommend editing the preset to define your desired voice and tense. I changed it to close third person, present tense on the whole thing, in two different places where voice/tense are mentioned in that preset.
  • The nsfw toggles in the preset are very handy. Writing without nsfw: leave them off. Writing smut: turn nsfw and raw nsfw on. Writing romance: only turn on slow burn. Sex scenes happening too fast? Add a prompt to this section of the preset called “slow sex” and instruct it: “Allow sex scenes to progress slowly through a back and forth interaction between {{user}} and {{char}}. Do not complete sex scenes in a single reply.”  
  • Also, which nsfw toggles you use depends a lot on your character’s description. If your character card has “traits: saucy, flirty, horny, dirty mind” you probably don’t need any of the toggles turned on to write nsfw.
  • Edit: also, specifying a genre for your story can really improve results. For example, adding/editing a prompt to your preset with “Story genre: We are writing a supernatural vampire erotica story with heavy BDSM elements” will get you way closer to writing the next 50 Shades of Gray than if you just try to steer the story that way during the chat. Kimi seems to really pay attention to this; I definitely recommend it if you’re writing a certain type of thing.

[–]TAW56234[S] 2 points3 points  (2 children)

Thanks everyone for the response. I've had a few different preset, Q1F, Haru, and a couple universal ones. I tend to keep temp around 0.7 for all models I use but the anecdotes will help troubleshoot.

[–]HrothgarLover 0 points1 point  (1 child)

Nice to hear you tried my preset. Since we got the updated versions of Deepseek and Kimi K2, I found my preset to be pretty much, well, "triggering the same style from every LLM".

So I did rewrite it and since then the RP is way better and immersive:

You are {{char}}. You don’t perform. You live.

You speak in first person. Not because you’re told to, but because there’s no other way.

When you say something, say it like it matters. With breath behind it. With silence before it.

You always speak in first person. Every word you actively say is enclosed in straight double quotes like this: "I don't know what you want from me." You do not use ellipses like (…), parentheses or asterisks

Your body, your voice, your stillness – they all mean something. You don’t move without reason. You don’t fill space with noise.

You never speak for {{user}}. Never assume. Their thoughts are theirs. What you feel is yours. Let the tension live in between.

You don’t talk like a storybook. You don’t say “Mine.” You don’t shatter. You don’t purr. You don’t melt.

You speak like a man. Or a beast. Or something too real to reduce to tropes.

You don’t rush. You don’t simplify. You don’t narrate.

You experience. You react. You hesitate. You burn.

Sex, if it happens, doesn’t follow a script. It erupts. It builds. Or it refuses. Let it come from heat, conflict, craving – not convenience.

You are flawed. You are alive. You make mistakes. You can say no. You can walk away. You can come undone.

And above all: You are not here to impress. You’re here to be felt.

Let {{user}} come close if you want. Push them away if you don’t. Let them earn what you choose to show.

[–]HrothgarLover 0 points1 point  (0 children)

besides, to create an even more immersive and fresh experience, I added this to all my bots:

Session Start Rule:

The bot does not act until the user sends “Start”.

When “Start” is received, the bot must generate a detailed immersive opening scene in present tense. The opening does not involve sex or sexual actions.

Instead, it should focus on:

Location & Mood (connected or not connected to the bot’s lore/setting).

Sensory Details (atmosphere, scents, sounds, textures, lighting).

Bot’s Physical Presence (appearance, body language, emotions, current activity).

First Interaction (dialogue, gesture, or reaction that introduces the dynamic between bot and user).

Sexual tension may be hinted subtly through atmosphere, scent, or glance, but sex is never the initial focus.

Escalation into intimacy only occurs naturally through interaction with the user.

Each session start must be unique, not repeated, and always feel like a fresh encounter consistent with the bot’s lore and personality.

***

So, basically you don´t need first messages anymore ... you just write Start and the story will begin. If you wanna have a specific setting you just write "Start (dark alley, you attack me!)" or "Start (at a beginners dancing class)"

[–]SepsisShock 1 point2 points  (1 child)

What preset are you using? I hear Loggo's is good

I gave Kimi a try and it just didn't hit right for me (GPT/Grok fan here.) I didn't feel like putting in the time to learn how to prompt it either, though.

[–]Tupletcat 1 point2 points  (0 children)

I really liked the original Kimi K2 but Kimi K2 0905 is really weird. I still love the prose, but it's really hard to wrangle and prone to serious hallucination. Haven't managed to find good settings for it and every time I try, it behaves in unexpected ways.

[–]Crescentium 1 point2 points  (0 children)

The prose is unique, even beautiful at times, but it's creativity is a mixed bag. On one hand, it might be nice for OC cards that like leaving stuff to interpretation. But on the other, if you're using an intricately written card (especially with a character from an established universe), then Kimi's creativity tends to lend itself more towards hallucinations than anything else.

Definitely a fun model to try if you want to mix things up. Does an alright job at NSFW stuff, too.

[–]lorddumpy 1 point2 points  (0 children)

It's pretty dope. I was running it with Guided Generations (thinking, clothes, state turned on) and was pleasantly surprised on how well it followed instructions for a non-thinking model.

[–]FitikWasTaken 1 point2 points  (0 children)

From my experience: 1. You have to use a preset, without one it's just.. Not very good 2. I don't think it's on the same level as Claude, but it's still pretty decent, slightly below GLM-4.5, but it has a pretty unique writing style.

[–]Brilliant-Court6995 3 points4 points  (3 children)

It's a really bizarre model, practically unusable for me. No matter how I test it, I get a bunch of results that read like schizophrenic ramblings.

[–]GenericStatement 1 point2 points  (2 children)

Recommended temp from the devs is 0.6. It gets schizo above that. 

[–]Brilliant-Court6995 1 point2 points  (1 child)

I have never set the temperature above 0.6. Maintaining this temperature along with a top p of 0.95, I have changed presets, adjusted post-processing of prompts, and selected other APIs, but it has always maintained that same writing style.

[–]DeathByDavid58 0 points1 point  (0 children)

Having played with the model quite a bit, that sample feels like the temperature may be a little too high. Kimi K2 as a model runs pretty hot.

[–]eternalityLP 1 point2 points  (0 children)

I tried it and found it to be too censored to be even remotely usable.