This page may contain sensitive or adult content that’s not for everyone. To view it, please log in to confirm your age.
By continuing, you also agree that use of this site constitutes acceptance of
Reddit’s User Agreement
and acknowledgement of our
Privacy Policy.
I ran the Aider benchmark (225 hard coding problems) on Qwen3.5 35B-A3B, got 26.7% pass@1 and 54.7% pass@2. It took 95 seconds per problem on average.
Running Omnicoder 9B right now. So far it did 75/225 problems. It's taking 402 seconds per problem, and the success rate so far is 5.3% at pass@1 and 29.3% pass@2.
I'm not even sure I want to wait for it to finish but it would be interesting to compare it vs vanilla Qwen3.5 9B later.
I'm not sure Claude distill is gonna fix Omnicoder's problems tbh
bwt, 122b pass@2 got 76%
Which quant?
I think Aider benchmark is overkill for model of such size. Btw pretty good results.
Yeah I just use it to find out which one of my local models is the best. 35B is the best quality vs speed tradeoff. I wanna try 27B Claude distill at Q3 next.
So far my results are: 27B IQ4_XS - 59.6%, 441 seconds per test, 35B Q6 - 54.7%, 95 seconds per test, 27B Q3_K_S - 50.7%, 218 seconds per test.
I want exactly this but for the 27B
Try to use this script in google colab: https://pastebin.com/xEP68vss - it's pretty simple. Just replace path to repositories, files, and pick a quant that works best on your hardware.
In next cell insert this script to upload result to huggingface: https://pastebin.com/PwxCbvwK
After that you can download model in LM Studio.
r = np.clip(a + (t - s), 0,) its a such primitive merge! why not use mergekit?
All these model names get me confused. Can I replace Claude Code with this model?
I think not. This is just an experiment of upgrading Qwen 3.5 9B fine tunes via merging. Goal: get fully working agent for programming and roleplay without censorship that runs on lowend consumer hardware.
Why is this NSFW?
Because it's uncensored model :)
Uncensored means it can produce malware
red teaming goes brrrrrrr
Would this also improve non reasoning mode?
I think yes. On my previous model it improved it a lot.
Can someone please share a suggested Modelfile or instructions to get this running quickly in ollama? Thanks!
Interesting, I'll give it a whirl, thanks
Nice👍.
Stop! I just have so much storage space!
Aren’t Claude proprietary models? Are these distilled SFT models?
I really don't mean this as a criticism, just genuinely curious. What is gained by having an Omnicoder be uncensored/NSFW? Is it to code mischievous things or to have surrounding conversation be spicy? Again, just genuinely curious
Basically uncensored / nsfw thing removes refusals layers from model. You will get spicy direct conversations and of cource model will be more creative without sounding too robotic.
I uploaded OmniClaw model. Basically it's just a merge of Omnicoder with this one from empero-ai https://huggingface.co/empero-ai/Qwen3.5-9B-Claude-Code-GGUF . This thing has been trained on real Claude Code / ChatGPT Codex agentic sessions from the DataClaw dataset collection. Feel free to take a look ^_^.