r/LocalLLaMA

13h ago

NSFW SPOILER

Omnicoder-Claude-4.6-Opus-Uncensored-GGUF

10 時間前

I ran the Aider benchmark (225 hard coding problems) on Qwen3.5 35B-A3B, got 26.7% pass@1 and 54.7% pass@2. It took 95 seconds per problem on average.

Running Omnicoder 9B right now. So far it did 75/225 problems. It's taking 402 seconds per problem, and the success rate so far is 5.3% at pass@1 and 29.3% pass@2.

I'm not even sure I want to wait for it to finish but it would be interesting to compare it vs vanilla Qwen3.5 9B later.

I'm not sure Claude distill is gonna fix Omnicoder's problems tbh

sotona-

8 時間前

bwt, 122b pass@2 got 76%

grumd

8 時間前

Which quant?

このスレッドを続行

EvilEnginer

9 時間前

I think Aider benchmark is overkill for model of such size. Btw pretty good results.

grumd

9 時間前

Yeah I just use it to find out which one of my local models is the best. 35B is the best quality vs speed tradeoff. I wanna try 27B Claude distill at Q3 next.

So far my results are: 27B IQ4_XS - 59.6%, 441 seconds per test, 35B Q6 - 54.7%, 95 seconds per test, 27B Q3_K_S - 50.7%, 218 seconds per test.

このスレッドを続行

sgmv

12 時間前

I want exactly this but for the 27B

EvilEnginer

12 時間前

Try to use this script in google colab: https://pastebin.com/xEP68vss - it's pretty simple. Just replace path to repositories, files, and pick a quant that works best on your hardware.

In next cell insert this script to upload result to huggingface: https://pastebin.com/PwxCbvwK

After that you can download model in LM Studio.

sotona-

10 時間前

r = np.clip(a + (t - s), 0,) its a such primitive merge! why not use mergekit?

このスレッドを続行

jack-in-the-sack

10 時間前

All these model names get me confused. Can I replace Claude Code with this model?

EvilEnginer

9 時間前

I think not. This is just an experiment of upgrading Qwen 3.5 9B fine tunes via merging. Goal: get fully working agent for programming and roleplay without censorship that runs on lowend consumer hardware.

このスレッドを続行

bharathbunny

8 時間前

Why is this NSFW?

EvilEnginer

8 時間前

Because it's uncensored model :)

siete82

8 時間前

Uncensored means it can produce malware

このスレッドを続行

jax_cooper

6 時間前

red teaming goes brrrrrrr

このスレッドを続行

mr_Owner

12 時間前

Would this also improve non reasoning mode?

EvilEnginer

11 時間前

I think yes. On my previous model it improved it a lot.

このスレッドを続行

Jack_Moves

6 時間前

Can someone please share a suggested Modelfile or instructions to get this running quickly in ollama? Thanks!

Icy-Degree6161

11 時間前

Interesting, I'll give it a whirl, thanks

EvilEnginer

11 時間前

Nice👍.

このスレッドを続行

oVerde

5 時間前

Stop! I just have so much storage space!

このスレッドを続行

eg7b

5 時間前

Aren’t Claude proprietary models? Are these distilled SFT models?

このスレッドを続行

tough-dance

4 時間前

I really don't mean this as a criticism, just genuinely curious. What is gained by having an Omnicoder be uncensored/NSFW? Is it to code mischievous things or to have surrounding conversation be spicy? Again, just genuinely curious

EvilEnginer

4 時間前

Basically uncensored / nsfw thing removes refusals layers from model. You will get spicy direct conversations and of cource model will be more creative without sounding too robotic.

このスレッドを続行

EvilEnginer

4 時間前

I uploaded OmniClaw model. Basically it's just a merge of Omnicoder with this one from empero-ai https://huggingface.co/empero-ai/Qwen3.5-9B-Claude-Code-GGUF . This thing has been trained on real Claude Code / ChatGPT Codex agentic sessions from the DataClaw dataset collection. Feel free to take a look ^_^.

Omnicoder-Claude-4.6-Opus-Uncensored-GGUF

I feel personally attacked

I was backend lead at Manus. After building agents for 2 years, I stopped using function calling entirely. Here's what I use instead.

Qwen3.5-9B-Claude-4.6-Opus-Uncensored-Distilled-GGUF

Unsloth announces Unsloth Studio - a competitor to LMStudio?

Introducing Unsloth Studio: A new open-source web UI to train and run LLMs

Homelab has paid for itself! (at least this is how I justify it...)

Saw this somewhere on LinkedIn 😂

MiniMax-M2.7 Announced!

Mistral Small 4:119B-2603

OmniCoder-9B | 9B coding agent fine-tuned on 425K agentic trajectories

Hugging Face just released a one-liner that uses 𝚕𝚕𝚖𝚏𝚒𝚝 to detect your hardware and pick the best model and quant, spins up a 𝚕𝚕a𝚖𝚊.𝚌𝚙𝚙 server, and launches Pi (the agent behind OpenClaw 🦞)

I'm fully blind, and AI is a game changer for me. Are there any local LLMS that can rival claude code and codex?

You guys gotta try OpenCode + OSS LLM

OpenCode concerns (not truely local)

Qwen3.5-9B is actually quite good for agentic coding

Qwen 3.5 122b - a10b is kind of shocking

Mistral 4 Family Spotted

My company just handed me a 2x H200 (282GB VRAM) rig. Help me pick the "Intelligence" ceiling.

Avacado is toast

Nvidia updated the Nemotron Super 3 122B A12B license to remove the rug-pull clauses

llama.cpp + Brave search MCP - not gonna lie, it is pretty addictive

55 → 282 tok/s: How I got Qwen3.5-397B running at speed on 4x RTX PRO 6000 Blackwell

Qwen3.5-9B on document benchmarks: where it beats frontier models and where it doesn't.

MiniMax M2.7 Is On The Way

So nobody's downloading this model huh?

Want to browse anonymously?

Omnicoder-Claude-4.6-Opus-Uncensored-GGUF

I feel personally attacked

I was backend lead at Manus. After building agents for 2 years, I stopped using function calling entirely. Here's what I use instead.

Qwen3.5-9B-Claude-4.6-Opus-Uncensored-Distilled-GGUF

Unsloth announces Unsloth Studio - a competitor to LMStudio?

Introducing Unsloth Studio: A new open-source web UI to train and run LLMs

Homelab has paid for itself! (at least this is how I justify it...)

Saw this somewhere on LinkedIn 😂

MiniMax-M2.7 Announced!

Mistral Small 4:119B-2603

OmniCoder-9B | 9B coding agent fine-tuned on 425K agentic trajectories

Hugging Face just released a one-liner that uses 𝚕𝚕𝚖𝚏𝚒𝚝 to detect your hardware and pick the best model and quant, spins up a 𝚕𝚕a𝚖𝚊.𝚌𝚙𝚙 server, and launches Pi (the agent behind OpenClaw 🦞)

I'm fully blind, and AI is a game changer for me. Are there any local LLMS that can rival claude code and codex?

You guys gotta try OpenCode + OSS LLM

OpenCode concerns (not truely local)

Qwen3.5-9B is actually quite good for agentic coding

Qwen 3.5 122b - a10b is kind of shocking

Mistral 4 Family Spotted

My company just handed me a 2x H200 (282GB VRAM) rig. Help me pick the "Intelligence" ceiling.

Avacado is toast

Nvidia updated the Nemotron Super 3 122B A12B license to remove the rug-pull clauses

llama.cpp + Brave search MCP - not gonna lie, it is pretty addictive

55 → 282 tok/s: How I got Qwen3.5-397B running at speed on 4x RTX PRO 6000 Blackwell

Qwen3.5-9B on document benchmarks: where it beats frontier models and where it doesn't.

MiniMax M2.7 Is On The Way

So nobody's downloading this model huh?

Mature Content

Want to browse anonymously?