PSA: Humans are scary stupid

mckirkus · 2026-03-04T18:03:16+00:00

People will always upvote ideas that reinforce their existing beliefs. Truth is a distant second

rm-rf-rm · 2026-03-04T17:42:55+00:00

P.S: I normally would have removed that post. I didn't because by the time I caught it, the damage was done (already had several comments and upvotes). I instead changed flair to Misleading and making this post as Im hoping the "show, don't tell" is going to be more helpful than just silently removing it post-fact

Vusiwe · 2026-03-04T17:54:18+00:00

I saw that post and just laughed yesterday

Practitioners here wouldn’t even trust Qwen 3 VL 235b with that type of task

A 4b VL post must be a parody is what I figured

dieyoufool3 · 2026-03-04T18:08:59+00:00

Saw the post and made sure to report + upvote the callout posts, but the underlying reason for yesterday is because this sub is a trusted source of news and many of us have outsourced our trust to communities like this

iMrParker · 2026-03-04T17:56:48+00:00

I've noticed a ton of posts that provide "findings" or results from AI, and comments will flood in with praise, sometimes minutes or seconds after a post. So clearly people aren't reading posts or articles before responding and up voting

trejj · 2026-03-04T18:11:23+00:00

The irony is that AI IS the tool to counter this problem - when used correctly

So requesting: a) Posters please validate before posting b) People critically evaluate posts

We all talk about how important it is to be critical of AI.

We all assume that we ourselves are critical, but others are accepting it at face value.

We all think AI is a great tool and hallucinations are not a problem for us since we can distinguish them, while others are proven to not be able to.

I think it will take a decade at least to make a dent to this fallacy, and in the meanwhile, we will keep repeating these lines in every passing.

MammayKaiseHain · 2026-03-04T17:56:38+00:00

I think the people upvoting plausible but incorrect things on reddit thereby corrupting the training data are the real heroes standing between greedy companies and ASI.

Chromix_ · 2026-03-04T18:09:15+00:00

Well, that's normal - unfortunately. Except that the comment explaining that / why it's wrong went to the top in time. Often (in other subs) its buried 5 pages down. Verifying is expensive, blindly trusting what seems plausible is easy - like with a lot of the vibe-coded success projects shared here.

People see what matches their opinion and they upvote. Yes, some read the comments, but when you look at the view statistics per comment vs. per posting then you can see that it's not that many. For example one of my postings has 250k views, and my earliest and top-most comments underneath are between 2k and 10k.

Even when people read the comments, Reddit tends to sometimes collapse interesting comments, which is why I like "expand all".

yuicebox · 2026-03-04T18:10:51+00:00

I appreciate this crashout, thanks king

toothpastespiders · 2026-03-04T18:30:32+00:00

This is a stark example of something I think is deeply troubling - stuff is readily accepted without any validation/thought. AI/LLMs are exacerbating this as they are not fully reliable sources of information.

Wikipedia's been the biggest wakeup call for me. A while back I stumbled on a wikipedia article on a subject that probably doesn't come up too much in most people's lives but enough that it should get a steady stream of fresh eyes on it. What stuck out is that it's a subject that I have enough of an academic background in to consider myself competent to critique it. Within the first few paragraphs there was a mistake that was glaring in both how misleading it'd be to the reader and how unaware of the subject one would need to be in order to accept it. The citation for it was laughably bad. But I thought it'd be interesting to see how long it'd take for something so obvious to be corrected.

About two years later and it's still there. And it's really struck me that wikipedia is pretty much 'the' goto for general purpose information. And people obviously aren't checking the citations when reading it. Just taking it in on face value. I mean obviously anyone should know that wikipedia isn't to be taken as authoritative. We know it intellectually. But I still find myself doing it too. Just loading up a page to quickly check on something I don't know about.

mtmttuan · 2026-03-04T18:28:21+00:00

Can we have a way for others to mark a post as potentially misleading? A flair for example. Then people actually read the post can re-vote whether it's actually misleading or not.

wh33t · 2026-03-04T20:40:28+00:00

The SLOP is so real.

onil_gova · 2026-03-04T18:25:00+00:00

People are going to be mad if you do and mad if you don't. I just want to thank you for the work that you do. This sub is still one of my favorite places on the internet, and that would not happen without dedicated mods like yourself.

Yorn2 · 2026-03-04T18:39:52+00:00

This might be a crazy idea but is there a way to keep track of the number of posts that get X upvotes within Y minutes of posting and automatically tag ones being brigaded with "Brigading detected"? I'm not sure if that would have even helped here, but figured I'd ask to see if you have the metrics to find out.

I mean, I know our knee-jerk reaction is to downvote anything that seems to stink of manipulation, but I would like to think the stuff being brigaded in a positive way (meaning upvotes instead of downvotes) by a team of people that are actually bringing something truthful and new to the discussion would survive the tag while the posts being brigaded in a positive way by a team of people that are not bringing something untruthful or old to the discussion would be judged a bit more harshly accordingly.

Obviously this would have to go through a testing phase to see if it actually produces the desired results. We wouldn't want Unsloth posts, for example, being downvoted as bridgading just because there a handful of people following daniel, but I'd like to think that such posts would survive the tag.

Impossible-Glass-487 · 2026-03-04T17:52:26+00:00

The mods of this sub have allowed anyone and everyone to post here with new accounts and no prior thought or investigation. The new people inherantly either cannot understand that their questions are better suited to a cloud model or they refuse to interact with AI for the simplest of questions prefering that a human answer them instead.

So requesting: a) mods please add a minimum amount of time (1 - 2 months) that a user must first be a member of the sub before being allowed to post b) do a better job of removing obvious slop and shit posts that should be answered with a cloud model (as stated in OPs post as "the irony" and c) you are the problem mods not the stupid users, you need to set up parameters to keep your sub from becoming the garbage that most other "AI" subs have become - this sub was the gold standard a month ago and now its a mess.

_Erilaz · 2026-03-04T20:04:30+00:00

Critical thinking both is a nontrivial skill and a hell of an effort. Also, people are lazy. What else did you expect?

mr_zerolith · 2026-03-04T20:22:46+00:00

The IQ on this sub is dropping rapidly probably due to growth.
Intervention is unfortunately necessary :(

Bitter-Ebb-8932 · 2026-03-04T17:41:04+00:00

This is why I always run image claims through multiple models and reverse image search. Takes 30 seconds, saves credibility

teleprint-me · 2026-03-04T18:22:35+00:00

We as human beings have a limited cognitive bandwidth. When inundated with perpetually "infinite" information, we can be overwhelmed and fatigued.

Its not possible to validate and verify every piece of information we come across. We just dont have the time. This is why we rely on each other as a group to validate information.

Unfortunately, we just accept information as presented to us from time to time and this has also been a cognitive loophole.

For example, the is a ton of information on YouTube. It is not physically possible or practical for every human to watch, validate, verify, and cross check every piece of information presented to us. It would take multiple life times to do so.

This is not to excuse it, but to just illuminate the core issue. I upvoted it, but Im feeling burnt out. So, much so, I can barely keep up with the rapid pace that current events unfolding. Im human and I need to take breaks to "refresh", which means I fall into this trap as do most others as well. Just because you understand, does not mean you can mitigate or prevent it (this is also a cognitive bias, see wikipedia list of cognitive biases for a general overview and light introduction).

Were not wired in a way to handle these issues. But Im sure its possible to setup safegaurds somehow, Im just not sure what they are or what they would look like.

Regardless, I appreciate the attention to detail. As an aside, Ive noticed that Qwen3.5 is not that great. It has potential, but it also has holes in its execution compared to previous releases. Not to say its a total flop, but its not great either.

Abject-Tomorrow-652 · 2026-03-04T18:09:36+00:00

Super important

pmttyji · 2026-03-04T18:24:52+00:00

Patting myself on the back slowly for not upvoting that thread.

That said, I have no idea of that pic location, otherwise I would've pointed out or joined the top comment there.

mantafloppy · 2026-03-04T18:29:48+00:00

The number of post Qwen is getting since the 3.5 release in not organic/natural, feel very anomalous and synthetic.

Sure a big bump is expected, but those level are wrong.

valuat · 2026-03-04T18:38:08+00:00

Your title is eerily accurate. You're good.

zenmagnets · 2026-03-04T19:38:35+00:00

But problem you've highlighted is exactly what reddit is all about hurrah

simracerman · 2026-03-04T19:39:46+00:00

Thanks OP. I think Mods need to comment and pin at the top a non-biased sources based clarification so all new traffic to the post can downvote accordingly or just read and go on.

With Reddit data included in LLM training, we need Mods comments to help balance what’s true. Bad data will continue to be fed into training, but hope some good content is there to counteract the damage.

Merchant_Lawrence · 2026-03-04T20:40:10+00:00

hahahahah i know this gonna a bound to happen, thanks mod for hardwork

Kahvana · 2026-03-04T20:48:32+00:00

Thank you for the hard work.

Ill-Bison-3941 · 2026-03-04T20:52:32+00:00

I mean it's Reddit. Sometimes I scroll through at 3AM and upvote anything remotely interesting I glance at for 2 seconds... But yeah, I understand what this post is asking and why.

GerchSimml · 2026-03-04T21:28:58+00:00

@grok is this true

Feztopia · 2026-03-04T18:20:50+00:00

I don't know the building and the image is very small on mobile. I expect the poster to know about his own image. I looked at the comments and I have seen the comments calling it bullshit. I updated my trust for posts from this sub and continued with my life.

Honest-Debate-6863 · 2026-03-04T18:28:37+00:00

I sometimes upvote before I read the whole thing because I like what the content is about to validate my personal beliefs assessments and predictions to make me look confident and stronger . Blame the system not the human

GreenPastures2845 · 2026-03-04T18:29:04+00:00

There is a thing that happens where you perceive a leap in AI capability and you get all excited, and the first thought is to go share the excitement. Resist the urge, cool off for a few minutes and think critically.

Yeah, shit is amazing, but let's build on top of it rather than just drool over potential like some cult.

The_IT_Dude_ · 2026-03-04T18:34:31+00:00

I hope 4o hasn't been shut off as of yet. I disagree and need to ask it if Im being crazy for not believing you.

/s

sir_turlock · 2026-03-04T18:38:35+00:00

I think the problem is that AI's talk like a human, but hallucinate/make mistakes in a way that a human really doesn't. Our failure modes and self-correction capabilities are entirely different. One is a stochastic text generator and the other is the result of millions of years of evolution and it's perfectly capable of doing hard/formal logic. There are even parts of the brain that light up during error detection and correction.

artisticMink · 2026-03-04T18:51:36+00:00

You prolly know it better than i - but that's sort of the norm in r/LocalLLaMA

There are still some good posts here. But the one that raise quickly are sensationalist headlines put out by people with borderline 'chatbot-psychosis' going off on hallucinations. Sprinkled in with the occasional I built <product> that solves <problem> for F R E E.

ForsookComparison · 2026-03-04T19:17:05+00:00

The LinkedIn spam and infographics from people that have never used a local LLM in their life used to not be able to penetrate this sub. Something changed :'(

EmergencyLabs411 · 2026-03-04T19:58:18+00:00

"PSA: Humans are scary stupid"

Say no more, fam

ghulamalchik · 2026-03-04T20:01:08+00:00

4B is very tiny to retain much knowledge so it's expected it just hallucinated that info. I think 4B is perfect for tool use since it's very smart, but don't rely on it for knowledge and facts.

LocoMod · 2026-03-04T20:25:56+00:00

When people make claims like “2b model matches closed frontier models”, that could be a kid that is building a TODO app that even a lemon can generate. Could be a junior dev working on basic things. Or could be a senior that has no idea what a true frontier capability is because their use case doesn’t expose the edge case.

Consider that the level of experience is broad and that you’re not entitled to have an opinion for the sake of it, but should only be entitled to what you invested time and effort into understanding and what you can actually argue and justify, preferably in a manner that can be replicated (otherwise it has no value).

Wishful thinking, I know. But a reminder that the great majority of the world is less than 30 years old, a big portion of that is non-technical, and that the cost to truly test the frontier models at a scale where their utility can be discerned is untenable for an even greater number.

The best model is the one they can afford, but that has nothing to do with capability of models, but the capability of your wallet.

Cool-Chemical-5629 · 2026-03-04T20:38:18+00:00

Is it so hard to figure out that we all pick favorites? It's the Qwen fans upvoting everything that praises Qwen models AND downvoting everything that even remotely criticizes them.

I'm glad you posted this so soon after the recent news. Apparently, despite the hype, it turns out that Qwen models were doing so well the team behind them nearly fell apart after a post-hype, sober reevaluation of the actual quality.

Don't get me wrong, I love Qwen models as much as the next guy here, if not for anything else, then from the principle that they are free and give us something in times when we already lost Llamas. However, there is no doubt they could have been much better and there's no point trying to downplay the weaknesses. Especially in the general knowledge department.

Apparently, it's not a miracle to achieve better knowledge at comparable size, because other models showed that it's possible, so that's something they can't just sweep under the rug anymore and for sake of further advancement of Qwen models, the Qwen team will have to look into ways how to improve it.

Hopefully the new ex-Gemini guy will help them to get there and make the Qwen models better than ever before.

Best-Echidna-5883 · 2026-03-04T22:07:26+00:00

This happens every day on Reddit. You should know that. There are so many whacky posts and redundant "news" items it gets out of control.

the-ai-scientist · 2026-03-04T22:28:14+00:00

the upvote-first-read-later pattern is genuinely getting worse. people see a confident output and their brain just accepts it. whats wild is that hallucination detection is actually a solvable problem - grounding responses in sources, flagging low-confidence outputs - but most people just dont bother setting that up. the tool exists, the defaults are just bad...

sullenisme · 2026-03-04T18:01:28+00:00

good username

repair_and_privacy · 2026-03-04T17:55:51+00:00

Be true to your username 😁

Shensmobile · 2026-03-04T20:09:28+00:00

When people say that LLMs make a ton of mistakes, I assume they're an AI bot that's trying to sow discord because any real human that's worked with other humans knows that humans make a TON of mistakes. I work in the space of deploying LLMs in healthcare where they can't hire anyone to do the boring clerical stuff, and when I'm finetuning these bots on "labelled" data, I would say that like 30% of medical records are entered into databases incorrectly. If an LLM can do it with a 10% error rate, that's already significantly better than anyone you could hire to do this work.

Substantial_Work_559 · 2026-03-04T19:29:45+00:00

The model was quite correct in fact. It messed up the naming a bit but got the location quite well, Lisbon, Belem. Its the 'Igreja de Santa Maria de Belém'. I didnt notice the messed up name, I just saw the picture and the location description, and because I had been there, recognized it as well. This is one of the most famous places in Lisbon, so not too impressed. Streetview link: https://www.google.de/maps/@38.6972728,-9.2050589,3a,75y,311.25h,100.11t/data=!3m7!1e1!3m5!1s-KKCWytA3fLTbFkqMn5wVw!2e0!6shttps:%2F%2Fstreetviewpixels-pa.googleapis.com%2Fv1%2Fthumbnail%3Fcb_client%3Dmaps_sv.tactile%26w%3D900%26h%3D600%26pitch%3D-10.11131316065324%26panoid%3D-KKCWytA3fLTbFkqMn5wVw%26yaw%3D311.2455819877518!7i16384!8i8192?entry=ttu&g_ep=EgoyMDI2MDMwMS4xIKXMDSoASAFQAw%3D%3D

sine120 · 2026-03-04T18:52:12+00:00

Humans are scary stupid

Source??

MrCoolest · 2026-03-04T18:54:41+00:00

Why would people use qwen if its that shit? I'd rather stick to chatgpt or claude. I guess maybe qwen might be good If you're cheating on your high school science homework?

mantafloppy · 2026-03-04T20:17:33+00:00

Thinking this are all human was your first mistake.

JayPSec · 2026-03-04T21:34:57+00:00

Well... Kinda. To be fair, even though it hallucinated a name, it correctly identified an architectural style from the 1500's and it described the place, "Mosteiro dos Jerónimos", to an impressive degree of detail. So yes, at least evaluating against my expectations, the model is scary smart.

nikgeo25 · 2026-03-04T18:22:30+00:00

How do we know this post isn't doing the same thing... reinforcing opinions in this sub

stylehz · 2026-03-04T18:28:26+00:00

"A building that doesn't exist." Lil bro can't even use Google properly and is making hot takes.
First, Qwen did not get the building 100%. Second, the building does exist; it is the MOSTEIRO DOS JERÓNIMOS in LISBON. At least Qwen got the location.
And you prove your point: humans are scary stupid.

LocalLLaMA

MODERATORS

Welcome to Reddit.

Want to add to the discussion?