×
all 21 comments

[–]digitaltransmutation 17 points18 points  (1 child)

Lately I have been hearing that the free providers on openrouter are de-prioritizing openrouter requests (namely chutes). I also see that deepinfra is quanting at fp4 and openInference specifically makes a point to demand the right to publish or redistribute your chats in what appears to be a hand-written privacy policy.

[–]Ok_Bug1610 1 point2 points  (0 children)

I can confirm that's true and you get a lot of rate limits now and other issues, but for the most part AI tools like RooCode just try again. It's not perfect, but it's free.

[–]foxdit 4 points5 points  (3 children)

Been using the nVidia NIM for a few weeks with great success, but it went down for like 24 hours recently, and today I wake up and it's down again... for who knows how long.

[–]biggest_guru_in_town 2 points3 points  (0 children)

They won't register international users so it gets a rotten tomato for me. Up to this day they haven't resolved it when trying to confirm registration with phone sms

[–]Omega-nemo[S] 1 point2 points  (0 children)

When too many users use a provider at the same time it is normal for this to happen

[–]quackycoaster 1 point2 points  (0 children)

Yeah Nvidia worked great for a while, then it just didn't. Gateway timeouts, even when it is working, it was taking on average of about 200 seconds for me to spit stuff out when openrouter was at about 30 seconds... I keep it in my list for when openrouter is clogged up, but I wouldn't rely on it as your sole provider.

[–]TonyKhanIsAMoneyMark 5 points6 points  (2 children)

Is there even a reason to use Infermatic and other providers when so much shit is free these days?

[–]Minimum-Analysis-792 4 points5 points  (0 children)

Maybe stability, because most of the free hosting platforms eventually get overloaded with people so they reduce the limit of RPM/RPD and it gets hard to get a response sometimes.

[–]Zulfiqaar 0 points1 point  (0 children)

Zero Data Retention endpoints, better reliability, significantly higher generation speeds - all have their price

[–]Minimum-Analysis-792 6 points7 points  (0 children)

That's a great list! I also have one for free and cheap APIs: rentry.c o/LLMAPI

I'll also add the ones in yours so it can be a big free list so that everyone have access to APIs.

[–]GullibleReturn4474 2 points3 points  (0 children)

Thanks for the info, it bothers me a little because I paid for the API to support and in return get something good for roleplay, but the official API has the new version that sucks for roleplay and now I have to change to free options

[–]jx2002 0 points1 point  (1 child)

I honestly don't understand how to get Vercel AI to work with SillyTavern. I have setup an account, got the free credits, made an API key and...it just never works. GPT keeps saying it's something with a firewall or whatever, but I can connect to any other service via API key just fine. Supposedly there's a unique URL you have to use...but I'll be damned if I can find it.

Also, if anyone can make AlibabaCloud work...I'd love to know how.

[–]Omega-nemo[S] 0 points1 point  (0 children)

Maybe it's the fault of the provider where they get the models from

[–]lawgun 0 points1 point  (2 children)

I wish there was a list like this for GLM 4.5 and Kimi 2 as well.

[–]Omega-nemo[S] 0 points1 point  (0 children)

Some of these providers like routeway have them

[–]biggest_guru_in_town 0 points1 point  (0 children)

Chutes has glm 4.5 air for free

[–]East_Piano2514 0 points1 point  (0 children)

They made a new model

[–]Ok_Bug1610 1 point2 points  (0 children)

Useful tidbit but the 50 free daily requests for Openrouter increase to 1,000 per day (and I've reached over 110 million daily tokens with this method) as long as you have at least a $10 USD balance. And you don't have to use the balance, just keep it to get higher limits for the free models. And you can go to the models page and filter by "free".

[–]IWEREN99 -1 points0 points  (2 children)

Add Nebula Block to the list. I used that provider to use Deepseek

[–]KING_IRQ 2 points3 points  (0 children)

Unfortunately, they don’t anymore.

[–]Omega-nemo[S] 0 points1 point  (0 children)

It does not have deepseek V3.1