Remember when putting your entire life & business into the cloud was good because they were all offering 5 9s of uptime?<p>Very few cases these days.. feels like we are lucky to get 2 9s anymore.
Honestly, downtime has gotten way better as one of the people behind (<a href="https://downforeveryoneorjustme.com" rel="nofollow">https://downforeveryoneorjustme.com</a>). Compared to 10 years ago things are so much more redundant and harder to take down.
So then why does no one offer 99.999% uptime guarantees in writing?<p>It should be low risk to offer such guarantees then.
Well, (a) why would they? (b) "uptime" has shifted from a binary "site up/down" to "degraded performance", which itself indicates improvements to uptime since we're both pickier and more precise.
Thank you finally.<p>Tired of all the people online with anxiety who project their own personal issues by spamming this kind of doomer posts.
'The outage of a single server is a tragedy, the outage of an entire AWS region is a statistic.'<p>- Stalin probably
We had a ton of traffic coming in to check them:
<a href="https://downforeveryoneorjustme.com/anthropic" rel="nofollow">https://downforeveryoneorjustme.com/anthropic</a><p>Not one of the usual ones that has service problems :)
At this point you can stop worrying about downtime-free deployments so the devops becomes easier
I wonder how much is due to supply constraints, how much is standard growing pains, and if over-reliance on AI was the cause for any outages.
> Our uptime has a '9' in it! -- Anthropic
By now, I'm nearly certain that they'd be down to 0 9s of uptime if they counted it conservatively.
Github this month is very close to having 0 9s reliability. (unless they want to argue that 89% has a 9 in it)
The comment you are replying is carefully written in a way that allows 23.19%
I'm not sure I've had a day without Github hiccups this month, so that feels right.
Or as the British would say "9 innit ?"
<a href="https://status.claude.com/" rel="nofollow">https://status.claude.com/</a>
I wouldn't be too harsh, scaling x10 YoY is a bit hard on the infra!
If you don't pay attention 99% may sound high but it means up to <i>20 hours</i> of downtime in over the quarter.<p>Anthropic has had more than that.<p>Yikes.
I honestly feel like it's more honest status measure than many status pages I know.
Probably vide-coded their infrastructure
You can access Claude models with Google Cloud reliability via VertexAI. The caveat is that you cannot use your subscription, per-token pricing only.<p>I personally prefer per-token, it makes you more thoughtful about your setup and usage, instead of spray and pray.<p>You can also access the notable open weight models with VertexAI, only need to change the model id string.
I also use them per-token (and strongly prefer that due to a lack of lock-in).<p>However, from a game theory perspective, when there's a subscription, the model makers are incentivized to maximize problem solving in the minimum amount of tokens. With per-token pricing, the incentive is to maximize problem solving while increasing token usage.
I don't think this is quite right because it's the same model underneath. This problem can manifest more through the tooling on top, but still largely hard to separate without people catching you.<p>I do agree that Big Ai has misaligned incentives with users, generally speaking. This is why I per-token with a custom agent stack.<p>I suspect the game theoretic aspects come into play more with the quantizing. I have not (anecdotally) experienced this in my API based, per-token usage. I.e. I'm getting what I pay for.
You can use your subscription for Anthropic-hosted Claude models?
You mean Google Chaos Services as we call them?
I saw a funny skit where if free Claude instance was down for you, you could just ask Rufus, Amazon's shopping AI assistant, your math/coding question phrased as a question about a product, and it would just answer lol.
They seem to be a victim of their own success. Their response times are quite bad, and it's widely believed they are doing something to degrade service quality (quantizing?) in order to stretch resources. They just announced that they're cutting their usage limits down during peak hours as well.<p>They're in serious risk of losing their lead with this sort of performance.
> it's widely believed they are doing something to degrade service quality (quantizing?) in order to stretch resources<p>God, I wish this inane bullshit would just fucking die already.<p>Models are not "degrading". They're not being "secretly quantized". And no one is swapping out your 1.2T frontier behemoth for a cheap 120B toy and hoping you wouldn't notice!<p>It's just that humans are completely full of shit, and can't be trusted to measure LLM performance objectively!<p>Every time you use an LLM, you learn its capability profile better. You start using it more aggressively at what it's "good" at, until you find the limits and expose the flaws. You start paying attention to the more subtle issues you overlooked at first. Your honeymoon period wears off and you see that "the model got dumber". It didn't. You got better at pushing it to its limits, exposing the ways in which it was always dumb.<p>Now, will the likes of Anthropic just "API error: overloaded" you on any day of the week that ends in Y? Will they reduce your usage quotas and hope that you don't notice because they never gave you a number anyway? Oh, definitely. But that "they're making the models WORSE" bullshit lives in people's heads way more than in any reality.
It can't be worse than gemini-cli using a Pro account.
i just use gemini 3 flash via api with custom agent.<p>only people who do not even look at code anymore need anything more than that.
I can't speak on Gemini but OpenAI is far worse for free accounts at least
<p><pre><code> > this sort of performance
</code></pre>
They've been very proud of it.
>"They're in serious risk of losing their lead with this sort of performance."<p>Nobody goes there anymore, it's too crowded.
[dead]
Victim of success.<p>They are the best.<p>ChatGPT is walmart.<p>Gemini is kroger.<p>Claude is... idk your local grocer that is always amazing and costs more?
MAKE NO MISTAKES!
DO NOT HALLUCINATE!
FIX IT!
[dead]
This is not an outage, Claude just gets lazier on Fridays.<p>Sometimes Claude wants more lunch breaks, takes a half day and leaves the desk early just like any human would. (since AI boosters like comparing LLMs to humans all the time) /s
If you're concerned about humans anthropomorphizing AI models, you might want to steer well clear of Anthropic, as their entire positioning (starting with the product name and continuing with UX choices and model releases) is built to attract the kind of researchers who are prone to believe in sentient machines.<p>They are going in the "Claude is alive" direction already and that line of communication is likely going full throttle in the nearby future.
You joke, but I think that's a fair summary of why people don't mind one 9 of uptime in a key component of their development workflow.