Groq stopped serving Kimi K2 (1T params) when they got aquihired by NVIDIA, so I guess NVIDIA took most of the hardware in addition to the employees. The largest model they serve now is the relatively minuscule gpt-oss-120b.<p>The community support forum is also getting retired and there haven't been any posts by support employees in forever anyway, so they are probably gone, too. Also, the number of issues have been piling up, suggesting that the developers are gone as well. <a href="https://community.groq.com/c/forum/4" rel="nofollow">https://community.groq.com/c/forum/4</a> (archive link for when it goes down <a href="https://web.archive.org/web/20260602064050/https://community.groq.com/c/forum/4" rel="nofollow">https://web.archive.org/web/20260602064050/https://community...</a>)<p>To me, it looks they are trying to raise 650M with a few remaining (ancient) LPUs and no employees.
Before they removed it, I was using groq Kimi K2 model for a chat bot in small community site/chat. It was really good, seemed to have incredibly vast general world knowledge and the fast speed (400tok/s if I remember right) meant that chat users got a response instantly which was a much better experience compared to other SOTA models at the time.<p>On the bright side it looks like Cerebras might be serving Kimi K2.6 at 1000tok/s soon <a href="https://www.cerebras.ai/blog/cerebras-kimi-k2-Enterprise" rel="nofollow">https://www.cerebras.ai/blog/cerebras-kimi-k2-Enterprise</a>
Somehow I never got the hype around Groq. They served models with fast inference speed - that sounded great in theory - and as a user I was looking forward to use them. But, when I did try, I discovered that they are quantizing the models underneath. And they dont even disclose it. So I stopped using them.<p>The whole thing never made any sense to me - but I guess AI hype is a thing.
I've was part of an acquire-to-remove-an-annoyance, including having the better product that didn't win for niche reasons - which vaguely matches the situation here. Yes, NVIDIA can wind down Groq (unambiguously a better product in this case) even though it doesn't make sense.<p>That being said, there's still a chance that NVIDIA engineering is in the process of stripping it for parts. Or the lawyers are - maybe they have too much momentum with GPUs and just want ASICs out of the market.<p>This kind of innovation stifling acquisition should have been blocked. NVIDIA is a serious monopoly threat.
I don't really get the value proposition of groq as a user, the performance is really poor for the token price. Data centres on the other hand are becoming a commodity, and I don't see any reason a priori to invest in groq specifically for something like that.
> the performance is really poor for the token price<p>That doesn’t match my experience or the numbers:<p><a href="https://openrouter.ai/openai/gpt-oss-120b?sort=throughput" rel="nofollow">https://openrouter.ai/openai/gpt-oss-120b?sort=throughput</a>
Groq is considerably faster and better at inference, they have a totally superior product to Nvidia for inference based tasks, which will be the dominant concern in the future.<p>Plausibly, take all the Nvidia hype and multiply that by a factor and that's what 'Groq' could be worth.<p>And there is no real commodification - there's Nvidia, Cerebras, Groq ... not many otheres.
> they have a totally superior product to Nvidia for inference based tasks<p>They're not really competing with Nvidia because 1) Nvidia owns their chips now, and 2) Nvidia is not really an inference provider.
Groq is a slicon maker, the inference provider stuff is a path to market, it's not really the reflection of their market potential.<p>Nvidia doesn't own them or all their IP now, we don't quite know the terms of the deal.
Define “totally superior”?<p>Was this comment created using quantized llama 3?<p>I love Groq, but across every single line break in your post there is a glaring issue that is easy to refute with in 15 seconds, even without 300t/s of throughput.
You wasted all of your commentary on snark and sadly unfunny humour, and yet still managed to add nothing.<p>Groq is more performant for the growing categories of inference-based tasks, wherein Nvidia's advantage in inference depends bulk/batch processing which will make up a smaller category over time, in relative terms.<p>The future of AI Silicon is inference, and the cost structure of AI data centres is constrained around the current necessity to have 'high GPU utilization' otherwise, the cost / amortization of the chips doesn't work out.<p>That cost structure is a limitation of Nvidia architecture.<p>Groq serves a lot faster, and without the limiting batching requirement, which opens hosting arrangements common in most classical hosting scenarios aka without necessarily the high utilization requirements.<p>Groq has bespoke hardware, lack of CUDA, much lower memory desnsity obviously and they don't have the deep distribution networks and leverage over TSMC that Nvidia has - but pound for pound, were we to be able to 'fire up a server' for our inference needs, it would be Groq, not Nvidia that we'd turn to.<p>Were they not a later market entrant and didn't have those barriers to entry, they'd be gigantic.
Google has been releasing a new TPU generation every year since 2023 and the eight generation consists of a training and an inference optimized design.<p>Google's eight generation TPU inference chip has 384 MB of on-chip SRAM vs 500 MB for Groq's third generation LPU.
Haha. Groq is trash. It can’t be used for anything where reliable work is required.<p>Groq lasted how long in the social contract experiment? Libertarians always forget that rules and government aren’t tyrannical they are the mechanism to ensure bullies don’t destroy everyone’s freedom.
This is not about xAi.
Groq != grok
My company had a really terrible experience trying to use Groq, and I would NOT recommend anyone use their service if you need reliability. So many random errors, so many silly quirks.
as soon as i saw they switched to "call us for quotes" for the new models, i knew they are over.
Maybe Tom Ellis can give another AMA <a href="https://news.ycombinator.com/item?id=39429047">https://news.ycombinator.com/item?id=39429047</a> so we can find out.
The most bizarre thing here is the reporters. They intentionally misrepresented what happened. All the news stories from last year claimed Nvidia "acquired" Groq - and in the same story, quoted what <i>actually</i> happened, and pretended it didn't. It's like the journalists had some kind of group psychosis, pretending it was an acquisition. It wasn't. It was a <i>really</i> white-glove product rental.<p>From the actual Groq PR release:<p><i>"Groq announced that it has entered into a non-exclusive licensing agreement with Nvidia"</i> - <i>"As part of this agreement, Jonathan Ross, Groq’s Founder, Sunny Madra, Groq’s President, and other members of the Groq team will join Nvidia to help advance and scale the licensed technology"</i> - <i>"Groq will continue to operate as an independent company with Simon Edwards stepping into the role of Chief Executive Officer"</i> (<a href="https://groq.com/newsroom/groq-and-nvidia-enter-non-exclusive-inference-technology-licensing-agreement-to-accelerate-ai-inference-at-global-scale" rel="nofollow">https://groq.com/newsroom/groq-and-nvidia-enter-non-exclusiv...</a>)<p>Nothing about an acquisition there. It says Nvidia is licensing it, and that others can too. The execs work for Nvidia to integrate it into Nvidia's... something. And Groq the company remains the same as before.<p>There's also no official source for the amount Nvidia paid for the tech, or two unofficial ones. Journalistically speaking, this is some bullshit.<p>Why's the deal like this? No idea. Does it make sense? No idea. But it's not odd that Groq is continuing to raise more money, because they never stopped being a normally operating company.<p>If you want an explanation for <i>why</i> Nvidia would do this deal, my best offer is here (<a href="https://openrouter.ai/rankings" rel="nofollow">https://openrouter.ai/rankings</a>): Of the top 10 fastest AI models, Groq is the provider of 4 of them. And of the <i>price</i> of those top 10 fastest AI models, Groq is #1, #2, #3, and #5. And you wonder why someone's giving them a measly half billion dollars? They're the fastest cheapest thing on the market. If you don't understand the value of that, you really don't understand AI.
> There's also no official source for the amount Nvidia paid for the tech,<p>Did you honestly believe that this kind of deal doesn't leave a trace in financial disclosures?<p>The February 2026 Nvidia 10-K has this:<p>Cash flows from investing activities: Groq. -$13B.<p>And this: "Total consideration consists of $13.0 billion paid at closing and $4 billion, inclusive of imputed interest, payable within one year included in Accrued and Other Current Liabilities on our Consolidated Balance
Sheets."
Deals like these are quite rare, would be nice to know more about how the funding worked, but for customers, what are they doing? Total silence from Groq for a very long time now.
I don’t understand one part of the licensing here: if it was just a license, can’t they relicense the software and hardware of LPU3 to AMD? Or hire new software and hardware people?<p>The new designs were their main asset besides the amazing talent that went to NVIDIA, not the remaining DCs.
what's the confusion. Groq offers a fast inference solution that is currently unique (why do you think Nvidia paid $8 billion to end-run around the SEC to acquire the technology). This is good news as it ensures that Groq customers can can be assured continuity to use their service.
Do they have any good models yet?
It’s not X AI, that would be Grok with a ‘k’.
The list is:<p>-- allam-2-7b<p>-- canopylabs/orpheus-arabic-saudi<p>-- canopylabs/orpheus-v1-english<p>-- groq/compound<p>-- groq/compound-mini<p>-- llama-3.1-8b-instant<p>-- llama-3.3-70b-versatile<p>-- meta-llama/llama-4-scout-17b-16e-instruct<p>-- meta-llama/llama-prompt-guard-2-22m<p>-- meta-llama/llama-prompt-guard-2-86m<p>-- openai/gpt-oss-120b<p>-- openai/gpt-oss-20b<p>-- openai/gpt-oss-safeguard-20b<p>-- qwen/qwen3-32b<p>-- whisper-large-v3<p>-- whisper-large-v3-turbo
I’m confused by the confusion. Groq licensed their technology (sold part of their business) to Nvidia for a large amount of money and distributed the spoils to their investors. Seems quite normal? But then the Axios article says…<p>“Existing shareholders will receive the remaining cash distributions and then have the opportunity to invest into a new company”<p>New company? But Groq still exists and continued to exist.<p>“The bottom line: Don't be surprised if this becomes a new transaction template in the AI private markets.”<p>A transaction template? I don’t follow what was novel about this situation. The Meta not-acquisition-acquisition of Scale seems more novel.<p>I guess I feel like Zach’s confusion is because of the way Axios has presented what is happening to Groq. Looking at why actually happened with Groq, it seems like Axios are reporting it weird.<p>Unless Groq really is starting a new company in which case I am equally as confused.<p>edit: when announced last year it was announced as an asset acquisition <a href="https://www.cnbc.com/2025/12/24/nvidia-buying-ai-chip-startup-groq-for-about-20-billion-biggest-deal.html" rel="nofollow">https://www.cnbc.com/2025/12/24/nvidia-buying-ai-chip-startu...</a>
The interesting thing here isn't "how, logistically, is the Groq corporate entity able to raise more money?". That's straightforward.<p>Rather, the interesting thing and the topic of most of the article is "how, after Nvidia hired most of Groq's team and licensed all their IP, did Groq manage to convince investors to invest in the remaining corporate entity?"
There's nothing normal at all about the Nvidia Groq deal, it's hard to read in terms of what it means. A straight licensing deal would have been easier to ingest.
[dead]
[dead]
[dead]
[dead]
[flagged]
[flagged]
i like groq but models seem to have stagnated - looks like the company isn't focused on b2c anymore?