28 comments

  • etra01 hour ago
    LLMs have certainly become extremely useful for Software Engineers, they&#x27;re very convincing (and pleasers, too) and I&#x27;m still unsure about the future of our day-to-day job.<p>But one thing that has scared me the most, is the trust of LLMs output to the general society. I believe that for software engineers it&#x27;s really easy to see if it&#x27;s being useful or not -- We can just run the code and see if the output is what we expected, if not, iterate it, and continue. There&#x27;s still a professional looking to what it produces.<p>On the contrary, for more day-to-day usage of the general pubic, is getting really scary. I&#x27;ve had multiple members of my family using AI to ask for medical advice, life advice, and stuff were I still see hallucinations daily, but at the same time they&#x27;re so convincing that it&#x27;s hard for them not to trust them.<p>I still have seen fake quotes, fake investigations, fake news being spreaded by LLMs that have affected decisions (maybe, not as crucials yet but time will tell) and that&#x27;s a danger that most software engineers just gross over.<p>Accountability is a big asterisk that everyone seems to ignore
    • santadays50 minutes ago
      I get this take, but given the state of the world (the US anyways), I find it hard to trust anyone with any kind of profit motive. I feel like any information can’t be taken as fact, it can just be rolled into your world view and discarded if useful or not. If you need to make a decision that can’t be backed out of that has real world consequences I think&#x2F;hope most people are learning to do as much due diligence as reasonable. Llms seem at this moment to be trying to give reliable information. When they’ve been fine tuned to avoid certain topics it’s obvious. This could change but I suspect it will be hard to find tune them too far in a direction without losing capability.<p>That said, it definitely feels as though keeping a coherent picture of what is actually happening is getting harder, which is scary.
      • etra08 minutes ago
        &gt; I find it hard to trust anyone with any kind of profit motive.<p>As much as this is true, and i.e. doctors for sure can profit (here in my country they don&#x27;t get any type of sponsor money AFAIK, other than having very high rates), there is <i>still</i> accountability.<p>We have built a society based on rules and laws, if someone does something that can harm you, you can follow the path to <i>at least</i> hold someone accountable (or, try).<p>The same cannot be said about LLMs.
      • twoodfin43 minutes ago
        <i>I feel like any information can’t be taken as fact, it can just be rolled into your world view and discarded if useful or not.</i><p>The concern, I think, is that for many that “discard function” is not, “Is this information useful?”. Instead: “Does this information reinforce my existing world view?”<p>That feedback loop and where it leads is potentially catastrophic at societal scale.
        • RussianCow23 minutes ago
          This was happening well before LLMs, though. If anything, I have hope that LLMs might break some people out of their echo chambers if they ask things like &quot;do vaccines cause autism?&quot;
    • joshribakoff7 minutes ago
      With code, even when it looks correct, it can be subtly wrong and traditional search engines don’t sit there and repeatedly pressure you into merging the PR.
    • raincole24 minutes ago
      &gt; using AI to ask for medical advice<p>So the number of anti-vaxxers is going to plummet drastically in the following decade, I guess.
      • etra014 minutes ago
        I haven&#x27;t tried with this specific topic, but being the pleasers llms are, I doubt someone so focused on being anti-vaxxer will be convinced by an LLM, if anything, the LLM will give them reason at some point.
      • preisschild15 minutes ago
        Depends if they use lobotomized bots like Grok...
        • andsoitis9 minutes ago
          &gt;&gt; So the number of anti-vaxxers is going to plummet drastically in the following decade, I guess.<p>&gt; Depends if they use lobotomized bots like Grok...<p>What are you on about?<p>For instance, asking Grok &quot;are vaccines safe&quot;, it has a pretty good reply, starting with <i>&quot;Yes, vaccines are overwhelmingly safe and one of the most effective public health interventions in history. Extensive scientific evidence from decades of research, including rigorous clinical trials, post-licensure monitoring, and systematic reviews by organizations like the WHO, CDC, NIH, and independent bodies, shows that the benefits of vaccination far outweigh the risks for individuals and populations.&quot;</i> and then rounding out the conversation talking about Key Evidence on Safety and Benefits; Risks vs. Benefits; Addressing Concerns.<p><a href="https:&#x2F;&#x2F;grok.com&#x2F;share&#x2F;c2hhcmQtNA_69e20553-2558-46be-9f21-6ad92c470367" rel="nofollow">https:&#x2F;&#x2F;grok.com&#x2F;share&#x2F;c2hhcmQtNA_69e20553-2558-46be-9f21-6a...</a><p>When I then ask &quot;I heard vaccines cause autism&quot;, it replies: <i>&quot;No, vaccines do not cause autism. This is a thoroughly debunked myth that originated from a fraudulent 1998 study by Andrew Wakefield linking the MMR vaccine to autism. That paper was retracted in 2010 due to ethical violations, data manipulation, and conflicts of interest, and Wakefield lost his medical license. Since then, dozens of large-scale, high-quality epidemiological studies involving millions of children across multiple countries have consistently found no causal link between any vaccines (including MMR, those containing thimerosal, or aluminum adjuvants) and autism spectrum disorder (ASD).&quot;</i><p>Seems pretty good to me.
  • bachmeier2 hours ago
    &gt; Programmers resistance to AI assisted programming has lowered considerably. Even if LLMs make mistakes, the ability of LLMs to deliver useful code and hints improved to the point most skeptics started to use LLMs anyway: now the return on the investment is acceptable for many more folks.<p>I&#x27;m not a fan of this phrasing. Use of the terms &quot;resistance&quot; and &quot;skeptics&quot; implies they were wrong. It&#x27;s important we don&#x27;t engage in revisionist history that allows people in the future to say &quot;Look at the irrational fear programmers had of AI, which turned out to be wrong!&quot; The change occurred because LLMs are useful for programming in 2025 and the earliest versions weren&#x27;t for most programmers. It was the technology that changed.
    • mjr001 hour ago
      &quot;Skeptics&quot; is also a loaded term; what does it actually mean? I find LLMs incredibly useful for various programming tasks (generating code, searching documentation, and yes with enough setup agents can accomplish some tasks), but I also don&#x27;t believe they have actual intelligence, nor do I think they will eviscerate programming jobs, the same way that Python and JavaScript didn&#x27;t eviscerate programming jobs despite lowering the barrier to entry compared to Java or C. Does that make me a skeptic?<p>It&#x27;s easy to declare &quot;victory&quot; when you&#x27;re only talking about the maximalist position on one side (&quot;LLMs are totally useless!&quot;) vs the minimalist position on the other side (&quot;LLMs can generate useful code&quot;). The AI maximalist position of &quot;AI is going to become superintelligent and make all human work and intelligence obsolete&quot; has certainly not been proven.
      • Aurornis1 hour ago
        No, that doesn’t make you a skeptic in this context.<p>The LLM skeptics claim LLM usefulness is an illusion. That the LLMs are a fad, and they produced more problems than they solve. They cite cherry picked announcements showing that LLM usage makes development slower or worse. They opened ChatGPT a couple times a few months ago, asked some questions, and then went “Aha! I knew it was bad!” when they encountered their first bad output instead of trying to work with the LLM to iterate like everyone who gets value out of them.<p>The skeptics are the people in every AI thread claiming LLMs are a fad that will go away when the VC money runs out, that the only reason anyone uses LLMs is because their boss forces them to, or who blame every bug or security announcement on vibecoding.
        • candiddevmike24 minutes ago
          Skeptic here: I do think LLMs are a fad for software development. They&#x27;re an interesting phenomen that people have convinced themselves MUST BE USEFUL in the context of software development, either through ignorance or a sense of desperation. I do not believe LLMs will be used long term for any kind of serious software development use cases, as the maintenance cost of the code they produce will run development teams into bankruptcy.<p>I also believe the current generations of LLMs (transformers) are technical dead ends on the path to real AGI, and the more time we spend hyping them, the less research&#x2F;money gets spent on discovering new&#x2F;better paths beyond transformers.<p>I wish we could go back to complaining about Kubernetes, focusing on scaling distributed systems, and solving more interesting problems that comparing winnings on a stochastic slot machine. I wish our industry was held to higher standards than jockeying bug-ridden MVP code as quickly as possible.
          • AYBABTME4 minutes ago
            In this year of 2025, in December, I find it untenable for anyone to hold this position unless they have not yet given LLMs a good enough try. They&#x27;re undeniably useful in software development, particularly on tasks that are amenable to structured software development methodologies. I&#x27;ve fixed countless bugs in a tiny fraction of the time, entirely accelerated by the use of LLM agents. I get the most reliable results simply making LLMs follow the &quot;red test, green test&quot; approach, where the LLM first creates a reproducer from a natural language explanation of the problem, and then cooks up a fix. This works extremely well and reliably in producing high quality results.
        • lowsong17 minutes ago
          &gt; They cite cherry picked announcements showing that LLM usage makes development slower or worse. They opened ChatGPT a couple times a few months ago, asked some questions, and then went “Aha! I knew it was bad!” when they encountered their first bad output instead of trying to work with the LLM to iterate like everyone who gets value out of them.<p>&quot;Ah-hah you stopped when this tool blew your whole leg off. If you&#x27;d stuck with it like the rest of us you could learn to only take off a few toes every now and again, but I&#x27;m confident that in time it will hardly ever do that.&quot;
        • mjr0059 minutes ago
          &gt; No, that doesn’t make you a skeptic in this context.<p>That&#x27;s good to hear, but I have been called an AI skeptic a lot on hn, so not everyone agrees with you!<p>I agree though, there&#x27;s a certain class of &quot;AI denialism&quot; which pretends that LLMs don&#x27;t do <i>anything</i> useful, which in almost-2026 is pretty hard to argue.
          • emp1734443 minutes ago
            On the other hand, ever since LLMs came on the scene, there’s been a vocal group claiming that AI will become intelligent and rapidly bring about human extinction - think the r&#x2F;singularity crowd. This seems just as untenable a position to hold at this point. It’s becoming clear that these things are simply tools. Useful in many cases, but that’s it.
          • Aurornis57 minutes ago
            &gt; That&#x27;s good to hear, but I have been called an AI skeptic a lot on hn, so not everyone agrees with you!<p>The context was the article quoted, not HN comments.<p>I’ve been called all sorts of things on HN and been accused of everything from being a bot to a corporate shill here. You can find people applying labels and throwing around accusations in every thread here. It doesn’t mean much after a while.
        • somewhereoutth35 minutes ago
          Not just their usefulness, but LLMs themselves are <i>worse</i> than an illusion, they are illusions that people often believe in unquestioningly - perhaps are being <i>forced</i> to believe in unquestionably (because of mandates, or short term time pressures as kind of race to the bottom).<p>When the ROI in training the next model is realised to be zero or even negative, then yes the money will run out. Existing models will soldier on for a while as (bankrupt) operators attempt to squeeze out the last few cents&#x2F;pennies, but they will become more and more out of date, and so the &#x27;age of LLMs&#x27; will draw to a close.<p>I confess my skeptic-addled brain initially (in hope?) misread the title of the post as &#x27;Reflections on the end of LLMs in 2025&#x27;. Maybe we&#x27;ll get that for 2026!
    • Aurornis1 hour ago
      &gt; The change occurred because LLMs are useful for programming in 2025<p>But the skeptics and anti-AI commenters are almost as active as ever, even as we enter 2026.<p>The debate about the usefulness of LLMs has grown into almost another culture war topic. I still see a constant stream of anti-AI comments on HN and every other social platform from people who believe the tools are useless, the output is always unusable, people who mock any idea that operator skill has an impact on LLM output, or even claims that LLMs are a fad that will go away.<p>I’m a light LLM user ($20&#x2F;month plan type of usage) but even when I try to share comments about how I use LLMs or tips I’ve discovered, I get responses full of vitriol and accusations of being a shill.
      • zahlman37 minutes ago
        It absolutely is culture war. I can easily imagine a less critical version of myself having ended up in that camp. It comes across to me that the perspective is informed by core values and principles surrounding what &quot;intelligence&quot; is.<p>I butted heads with many earlier on, and they did nothing to challenge that frame meaningfully. What <i>did</i> change is my perception of the set of tasks that <i>don&#x27;t require</i> &quot;intelligence&quot;. And the intuition pump for that is pretty easy to start — I didn&#x27;t suppose that Deep Blue heralded a dawn of true &quot;AI&quot;, either, but chess (and now Go) programs have only gotten even more embarrassingly stronger. Even if researchers and puzzle enthusiasts might still find positions that are easier for a human to grok than a computer.
    • 20k1 hour ago
      Its also significantly lowered because management is forcing AI on everyone at gunpoint, and saying that you&#x27;ll lose your job if you don&#x27;t love AI<p>That&#x27;s a very easy way to get everyone to pinky promise that they absolutely love AI to the ends of the earth
    • mvkel1 hour ago
      One only has to go read the original vibe coding thread[0] from ...ten months ago(!) to see the resistance and skepticism loud and clear. The very first comment couldn&#x27;t be more loud about it.<p>It was possible to create things in gpt-3.5. The difference now is it aligns with the -taste- of discerning programmers, which has a little, but not everything, to do with technological capability.<p>[0]<a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=42913909">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=42913909</a>
      • HarHarVeryFunny57 minutes ago
        &quot;Look Ma, no hands!&quot; vibe coding, as described by Karpathy, where you never look at the code being generated, was never a good idea, and still isn&#x27;t. Some people are now misusing &quot;vibe coding&quot; to describe any use of LLMs for coding, but there is a world of difference between using LLMs in an intelligent considered way as part of the software development process, and taking a hit on the bong and &quot;vibe coding&quot; another &quot;how many calories in this plate of food&quot; app.
      • zahlman1 hour ago
        &gt; The difference now is it aligns with the -taste- of discerning programmers<p>This... doesn&#x27;t match the field reports I&#x27;ve seen here, nor what I&#x27;ve seen from poking around the repos for AI-powered Show HN submissions.
    • ookblah1 hour ago
      you just need to hop into any AI reltaed thread (even this one) and it&#x27;s pretty clear no one is revising anything, skepticism is there lol.
    • nl1 hour ago
      There is some limited truth in this but we still see claims that LLMs are &quot;just next token predictors&quot; and &quot;just regurgitate code they read online&quot;. These are just uninformed and <i>wrong</i> views. It&#x27;s fair to say that these people were (are!) wrong.
      • mjr0035 minutes ago
        &gt; we still see claims that LLMs are &quot;just next token predictors&quot; and &quot;just regurgitate code they read online&quot;. These are just uninformed and wrong views. It&#x27;s fair to say that these people were (are!) wrong.<p>I don&#x27;t think it&#x27;s fair to say that at all. How are LLMs <i>not</i> statistical models that predict tokens? It&#x27;s a big oversimplification but it doesn&#x27;t seem <i>wrong</i>, the same way that &quot;computers are electricity running through circuits&quot; isn&#x27;t a wrong statement. And in both cases, those statements are orthogonal to how useful they are.
      • zahlman58 minutes ago
        <i>Objecting</i> to these claims is missing their point. Saying these things is really about denying that the LLMs &quot;think&quot; in any meaningful sense. (And the retorts I&#x27;ve seen in those discussions often imply very depressing and self-deprecating views of what it actually means to be human.)
        • emp1734439 minutes ago
          Leave it to HN to be militantly misanthropic to sell chatbots.
    • HarHarVeryFunny1 hour ago
      Yes, it&#x27;s a strange take. It&#x27;s not that programmers have changed their mind about unchanging LLMs, but rather that LLMs have changed and are now useful for coding, not just CoPilot autocomplete like the early ones.<p>What changed was the use of RLVR training for programming, resulting in &quot;reasoning&quot; models that are now attempting to optimize for a long-horizon goal (i.e. bias generation towards &quot;reasoning steps&quot; that during training let to a verified reward), as opposed to earlier LLMs where RL was limited to RLHF.<p>So, yeah, the programmers who characterized early pre-RLVR coding models as of limited use were correct. Now the models are trained differently and developers find them much more useful.
      • zahlman56 minutes ago
        I thought I&#x27;d read a lot of these threads this year, and also discussed off-site the use of coding agents and the technology behind them; but this is genuinely the first time I&#x27;ve seen the term &quot;RLVR&quot;.
  • mrdependable5 minutes ago
    These comments are a bit scary. It feels like LLMs managed to exploit some fault in the human psyche. I think the biggest danger of this technology is that people are not mentally equipped to handle it.
  • mwkaufma30 minutes ago
    A list of unverifiable claims, stated authoritatively. The lady doth protest too much.
  • jimmydoe24 minutes ago
    &gt; * The fundamental challenge in AI for the next 20 years is avoiding extinction.<p>sorry, I say it&#x27;s folding the laundry. with an aging population, that&#x27;s the most, if not only, useful thing.
  • dhpe6 hours ago
    I have programmed 30K+ hours. Do LLMs make bad code: yes all the time (at the moment zero clue about good architecture). Are they still useful: yes, extremely so. The secret sauce is that you&#x27;d know exactly what to do without them.
    • dejv2 hours ago
      &quot;Do LLMs make bad code: yes all the time (at the moment zero clue about good architecture). Are they still useful: yes, extremely so.&quot;<p>Well, lets see how all the economics will play out. LLMs might be really useful, but as far as I can see all the AI companies are not making money on inference alone. We might be hitting plateau in capabilities with money being raised on vision of being this godlike tech that will change the world completely. Sooner or later the costs will have to meet the reality.
      • Aurornis1 hour ago
        &gt; but as far as I can see all the AI companies are not making money on inference alone<p>The numbers aren’t public, but from what companies have indicated it seems inference itself would be profitable if you could exclude all of the R&amp;D and training costs.<p>But this debate about startups losing money happens endlessly with every new startup cycle. Everyone forgets that losing money is an expected operating mode for a high growth startup. The models and hardware continue to improve. There is so much investment money accelerating this process that we have plenty of runway to continue improving before companies have to switch to full profit focus mode.<p>But even if we ignore that fact and assume they had to switch to profit mode tomorrow, LLM plans are currently so cheap that even a doubling or tripling isn’t going to be a problem. So what if the monthly plans start at $40 instead of $20 and the high usage plans go from $200 to $400 or even $600? The people using these for their jobs paying $10K or more per month can absorb that.<p>That’s not going to happen, though. If all model progress stopped right now the companies would still be capturing cheaper compute as data center buildouts were completed and next generation compute hardware was released.<p>I see these predictions as the current equivalent of all of the predictions that Uber was going to collapse when the VC money ran out. Instead, Uber quietly settled into steady operation, prices went up a little bit, and people still use Uber a lot. Uber did this without the constant hardware and model improvements that LLM companies benefit from.
      • NitpickLawyer12 minutes ago
        &gt; but as far as I can see all the AI companies are not making money on inference alone.<p>This was the missed point on why GPT5 was such an important launch (quality of models and vibes aside). It brought the model sizes (and hence inference cost) to more sustainable numbers. Compared to previous SotA (GPT4 at launch, or o1&#x2F;3 series), GPT5 is 8x-12x cheaper! I feel that a lot of people never re-calibrated their views on inference.<p>And there&#x27;s also another place where you can verify your take on inference - the 3rd party providers that offer &quot;open&quot; models. They have 0 incentive to subsidise prices, because people that use them often don&#x27;t even know who serves them, so there&#x27;s 0 brand recognition (say when using models via openrouter).<p>These 3rd party providers have all converged towards a price-point per billion param models. And you can check those prices, and have an idea on what would be proffitable and at what sizes. Models like dsv3.2 are really really cheap to serve, for what they provide (at least gpt5-mini equivalent I&#x27;d say).<p>So yes, labs could totally become profitable with inference alone. But they don&#x27;t want that, because there&#x27;s an argument to be made that the best will &quot;keep it all&quot;. I hope, for our sake as consumers that it isn&#x27;t the case. And so far this year it seems that it&#x27;s not the case. We&#x27;ve had all 4 big labs one-up eachother several times, and they&#x27;re keeping eachother honest. And that&#x27;s good for us. We get frontier level offerings at 10-25$&#x2F;MTok (Opus, gpt5.2, gemini3pro, grok4), and we get highly capable yet extremely cheap models at 1.5-3$&#x2F;MTok (gemini3-flash, gpt-minis, grok-fast, etc)
      • Workaccount22 hours ago
        If the tech plateaus today, LLM plans will go to $60-80&#x2F;mo, Chinese-hosted chinese models will be banned (national security will be the given reason), and the AI companies will be making ungodly money.<p>I&#x27;m not gonna dig out the math again, but if AI usage follows the popularity path of cell phone usage (which seems to be the case), then trillions invested has a ROI of 5-7 years. Not bad at all.
        • iLoveOncall2 hours ago
          OpenAI would still lose money if the basic subscriptions were costing $500 and they had the same amount of subscribers as right now. There&#x27;s not a single model shop who&#x27;s ever making any money, let alone ungodly amounts.
          • Workaccount22 hours ago
            These costs you are referencing are training&#x2F;R&amp;D costs. Take those largely away, and you are left with inference costs, which are dirt cheap.<p>Now you have a world of people who have become accustomed to using AI for tons of different things, and the enshittification starts ramping up, and you find out how much people are willing to pay for their ChatGPT therapist.
      • ImprobableTruth2 hours ago
        They&#x27;re not making money on inference alone because they blow ungodly amounts on R&amp;D. Otherwise it&#x27;d be a very profitable business.
      • nl1 hour ago
        Anthropic - for one - is making lots of money on inference.
      • 20k1 hour ago
        This is one of the reasons why I&#x27;m surprised to see so many people jump on board. We&#x27;re clearly in the &quot;release product for free&#x2F;cheap to gain customers&quot; portion of the enshittification plan, before the company starts making it completely garbage to extract as much money as possible from the userbase<p>Having good quality dev tools is non negotiable, and I have a feeling that a lot of people are going to find out the hard way that reliability and it not being owned by profit seeking company is the #1 thing you want in your environment
    • qsort5 hours ago
      One of the mental frameworks that convinced me is how much of a &quot;free action&quot; it is. Have the LLM (or the agent) churn on some problem and do something else. Come back and review the result. If you had to put significant effort into each query, I agree it wouldn&#x27;t be worth it, but you can just type something into the textbox and wait.
    • _rpxpx5 hours ago
      OK, maybe. But how many programmers will know this in 10 years&#x27; time as use of LLMs is normalized? I like to hear what employers are saying already about recent graduates.
      • bartread5 hours ago
        They’d have to be hiring recent graduates for you to hear that perspective.<p>And, as much as what I’ve just said is hyperbolically pessimistic, there is some truth to it.<p>In the UK a bunch of factors have coincided to put the brakes on hiring, especially smaller and mid-size businesses. AI is the obvious one that gets all the press (although how much it’s really to blame is open to question in my view), but the recent rise in employer AI contribution, and now (anecdotally) the employee rights bill have come together to make companies quite gunshy when it comes to hiring.
        • bartread38 minutes ago
          *Employer NI contribution, not employer AI contribution - a pox be upon autocorrect
      • spaceman_20203 hours ago
        This is nothing new - entire industries and skills died out as the apprenticeship system and guilds were replaced by automation and factories
      • energy1232 hours ago
        I&#x27;m uncertain that programming will be a major profession in 10 years.<p>Programming is more like math than creative writing. It&#x27;s largely verifiable, which is where RL is repeatedly proven to eventually achieve significantly better than human intelligence.<p>Our saving grace, for now, is that it&#x27;s not entirely verifiable because things like architectural taste are hard to put into a test. But I would not bet against it.
      • nutjob22 hours ago
        If they don&#x27;t learn that they won&#x27;t get very far.<p>This is true for everything, any tool you might use. Competent users of tools understand how they work and thus their limitations and how they&#x27;re best put to work.<p>Incompetents just fumble around and sometimes get things working.
      • QuiDortDine2 hours ago
        hahah what are you talking about, there&#x27;s no such thing as long term!
    • bilsbie1 hour ago
      I mean if you leaned heavily on stack overflow before AI then nothing really changes.<p>It’s basically the same idea but faster.
    • feverzsj5 hours ago
      So, it&#x27;s like taking off your pants to fart.
  • danielfalbo7 hours ago
    &gt; There are certain tasks, like improving a given program for speed, for instance, where in theory the model can continue to make progress with a very clear reward signal for a very long time.<p>This makes me think: I wonder if Goodhart&#x27;s law[1] may apply here. I wonder if, for instance, optimizing for speed may produce code that is faster but harder to understand and extend. Should we care or would it be ok for AI to produce code that passes all tests and is faster? Would the AI become good at creating explanations for humans as a side effect?<p>And if Goodhard&#x27;s law doesn&#x27;t apply, why is it? Is it because we&#x27;re only doing RLVR fine-tuning on the last layers of the network so all the generality of the pre-training is not lost? And if this is the case, could this be a limitation in not being able to be creative enough to come up with move 37?<p>[1] <a href="https:&#x2F;&#x2F;wikipedia.org&#x2F;wiki&#x2F;Goodhart&#x27;s_law" rel="nofollow">https:&#x2F;&#x2F;wikipedia.org&#x2F;wiki&#x2F;Goodhart&#x27;s_law</a>
    • lemming6 hours ago
      <i>I wonder if, for instance, optimizing for speed may produce code that is faster but harder to understand and extend.</i><p>This is generally true for code optimised by humans, at least for the sort of mechanical low level optimisations that LLMs are likely to be good at, as opposed to more conceptual optimisations like using better algorithms. So I suspect the same will be true for LLM-optimised code too.
    • username2236 hours ago
      &gt; I wonder if, for instance, optimizing for speed may produce code that is faster but harder to understand and extend.<p>Superoptimizers have been around since 1987: <a href="https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;Superoptimization" rel="nofollow">https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;Superoptimization</a><p>They generate fast code that is not meant to be understood or extended.
      • progval6 hours ago
        But there output is (usually) executable code, and is not committed in a VCS. So the source code is still readable.<p>When people use LLMs to improve their code, they commit their output to Git to be used as source code.
        • Wowfunhappy3 hours ago
          ...hmm, at some point we&#x27;ll need to find a new place to draw the boundaries, won&#x27;t we?<p>Until ~2022 there was a clear line between human-generated code and computer-generated code. The former was generally optimized for readability and the latter was optimized for speed at all cost.<p>Now we have computer-generated code in the human layer and it&#x27;s not obvious what it should be optimized for.
          • erichocean1 hour ago
            &gt; <i>it&#x27;s not obvious what it should be optimized for</i><p>It should be optimized for readability by AI. If a human wants to know what a given bit of code does, they can just ask.
    • franktankbank3 hours ago
      Ehh I think if it ends up being a half good architecture you wind up with a difficult to understand kernel that never needs touching.
  • torlok6 hours ago
    This is a bunch of &quot;I believe&quot; and &quot;I think&quot; with no sources by a random internet person.
    • ctoth6 hours ago
      Ah, I see you have discovered blogs! They&#x27;re a cool form of writing from like ~20 years ago which are still pretty great. Good thing they show up on this website, it&#x27;d be rather dull with only newspapers and journal articles doncha think?
    • ajoseps6 hours ago
      he’s not a “random internet person”, he created Redis. Despite that, I don’t know how authoritative of a figure he is with respect to AI research. He’s definitely a prolific programmer though.
      • nurettin5 hours ago
        To be fair, you may find equally capable random people in this thread, doesn&#x27;t mean they speak with any kind of authority.
      • megous6 hours ago
        That still qualifies as a random internet person, wrt the topic. And I think the emphasis is on no sources and I beliefs and I thinks, in any case :)
      • XorNot6 hours ago
        There are plenty of Nobel laureates who well, do rest on their laurels and dive deep into pseudoscience after that.<p>Accomplishment in one field does not make one an expert, nor even particularly worth listening to, in any other. Certainly it doesn&#x27;t remove the burden of proof or necessity to make an actual argument based on more then simply insisting something is true.
        • 2snakes2 hours ago
          Careful with the scientism. The job of science is to explain the nature of reality, but we can only describe what we experience.
    • desbo6 hours ago
      Yeah, it’s called “Reflections”.
    • jacquesm2 hours ago
      Indeed, and, what do you &#x27;believe&#x27; or &#x27;think&#x27; in response?
    • dgellow2 hours ago
      It&#x27;s the personal blog of a famous internet person
    • matthewmacleod6 hours ago
      That is what a blog post is. Someone documenting what they think about a topic.<p>It&#x27;s not the case that every form of writing has to be an academic research paper. Sometimes people just think things, and say them – and they may be wrong, or they may be right. And they sometime have some ideas that might change how you think about an issue as a result.
    • echelon6 hours ago
      &gt; by a random internet person.<p>The creator of Redis.
      • cinntaile6 hours ago
        Sure but quite a few claims in the article are about AI research. He does not have any qualifications there. If the focus was more on usefulness, that would be a different discussion and then his experience does add weight.
        • djdishsv2 hours ago
          &gt; smart, intelligent person gives opinion<p>&gt; woah buddy this persons opinion isn’t worth anything more than a random homeless person off the street. they’re not an expert in this field<p>Is there a term for this kind of pedantry? Obviously we can put more weight behind the words a person says if they’ve proven themselves trustworthy in prior areas - and we should! We want all people to speak and let the best idea win. If we fallback to only expert opinions are allowed that’s asking to get exploited. And it’s also important to know if antirez feels comfortable spouting nonsense.<p>This is like a basic cornerstone of a functioning society. Though, I realize this “no man is innately better than another, evaluate on merit” is mostly a western concept which might be some of my confusion.
          • blibble1 hour ago
            &gt; Obviously we can put more weight behind the words a person says if they’ve proven themselves trustworthy in prior areas - and we should!<p>no, you shouldn&#x27;t<p>this is how you end up with crap like vaccine denialism going mainstream<p>&quot;but he&#x27;s a doctor!&quot;
      • nutjob22 hours ago
        Don&#x27;t see how that gives him more credibility wrt AI.<p>His entirely unsupported statements about AGI are pretty useless, for instance.<p>So many people assume AGI is possible, yet no one has a concrete path to it or even a concrete definition of what it or what form it might take.
    • dist-epoch6 hours ago
      What is a &quot;source&quot;? Isn&#x27;t it just &quot;another random internet person&quot;?
  • pton_xd1 hour ago
    &gt; For years, despite functional evidence and scientific hints accumulating, certain AI researchers continued to claim LLMs were stochastic parrots: probabilistic machines that would: 1. NOT have any representation about the meaning of the prompt. 2. NOT have any representation about what they were going to say. In 2025 finally almost everybody stopped saying so.<p>It&#x27;s interesting that Terrence Tao just released his own blog post stating that they&#x27;re best viewed as stochastic generators. True he&#x27;s not an AI researcher, but it does sound like he&#x27;s using AI frequently with some success.<p>&quot;viewing the current generation of such tools primarily as a stochastic generator of sometimes clever - and often useful - thoughts and outputs may be a more productive perspective when trying to use them to solve difficult problems&quot; [0].<p>[0] <a href="https:&#x2F;&#x2F;mathstodon.xyz&#x2F;@tao&#x2F;115722360006034040" rel="nofollow">https:&#x2F;&#x2F;mathstodon.xyz&#x2F;@tao&#x2F;115722360006034040</a>
    • antirez52 minutes ago
      What happened recently is that all the serious AI researches that were in the stochastic parrot side changed point of view but, incredibly, people without a deep understanding on such matters, previously exposed to such arguments, are lagging behind and still repeat arguments that the people who popularized them would not repeat again.<p>Today there is no top AI scientist that will tell you LLMs are just stochastic parrots.
      • visarga0 minutes ago
        The stochastic parrot framing makes some assumptions, one of them being that LLMs generate from minimal input prompts, like &quot;tell me about Transformers&quot; or &quot;draw a cute dog&quot;. But when input provides substantial entropy or novelty, the output will not look like any training data. And longer sessions with multiple rounds of messages also deviate OOD. The model is doing work outside its training distribution.
      • geraneum37 minutes ago
        Now that you’re here, what do you mean by “scientific hints” in your first paragraph?
  • seu6 hours ago
    &gt; And I&#x27;ve vibe coded entire ephemeral apps just to find a single bug because why not - code is suddenly free, ephemeral, malleable, discardable after single use. Vibe coding will terraform software and alter job descriptions.<p>I&#x27;m not super up-to-date on all that&#x27;s happening in AI-land, but in this quote I can find something that most techno-enthusiast seem to have decided to ignore: no, code is <i>not</i> free. There are immense resources (energy, water, materials) that go into these data centers in order to produce this &quot;free&quot; code. And the material consequences are terribly damaging to thousands of people. With the further construction of data centers to feed this free video coding style, we&#x27;re further destroying parts of the world. Well done, AGI loverboys.
    • fourside1 hour ago
      My guess is that “free” is meant in terms of the old definition where you’re not having to pay someone to create and maintain it. But yes, it’s important to realize there really is a cost here and one that can’t just be captured by a dollar amount.
    • dwaltrip2 hours ago
      Can you provide numbers relative to things many of us already do?<p>- drive to the store or to work<p>- take a shower<p>- eat meat<p>- fly on vacation<p>And so on... thanks!
    • Hendrikto6 hours ago
      You know what uses roughly 80 times more water in the US alone than water used by AI data centers world wide? Corn.
      • raddan6 hours ago
        Assuming your fact is true, that corn merely uses an order of magnitude or two more water than AI is surprising, given the utility of corn. It feeds the entire US (hundreds of millions of people), is used as animal feed (thus also feeding us), and is widely exported to feed other people. I the spirit of the “I think”s and “I believe”s of this blog post, I think that corn has a lot more utility than AI.
        • Hendrikto4 hours ago
          &gt; It feeds the entire US (hundreds of millions of people), is used as animal feed (thus also feeding us), and is widely exported to feed other people.<p>Not really. Most corn grown in the US isn’t even fit for consumption. It is primarily used for fermenting bioethanol.
  • piker6 hours ago
    &gt; There are certain tasks, like improving a given program for speed, for instance, where in theory the model can continue to make progress with a very clear reward signal for a very long time.<p>Super skeptical of this claim. Yes, if I have some toy poorly optimized python example or maybe a sorting algorithm in ASM, but this won’t work in any non-trivial case. My intuition is that the LLM will spin its wheels at a local minimum the performance of which is overdetermined by millions of black-box optimizations in the interpreter or compiler signal from which is not fed back to the LLM.
    • andy996 hours ago
      There was a discussion the other day where someone asked Claude to improve a code base 200x <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=46197930">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=46197930</a>
      • exitb5 hours ago
        That’s most definitely not the same thing, as „improving a codebase” is an open ended task with no reliable metrics the agent could work against.
    • dist-epoch6 hours ago
      <a href="https:&#x2F;&#x2F;github.com&#x2F;algorithmicsuperintelligence&#x2F;openevolve" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;algorithmicsuperintelligence&#x2F;openevolve</a>
      • piker5 hours ago
        <a href="https:&#x2F;&#x2F;chatgpt.com&#x2F;backend-api&#x2F;estuary&#x2F;public_content&#x2F;enc&#x2F;eyJpZCI6Im1fNjk0Njg0MjkzNTEwODE5MWE2NzY5MmE4YWRjNTZiMTA6ZmlsZV8wMDAwMDAwMDljNzg3MWZkYTExODc2MDgxZDllYjAyOSIsInRzIjoiMjA0NDIiLCJwIjoicHlpIiwiY2lkIjoiMSIsInNpZyI6IjIxMDJlMDkzMGExNjNkYWY3OWI4ZTI4YmNhZDE5OThlNGFjYmQxNjQzNzQ2ODRiYmM3NDFlZmE1OGViMjQ5NzgiLCJ2IjoiMCIsImdpem1vX2lkIjpudWxsLCJjcyI6bnVsbCwiY3AiOm51bGwsIm1hIjpudWxsfQ==" rel="nofollow">https:&#x2F;&#x2F;chatgpt.com&#x2F;backend-api&#x2F;estuary&#x2F;public_content&#x2F;enc&#x2F;e...</a>
  • abricq5 hours ago
    &gt; * Programmers resistance to AI assisted programming has lowered considerably. Even if LLMs make mistakes, the ability of LLMs to deliver useful code and hints improved to the point most skeptics started to use LLMs anyway: now the return on the investment is acceptable for many more folks.<p>Could not agree more. I myself started 2025 being very skeptical, and finished it very convinced about the usefulness of LLMs for programming. I have also seen multiple colleagues and friends go through the same change of appreciation.<p>I noticed that for certain task, our productivity can be multiplied by 2 to 4. So hence comes my doubts: are we going to be too many developers &#x2F; software engineers ? What will happen for the rests of us ?<p>I assume that other fields (other than software-related) should also benefits from the same productivity boosts. I wonder if our society is ready to accept that people should work less. I think the more likely continuation is that companies will either hire less, or fire more, instead of accepting to pay the same for less hours of human-work.
    • danielfalbo5 hours ago
      &gt; Are we going to be too many developers &#x2F; software engineers ? What will happen for the rests of us?<p>I propose that we should raise the bar for the quality of software now.
      • abricq5 hours ago
        Yes, certainly agree. A few days ago here there was this blog claiming how formal verification would become widely more used with AI. The author claiming that AI will help us with the difficulty barrier to write formal proofs.
    • antihipocrat5 hours ago
      I like to think of it as adding new lanes to a highway. More will be delivered until it all jams up again.
  • fleebee6 hours ago
    &gt; The fundamental challenge in AI for the next 20 years is avoiding extinction.<p>That&#x27;s a weird thing to end on. Surely it&#x27;s worth more than one sentence if you&#x27;re serious about it? As it stands, it feels a bit like the fearmongering Big Tech CEOs use to drive up the AI stocks.<p>If AI is really that powerful and I should care about it, I&#x27;d rather hear about it without the scare tactics.
    • Recursing6 hours ago
      I think <a href="https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;Existential_risk_from_artificial_intelligence#History" rel="nofollow">https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;Existential_risk_from_artifici...</a> has much better arguments than the LessWrong sources in other comments, and they weren&#x27;t written by Big Tech CEOs.<p>Also &quot;my product will kill you and everyone you care about&quot; is not as great a marketing strategy as you seem to imply, and Big Tech CEOs are not talking about risks anymore. They currently say things like &quot;we&#x27;ll all be so rich that we won&#x27;t need to work and we will have to find meaning without jobs&quot;
    • tejohnso2 hours ago
      What makes it a scare tactic? There are other areas in which extinction is a serious concern and people don&#x27;t behave as though it&#x27;s all that scary or important. It&#x27;s just a banal fact. And for all of the extinction threats, AI included, it&#x27;s very easy to find plenty of deep dive commentary if you care.
    • grodriguez1006 hours ago
      I would say yes, everyone should care about it.<p>There is plenty of material on the topic. See for example <a href="https:&#x2F;&#x2F;ai-2027.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;ai-2027.com&#x2F;</a> or <a href="https:&#x2F;&#x2F;www.lesswrong.com&#x2F;posts&#x2F;uMQ3cqWDPHhjtiesc&#x2F;agi-ruin-a-list-of-lethalities" rel="nofollow">https:&#x2F;&#x2F;www.lesswrong.com&#x2F;posts&#x2F;uMQ3cqWDPHhjtiesc&#x2F;agi-ruin-a...</a>
      • dkdcio6 hours ago
        fear mongering science fiction, you may as well cite Dune or Terminator
        • defrost6 hours ago
          There&#x27;s arguably more dread and quiet constrained horror in <i>With Folded Hands ...</i> (1947)<p><pre><code> Despite the humanoids&#x27; benign appearance and mission, Underhill soon realizes that, in the name of their Prime Directive, the mechanicals have essentially taken over every aspect of human life. No humans may engage in any behavior that might endanger them, and every human action is carefully scrutinized. Suicide is prohibited. Humans who resist the Prime Directive are taken away and lobotomized, so that they may live happily under the direction of the humanoids. </code></pre> ~ <a href="https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;With_Folded_Hands_" rel="nofollow">https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;With_Folded_Hands_</a>...
          • XorNot6 hours ago
            This hardly disproves the point: no one is taking this topic seriously. They&#x27;re just making up a hostile scenario from science fiction and declaring that&#x27;s what&#x27;ll happen.
        • lm284695 hours ago
          Lesswrong looks like a forum full of terminally online neckbeards who discovered philosophy 48 hours ago, you can dismiss most of what you read there don&#x27;t worry
    • dist-epoch5 hours ago
      Yeah, well known marketing trick that Big Companies do.<p>Oil companies: we are causing global warming with all this carbon emissions, are you scared yet? so buy our stock<p>Pharma companies: our drugs are unsafe, full of side effects, and kill a lot of people, are you scared yet? so buy our stock<p>Software companies: our software is full of bugs, will corrupt your files and make you lose money, are you scared yet? so buy our stock<p>Classic marketing tactics, very effective.
    • VladimirGolovin6 hours ago
      This has been well discussed before, for example in this book: <a href="https:&#x2F;&#x2F;ifanyonebuildsit.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;ifanyonebuildsit.com&#x2F;</a>
  • register5 hours ago
    Where to understand more about how chain of thoughs really affects LLMs performance? I read the seminal paper but all it says is that it&#x27;s basically another prompt engineering tecnique that improves accuracy.
    • HarHarVeryFunny33 minutes ago
      Chain of thought, now including &quot;reasoning&quot;, are basically a work around for the simplistic nature of the Transformer neural network architecture that all LLMs are based on.<p>The two main limitations of the Transformer that it helps with are:<p>1) A Transformer is just a fixed-size stack of layers, with a one-way flow of data through the layers from input to output. The fixed number of layers equates to how many &quot;thought&quot; steps the LLM can put into generating each word of output, but good responses to harder questions may require many more steps and iterative thinking...<p>The idea of &quot;think step by step&quot;, aka chain of thought, is to have the model break it&#x27;s response down into a sequence of steps, each building on what came before, so that the scope of each step is withing the capability of the fixed number of layers of the transformer.<p>2) A Transformer has extremely limited internal memory from one generated word to the next, so telling the model to go one step at a time, feeding its own output back in as input, in effect makes the model&#x27;s output a kind of memory that makes up for this.<p>So, chain of thought prompting ultimately give the model more thinking steps (more words generated), together with memory of what it is thinking, in order to be able to generate a better response.
  • Fraterkes6 hours ago
    It’s interesting that half the comments here are talking about the extinction line when, now that we’re nearly entering 2026, I feel the 2027 predictions have been shown to be pretty wrong so far.
    • squidbeak1 hour ago
      &gt; I feel the 2027 predictions have been shown to be pretty wrong so far<p>Does your clairvoyance go any further than 2027?
      • AnimalMuppet1 hour ago
        I don&#x27;t know that it&#x27;s &quot;clairvoyance&quot;. We&#x27;re two weeks from 2026. We might be able to see somewhat more than we do now if this was going to turn into AGI by 2027.<p>If you assume that we&#x27;re only one breakthrough away (or zero breakthroughs - just need to train harder), then the step could happen any time. If we&#x27;re more than one away, though, then where are they? Are they all going to happen in the next two years?<p>But everybody&#x27;s guessing. We don&#x27;t <i>know</i> right now whether AGI is possible at current hardware levels. If it is N breakthroughs away, we all have our own guesses of approximately what N is.<p>My guess is that we are more than one breakthrough away. Therefore, one can look at the current state of affairs and say that we are unlikely to get to AGI by 2027.
  • Aiisnotabubble6 hours ago
    What also happens and it&#x27;s irrelevant of AGI: global RL<p>Around the world people ask an LLM and get a response.<p>Just grouping and analysing these questions and solving them once centrally and then making the solution available again is huge.<p>Linearly solving the most asked questions and then the next one then the next will make, whatever system is behind it, smarter every day.
    • danielfalbo6 hours ago
      Exactly. The singularity is already here. It&#x27;s just &quot;programmers + AI&quot; as a whole, rather than independent self-improvements of the AI.<p>I wonder how a &quot;programmers + AI&quot; self-improving loop is different from an &quot;AI only&quot; one.
      • bryanrasmussen6 hours ago
        The AI only one presumably has a much faster response time. The singularity is thus not here because programmer time is still the bottleneck, whereas as I understand in the singularity time is no longer a bottleneck component.
      • Aiisnotabubble5 hours ago
        AGI will be faster as it doesn&#x27;t need initial question.<p>AGI will also be generic.<p>LLM is already very impressive though
  • agumonkey6 hours ago
    There&#x27;s videos about Diffusion LLMs too, apparently getting rid of the linear token generation. But I&#x27;m no ML engineer.
    • nephanth4 hours ago
      As someone who worked on transformer-based diffusion models before (not for language though), i can say one thing: they&#x27;re hard.<p>Denoising diffusion models benefited a lot from the u-net, which is a pretty simple network (compared to a transformer) and very well-adapted to the denoising task. Plus diffusion on images is great to research because it&#x27;s very easy to visualize, and therefore to wrap your head around<p>Doing diffusion on text is a great idea, but my intuition is it will prove more challenging, and probably take a while before we get something working
      • agumonkey2 hours ago
        Thanks. Do you see that part of the field as plateauing or ramping up (even taking into account the difficulty).<p>If you know labs &#x2F; researchers on the topic, i&#x27;d love to read their page &#x2F; papers
  • erichocean1 hour ago
    &gt; <i>1. NOT have any representation about the meaning of the prompt.</i><p>This one is bizarre, if true (I&#x27;m not convinced it is).<p>The entire purpose of the attention mechanism in the transformer architecture is to build this representation, in many layers (conceptually: in many layers <i>of abstraction</i>).<p>&gt; <i>2. NOT have any representation about what they were going to say.</i><p>The only place for this to go is in the model weights. More parameters means &quot;more places to remember things&quot;, so clearly that&#x27;s <i>at least</i> a representation.<p>Again: who was pushing this belief? Presumably not researchers, these are fundamental properties of the transformer architecture. To the best of my knowledge, they are not disputed.<p>&gt; <i>I believe [...] it is not impossible they get us to AGI even without fundamentally new paradigms appearing.</i><p>Same, at least for the OpenAI AGI definition: &quot;An AI system that is at least as intelligent as a normal human, and is able to do any economically valuable work.&quot;
    • zahlman26 minutes ago
      &gt; This one is bizarre, if true (I&#x27;m not convinced it is).<p>&gt; The entire purpose of the attention mechanism in the transformer architecture is to build this representation, in many layers (conceptually: in many layers of abstraction).<p>I think this is really about a hidden (i.e. not readily communicated) difference in what the word &quot;meaning&quot; means to different people.
  • ctoth6 hours ago
    &gt; The fundamental challenge in AI for the next 20 years is avoiding extinction.<p>So nice to see people who think about this seriously converge on this. Yes. Creating something smarter than you was always going to be a sketchy prospect.<p>All of the folks insisting it just couldn&#x27;t happen or ... well, there have just been so many objections. The goalposts have walked from one side of the field to the other, and then left the stadium, went on a trip to Europe, got lost in a beautiful little village in Norway, and decided to move there.<p>All this time though, the prospect of instantiating a something smarter than you (and yes, it will be smarter than you even if it&#x27;s at human level because of electronic speeds...) This whole idea is just cursed and we should not do the thing.
    • cheschire6 hours ago
      &quot;Your scientists were so preoccupied with whether or not they could, they didn&#x27;t stop to think if they should.&quot;
  • a_bonobo6 hours ago
    &gt;* For years, despite functional evidence and scientific hints accumulating, certain AI researchers continued to claim LLMs were stochastic parrots: probabilistic machines that would: 1. NOT have any representation about the meaning of the prompt. 2. NOT have any representation about what they were going to say. In 2025 finally almost everybody stopped saying so.<p>Man, Antirez and I walk in very different circles! I still feel like LLMs fall over backwards once you give them an &#x27;unusual&#x27; or &#x27;rare&#x27; task that isn&#x27;t likely to be presented in the training data.
    • oersted6 hours ago
      LLMs certainly struggle with tasks that require knowledge that is not provided to them (at significant enough volume&#x2F;variance to retain it). But this is to be expected of any intelligent agent, it is certainly true of humans. It is not a good argument to support the claim that they are Chinese Rooms (unthinking imitators). Indeed, the whole point of the Chinese Room thought experiment was to consider if that distinction even mattered.<p>When it comes to of being able to do novel tasks on known knowledge, they seem to be quite good. One also needs to consider that problem-solving patterns are also a kind of (meta-)knowledge that needs to be taught, either through imitation&#x2F;memorisation (Supervised Learning) or through practice (Reinforcement Learning). They can be logically derived from other techniques to an extent, just like new knowledge can be derived from known knowledge in general, and again LLMs seem to be pretty decent at this, but only to a point. Regardless, all of this is definitely true of humans too.
      • feverzsj6 hours ago
        In most cases, LLMs has the knowledge(data). They just can&#x27;t generalize them like human do. They can only reflect explicit things that are already there.
        • oersted5 hours ago
          I don&#x27;t think that&#x27;s true. Consider that the &quot;reasoning&quot; behaviour trained with Reinforcement Learning in the last generation of &quot;thinking&quot; LLMs is trained on quite narrow datasets of olympiad math &#x2F; programming problems and various science exams, since exact unambiguous answers are needed to have a good reward signal, and you want to exercise it on problems that require non-trivial logical derivation or calculation. Then this reasoning behaviour gets generalised very effectively to a myriad of contexts the user asks about that have nothing to do with that training data. That&#x27;s just one recent example.<p>Generally, I use LLMs routinely on queries definitely no-one has written about. Are there similar texts out there that the LLM can put together and get the answer by analogy? Sure, to a degree, but at what point are we gonna start calling that intelligent? If that&#x27;s not generalisation I&#x27;m not sure what is.<p>To what degree can you claim as a human that you are not just imitating knowledge patterns or problem-solving patterns, abstract or concrete, that you (or your ancestors) have seen before? Either via general observation or through intentional trial-and-error. It may be a conscious or unconscious process, many such patterns get backed into what we call intuition.<p>Are LLMs as good as humans at this? No, of course, sometimes they get close. But that&#x27;s a question of degree, it&#x27;s no argument to claim that they are somehow qualitatively lesser.
    • jmfldn6 hours ago
      &quot;In 2025 finally almost everybody stopped saying so.&quot;<p>I haven&#x27;t.
      • dist-epoch5 hours ago
        Some people are slower to understand things.
        • jmfldn5 hours ago
          Well exactly ;)
    • barnabee5 hours ago
      I don’t think this is quite true.<p>I’ve seen them do fine on tasks that are clearly not in the training data, and it seems to me that they struggle when some particular type of task or solution or approach might be something they haven’t been exposed to, rather than the exact task.<p>In the context of the paragraph you quoted, that’s an important distinction.<p>It seems quite clear to me that they are getting at the meaning of the prompt and are able, at least somewhat, to generalise and connect aspects of their training to “plan” and output a meaningful response.<p>This certainly doesn’t seem all that deep (at times frustratingly shallow) and I can see how at first glance it might look like everything was just regurgitated training data, but my repeated experience (especially over the last ~6-9 months) is that there’s something more than that happening, which feels like whet Antirez was getting at.
    • Kiro2 hours ago
      Give me an example of one of those rare or unusual tasks.
      • recursive2 minutes ago
        Set the font size of a simple field in openxml. Doesn&#x27;t even seem that rare. It said to add a run inside and set the font there. Didn&#x27;t do anything. I ended up reverse engineering the output out of ms word. This happened yesterday.
  • bgwalter46 minutes ago
    They are very advanced stochastic parrots that allow AI invested authors to suddenly write in perfect English.<p>If Antirez has never gotten an LLM to perform an absolutely embarrassing mistake, he must be very lucky or we should stop listening to him.<p>Programmers&#x27; resistance has not weakened. Since the ORCL drop of 40% anti-LLM opinions are censored and downvoted here. Many people have given up, and we always get articles from the same LLM influencers.
  • lowsong27 minutes ago
    I&#x27;m impressed that such a short post can be so categorically incorrect.<p>&gt; For years, despite functional evidence and scientific hints accumulating, certain AI researchers continued to claim LLMs were stochastic parrots<p>&gt; In 2025 finally almost everybody stopped saying so.<p>There is still no evidence that LLMs are anything beyond &quot;stochastic parrots&quot;. There is no proof of any &quot;understanding&quot;. This is seeing faces in clouds.<p>&gt; I believe improvements to RL applied to LLMs will be the next big thing in AI.<p>With what proof or evidence? Gut feeling?<p>&gt; Programmers resistance to AI assisted programming has lowered considerably.<p>Evidence is the opposite, most developers do not trust it. <a href="https:&#x2F;&#x2F;survey.stackoverflow.co&#x2F;2025&#x2F;ai#2-accuracy-of-ai-tools" rel="nofollow">https:&#x2F;&#x2F;survey.stackoverflow.co&#x2F;2025&#x2F;ai#2-accuracy-of-ai-too...</a><p>&gt; It is likely that AGI can be reached independently with many radically different architectures.<p>There continues to be no evidence beyond &quot;hope&quot; that AGI is even possible, yet alone that Transformer models are the path there.<p>&gt; The fundamental challenge in AI for the next 20 years is avoiding extinction.<p>Again, nothing more than a gut feeling. Much like all the other AI hype posts this is nothing more than &quot;well LLMs sure are impressive, people say they&#x27;re not, but I think they&#x27;re wrong and we will make a machine god any day now&quot;.
  • ur-whale7 hours ago
    Not sure I understand the last sentence:<p>&gt; The fundamental challenge in AI for the next 20 years is avoiding extinction.
    • danielfalbo7 hours ago
      I think he&#x27;s referring to AI safety.<p><a href="https:&#x2F;&#x2F;lesswrong.com&#x2F;posts&#x2F;uMQ3cqWDPHhjtiesc&#x2F;agi-ruin-a-list-of-lethalities" rel="nofollow">https:&#x2F;&#x2F;lesswrong.com&#x2F;posts&#x2F;uMQ3cqWDPHhjtiesc&#x2F;agi-ruin-a-lis...</a>
      • grodriguez1006 hours ago
        For a perhaps easier to read intro to the topic, see <a href="https:&#x2F;&#x2F;ai-2027.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;ai-2027.com&#x2F;</a>
        • dkdcio6 hours ago
          or read your favorite sci-fi novel, or watch Terminator. this is pure bs by a charlatan
    • chrishare7 hours ago
      He&#x27;s referring to humanity, I believe
      • A_D_E_P_T6 hours ago
        It&#x27;s ambiguous. It could go the other way. He could be referring to that oldest of science fiction tropes: The Bulterian Jihad, the human revolt against thinking machines.
        • AnimalMuppet1 hour ago
          Meh. I think the more likely scenario is the <i>financial</i> extinction of the AI companies.
  • alexgotoi6 hours ago
    &gt; * The fundamental challenge in AI for the next 20 years is avoiding extinction.<p>This reminded me of the Don’t look up movie where they basically gambled with the humans extinction.
  • rckt6 hours ago
    &gt; Even if LLMs make mistakes, the ability of LLMs to deliver useful code and hints improved to the point most skeptics started to use LLMs anyway<p>Here we go again. Statements with the single source in the head of the speaker. And it’s also not true. The llms still produce bad&#x2F;irrelevant code at such rate that you can spend more time prompting than doing things yourself.<p>I’m tired of this overestimation of llms.
    • barnabee6 hours ago
      Even where they are not directly using LLMs to write the most critical or core code, nearly every skeptic I know has started using LLMs at very least to do things like write tests, build tools, write glue code, help to debug or refactor, etc.<p>Your statement suffers not only from also coming only from your brain, with no evidence that you&#x27;ve actually tried to learn to use these tools, but it also goes against the weight of evidence that I see both in my professional network and online.
      • rckt5 hours ago
        I just want people making statements like the author to be more specific how exactly the llms are being used. Otherwise they contribute to this belief that llms are a magical tool that can do anything.<p>I am aware of simple routine tasks that LLMs can do. This doesn’t change anything about what I said.
        • danielbln1 hour ago
          All you had to do is scroll down further and read the next couple of posts where the author is being more specific on how they used LLMs.<p>I swear, the so called critics need everything spoon fed.
        • Kiro1 hour ago
          Sorry, but we&#x27;re way past that. It&#x27;s you who need to provide examples of tasks it can&#x27;t do.
      • AnimalMuppet1 hour ago
        You need to meet more skeptics. (Or maybe I do.) In my world, it&#x27;s much more rare than you say.
    • locknitpicker7 minutes ago
      &gt; Here we go again. Statements with the single source in the head of the speaker. And it’s also not true.<p>You&#x27;re making the same sort of baseless claim you are criticising the blogger for making. Spewing baseless claims hardly moves any discussion forward.<p>&gt; The llms still produce bad&#x2F;irrelevant code at such rate that you can spend more time promoting than doing things yourself.<p>If that is your personal experience then I regret to tell you that it is only the reflection of your own inability to work with LLMs and coding agents. Meanwhile, I personally manage to effectively use LLMs anywhere between small refactoring needs and large software architecture designs, including generating fully working MVPs in one-shot agent prompts. From this alone it&#x27;s rather obvious who is making baseless statements that are more aligned with reality.
    • iamflimflam16 hours ago
      But you have just repeated what you are complaining about.
      • rckt5 hours ago
        Do you want me to spend time to come with a quality response to a lazy statement? It’s like fighting with windmills. I’m fine with having my say the way I did.
    • xiconfjs6 hours ago
      My person experience: if I can find a solution on stackoverflow etc. the LLM will produce working and fundamentally correct code. If I can‘t find a already fullfilled solution on these sites, the LLM is hallucinating like crazy (newer existing functions&#x2F;modules&#x2F;plugins, protocol features which aren’t specified and even github-repos which never existed). So, as stated my many people online before: for low-hanging fruits LLM are totally viable solution.
      • danielbln1 hour ago
        I don&#x27;t remember the last time Claude Code hallucinated some library, as it will check the packages, verify with the linter, run a test import and so on.<p>Are you talking about punching something into some LLM web chat that&#x27;s disconnected from your actual codebase and has tooling like web search disabled? If so, that&#x27;s not really the state of the art of AI assisted coding, just so you know.
  • HellDunkel5 hours ago
    [flagged]
    • danielbln1 hour ago
      Must feel nice to let yourself be coddled by in-group&#x2F;out-group thinking like that. &quot;I&#x27;ve decided that AI is bad and useless, therefore anyone disagreeing must be an AI bro&quot;.
  • feverzsj6 hours ago
    Seems they also want some AI money[0]. Guess, I&#x27;ll keep using Valkey.<p>[0] <a href="https:&#x2F;&#x2F;redis.io&#x2F;redis-for-ai&#x2F;" rel="nofollow">https:&#x2F;&#x2F;redis.io&#x2F;redis-for-ai&#x2F;</a>
    • danielfalbo6 hours ago
      &gt; they<p>I&#x27;m not sure antirez is involved in any business decision making process at Redis Ltd.<p>He may not be part of &quot;they&quot;.
      • antirez4 hours ago
        I&#x27;m not involved in business decisions and while I&#x27;m very AI positive I believe Redis as a company should focus on Redis fundamentals: so my piece has zero alignment on what I hope for the company.
    • sibellavia4 hours ago
      In any case, what would be the problem? The page you mentioned simply illustrates how the product can be used in a specific domain; it doesn&#x27;t seem forced to me.
    • bgwalter43 minutes ago
      Conflict of interest and disclosure posts are frequently downvoted.