2 comments

  • charcircuit39 minutes ago
    Why didn&#x27;t this author compare Llama 3 with GLM 5.2 (released 1 week ago) which is a more standard attention based LLM? To compare 2 separate families of LLMs and then pointing out that they are different is not a surprising result and detracts from the point the author is trying to make.<p><a href="https:&#x2F;&#x2F;sebastianraschka.com&#x2F;llm-architecture-gallery&#x2F;?compare=llama-3-8b%2Cglm-5-2#architecture-diff-tool" rel="nofollow">https:&#x2F;&#x2F;sebastianraschka.com&#x2F;llm-architecture-gallery&#x2F;?compa...</a><p>If you look at it, the diagrams are very similar, but the main differences are that the feedforward is replaced with a MoE (router to multiple feedforwards) and the model has a different attention implementation.
    • lproven3 minutes ago
      &gt; If you look at it, the diagrams are very similar,<p>The page <i>links to the same site you do</i>. No wonder it is similar -- the source is the same!
      • charcircuit2 minutes ago
        The source is the same in the original article too. He is using different those 2 diagrams from the same site are to justify his point on how much more complicated things have become by showing a complicated diagram on the right.
    • christopherwxyz18 minutes ago
      It’s written by AI.
      • lproven4 minutes ago
        [[citation needed]]<p>I am a professional writer and have been for over 30 years. (I do not use any form of LLM ever.) This means I read <i>a lot</i>. This also means that I have 30+ years of experience of readers not understanding what I wrote, or not getting further than the title, or not getting the main message, or inverting it in their heads, or inserting their own message and then complaining when I diverge, and an endless list of Ways People Do Not Get It.<p>I am also a trained TESOL teacher. Ability to capture gist is a skill we test for and measure, and many, maybe the majority, of native speakers don&#x27;t have it and don&#x27;t know.<p>In recent years I <i>constantly</i> see people going &quot;this is written by AI&quot; and I have yet to see <i>a single of of them</i> able to coherently prove their point. It&#x27;s all just feelings and hunches.<p>So I am calling you on this:<p>How do you know? Show your working. Demonstrate your case.
      • alecco3 minutes ago
        Grammarly and GPTZero say 0% AI.