4 comments

  • kristianp3 hours ago
    Recent model released a couple of weeks ago. &quot;Mixture of Experts (MoE) architecture, it selectively activates only 11B of its 196B parameters per token&quot;. Beats Kimi K2.5 and GLM 4.7 on more benchmarks than it loses to them.<p>Edit: there are 4 bit quants that can be run on an 128GB machine like a GB10 [1], AI Max+ 395, or mac studio.<p>[1] <a href="https:&#x2F;&#x2F;forums.developer.nvidia.com&#x2F;t&#x2F;running-step-3-5-flash-on-single-spark&#x2F;359457&#x2F;12" rel="nofollow">https:&#x2F;&#x2F;forums.developer.nvidia.com&#x2F;t&#x2F;running-step-3-5-flash...</a>
  • danieltanfh951 hour ago
    Hallucinates like crazy. use with caution. Tested it with a simple &quot;Find me championship decks for X pokemon&quot;, &quot;How does Y deck work&quot;. Opus 4.6, Deepseek and Kimi all performed well as expected.
  • wmf2 hours ago
    That reverse x axis sure is confusing.
    • esafak12 minutes ago
      I imagine they thought they&#x27;d look better this way. I don&#x27;t think they do.
  • SilverElfin1 hour ago
    So who exactly is StepFun? What is their business (how do they make money)? Each time I click “About Stepfun” somewhere on their website, it sends me to a generic landing page in a loop.
    • 0x19971 hour ago
      <a href="https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;StepFun" rel="nofollow">https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;StepFun</a>
      • SilverElfin32 minutes ago
        Thanks. Do they sell any of these products today or is it more like research? I am not able to find anything relating to pricing on their website. Just a chatbot.
    • deaux1 hour ago
      Might want to give it a search.