Step 3.5 Flash: Fast Enough to Think. Reliable Enough to Act

(static.stepfun.com)

30 points by kristianp3 hours ago

4 comments

kristianp3 hours ago
Recent model released a couple of weeks ago. "Mixture of Experts (MoE) architecture, it selectively activates only 11B of its 196B parameters per token". Beats Kimi K2.5 and GLM 4.7 on more benchmarks than it loses to them.<p>Edit: there are 4 bit quants that can be run on an 128GB machine like a GB10 [1], AI Max+ 395, or mac studio.<p>[1] <a href="https://forums.developer.nvidia.com/t/running-step-3-5-flash-on-single-spark/359457/12" rel="nofollow">https://forums.developer.nvidia.com/t/running-step-3-5-flash...</a>
danieltanfh951 hour ago
Hallucinates like crazy. use with caution. Tested it with a simple "Find me championship decks for X pokemon", "How does Y deck work". Opus 4.6, Deepseek and Kimi all performed well as expected.
wmf2 hours ago
That reverse x axis sure is confusing.
- esafak12 minutes ago
  I imagine they thought they'd look better this way. I don't think they do.
SilverElfin1 hour ago
So who exactly is StepFun? What is their business (how do they make money)? Each time I click “About Stepfun” somewhere on their website, it sends me to a generic landing page in a loop.
- 0x19971 hour ago
  <a href="https://en.wikipedia.org/wiki/StepFun" rel="nofollow">https://en.wikipedia.org/wiki/StepFun</a>
  - SilverElfin32 minutes ago
    Thanks. Do they sell any of these products today or is it more like research? I am not able to find anything relating to pricing on their website. Just a chatbot.
- deaux1 hour ago
  Might want to give it a search.