1 comments

  • stared25 minutes ago
    Thank you for sharing benchmark. However, the results are selective.<p>Why no Opus 4.7? Why Gemini 3.1 Pro is missing?<p>If there is some other criterion (e.g. models within certain time or budget), great - just make it explicit.<p>When I see &quot;Top 5 at a glance&quot; and it missed key frontier models, I am (at best) confused.
    • Flux15911 minutes ago
      Agree that the choices are strange. Sonnet 4.6 was tested, but no Opus 4.6.<p>Gemini 3.1 and GLM 5 came out around the same time as Sonnet 4.6 (~Feb 2026) so it&#x27;s strange that they are missing, but Gemini 2.5 Flash, Gemini 3 Flash, and GLM 4.7 are there.