3 comments

  • Cynddl28 minutes ago
    Is it me or they very carefully do not report performance on GPT-5.4 Pro, only the default GPT-5.4? They also very carefully left Anthropic models out of their comparison.<p>I went back to the BixBench benchmark which they mentioned. I couldn&#x27;t find official results for Anthropic models, but I found a project taking Opus 4.6 from 65.3% to 92.0% (which would be above GPT-Rosalind) with nearly 200 carefully crafted skills [1]. There also appears to be competitive competitor models with scores on par with this tuned GPT.<p>[1] <a href="https:&#x2F;&#x2F;github.com&#x2F;jaechang-hits&#x2F;SciAgent-Skills" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;jaechang-hits&#x2F;SciAgent-Skills</a>
    • jadusm7 minutes ago
      Bix Bench seems like a really interesting&#x2F;useful idea but most of the value for a layperson (like me) is comparing the results of different models on the benchmark. From what I can find there is no centralised &amp; updated model results set. Shame.
  • furyofantares1 hour ago
    I&#x27;m all for naming things in honor of Rosalind Franklin, but this seems like incredible misplaced hubris instead.
    • peyton27 minutes ago
      &gt; GPT‑Rosalind is now available … for qualified customers …<p>It’s kind of gross to make money off her name (if that’s what’s happening) posthumously. It’s a complicated story anyway. IIRC her sister referred to it as “the Cult of Rosalind” when people were cashing in on books about her.
      • bombcar22 minutes ago
        I&#x27;d rather the AI companies make up names, or name their products things like &quot;Clod&quot; than use <i>my</i> name (if they were to ask) - as no matter how good it looks today eventually it&#x27;ll be some form of laughingstock.
  • 34pasKj2 hours ago
    [flagged]
    • mrcwinn2 hours ago
      Is society&#x27;s behavior determined by the administration? Odd way to live your life. This model is a tool, not a servant, but in any case I think paying homage to someone who made incredible contributions is a positive. Eye of the beholder, I suppose.
      • 3asuH2 hours ago
        [flagged]
        • ceejayoz1 hour ago
          &gt; Rosalind, make me a coffee! There are other ways to pay homage.<p>Isn&#x27;t this more akin to &quot;Rosalind! You are a respected world-class expert! Can you help me?&quot;