5 comments

  • jerpint1 minute ago
    Solutions like these are really cementing the view that LLMs are becoming a commodity
  • kristjansson16 minutes ago
    &gt; The phrase &quot;frontier model&quot; is starting to mean two things. One is a checkpoint. The other is a system boundary.<p>LLM-isms aside, I don&#x27;t think we want this to be the case? An LLM, for all its complexity, is something that can be reasoned about. It&#x27;s picking the next token, until it hits an EOS. The semantics imposed on those tokens (reasoning ,tool call, etc.) are up to the user(&#x27;s harness) to decide and act on. The more that&#x27;s pushed behind the facade, the harder it is achieve sufficient understanding of the model&#x27;s behavior s.t. one can compose it into larger abstractions. Perhaps the performance (and the adherence to an interface&#x2F;contract) compensate? But swapping from Opus or 5.5 to this or Fugu seems like a much bigger change than swapping between different &#x27;base&#x27; models.
    • Xx_crazy420_xX8 minutes ago
      I might be wrong, but strongly suspect that Fable 5 is already something in this shape, considering long time to first token while having normal troughput.
  • droidjj55 minutes ago
    Can we please stop submitting fully AI-generated text to HN?
    • tensegrist30 minutes ago
      at least 50% of the front page would disappear if this were enforced
      • folkrav3 minutes ago
        I&#x27;d be perfectly okay with that.
  • alchemist1e954 minutes ago
    This should help with better utilizing a heterogenous collection of inference hardware.
  • ShizuhaLabs1 hour ago
    [flagged]