6 comments

  • ctoth16 minutes ago
    Something I found really helpful when reading this was having read The Void essay:<p><a href="https:&#x2F;&#x2F;github.com&#x2F;nostalgebraist&#x2F;the-void&#x2F;blob&#x2F;main&#x2F;the-void.md" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;nostalgebraist&#x2F;the-void&#x2F;blob&#x2F;main&#x2F;the-voi...</a>
  • devradardev36 minutes ago
    Stabilizing character is crucial for tool-use scenarios. When we ask LLMs to act as &#x27;Strict Architects&#x27; versus &#x27;Creative Coders&#x27;, the JSON schema adherence varies significantly even with the same temperature settings. It seems character definition acts as a strong pre-filter for valid outputs.
  • t0md4n21 minutes ago
    Pretty cool. I wonder what the reduction looks like in the bigger SOTA models.<p>The harmful responses remind me of &#x2F;r&#x2F;MyBoyfriendIsAI
  • dataspun35 minutes ago
    Is the Assistant channeling Uncharles?
  • aster0id51 minutes ago
    This is incredible research. So much harm can be prevented if this makes it into law. I hope it does. Kudos to the anthropic team for making this public.