5 comments

  • jarmitage24 days ago
    In case you want to try Aristotle, I asked Claude Code to make a plugin for it here <a href="https:&#x2F;&#x2F;github.com&#x2F;afhverjuekki&#x2F;claude-code-aristotle-plugin" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;afhverjuekki&#x2F;claude-code-aristotle-plugin</a>
  • RGamma24 days ago
    This is part of the work that lead to Aristotle, the system that performed at Gold level at IMO: <a href="https:&#x2F;&#x2F;arxiv.org&#x2F;abs&#x2F;2510.01346" rel="nofollow">https:&#x2F;&#x2F;arxiv.org&#x2F;abs&#x2F;2510.01346</a>
  • auggierose24 days ago
    Very interesting. Do I get this right, running 500000 instances for 1 hour can be done for about $5000, or are there many hidden costs? (500000 * $0.01).
  • whattheheckheck23 days ago
    Lean4 with a mathlib project seems really slow has anyone else experienced that?
  • ncgl24 days ago
    am i understanding it right that this is used to validate the output of llms? any other uses for distributed lean? genuinely curious
    • UltraSane24 days ago
      Lean is an automated theorem prover. It decides if a given proof is true or not. This uses LLMs to try to write proofs for a given problem