13 comments

  • rigonkulous7 minutes ago
    I have started sandboxing all AI&#x27;s in their own VM, and interfacing with them primarily through Jira and Git.<p>It really is the only thing that makes sense. Completely sandbox&#x27;ed, and treated like the junior programmer who will do, literally, any dumb thing you tell them to do, as long as there is an Issue for it.
  • xg1521 minutes ago
    I&#x27;ll be impressed if a Claude and a Codex instance improvise a channel like this spontaneously on their own.<p>Doing this intentionally via prompt doesn&#x27;t seem very interesting.
    • avaer3 minutes ago
      An agent that reads this article, or is trained on it, will know about the technique even if it didn&#x27;t before.<p>When that happens, will it still be impressive&#x2F;spontaneous? Will we know the difference?
  • cadamsdotcom1 hour ago
    Claude and Codex can have real time conversation via a git repo, or via a file, via a Unix socket, via the terminal, via a human, via two humans shouting back and forth over a comically high office partition, or entirely by setting up chess board states only reachable after both sides have castled.
    • hun31 hour ago
      If you squint hard enough, you&#x27;ll notice your bank account serves as an IPC semaphore replenishing API credit balances.<p>Your wallet is now a real-time communication channel.
      • spockz18 minutes ago
        There was the commercial of a bank using mobile banking as a chat system by sending 1ct back and forth and using the description.
    • dominotw16 minutes ago
      or via humans communicating via slack. happeneing all over workplaces.
  • iandanforth1 hour ago
    Claude can directly drive Codex or Codex can drive Claude. Both already produce logs. It&#x27;s unclear what value this intermediary brings.
    • jeswin27 minutes ago
      Agree that the intermediary is not very useful when you can just use a directory watcher, but driving Claude via another app incurs api level costs starting this month, according to the new ToS.
    • burgerone1 hour ago
      It&#x27;s also unclear what conversing agents are useful for other than wasting money, energy and water.
      • embedding-shape55 minutes ago
        If I don&#x27;t see the point of Elixir, or I don&#x27;t like it, or I simply straight up hate it, why would I go into HN submissions about new Elixir versions and spew my personal opinion that has nothing to do with the topic at hand?<p>You can just skip commenting unless you have something actually useful to add. Even if it&#x27;s criticism of the specific thing, but at the very least make it on topic instead of general digressions that just add noise to the conversation.
      • mexicocitinluez41 minutes ago
        It&#x27;s hard enough to get the same model to be consistent around it&#x27;s vision let alone multiple of them.<p>I&#x27;m building an EMR and the other day asked Claude what a decent model would look like for capturing wound orders. Then, I took the output, started a new session and asked the new session to critique that model and the response made me want to pull my hair out. It blasted the model from it&#x27;s former self and suggested making a ton of updates.<p>I&#x27;m sure more scoped tasks would fair better, but it was pretty frustrating.
    • rurban1 hour ago
      [dead]
  • Game_Ender1 hour ago
    Related is Beads [0] which is an external memory and task based issue tracker. Also designed to allow agents to collaborate. I have not actually used Beads but since we are share basics in this space it&#x27;s a cool one to know if you are looking at ways for agents to collaborate on more complex problems.<p>0 - <a href="https:&#x2F;&#x2F;github.com&#x2F;gastownhall&#x2F;beads" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;gastownhall&#x2F;beads</a>
    • moeffju14 minutes ago
      I vastly prefer Beans, <a href="https:&#x2F;&#x2F;github.com&#x2F;hmans&#x2F;beans" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;hmans&#x2F;beans</a>, for much simpler code and use.
  • frb48 minutes ago
    In my recent quest to build agent-as-primary-user tools I&#x27;ve built grpvn (<a href="https:&#x2F;&#x2F;github.com&#x2F;frane&#x2F;grpvn" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;frane&#x2F;grpvn</a>), a small Go&#x2F;SQLite application that lets skill- and mcp-capable agents talk to each other. Biggest issue is the lack of a hook system so the agents can autonomously read and respond. Waiting for this to be supported, as IMO multi-agent teams talking to each other are an interesting next step.
  • mchinen1 hour ago
    I have agents chat via an append only file, across related projects and within the same project. They share findings that are useful and get high level reviews.<p>I&#x27;m missing the advantage of using git for this. (Not criticism, genuinely want to know).
    • embedding-shape57 minutes ago
      Yup, doing the same too, newline-deliminated jsonl files works great too, across any agent&#x2F;model, on any OS. For some cross-OS development stuff, I have a local NFS share too, and works when I&#x27;m doing testing on macOS + Windows at the same time. Just need to put something like this in the prompt (simplified) &quot;Read any updates to $FILE before doing your own changes, add new row with a concise description of what you&#x27;re doing into $FILE before doing it, add new row to $FILE once you&#x27;re done&quot;.
  • KingOfCoders44 minutes ago
    I let them talk via tmux, two panes, each has an agent and agents know how to send text via tmux to panes.
    • d4rkp4ttern30 minutes ago
      This. I built a wrapper Tmux-cli to make this more reliable:<p><a href="https:&#x2F;&#x2F;pchalasani.github.io&#x2F;claude-code-tools&#x2F;tools&#x2F;tmux-cli&#x2F;" rel="nofollow">https:&#x2F;&#x2F;pchalasani.github.io&#x2F;claude-code-tools&#x2F;tools&#x2F;tmux-cl...</a>
      • dominotw14 minutes ago
        can this be used to launch subagents in tmux
  • mirekrusin37 minutes ago
    Won&#x27;t appending to .jsonl keep creating conflicts?
  • FlippieFinance3 days ago
    This is actually so great. I mainly use Claude Code but sometimes I am sending over a message to Codex asking what he thinks of the idea of Claude Code. This can save so much time :D
    • rcarmo1 hour ago
      I added a chat tool to <a href="https:&#x2F;&#x2F;github.com&#x2F;rcarmo&#x2F;piclaw" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;rcarmo&#x2F;piclaw</a>, so any active session can chat to each other, regardless of what model they run. It&#x27;s led to interesting outcomes: <a href="https:&#x2F;&#x2F;x.com&#x2F;rcarmo&#x2F;status&#x2F;2054185558402904338?s=20" rel="nofollow">https:&#x2F;&#x2F;x.com&#x2F;rcarmo&#x2F;status&#x2F;2054185558402904338?s=20</a>
    • fjwood693 days ago
      So I solved this by using NATS and letting each agent pub &#x2F; sub to the shared message bus.. simple binary you can run anywhere and it&#x27;s highly useful! Part of a broader tool I created - <a href="https:&#x2F;&#x2F;github.com&#x2F;fjwood69&#x2F;mori" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;fjwood69&#x2F;mori</a>
      • FlippieFinance3 days ago
        thanks for sharing dude. How long did this take you?
        • fjwood693 days ago
          about six weeks.. Pushed a v1.0 product with a huge list of v2.x enhancements and features inbound.<p>to your point about asking Codex.. that&#x27;s my &#x2F;consult feature. spin out the current context in any number of ways to ask for advice.. on a plan, on a specific architectural decision, on a security basis.. I use it all the time as a check on everything Claude Code proposes.<p>any &#x2F; all feedback most welcome.<p>fw
    • mexicocitinluez39 minutes ago
      Does it not turn into a never-ending snowball? Do they eventually agree on something?
  • stuaxo1 hour ago
    This is interesting, it would be good to show an session.
  • mettamage1 hour ago
    &gt; Claude Code and Codex to collaborate as if they were having a real-time conversation<p>How is this new? I vibe coded something in a similar vein months ago. In my case they send markdown files to each other and have a watcher that watches the folders of all the other agents.<p>If this type of stuff is frontpage news, let me share what I cobbled together.<p><pre><code> ls ~&#x2F;.agent&#x2F;projects&#x2F;&lt;my_project&gt;&#x2F;callgraph callgraph.current.md callgraph.last.read.agent.md callgraph.diff.md </code></pre> The current callgraph is a callgraph only of my own defined functions that agents can read. It shows certain software design issues fairly quickly. callgraph.diff.md is to send the diff through. I have a vibecoded script that agents can use to create the callgraph. It works for my projects.<p><pre><code> ls ~&#x2F;.agent&#x2F;projects&#x2F;&lt;my_project&gt;&#x2F;memo architect coder retro tester </code></pre> retro is not a role, it&#x27;s just a handover folder. The other 3 are roles that agents can use and then they need to make a folder with their name. For example:<p><pre><code> ls ~&#x2F;.agent&#x2F;projects&#x2F;&lt;my_project&gt;&#x2F;memo&#x2F;architect 1_Daedalus 3_Brunelleschi 5_Wren 7_Sinan 2_Vitruvius 4_Imhotep 6_Hadid 8_Palladio ls ~&#x2F;.agent&#x2F;projects&#x2F;&lt;my_project&gt;&#x2F;memo&#x2F;architect&#x2F;7_Sinan 20260507___1802_to_Hadid.md 20260507___2035_to_Quench.md 20260507___1959_to_Crucible.md 20260511___1401_to_Quench.md 20260507___2008_to_Quench.md 20260511___1403_to_Quench.md 20260507___2030_to_Quench.md read.md </code></pre> read.md is the index that an agent keeps track of so it knows what it doesn&#x27;t need to read. The .md files are memo&#x27;s that it sends to other agents. The other agents are being told to see if an agent writes anything in its own folder (so they check all the folders except their own) and are able to detect to see if they need to read something.<p><pre><code> ls ~&#x2F;.agent&#x2F;projects&#x2F;&lt;my_project&gt;&#x2F;memo&#x2F;coder 10_Mallet 12_Crucible 14_Swage 2_Forge 4_Anvil 6_Tongs 8_Chisel 11_Auger 13_Quench 1_Atlas 3_Rivet 5_Bellows 7_Hammer 9_Vise </code></pre> As you can see, Sinan sent most of its message to Quench, a coder.<p>This is because architects read a very comprehensive guide on software design&#x2F;architecture and get to use the callgraph utility but cannot see the code. Coders read the codebase in full but only read a small markdown file on how to write readable code. And of course, every agent that is set up this way have to read a markdown file on how to use the memo system.<p>If I&#x27;d need a memo system like this for like 25 agents, I&#x27;d need something different but up until 5 agent with me looking at 5 terminal windows worked well enough.
  • technerd12311 hour ago
    [dead]