One thing I don't understand with all of these, is how they're handling different worktree infrastructure spin-up?<p>For example, if I have a webapp, I want each of the worktrees to spin up its own infrastructure, and be accessible on its own unique local url, so that I can see the changes locally for each worktree, or I can have agents automate visual checks using something like agent-browser.<p>Currently I use docker for my infrastructure, each service running in its own container. I have a script that has a ./app worktree create worktreename. That creates a worktree as "worktreename" and spins up all of my docker infrastructure with prefixes for things like "WORKTREENAME", and I can access all my urls at worktreename.myapp.test (or just myapp.test for the main worktree).<p>This is working fine for now, but it'd be cool if one of these apps was compatible with this concept so I could move over to that.
Nice if didnt have the same name with an Apache project
<a href="https://superset.apache.org/" rel="nofollow">https://superset.apache.org/</a>
I was looking exactly for something like that. I tried installing the Linux version and it runs but:<p>1. it's behind a login wall | 2. tries to download its own OpenCode instead of using the one installed on my machine<p>I also tried to create a new workspace. It asked what I want to do. I tried with "create user accounts", and it proceeded to create the git worktree (without asking for its name) and sent that prompt straight to OpenCode without allowing me to choose the LLM model.<p>I plaud the effort - I really do - but it doesn't really seems a great experience for now.
That still is not it for me, for now iTerm2 tabs / tmux is plenty. This, or tools like Warp, feel so heavy for me and I get overwhelmed.<p>The Codex/Antigravity desktop app alternate route feels more like the direction I can buy into it, but I still feel like there's room for another novel UX not yet done.<p>Perhaps the biggest thing I feel like I wish existed is a "Scratchpad" mode inside a session, Claude recently came up with `/btw` and it's been really useful. Same with `!` for terminal commands within session.<p>Nested thinking spaces within a chat window UX is a very tricky thing to reason about.
yeah us and similar categories of tools prescribe a lot of our specific workflow for worktrees, review, code editing where iterm + tmux is much more flexible. we have to tow the line of what's useful as a built-in feature vs being very agnostic.<p>i do agree that there's a more novel UX than the same left sidebar with worktrees and middle chat/terminal that is not yet explored.<p>we've been spiking on what the UI is like to program and monitor the specific steps in the workflow from triggering agent -> planning -> coding -> review -> merge.
Wow - this looks like a fork of my project <a href="https://github.com/frenchie4111/harness" rel="nofollow">https://github.com/frenchie4111/harness</a><p>It's crazy to see how we have independently landed in the same place<p>Good luck to your project! Excited to see how our projects differ in the future
looks awesome! very interesting how similar these tools all look. i do think we're at an intermediate state here with the current UI where it doesn't scale as well for the future where triggering agents and review becomes much more important than the writing code part.
Amazing, this is exactly what I've been looking for a while now, especially the remote dev box part. I'll give it a try, thank you!
Looks great, even has Linux support!
And headless server support! I have gotten a lot of request to let people run the agent on a separate dev box (in the cloud or some pc gathering dust in the corner). The frontend/backend are decoupled so you can run the agents on a totally separate machine than the GUI is running.<p>If you have any other problems/concerns/issues/suggestions/etc, reach out!
I've been using Superset as my daily driver for about 2 months and I'm loving it.<p>i have created a simple setup script that reserves a couple of ports per workspace/worktree.
with that, each ticket/workspace can starts it's own webserver. Allowing me to work on multiple things in parallel.
that also allows me to use Claude+Chrome MCP to e2e test each implementation in parallel.<p>Besides my own repo I also have a couple open source repos open. so I can quickly ask Claude questions about it instead of looking at the docs, which are usually outdated.<p>All in all, I usually have ~5 workspace/tickets/agents running at the same time.
And one as Q&A agents per OSS repo
I always feel like I want to use something like this, but then realize my NeoVim setup + tmux + Ghostty is good enough and I am not ready to learn a whole another system for modest gains.<p>The friction I have currently is obviously things like port forwarding, in app browser etc.<p>I keep thinking to try out cmux but haven't gotten around to it.
I've been using this for the past few months, and I love it! It's built exactly around my workflow with many worktrees in various repos open at the same time, sometimes with different agents working side-by-side. Before Superset I just used terminal tabs but simply couldn't manage more than like 20 terminal tabs without losing track, so i coudn't scale further. Now i'm running probably 40-50 agent sessions over several repos simultaneously without any issues and losing track!
Keep up the good work guys!
Could you elaborate a little on what you are doing with 40-50 agents? I use Claude, I've employed sub-agents, but I still can't wrap my head around how people are using them to that extent.
ha, I didn't mean they are all running at the same time. sorry if it sounded like this.
I open a new worktree/agent session for each new feature or bugfix. Usually I start with ideation and brainstorming before doing the actual implementation. However my priorities shift daily, so I could be starting a feature on monday, but then stop during the ideation already, pick up another task on tuesday and won't go back to the initial one until a week or a month later. Since I got it in a neat worktree in superset, I still see my open branches easily, have access to its claude code session and can resume the work much easier than before having terminal tabs.
Because of this setup I currently have 17 backend sessions, 25 frontend sessions, plus several other repos open and ready, when i need to.
awesome to hear this! 40-50 is definitely on the high end. Are you adding any workflow on top to manage that many?
glad to hear it! thank you!
one issue i'm having, maybe you know what's causing this, in long running terminal sessions I'm getting rendering issues. Like some characters show as something else. I have to resize the window to fix it, usually opening and closing the code sidebar fixes it for some time, but it never fully goes away.
I think i also have it more on the big screen (4k) than on the laptop screen.
Nice. In the right track. I made something similar, but focused on local agents, but we both have issue tracking for managing multiple project and agents in parallel. It works, I think people will be surprised when they start using systems like this.<p>It is very different from current editors and the direction they are going in. In a way, it undermines the direction they are going. Current editors aim to make engineers 10x or 100x. These editors aim at a different target than the engineers. I will leave it to the imagination on who.
Just switched from Conductor to Superset and I'm a big fan. I really didn't like the extra stuff conductor added to the UX (the text rendering always drove me nuts).<p>So far so good with Superset - even as a non-engineer builder.
Why Mac only?<p>Also - one issue I've seen with other tools doing worktree stuff is they don't deal with merge conflicts automatically. IMO the agents should just automatically resolve conflicts & rebase on their own, is that a thing here?
We technically have a linux build, but dont maintain it as well as mac. We are built on electron so eventually we want to more heavily support linux and windows but we dont currently have anyone on the team that daily drives linux and windows the maintenance overhead is a bit too high. Once we do have someone on the team driving linux/windows we'll have broader publicized support for all platforms.<p>And on the worktree note, I find when working in a worktree the agent has a much easier time solving merge conflicts and as the number of feature branches scale and it just makes it so i dont have to worry about conflicts while working on a feature.
Surely that's something you just instruct the model to do in your CLAUDE.md / AGENTS.md right? Not really the domain of the IDE.
If you try to just instruct them they might get it wrong. If the surrounding software forces them to do it, it'll always work. E.g. it can check for merge conflict markers like <<<<<<< and re-invoke codex/claude to merge again if the previous resolution failed for whatever reason (e.g. AI hallucinated, threw up, whatever).<p>Also you'll need a wrapper to actually detect when merge conflicts will occur and when rebasing is necessary.<p>Generally, the less you rely on the AI the better. Make the AI write the code, sure, but don't make the AI be the process.
Personally, IDE for the agent era is just Linux.<p>Kitty with oh-my-zsh, lazyvim and an agent. The entire thing is an ide. If I need to refactor, query data and interact with the system I just use native tools like rg+fastmod, bash, awk, jq... Either writing myself of asking an agent to do the heavy lifting.<p>Linux in the agent era is a breeze to operate and reason about, so the whole thing becomes a single development environment that's really light on resources and effective.
What I've started to do is to use Zulip. You can have different agents in different chats. You can upload files and you update from your mobile phone. At first I thought it was crazy but it's nice not to have 3 different AI agents running in tmux
That's an interesting take! Basically Linux / a computer is everything you need to ship code.<p>If I could provide one gentle pushback - the same way there's utility in OMZ, lazyvim etc., there may be utility in us shipping our CLI etc. - there must exist some software we can build that'll be useful to you as well :)
Loving superset! Congrats on the launch! Excited to try remote workspaces so I don’t have to leave my computer open while I have autoresearch running.
What's the difference between you and emdash and conductor and t3
Conductor and t3 - We don't build on top of the SDKs and don't plan to. As a terminal first we give you the flexibility to work between the different CLI agents CC, Opencode, Codex, etc. to get the latest features.<p>Emdash more similar but we're planning on investing more into allowing you to script and automate the app + agents more with our CLI.
Sadly there are so few solutions, if even one, that is trying to offer a real remote agent interaction.
That's why I've build my own. No IDE per se rather than a discord bridge, with all interaction possibilities that makes sense and a way for my agent to quickly send me files or host mockups. My usecase is to build tools, reports, do research, mock up ideas and build prototypes. That's why I don't need to see the code that was written.
At first glance, it looks similar to Conductor (<a href="https://www.conductor.build/">https://www.conductor.build/</a>). It seems like a lot of these tools are converging on the same general ideas.<p>Could you share a comparison with the other tools out there?
For me the greatest difference is that superset is terminal-centric while conductor is chat-centric.<p>GUIs slow you down, in my opinion. But having the nice visual diff is something we can't really do well in TUIs, so very welcome.<p>So, superset, for me (been using for quite some time now) is basically to organize my agent and terminal sessions per task and project.<p>I can switch context much easier and can also resume working on something days later, with all my tabs nicely available and separated.<p>This was consuming me before, a dozen or more tabs and windows in my computer that I don't really remember to which task each belongs to.
Note that Anthropic specifically called out that usage through Conductor will be metered as "programmatic usage" in their June 15th pricing change: <a href="https://news.ycombinator.com/item?id=48126438">https://news.ycombinator.com/item?id=48126438</a>
My colleagues are planning to get around this by using Conductor's big terminal mode. At which point I think it's basically just a git worktree manager?
If you want to do that you mine as well give Superset a try, we've put a lot of time into giving a good terminal experience, with baked in notifications and a daemon that runs in the background so your terminal sessions dont die when the app updates or closes.
I don’t see any specific mention of Conductor, is this confirmed?
yeah there is a lot of overlap, we are more terminal first than conductor so you can do can use any cli agent you want. We have a lot more quality of life features around the terminal like notifications, and some things similar to tmux where if you kill the app or update your sessions stay alive and running. We also recently released remote workspaces so you can setup cloud workspaces for your agents. Id say if you like the chat experience conductor is still a bit more polished, we'll get to that level of polish soon, but if you care more about the terminal and cloud and more new integrations we are shipping superset is better.
Very excited about the remote workspaces! Perfect for me as I've set up a remote Mac as a Claude terminal. I know it's only just released but the typing is quite laggy... have you considered something like mosh? I've been using this from the terminal and it makes it feel as fast as running locally.
When you say “terminal first”, are you terminal-first enough that I could use vim buffers for editing?
Is anyone actually using agent swarms for anything real?
Yeah, how many agents can you people even run at once and how much does it cost you? In company we used the monthly token quota and nowadays it's basically unusable with claude opus 4.6 on high reasoning. You can basically burn through 100% usage through a single day. How does it even scale for you with N agents and which magical plans or models do you use, where tools like this are even viable?
for clarification we're less of a agent swarm tool, and more of a launch a bunch of independent agents in isolated environments and have some nice UX to manage them too. I also havent had as much luck with agent swarms or ralph loops, but i'm sure the laps will improve them with time
[flagged]
How do you guys plan to sustain the business, given that your product here is open source & already has many competitors doing similar things?
So far we've been growing pretty healthily all things considered!<p>I think one thing to remember is that the other side of us having dozens of competitors is that if the space couldn't sustain more than 1-2 parallel agent companies, a lot fewer of us would exist. We also will have a lot of time to continue creating value for our customers in the future in new ways :)
we monetize on teams and, in the future, cloud. the bet is that teams will want to centralize their set up for this type of work, especially shared Linear, GitHub, skills, etc.
I'd love a comparison to what's already out there. Don't vscode, antigravity, cursor etc all have agents too?
Sorry if I'm missing something but is there a way to create a new Claude workspace that has dangerously-skip-permissions but starts in Plan mode? There's no mode selector in the new workspace modal
I agree with the hard part being managing state, especially environments and ports. I've never used lsof so much in my life.<p>Question on Remote Workspace: Can the remote machine port forward so I can use a browser to see / test current state of the app on the remote machine?
I switched over to Superset from Conductor a few months ago and haven't looked back - it's really nice to be able to use the native Codex/Claude Code TUIs without any of the bloat<p>Can't wait to see what else you guys cook up!
That’s awesome to hear!<p>Definitely some exciting stuff coming with better automations, mobile and remote workspaces
Glad to hear it thanks for the support!
What are you doing with multiple agents, i.e. how do you have so much work to do? I can't come up with new ideas faster than the dude is coding.
is it terminal on steroids some kind of? so you can manage mutiple coding agents? how many coding agents you can manage in parallel that it is still comfortable to work and code changes are meaningful
yes, we surface agent states automatically so you can see what's running or needs attention across the different workspaces. there's a set of tasks where having 5-6 running in parallel is still productive for me such as running spikes and fixing small issue.<p>As we're investing more into integration test and self-validating for the agents we're able to increase the number without sacrificing quality.
Why do folks choose you guys over Conductor?
biggest thing is flexibility on what harness you use. we don't build on the claude code /codex sdk. instead, we just let you bring whatever CLI agents you prefer and just optimize the terminal experience for using CLI agents (track running / needs attention state, set up worktrees, split pane, etc)
This uses separate git worktrees. If we have a local dev setup involving multiple docker services, is there a recommended solution for managing those envs? I didn't see.
We have a concept of setup and teardown scripts if you're interested in checking them out! Together with worktrees, you can make it pretty automatic to making copies of your repo: <a href="https://docs.superset.sh/setup-teardown-scripts">https://docs.superset.sh/setup-teardown-scripts</a><p>Ours are a bit complex but here's an example:
<a href="https://github.com/superset-sh/superset/tree/main/.superset" rel="nofollow">https://github.com/superset-sh/superset/tree/main/.superset</a>
Still misses entirely the way software is constructed. These whack-a-mole tools are pointless.
definitely understand the perspective. I feel like theres some dev work that completely requires me to be single threaded and just think hard, but there other modes where i can jump into whack-a-mole mode and try to work on a bunch of github issues in parallel, or sentry issues, or little ui bugs and the thinking goes more into reviewing plans and prs and i dont have to be as attentive to each individual session
Care to elaborate
Even with the strongest guardrails, no llm can be left unsupervised. And believing an llm can supervise itself is madness. For every agent you deploy as a “team” you will have multiplied diminishing returns.<p>I’ve iterated the process hundreds of times and even with a strong specification and tests, an llm can meet the spec and pass all the tests and still build the wrong thing.<p>This multi-agent workflow idea is worse than web3.<p>But it’s a YC thing so someone will make money.
Why support each agent individually instead of using ACP and get much better agent coverage?
we're actually currently evaluating integrating acp for our chat, but when you start digging a bit deeper its not a complete silver bullet.
ACP is pretty much dead for Claude subscription usage
Binding the shell <-> local git clone automatically feels like the future. Great work.
can you make Superset work on folders that aren't git repositories?
I like this app, but 40 year old me says I will buy it for onetime payment rather than 20 dollars monthly. Great app
I used Superset for quite a while until a month ago. There were some annoying issues, with freezing and terminal not being rendered how it should be. And they did repeated fixes that didn't really solve it. Since I had work to do I moved on.<p>I installed Zellij on my server where most of work is happening and local machine and this works well for me. There are other issues I have now, but overall flow is fairly natural to what I am doing.<p>I liked that they did integrate a lot of agent workflow in Superset but my experience was that it would just take too many resources and especially with glitches, it wasn't worth it continuing. I had a period where i enjoyed working in it. It is vibe coded electron app, 2GB! is too much for this kind of app.<p>I just updated to their new version... it supposedly imported my projects but I can't find anything... so... I guess this is it.
zed , orca , /.+mux.*/ , ...<p>they all look incredibly / increasingly the same?
yeah i think theres a lot of ux conventions that are starting to get figured out, but we do want to be different. At least right now most dont well support remote workspace, issue tracking, or review. I bet most of the current ux patterns will look very different in a year
No linear integration in free version and taxing it 20$/m is a bit steep.
It's nice to see people building things, but honestly I found the demo video a bit disappointing. A bit too slow, a bit too choppy, a bit hand wavy. It didn't make me grasp why I needed this in my life.
The FAQ says <i>"Superset has a free tier. The source code is available on GitHub under Elastic License 2.0 (ELv2), so you can inspect and self-host it subject to the license terms."</i> - what is self hosting in this context, isn't it a desktop app? Is this why it wants me to sign into something? What exactly am I signing in to?
So we also ship a cloud service along with Superset, which enables our Linear integration, Slack integration, and our multiplayer capabilities / remote workspaces.<p>When you sign in, you're signing into our cloud service!
How does this compare to Cursor?<p>What happens if Cursor makes the exact same features as your product?
Actually Cursor is starting to converge with us as we speak! You can look at their new agents mode (which is now their default for new users) as an example.<p>For what happens, in our heads the end goal is building a software factory where dozens to hundreds of agents are always running - something that nobody has nailed the experience for yet. Until that's a solved problem I hope we have room to grow and build!
Windddddddoooowwwwwssssss
[flagged]
I thought this was somehow related to Apache Superset.<p><a href="https://superset.apache.org/" rel="nofollow">https://superset.apache.org/</a>
Confusing name. Superset is already an established analytics tool.
[flagged]
[flagged]
> <i>Just realized that I have done all my work in @superset_sh since Dec 26.</i><p>Just FYI the first quote on your site references a date we haven’t reached yet!
[flagged]
[flagged]
[flagged]
[flagged]
[dead]
How many "IDEs for the agentic era" do we need?
As many as we can new idea's from and figure out how to do this more efficiently. It's ok to stick with vim, of course!
Fair pushback! The space will settle down eventually, it's just clear to a lot of people that there's a lot of value to be created in this space that hasn't been created yet :)
We live in this era when folks can vibecode entire startups without ever making a simple Google Search.<p><a href="https://github.com/apache/superset" rel="nofollow">https://github.com/apache/superset</a>
"<i>Please don't complain about tangential annoyances—e.g. article or website formats, name collisions, or back-button breakage. They're too common to be interesting.</i>"<p><a href="https://news.ycombinator.com/newsguidelines.html">https://news.ycombinator.com/newsguidelines.html</a>
Always preferred the name Caravel...
yeah I was confused coming across this headline - "Huh? Apache Superset is doing a Launch HN?"