Oh this is really neat for the Bazel community, as depending on tree-sitter to build a gazelle language extension, with Gazelle written in Go, requires you to use CGO.<p>Now perhaps we can get rid of the CGO dependency and make it pure Go instead.
I have pinged some folks to take a look at it.
would also be nice to have this support gopackagesdriver backend
thanks so much for the note! i really appreciate it. i built this precisely for folks like yourself with this specific pain, thanks again!
Can someone please explain what's the connection between this and LSP? For example in Helix can one use this instead of various language servers?
Tree-sitter is merely a tool for generating an AST for a given language. LSPs on the other hand have way more capabilities (formatting, diagnostics, project-wise go to definition, inlay hints, documentation on hover, etc.) as you can see in its specification.[0] They can't really replace one another.<p>[0]: <a href="https://microsoft.github.io/language-server-protocol/specifications/lsp/3.17/specification/" rel="nofollow">https://microsoft.github.io/language-server-protocol/specifi...</a>
I also have a tree-sitter Rust rewrite. Though I cannot find it more useful for end users... <a href="https://github.com/HerringtonDarkholme/tree-sitter" rel="nofollow">https://github.com/HerringtonDarkholme/tree-sitter</a>
Wouldn't `got` be confused with OpenBSD's Got: <a href="https://gameoftrees.org/index.html" rel="nofollow">https://gameoftrees.org/index.html</a>
Why would people be confused with something that the vast majority never heard of. Naming shouldn't care about none mainstream project.
oh wow! i really thought i was being too clever but i shouldve assumed nothing new under the sun. well im taking name suggestions now!
Well, find and sed have modern "fd" and "sd" alternatives. Naming it "gt" allows you to claim that your version save 33% compared to typing "git".
Goty McGotface
uGOT / uGOTme? (sort of like the idea behind uTorrent) but I will agree that sbankowi's idea of Yet another got is great as well. +1 to that as well.
YAGOT (Yet Another GOT)
This is great, I was looking for something like this, thanks for making this!<p>I imagine this can very useful for Go-based forges that need syntax highlighting (i.e. Gitea, Forgejo).<p>I have a strict no-cgo requirement, so I might use it in my project, which is Git+JJ forge <a href="https://gitncoffee.com" rel="nofollow">https://gitncoffee.com</a>.
Do you have an equivalent of TreeCursors or tree-sitter-generate?<p>There are at least some use cases where neither queries nor walks are suitable. And I have run into cases where being able to regenerate and compile grammars on the fly is immeasurably helpful.<p>At least for my use cases, this would be unusable.<p>Also, what the hell is this:<p>> partial [..] missing external scanner<p>Why do you have a parsing mode that guarantees incorrect outputs on some grammars (html comes to mind) and then use it as your “90x faster” benchmark figure?
<i>Claude attempted a treesitter to go port</i><p>Better title
I work on a revision control system project, except merge is CRDT. On Feb 22 there was a server break-in (I did not keep unencrypted sources on the client, server login was YubiKey only, but that is not 100% guarantee). I reported break-in to my Telegram channel that day.<p>My design docs <a href="https://replicated.wiki/blog/partII.html" rel="nofollow">https://replicated.wiki/blog/partII.html</a><p>I used tree-sitter for coarse AST. Some key parts were missing from the server as well, because I expected problems (had lots of adventures in East Asia, evil maids, various other incidents on a regular basis).<p>When I saw "tree-sitter in go" title, I was very glad at first. Solves some problems for me. Then I saw the full picture.
Wait, are you suggesting that OP broke in to your server and stole code and is republishing it as these repos?<p>I have questions. Have you reviewed the code here to see if it matches? What, more specifically, do you mean when you say someone broke in? What makes you think that this idea (which is nice but not novel) is worth stealing? If that sounds snarky, it’s not meant to; just trying to understand what’s going on. Why is that more likely than someone using Claude to vibe up some software along the same lines?
1. Just saying, strange coincidence<p>2. How can we compare Claude's output in a different language?<p>3. Detecting break-ins and handling evil-maids: unless the trick is already known on the internets, I do not disclose. Odds are not in my favor.<p>4. Maybe worth, maybe not. I have my adaptations. Trying to make it not worthy of stealing, in fact.
Also, evil maids, what?
I can't speak for the specificity of parent's "evil maids" phrase but the concept of an "Evil maid" is used in security scenarios.<p>A maid tends to be an example of a person who's mostly a stranger, but is given unmonitored access to your most private spaces for prolonged periods of time. So they theoretically become a good vector for a malicious actor to say "hey I'll give you $$ if you just plug in this USB drive in his bedroom laptop next time you're cleaning his house" - it's often used in the scenario of "ok what if someone has physical access to your resource for a prolonged period of time without you noticing? what are your protections there?"<p>I wonder if that's what OP meant? :-)
"Evil maids" (example): I put my laptop into a safe, seal the safe, seal the room, go to breakfast. On return, I see there was cleaning (not the usual time, I know the hotel), the cleaner looks strangely confused, the seal on the safe is detached (that is often done by applying ice; adhesive hardens, seal goes off). This level of paranoia was not my norm. Had to learn these tricks cause problems happened (repeatedly). In fact, I frequented that hotel, knew customs and the staff, so noticed irregularities.
Ah right, thanks! But it seems he meant literal evil maids. Which I guess count as the figurative kind too.
This was my first thought as well, just from reading the title.
How is OP using Claude relevant?
OK for prototyping. Not OK for prod use if noone actually read it line by line.
I’m just curious, what would need to happen for you to change your opinion about this? Are you basically of the opinion that it’s not good enough today, never will be good enough in the future, and we should just wind back the clock 3 years and pretend these tools don’t exist?<p>It feels to me like a lot of this is dogma. If the code is broken or needs more testing, that can be solved. But it’s orthogonal: the LLM can be used to implement the unit testing and fuzz testing that would beat this library into shape, if it’s not already there. It’s not about adding a human touch, it’s about pursuing completeness. And that’s true for all new projects going from zero to one, you have to ask yourself whether the author drove it to completeness or not. That’s always been true.<p>You want people to hedge their projects with disclaimers that it probably sucks and isn’t production worthy. You want them to fess up to the fact that they cheated, or something. But they’re giving it away for free! You can just not use it if you don’t want to! They owe you <i>nothing</i>, not even a note in the readme. And you don’t deserve more or less hacker points depending on whether you used a tool to generate the code or whether you wrote it by hand, because hacker points don’t exist, because the value of all of this is (and always will be) subjective.<p>To the extent that the modern tools and models can’t oneshot anything, they’re going to keep improving. And it doesn’t seem to me like there’s any identifiable binary event on the horizon that would make you change your mind about this. You’re just against LLMs, and that’s the way it is, and there’s nothing that anyone can do to change your mind?<p>I mean this in the nicest way possible: the world is just going to move on without you.
ii am trying to not take issue with this comment because im aware of the huge stigma around ai generated code.<p>i needed this project so i made it for my use case and had to build on top of it. the only way to ensure quality is to read it all line by line.<p>if you give me code that you yourself have not reviewed i will not review it for you.
That ship has sailed, man…
Sorry but these are just not accurate as blanket statements anymore, given how good the models have gotten.<p>As other similar projects have pointed out, if you have a good test suite and a way for the model to validate its correctness, you can get very good results. And you can continue to iterate, optimize, code review, etc.
People should say what models/tools they used in even show the prompts.
Because the entire README doesn't even mention it, and it is an important factor in deciding whether it is ready for production use.<p>I, for one, am definitely not going to use this project for anything serious unless I have thoroughly reviewed the code myself. Prototyping is fine.
Because OP obviously downplayed this important fact, which typically shows lower quality/less tested code.
maintenance burden
AI often produces nonsense that a human wouldn't. If a project was written using AI the chances that it is a useless mess are significantly higher than if it was written by a human.
well how did it do?
Hard to say. Claude’s very good at writing READMEs. In fact, Copilot often complains about docs that sound like they’re about current capabilities when in fact they’re future plans or just plan aspirational.<p>Without downloading and testing out your software, how can we know if it’s any good? Why would we do that if it’s obviously vibed? The dilemma.<p>I’m not at all against vibe coding. I’m just pointing out that having a nice README is trivial. And the burden of proof is on you.
Shouldn't you be able to answer that?
That is very very interesting. I work on a similar project <a href="https://replicated.wiki/blog/partII.html" rel="nofollow">https://replicated.wiki/blog/partII.html</a><p>I use CRDT merge though, cause 3-way metadata-less merges only provide very incremental improvements over e.g. git+mergiraf.<p>How do you see got's main improvement over git?
primarily, got is structural VCS intended for concurrent edits of the same file.<p>it does this via gotreesitter and gts-suite abstractions that enable it to:
- have entity-aware diffs
- not line by line but function by function
- structural blame
- attribution resolution for the lifetime of the entity
- semver from structure
- it can recommend bumps because it knows what is breaking change vs minor vs patch
- entity history
- because entities are tracked independently, file renames or moves dont affect the entity's history<p>when gotreesitter cant parse a language, the 3way text merge happens as a fallback. what the structural merge enables is no conflicts unless same entity has conflicting changes
I think I understand the situation.
gah,. sincere apologies for formatting of this post. i ahve been on HN for basically 10 years now without ever having made a post (:
It looks like porting the custom C lexers is a big part of the trouble you had to go to do this.
Interesting. I have a similar usecase but intended to use CGo tree-sitter with Zig<p>Are these pretty up-to-date grammars? I'm awfully tempted to switch to your project<p>How large are your binaries getting? I was concerned about the size of some of the grammars
How about making 'got' compatible with git repos like jujutsu? It would be a lot easier to try out.
it is interoperable with git. we like git when its good but attempted to ease the pains in UX somewhat. you can take advantage of got locally but still push it to git remote forges jsut the same. when you pull stuff in this way, got will load the entity history into the git repo ensuring that you can still do got stuff locally (inspect entity histories, etc)
"rewrite" a nice code base without mentioning it is vibe coded is not great.<p>Essentially you use AI to somehow re-implement the original code base in a different language, made it somehow work, and claim it is xx times faster. It is misleading.
Is it a go-ism that source for implementation and test code lives in the root of the repo or is this an LLM thing?
yeah the tests live with the implementation code always (Go thing) and the repo root thing is like a preference, main is an acceptable package to put stuff in (Go thing), i see this a lot with smaller projects or library type projects