14 comments

  • goda9020 days ago
    A few years ago I set out to refactor some of my team's code that I wasn't particularly familiar with, but we wanted to modularize and re-use in more places. The primary file alone was 18k+ lines of Typescript that was a terrible mess of spaghetti. Most of it had been written in JavaScript but later converted haphazardly. I ended up writing myself a little app that used the Typescript compiler APIs to help me just explore all the many branches of the code and annotate how I would refactor different parts. It helped a bit, but I never got time to add some of the more intelligent features I wanted like finding every execution path between two points.
    • henryhale20 days ago
      give depgraph a try - <a href="https:&#x2F;&#x2F;github.com&#x2F;henryhale&#x2F;depgraph" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;henryhale&#x2F;depgraph</a> - i&#x27;d like to learn about how i could improve it.
      • flowerbreeze19 days ago
        I gave it a try on my current codebase out of curiosity. Definitely useful. It worked well and fast, but it has a lot of duplicates that get rendered as exports in the NodeJS modules based codebase. I think it can sometimes be caused by me just being haphazard about re-exporting them, but other times I&#x27;m not sure.<p>Eg authenticatedMenu() appears 4 times in authenticatedMenu.js, only one of them is imported by 2 different files and 3 are just there alone. There&#x27;s a single export in the file and a number of other files import it through an index.js that re-exports several files other files too.<p>In my case I think it&#x27;d help, if I could disable the duplicates as they don&#x27;t really provide any useful information when exploring the codebase.<p>Also, if there was optionally a way to ignore the files that re-export functions&#x2F;classes and collapse those paths, it&#x27;d make the graph a lot smaller and more easy to understand. Maybe it&#x27;s already something that depgraph does, but the duplicates confuse things, so I&#x27;m not sure.
        • henryhale19 days ago
          &gt; I think it can sometimes be caused by me just being haphazard about re-exporting them, but other times I&#x27;m not sure.<p>I think so too. I guess that&#x27;s how your project is structured and duplicates maybe inevitable.<p>The graph shows exactly how the project is organized. Right - &quot;duplicates confuse things&quot; - this would suggest eliminating &quot;files that re-export functions&#x2F;classes&quot; or passing an option (-i) for ignoring specific paths would help. Otherwise, this issue is noted for further analysis.<p>Thanks for trying depgraph.
  • dcreater20 days ago
    you say &quot;local-first&quot; but have placed voyage API for embeddings as the default (had to go to the website and dig to find that you can infact use local embedding models). Please fix
    • ofriw20 days ago
      Thank you, yes the docs are overdue for a refresh. It&#x27;s in the works
      • wiml19 days ago
        Presumably it could update its own docs
        • ofriw19 days ago
          Exactly. There&#x27;s an autodoc feature coming up in the next version
    • esafak20 days ago
      It would be convenient if it could load local SLMs itself, otherwise I&#x27;ll have to manually start the LLM server before I can use it, and it&#x27;s not something I leave running all the time.
  • henryhale20 days ago
    I have been working on depgraph (<a href="https:&#x2F;&#x2F;github.com&#x2F;henryhale&#x2F;depgraph" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;henryhale&#x2F;depgraph</a>) for a while now. It is truly local with several output options(json, mermaid, jsoncanvas). Mutliple languages are supported (js, go, c) - expanding the list slowly but sure.
  • romperstomper20 days ago
    I don&#x27;t understand how&#x2F;why all of this is local-first if all these providers are supported and used - could you elaborate what is sent to them?
    • ofriw20 days ago
      The DB is stored locally, and any embedding, reranker and LLM will work. It&#x27;s up to you if you self host these or bring them externally from one SaaS or the other
  • Neywiny20 days ago
    Might give this a try to experiment if it&#x27;s really free to use (I&#x27;ll have to read up on that I guess). The qemu codebase is huge and every contributer seems to solve problems in slightly different ways. Would be nice if this tool could help distill it.
    • ofriw20 days ago
      Completely free, MIT licensed. You can fully self host it if you have the hardware to run Qwen3-embedding and reranker models
  • dogman12320 days ago
    Is there a way to have the model inside of codex to make use of chunkhound instead of its “built in” search&#x2F;explore functionality with rg? Whenever I spin up a new agent using xhigh thinking it spins its wheels for a while to get up to speed — wondering if chunkhound can make this process faster.
    • esafak20 days ago
      That&#x27;s what the MCP is for, if you can get the LLM to use it. Sometimes they just like to do it their own way :)
      • strainer_spoon8 days ago
        The ChunkHound docs are a bit confusing for making it available as an MCP server for Codex. How exactly do you do it? I got up to the indexing step and now need to let Codex be able to use it.
  • conception20 days ago
    I have chunckhound is a few projects and it’s noted in both the agent md file as well as mcp and claude never uses it. Ever. Never once.<p>Is there a prompt special sauce y’all use to get it to use it?
    • ofriw20 days ago
      Just add to your prompt something like &quot;use code research&quot;, but yes there&#x27;s a PR in the works that fixes that and optimizes the MCP tools interface - <a href="https:&#x2F;&#x2F;github.com&#x2F;chunkhound&#x2F;chunkhound&#x2F;pull&#x2F;150" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;chunkhound&#x2F;chunkhound&#x2F;pull&#x2F;150</a>
  • potamic18 days ago
    I followed the docs for ollama configuration, but it says unknown LLM provider when I try running the research command.
  • apgwoz20 days ago
    Perhaps I am missing something, but this seems to require a Lemon (LLM)? Is the idea that the Lemon is used to help build an index AOT that can be queried locally, after?<p>I want to figure out how to build advanced tools, potentially by leveraging Lemons to iterate quickly, that allow us all to rely _less_ on Lemons, but still get 10,20,30x efficiency gains when building software, without needing to battle the ethics of it all.
    • ofriw20 days ago
      ChunkHound does it a bit differently, since at true enterprise scale it&#x27;s very slow and costly to pass all code chunks through an LLM during indexing time. Instead, ChunkHound implements a customized &quot;deep research&quot; algorithm that&#x27;s been optimized for code exploration so it can answer, on demand, any deep technical question about the indexed codebase. This research agent can be powered by a lower tier LLM (think Haiku, Codex low, etc) that&#x27;s already included in your subscription.
  • dmos6220 days ago
    Will try this out. Was always envious of how Augment was able to do this. Kudos.
  • bravura20 days ago
    Can you please expose the functionality as a self-documenting CLI command with machine readable output? (Or did I misunderstand that MCP isn&#x27;t the <i>only</i> way to use it?)<p>I am curious to try it but do not want to adopt MCP servers.<p>Telling Claude to call the CLI tool is more efficient.
    • dcreater20 days ago
      Agree. And to make the CLI usage more effective&#x2F;efficient, if you can publish a skill that would be excellent
      • esafak20 days ago
        That&#x27;s why we&#x27;re asking for the CLI; so we can write the skills.
    • ofriw20 days ago
      `chunkhound search &lt;query&gt;`, `chunkhound search --regex &lt;query&gt;` and `chunkhound research &lt;query&gt;` are the main cli entry points that you can already use today
    • blackqueeriroh20 days ago
      Am I confused or is this not an open-source project on GitHub?<p>You have every ability to make these modifications yourself; is there a reason you feel the need to require the creator to do so?
      • from_memory20 days ago
        I think the term is &quot;Instrumentalism&quot;.
  • CamperBob220 days ago
    Looks like the tutorial link is broken.
    • ofriw20 days ago
      Fixed, thank you