6 comments

  • alenmangattu3 days ago
    I’ve spent the last 3 months building a crawler to index the public parts of Telegram (<a href="https:&#x2F;&#x2F;telehunt.org" rel="nofollow">https:&#x2F;&#x2F;telehunt.org</a>). The native search is essentially a black box that favors the top 0.1% of bot almost invisible. The Tech: I had to deal with rate limits and the lack of a global &#x27;sitemap&#x27;. I’m currently using a hybrid approach of metadata scraping to keep the index fresh. The Goal: It’s an experiment in making &#x27;un-indexable&#x27; bot data discoverable.
    • Antibabelic4 hours ago
      Where is the search engine? The site says that it&#x27;s a bot directory.
      • renegat0x02 hours ago
        wikipedia &quot;A search engine is a software system that provides hyperlinks to web pages, and other relevant information on the Web in response to a user&#x27;s query&quot;.<p>I think there can be different expectation connected to this term. It seems to be a &quot;search engine&quot; for bots. Bot directory does not have to have &quot;search&quot; functionality, right?
    • duskwuff3 hours ago
      You may be overestimating the number of bots that meaningfully exist. The vast majority of bots (and public channels) on the platform are nonfunctional and&#x2F;or spam.
  • lovegrenoble2 hours ago
    It&#x27;s all about Bot directories... (((
  • hiprob2 hours ago
    This is cool. Telegram also has a Premium feature which crawls the contents of (presumably) all public channels on the platform. It&#x27;s limited to 10 searches per day and doesn&#x27;t search for old content if there are too many retrieved posts.
  • renegat0x02 hours ago
    - &quot;I built a search engine&quot; sounds cool on hacker news, but in reality it is a &quot;company product&quot;, right?<p>- do the links in the footer work? I tried clicking on github icon, and it appears to be broken
  • jadengeller1 hour ago
    what do you verify about the bots?
  • copoitzq3 days ago
    [dead]