13 comments

  • nickjantz28 minutes ago
    Am I missing something other commenters are seeing about this not being an ad? The domain is on Burla, which hosted the compute needed for this. There&#x27;s a giant airbnb x burla logo at the top. People are saying there&#x27;s a lawsuit pending, it&#x27;s against guidelines, what&#x27;s the point, etc..<p>It&#x27;s content marketing plain and simple for Burla towards people that view this site. It was highly likely done by employees at both Burla and AirBNB together as a joint project.
    • jperryjperry19 minutes ago
      One of the Burla founders here. Not a joint project with Airbnb. I’ve been experimenting with giving agents access to Burla clusters and letting them run with analysis ideas I find interesting. This was one of the results.<p>The branding is a bit much, fair call, but the intent here was just to explore what these agents can actually build when you give them access to large amounts of compute.
      • add-sub-mul-div16 minutes ago
        How many accounts do you have spamming your projects here?
        • zamadatix4 minutes ago
          Looks like just 2 accounts with 11 total submissions in the last year, both with disclosures in the comments and&#x2F;or profile <a href="https:&#x2F;&#x2F;hn.algolia.com&#x2F;?dateRange=all&amp;page=0&amp;prefix=true&amp;query=burla&amp;sort=byDate&amp;type=story" rel="nofollow">https:&#x2F;&#x2F;hn.algolia.com&#x2F;?dateRange=all&amp;page=0&amp;prefix=true&amp;que...</a>.<p>This post is a bit lighter on that disclosure than I&#x27;d like (and isn&#x27;t as obvious as a Show HN would be) but I feel I missing some big portion of the backstory to this comment?
  • NoLinkToMe37 minutes ago
    What a waste of energy (money&#x2F;resources)... Scraping and AI-scanning 2 million photos to identify animals in the advertisement pictures? What&#x27;s the point.<p>As an exercise a sample of 1000 photos would&#x27;ve been enough. As a database, knowing a listing has a cat in the picture or a funny review doesn&#x27;t offer any real value.<p>I wonder what the footprint is of such an exercise.
    • ericmcer14 minutes ago
      I dunno there are literally 100s of millions (billions?) of people who spend more than an hour per day just scrolling through social media feeds.<p>How much does it cost to send a billion people an hour of video every day? Almost all of the resources tech uses is for pointless or even negative things.<p>What % of compute&#x2F;bandwidth do you think is used for &quot;real value&quot;? I would guess it is well below 1%.
    • jperryjperry18 minutes ago
      The pet detection part isn’t the point, that’s just a visible output. The actual goal was to stress test agents + distributed compute on something non-trivial.
  • dwroberts4 minutes ago
    “Drug den vibes” and they’re mostly just small rooms?
  • wheelerwj1 hour ago
    This thing is ripe for a lawsuit and has terrible methodology as far as I can tell.
    • smrtinsert15 minutes ago
      On what grounds is there a lawsuit? Hasn&#x27;t scraping been classified as legal?
  • htrp36 minutes ago
    This seems like an advertisement for an open source package<p>&gt;Scale Python across 1,000 CPUs or GPUs in 1 second. Burla is a high-performance parallel processing library with an extremely fast developer experience. Scale batch processing, vector embeddings, inference, or build pipelines with dynamic hardware.<p>Edit: Author comment was flagged dead. They work at burla which is a managed cloud service for parallelizing python
    • andai29 minutes ago
      Looks like it was hit by some sort of automated ChatGPT detector.
  • xrd36 minutes ago
    Airbnb was actually started by two guys who created an opium den for Obama&#x27;s convention so this doesn&#x27;t surprise me.
  • danhon58 minutes ago
    &quot;Looking at every public Airbnb listing in Inside Airbnb&#x27;s open data dump, all at once, on Burla&quot;<p>This Inside Airbnb?<p>Community Guidelines<p>Please:<p>Only take the data you need. Do not scrape data from the site, if you would like to subscribe to the data directly, please email data@insideairbnb.com
    • yodon43 minutes ago
      &gt;Everything was parallelized on Burla, on a single dynamic cluster that scaled to ~1.7K CPU workers for photo download and CLIP, with 20 A100 GPUs running embedding clusters in parallel on the same cluster.<p>That&#x27;s a lot of budget - would have been nice if they&#x27;d made an actual donation to the project, instead of pounding the project&#x27;s servers and bandwidth when there are much better ways to interact with the data.
      • jperryjperry16 minutes ago
        Totally fair callout. This wasn’t intended to put unnecessary load on anything. We were experimenting with running agents + distributed compute end to end on a real dataset, and this happened to be a good candidate.<p>That said, if this ended up stressing the project’s infrastructure more than expected, happy to make a donation to support it. Appreciate you raising it!
        • danhon1 minute ago
          ... so you&#x27;d only end up making a donation if you ended up &quot;stressing the project&#x27;s infra more than expected&quot;?!
  • devmor12 minutes ago
    The author makes some pretty insane leaps in logic for classification, and it’s apparent in the photos.<p>“Drug-Den vibes” apparently means the owner is poor or a photo is obscured or badly lit.
  • gavmor1 hour ago
    These are amazing! Some are probably offensive, because I saw a cozy, if kitschy, British den labeled as &quot;did-someone-just-leave&quot; vibes which... unfair.
    • jperryjperry15 minutes ago
      do you know the listing number? will remove that one haha
  • add-sub-mul-div18 minutes ago
    This vanity scraping is fucking up the internet for everyone else.<p>It&#x27;s hardly the only thing, but it&#x27;s part of the problem.
    • jperryjperry11 minutes ago
      Fair feedback. Definitely more backlash than I expected. The intent was to experiment with large-scale analysis, not add noise or put strain on shared resources. I’ll be more thoughtful about this kind of thing going forward.
  • xikrib42 minutes ago
    Ah yes, let&#x27;s price the world out of the real estate market and then use insanely powerful AI models to systematically mock the living conditions of the poors.
  • guywithahat1 hour ago
    This is pretty great, the reviews at the bottom are the best part. I&#x27;m impressed they were able to scrape so much data
  • jmp10624 hours ago
    [dead]