Re: Rate Limits, see<p><a href="https://rachelbythebay.com/w/2024/05/27/feed/" rel="nofollow">https://rachelbythebay.com/w/2024/05/27/feed/</a><p>but coming from an aggressively anticommercial world view. She collects evidence that real world feed readers don't implement RSS correctly<p><a href="https://rachelbythebay.com/w/2026/02/23/readers/" rel="nofollow">https://rachelbythebay.com/w/2026/02/23/readers/</a><p>Her problems are the problems of a polling-based protocol and really if she does not like the RSS protocol she should stop publishing it and stand up an ActivityPub or PubSubHubBub service instead.<p>A big part of the value of Google Reader and the ecosystem around it was that Google could poll your RSS feed once and everyone could read it... A huge win for the Rachels!
I have almost 40 feeds I subscribe to and they're my primary way of getting information I care about without being exposed to ads or other things I don't want to see.
I built a site that's similar in concept to Hacker News, but is entirely fed by RSS feed content, that is then bullet-pointed summarized on the article page: <a href="https://engineered.at/" rel="nofollow">https://engineered.at/</a><p>But I also extract topics automatically from the content too with LLMs, to allow for dynamic topic pages that users can separately subscribe to to tune their feeds.<p>Haven't promoted it much, but it's pretty amazing what you can do for a couple bucks a month. And my main thesis with this site is that by locking the content to only rss feeds of known blogs, you dramatically reduce the spam submission risk (basically eliminate it). Doesn't handle the spam comment side of things, but that's a different problem.<p>EDIT: I also open sourced a Rails engine I made to power this site if anyone is interested: <a href="https://github.com/dchuk/source_monitor" rel="nofollow">https://github.com/dchuk/source_monitor</a>
AI agents don't need RSS. What they need is some representation in text. The XML/RSS markup is completely unnecessary.
What's old is new again. The solution RSS offered was structure for an otherwise unstructured challenge (trying to figure out updates on a site). That value grew exponentially when connected to AI (providing the signals of when do I need to look at this site/podcast again). Smart marketing.
I kinda don't like RSS because I often want like a whole blog archive downloaded if I add a new feed and it usually has limits how far back of posts it will download (randomly configured by each site)<p>Unless someone has a fix of whatever settings I've been using
Sorry, I don't have a solution. But I use RSS for <i>everything</i>, and I can confidently say: RSS is not designed for your use case.
For the Premium Archive tier, NewsBlur attempts to download a blog's entire backlog to backfill stories, whether it's exposed through paging or RFC 5005. Here's more info about how NewsBlur does it: <a href="https://blog.newsblur.com/2022/07/01/premium-archive-subscription/" rel="nofollow">https://blog.newsblur.com/2022/07/01/premium-archive-subscri...</a>
I use Elfeed for Emacs and it stores the history as you download updates so you can always go back and read an old post.
I guess if you want your content all slurped up and served as coming from AI with no backlinks.
Anyone know the best practices for keeping AI crawlers off your RSS feeds? I know robots.txt works for the well-behaved bots. Other tools like interstitial captchas don't as the feed readers break if you send them anything but XML.<p>Putting just the post intro in the feed and linking to the website feels like a safer approach, assume you have bot protections on the website, but that's a poor experience for people who want to read in their feed reader.
You mean scraping instead of reading it? Reddit does not like the sound of that at all and are mulling to remove RSS support due to scrapers [0]<p>[0] <a href="https://www.reddit.com/r/modnews/comments/1tq9vxo/protecting_communities_from_scrapers_and_platform/" rel="nofollow">https://www.reddit.com/r/modnews/comments/1tq9vxo/protecting...</a>
>RSS was declared dead in 2013<p>Where? Not within the homelab space.
They probably meant it hyperbolically, but RSS was on a downward slope during that period. The recent uptick is fascinating.<p><a href="https://trends.google.com/explore?q=%2Fm%2F0n5tx&date=all&geo=Worldwide" rel="nofollow">https://trends.google.com/explore?q=%2Fm%2F0n5tx&date=all&ge...</a>
I must add that I self-host FreshRSS to fetch news and GitHub repos updates so I can update my stuff, everything in-house, controlled by me.<p>RSS makes life so much easier, some only provide the bare minimal while others, provide the whole post so I can read everything right there without opening a website.<p>Also, some podcast support it so I have a list of podcast that I list and can go back without having to go from website to website.<p>One place to govern them all, RSS still king.
i mean, i still read hacker news primarily via RSS in feedly. i kind of never stopped using it, and everybody is much more generous with their feeds nowadays than back in google reader times. bearblog, etc. RSS rules