85 points by julienreszka 3 days ago | 65 comments | View on ycombinator
analogpixel 3 days ago |
eugeneonai 3 days ago |
phyzix5761 3 days ago |
daxfohl 3 days ago |
alextillman 3 days ago |
nreece 3 days ago |
Systems and agents need to monitor and extract public web content into fresh structured data for their ingestion, intelligence workflows and analysis.
* Shameless plug * Our data infrastructure layer for businesses and AI turns continuously updated websites into a stream of structured data.
dchuk 3 days ago |
But I also extract topics automatically from the content too with LLMs, to allow for dynamic topic pages that users can separately subscribe to to tune their feeds.
Haven't promoted it much, but it's pretty amazing what you can do for a couple bucks a month. And my main thesis with this site is that by locking the content to only rss feeds of known blogs, you dramatically reduce the spam submission risk (basically eliminate it). Doesn't handle the spam comment side of things, but that's a different problem.
EDIT: I also open sourced a Rails engine I made to power this site if anyone is interested: https://github.com/dchuk/source_monitor
PaulHoule 3 days ago |
https://rachelbythebay.com/w/2024/05/27/feed/
but coming from an aggressively anticommercial world view. She collects evidence that real world feed readers don't implement RSS correctly
https://rachelbythebay.com/w/2026/02/23/readers/
Her problems are the problems of a polling-based protocol and really if she does not like the RSS protocol she should stop publishing it and stand up an ActivityPub or PubSubHubBub service instead.
A big part of the value of Google Reader and the ecosystem around it was that Google could poll your RSS feed once and everyone could read it... A huge win for the Rachels!
ramaseshanms 1 day ago |
sperandeo 3 days ago |
rvz 3 days ago |
[0] https://www.reddit.com/r/modnews/comments/1tq9vxo/protecting...
hparadiz 3 days ago |
https://github.com/hparadiz/technexus/blob/release/src/Contr...
I would enjoy a JSON based refresh of the format.
b3ing 3 days ago |
erelong 3 days ago |
Unless someone has a fix of whatever settings I've been using
grobibi 3 days ago |
Can someone reccomend a way to create an rss feed from a site that has none?
h4kunamata 3 days ago |
Where? Not within the homelab space.
_pdp_ 3 days ago |
themafia 3 days ago |
Get your rapacious hands away from my website please.
> and actively degrades programmatic access.
That's your problem. You choose these tools. If they can't function without ripping everyone else off then why do you persist in using them?
amai 3 days ago |
Nowadays AI agents also don't read ads. Let's see how that is going, but the ad industry isn't amused about that.
0gs 3 days ago |
notnullorvoid 3 days ago |
hendler 3 days ago |
tokenfaucet 3 days ago |
overfits-ai 3 days ago |
I'm not sure why I keep reading HN, 99% of the content is uninteresting, probably 99.9% now that every article is about AI. maybe I just like clicking on things.