fattyfoods@feddit.nl to Open Source@lemmy.ml · 13 days agoThe Open-Source Software Saving the Internet From AI Bot Scraperswww.404media.coexternal-linkmessage-square106linkfedilinkarrow-up1582arrow-down113cross-posted to: opensource@programming.devtechnology@beehaw.org
arrow-up1569arrow-down1external-linkThe Open-Source Software Saving the Internet From AI Bot Scraperswww.404media.cofattyfoods@feddit.nl to Open Source@lemmy.ml · 13 days agomessage-square106linkfedilinkcross-posted to: opensource@programming.devtechnology@beehaw.org
minus-squarekcweller@feddit.nllinkfedilinkarrow-up83·12 days agoRobots.txt expects that the client is respecting the rules, for instance, marking that they are a scraper. AI scrapers don’t respect this trust, and thus robots.txt is meaningless.
Robots.txt expects that the client is respecting the rules, for instance, marking that they are a scraper.
AI scrapers don’t respect this trust, and thus robots.txt is meaningless.