That makes no sense. There is no reason for AI scrappers to use tens of thousand...

TurdF3rguson · 2026-01-16T23:11:48 1768605108

Sure there is, scrapers do that to defeat throttling. 10,000 is less than 3 hours of scraping at 1 request per second.

Havoc · 2026-01-17T00:55:15 1768611315

It's not 10k requests, it's 10k IPs

Having lots of IPs is helpful for scraping, but you don't need 10k. That's a botnet

TurdF3rguson · 2026-01-17T01:05:41 1768611941

The way it works is this: You can sign up for a proxy rotator service that works like a regular proxy except every request you make goes through a different ip address. Is that a botnet? Yes. Is it also typically used in a scraping project? Yes.

Havoc · 2026-01-17T01:39:23 1768613963

Yeah I know, I've done scrapping too.

It can absolutely be that, but that requires a confluence of multiple factors - misconfigured scrapper hitting the site over and over, a big bot net like proxy setup that is way overkilled for scrapping, a setup sophisticated enough to do all that yet simultaneously stupid enough to not cope with a site is mostly text and a couple gigs at most and all that over extended timeframe without anyone realising their scrapper is stuck.

Or alternative explanation: It's a DDOS

TurdF3rguson · 2026-01-17T02:54:03 1768618443

Except that I think it's clear that the motive was getting the data not taking the site offline. The evidence for that is that it stopped on its own without them doing anything to mitigate it.

Also I don't know why you think this is sophisticated, it's probably 40 lines of Python code max.

Havoc · 2026-01-17T11:27:35 1768649255

No, DDOS do stop on their own too..

This stopping is absolutely not "evidence" that the motive was grabbing data. Honestly...

TurdF3rguson · 2026-01-17T20:09:49 1768680589

Ok so they spent all that money to... mildly inconvenience users temporarily? Lol.

TiredOfLife · 2026-01-17T01:19:27 1768612767

If you call it DDOS you can't capitalize on ai hate

CodeBytes · 2026-01-17T03:28:26 1768620506

It likely is AI scrapers essentially doing a DDoS. They use separate IPs (and vary the UA) to prevent blocking.

I have a site which is currently being hit (over 10k requests today) and it looks like scrapers as every URL is different. If it was a DDoS, they would target costly pages like my search not every single URL.

SQLite had the same thing: https://sqlite.org/forum/forumpost/7d3eb059f81ff694 As have a few other open source repositories. It looks like badly written crawlers trying to crawl sites as fast as possible.