Flutterby™! : So in the past day someone at

Next unread comment / Catchup all unread comments User Account Info | Logout | XML/Pilot/etc versions | Long version (with comments) | Weblog archives | Site Map | | Browse Topics

So in the past day someone at

2025-06-04 20:45:02.462166+02 by Dan Lyke 3 comments

So in the past day, someone at 34.132.153.18 running Scrapy has made 106,033 requests to Flutterby.com. Which, you know, on the one hand "whatever", on the other hand it also seems to be making requests with weird characters.

But I think the open web was a mistake.

[ related topics: History Sports ]

comments in ascending chronological order (reverse):

#Comment Re: So in the past day someone at made: 2025-06-04 22:33:47.553766+02 by: spc476

I too got hit with 172,186 hits from Scrapy. Not from the same IP as you, but it's from the same ASN---Google Cloud Platform (doesn't surprise me). And it's weird, because the past year or so, I've had very few hits from Scrapy.

#Comment Re: So in the past day someone at made: 2025-06-04 22:33:47.553766+02 by: spc476

Good Lord! The companies using Scrapy (mentioned on the Scrapy home page) aren't even hiding what they're doing. The first one I looked at: "Overcome anti-scraping blockers: DataFlirt web scraping services overcome IP blocking, browser fingerprints, and captcha anti-blocking technologies for high quality data extraction."

Jesus Christ! Can't someone take the companies out behind the shed?

#Comment Re: So in the past day someone at made: 2025-06-04 22:33:47.553766+02 by: Dan Lyke

Holy shit. I cleaned off some log space and restarted some things, and in just over a half an hour it hit me over 74k times.

Kinda thinking it might be time to move this whole thing over to the Gemini protocol...

Add your own comment:

(If anyone ever actually uses Webmention/indie-action to post here, please email me)




Format with:

(You should probably use "Text" mode: URLs will be mostly recognized and linked, _underscore quoted_ text is looked up in a glossary, _underscore quoted_ (http://xyz.pdq) becomes a link, without the link in the parenthesis it becomes a <cite> tag. All <cite>ed text will point to the Flutterby knowledge base. Two enters (ie: a blank line) gets you a new paragraph, special treatment for paragraphs that are manually indented or start with "#" (as in "#include" or "#!/usr/bin/perl"), "/* " or ">" (as in a quoted message) or look like lists, or within a paragraph you can use a number of HTML tags:

p, img, br, hr, a, sub, sup, tt, i, b, h1, h2, h3, h4, h5, h6, cite, em, strong, code, samp, kbd, pre, blockquote, address, ol, dl, ul, dt, dd, li, dir, menu, table, tr, td, th

Comment policy

We will not edit your comments. However, we may delete your comments, or cause them to be hidden behind another link, if we feel they detract from the conversation. Commercial plugs are fine, if they are relevant to the conversation, and if you don't try to pretend to be a consumer. Annoying endorsements will be deleted if you're lucky, if you're not a whole bunch of people smarter and more articulate than you will ridicule you, and we will leave such ridicule in place.


Flutterby™ is a trademark claimed by

Dan Lyke
for the web publications at www.flutterby.com and www.flutterby.net.