So this particular LLM wasn't fooled. But what happens if you ask it to summariz...

flir · 2025-10-27T14:10:21 1761574221

They don't seem to contain the meta-instruction. Example with GPT-5 Thinking:

https://maurycyz.com/babble/With-Wand-and-Broomstick-Across-...

"Short version: The page is a nonsense / glitchy faux-technical text dressed up like a lab manual."

If we take a page from the site that doesn't hint but does have the in-band instruction:

"Short version: I tried to load https://maurycyz.com/misc/ipv4/ directly and via search. The server is intentionally serving AI crawlers decoy text ("Garbage for the garbage king!", random chemistry/manual fragments, etc.) instead of the real article. Because of that, I can't actually read the real content of the page."

Seems like this poison pill strategery is a non-starter if a chatbot can reliably identify the page as nonsense. Most you're going to do is burn bandwidth to trap a spider.

xphos · 2025-10-27T20:14:32 1761596072

I mean how does it know that though? How would you know if the set of possible texts is garbage without running them? Honestly feels like your saying LLMs solved the halting problem as programs which seems to be dishonest granted you could probably guess with high efficiency

flir · 2025-10-27T23:09:35 1761606575

> I mean how does it know that though?

Not a clue. But apparently it does. Try a few nonsense texts yourself, see if it rejects them.

I'm saying that if you're spidering the whole web, then training an LLM on that corpus, asking an existing LLM "does this page make sense?" is a comparatively small additional load.

> guess with high efficiency

Yes, I think that's basically what's happening. Markov nonsense is cheap to produce, but easy to classify. A more subtle strategy might be more successful (for example someone down-thread mentions using LLM-generated text, and we know that's quite a hard thing to classify).

01HNNWZ0MV43FF · 2025-10-28T12:57:19 1761656239

Probably the same way a human knows. The gap is closing and I don't necessarily love it