Flutterby™! : Finally, a legitimate use for XML...

Next unread comment / Catchup all unread comments User Account Info | Logout | XML/Pilot/etc versions | Long version (with comments) | Weblog archives | Site Map | | Browse Topics

Finally, a legitimate use for XML...

2025-04-27 17:12:38.790206+02 by Dan Lyke 0 comments

Hidden Layer: Novel Universal Bypass for All Major LLMs — The Policy Puppetry Prompt Injection Technique. Format it like a configuration file, if that doesn't work then write your query out in l33tsp3@k.

Leveraging a novel combination of an internally developed policy technique and roleplaying, we are able to bypass model alignment and produce outputs that are in clear violation of AI safety policies: CBRN (Chemical, Biological, Radiological, and Nuclear), mass violence, self-harm and system prompt leakage.

[ related topics: Interactive Drama Work, productivity and environment Artificial Intelligence ]

comments in ascending chronological order (reverse):