Flutterby™! : Finally, a legitimate use for XML...

Next unread comment / Catchup all unread comments User Account Info | Logout | XML/Pilot/etc versions | Long version (with comments) | Weblog archives | Site Map | | Browse Topics

Finally, a legitimate use for XML...

2025-04-27 17:12:38.790206+02 by Dan Lyke 0 comments

Hidden Layer: Novel Universal Bypass for All Major LLMs — The Policy Puppetry Prompt Injection Technique. Format it like a configuration file, if that doesn't work then write your query out in l33tsp3@k.

Leveraging a novel combination of an internally developed policy technique and roleplaying, we are able to bypass model alignment and produce outputs that are in clear violation of AI safety policies: CBRN (Chemical, Biological, Radiological, and Nuclear), mass violence, self-harm and system prompt leakage.

[ related topics: Interactive Drama Work, productivity and environment Artificial Intelligence ]

comments in ascending chronological order (reverse):

Comment policy

We will not edit your comments. However, we may delete your comments, or cause them to be hidden behind another link, if we feel they detract from the conversation. Commercial plugs are fine, if they are relevant to the conversation, and if you don't try to pretend to be a consumer. Annoying endorsements will be deleted if you're lucky, if you're not a whole bunch of people smarter and more articulate than you will ridicule you, and we will leave such ridicule in place.


Flutterby™ is a trademark claimed by

Dan Lyke
for the web publications at www.flutterby.com and www.flutterby.net.