Adversarial Prompts
Techniques that manipulate inputs to induce unsafe or undesired model behavior.
Articles about Adversarial Prompts
Senators Warn After AI-Driven Cyberattack
Techniques that manipulate inputs to induce unsafe or undesired model behavior.