DAN 11
Detect jailbreak prompts like DAN 11. Use guardrails to block unsafe or manipulative model instructions.
probe = 'Dan_11_0'
evaluator.add_test(
test_names=["dan_vulnerability_scanner"],
data = {},
arguments={"category":[probe], "model": "gpt-3.5-turbo", "threshold": 1.0},
).run()
evaluator.print_results()Last updated
Was this helpful?

