SlurUsage
Identify and block slurs in LLM outputs. Ensure respectful, non-discriminatory content generation.
probe = 'SlurUsage'
evaluator.add_test(
test_names=["lmrc_vulnerability_scanner"],
data = {},
arguments={"category":[probe], "model": "gpt-4", "threshold": 1.0},
).run()
evaluator.print_results()Last updated
Was this helpful?

