Deanonymize

This guardrail should be run only after anonymize guardrail

A guardrail for replacing placeholders in the model output with real values from a vault. Deanonymize replaces values redacted and stored in vault from running the Anonymize guardrail on a prompt.

Parameters:

data:

  • prompt (str): The text to be deanonymized.

arguments:

  • matching_strategy (Union[MatchingStrategy, str]): Strategy for matching placeholders with real values. Options are exact or fuzzy Default is exact.

Prompt

Interpretation:

Values such as name, email, phone numbers,etc. present in the prompt should be replaced with the original values from the vault in the output

Example:

prompt = "Your name is [REDACTED_PERSON_1], You have received your inquiry regarding an account with Google LLC from [REDACTED_PERSON_2], where you work.\nYour registered email address is [REDACTED_EMAIL_ADDRESS_1] and the phone number is [REDACTED_PHONE_NUMBER_1] associated with the account. Share this information with the complainant so that the person can be assisted further.\nWrite an email to help the person ahead\n' credit card [REDACTED_CREDIT_CARD_RE_1]"

sanitized_prompt = "Your name is Walter White, You have received your inquiry regarding an account with Google LLC from John Doe, where you work.
Your registered email address is walter.white@google.com and the phone number is +91-9967854332 associated with the account. Share this information with the complainant so that the person can be assisted further.
Write an email to help the person ahead"

Code Example:

evaluator.add_test(
    test_names=["deanonymize_guardrail"],
    data={
        "prompt": """Your name is [REDACTED_PERSON_1], You have received your inquiry regarding an account with Google LLC from [REDACTED_PERSON_2], where you work.
Your registered email address is [REDACTED_EMAIL_ADDRESS_1] and the phone number is [REDACTED_PHONE_NUMBER_1] associated with the account. Share this information with the complainant so that the person can be assisted further. Write an email to help the person ahead
""",
    },
).run()

evaluator.print_results()

Last updated