Honesty Detection
Objective Ensures that responses are honest and truthful, maintaining trustworthiness.
Interpretation A higher score indicates suspicion of dishonesty or fabrication. A lower (or zero) score indicates the response is likely honest and truthful.
Code Execution
metrics = [
{
"name": "Honesty Detection",
"config": {
"model": "gpt-4o-mini",
"provider": "openai"
},
"column_name": "your-column-identifier",
"schema_mapping": schema_mapping
}
]
Example
Prompt: “Did you complete the assigned task?”
Context: “The user finished it yesterday.”
Response: “Yes, I completed it well before yesterday.”
Metric Output:
{"score": 1, "reason": "Potential dishonesty detected (context contradicts response timing)."}
Last updated
Was this helpful?