Code Generation

Exclusive to enterprise customers. Contact us to activate this feature.

The Code Generation Metrics suite in RagaAI Catalyst evaluates the accuracy, robustness, and functionality of code generated by LLMs. These metrics assess aspects like structural correctness, similarity to reference code, adaptability to prompt changes, and success in passing functional tests. By measuring code-specific factors such as n-gram overlap, logical consistency, and robustness across prompt variations, these metrics provide comprehensive insights into the reliability and precision of code generation models.

Functional Correctness

ChrF

Ruby

CodeBLEU Robust Pass@k Robust Drop@k Pass-Ratio@n

PreviousFact Entropy NextFunctional Correctness

Last updated 9 months ago

Was this helpful?