Code Generation
Exclusive to enterprise customers. Contact us to activate this feature.
The Code Generation Metrics suite in RagaAI Catalyst evaluates the accuracy, robustness, and functionality of code generated by LLMs. These metrics assess aspects like structural correctness, similarity to reference code, adaptability to prompt changes, and success in passing functional tests. By measuring code-specific factors such as n-gram overlap, logical consistency, and robustness across prompt variations, these metrics provide comprehensive insights into the reliability and precision of code generation models.
Functional CorrectnessChrFRubyCodeBLEURobust Pass@kRobust Drop@kPass-Ratio@nLast updated