# OCR

<figure><img src="/files/A6DLeBjJUq3yuKyy65rN" alt=""><figcaption><p><a href="https://platform.raga.ai/">Try the RagaAI Platform!</a></p></figcaption></figure>

The OCR Project on the sample workspace is an example of how the RagaAI Testing Platform can help with the following tasks -&#x20;

* Data Quality Checks before training a new model
* Model Quality Checks to identify performance gaps and perform regression analysis
* End-to-end pipeline level tests beyond AI models

The RagaAI Testing Platform is designed to add science to the art of detection AI issues, performing root cause analysis and providing actionable recommendations. This is done as an automated suite of tests on the platform.&#x20;

An overview of all tests for the sample project is available here -&#x20;

#### 1. Outlier Detection

<figure><img src="/files/W8c0o2peXiG5owOf0fFo" alt=""><figcaption><p>Detecting outliers for OCR data on the RagaAI Testing Platform</p></figcaption></figure>

**Goal** - Identify scenarios in the field data which are drastically different (out-of-distribution) with respect to the training dataset. The AI model is prone to generating erroneous predictions on such datapoints.&#x20;

**Methodology** - RagaAI automatically detection OOD datapoints using the embeddings from the RagaAI DNA technology

**Insight** - For this case, we see that the platform identifies data drift for images which are rotated or have different lighting conditions given the model has only been trained on portrait images.&#x20;

**Impact -** This automated test helps users access if the data in the production setting has shifted and the model needs to be retrained.&#x20;

For more details, please refer to the detailed [outlier detection documentation](/ragaai-prism/test-inventory/ocr/outlier-detection.md).

#### 2. Missing Detections

<figure><img src="/files/4qGgGiQ4hEYHmQtNicdn" alt=""><figcaption><p>Detecting missing values in OCR data on the RagaAI Testing Platform</p></figcaption></figure>

**Goal** - Identify label drift among the model predictions

**Methodology** - RagaAI automatically detection OOD datapoints using the embeddings from the RagaAI DNA technology

**Insight** - For this case, we see that the platform identifies label drift for images which are rotated or have different lighting conditions given the model has only been trained on portrait images.&#x20;

**Impact -** This automated test helps users access if the label distribution in the production setting has shifted and the model needs to be retrained.&#x20;

For more details, please refer to the detailed [missing values documentation](/ragaai-prism/test-inventory/ocr/missing-value-test.md).


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.raga.ai/ragaai-prism/sandbox-guide/ocr.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
