> For the complete documentation index, see [llms.txt](https://docs.raga.ai/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.raga.ai/ragaai-prism/test-inventory/instance-segmentation/scenario-imbalance.md).

# Scenario Imbalance

### Execute Test:

The  code executes the Class Imbalance Test using two different metrics, namely Jensen-Shannon Divergence and Chi-Squared Test, to evaluate the distribution of scenarios within a dataset.

```python
rules = SBRules()
rules.add(metric="js_divergence", ideal_distribution="uniform", metric_threshold=0.1)
rules.add(metric="chi_squared_test", ideal_distribution="uniform", metric_threshold=0.1)


# clustering is required only at cluster level
cls_default = clustering(test_session=test_session,
                         dataset_name=dataset_name,
                         method="k-means",
                         embedding_col="Embedding",
                         level="image",
                         args={"numOfClusters": 4}
                         )


edge_case_detection = scenario_imbalance(test_session=test_session,
                                            dataset_name = dataset_name,
                                            test_name = run_name,
                                            type = "scenario_imbalance",
                                            output_type="cluster",
                                            embedding= "Embedding",
                                            rules = rules,
                                            clustering = cls_default
                                             )
test_session.add(edge_case_detection)
test_session.run()
```

1. **Initialize Scenario Imbalance Rules**:
   * Use the `SBRules()` function to initialize the rules for the test.
2. **Add Rules**:
   * Use the `rules.add()` function to add specific rules with the following parameters:
     * `metric`: The metric used to evaluate scenario distribution (e.g., js\_divergence, chi\_squared\_test).
     * `ideal_distribution`: The ideal distribution assumption for the metric (e.g., "uniform").
     * `metric_threshold`: The threshold for the metric, indicating when the scenario distribution is considered imbalanced.
3. **Configure Clustering**:
   * Perform clustering on the dataset to group similar scenarios together using the desired method and parameters.
     * Use the `clustering()` function with parameters such as `method`, `embedding_col`, `level`, and `args`.
4. **Execute Test**:
   * Use the `scenario_imbalance_test()` function to execute the test with the following parameters:
     * `test_session`: The session object managing tests.
     * `dataset_name`: Name of the dataset to be tested.
     * `test_name`: Name of the test run.
     * `type`: Type of test, which should be set to "scenario\_imbalance".
     * `output_type`: Type of output expected from the model.
     * `annotation_column_name`: Name of the column containing annotations.
     * `rules`: Predefined rules for the test.
5. **Add Test to Session**:
   * Use the `test_session.add()` function to register the test with the test session.
6. **Run Test**:
   * Use the `test_session.run()` function to start the execution of all tests added to the session, including the Scenario Imbalance Test.

By following these steps, you can effectively evaluate scenario distribution within your dataset using the Scenario Imbalance Test.

### Interpreting the Results

The Scenario Imbalance Test provides insights into the distribution of scenarios or contexts within a dataset. The results are presented in three segments:

**Bar Chart Comparison**

* The bar chart compares the distribution of scenarios between the training dataset and the dataset under evaluation.
* This visualisation highlights any discrepancies in scenario distribution between the two datasets.

Use the bar chart to compare scenario distributions and identify any significant disparities between the training dataset and the dataset under evaluation.

**Data Grid View:** Helps visualise annotations with images sorted by mistake scores.

**Image View:** Delve into detailed analyses for each image

By leveraging these features, you can effectively evaluate scenario distribution within your datasets using the Scenario Imbalance Test.

####


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter:

```
GET https://docs.raga.ai/ragaai-prism/test-inventory/instance-segmentation/scenario-imbalance.md?ask=<question>&goal=<endgoal>
```

`ask` is the immediate question: it should be specific, self-contained, and written in natural language.
`goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal.

The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.