LogoLogo
Slack CommunityCatalyst Login
  • Welcome
  • RagaAI Catalyst
    • User Quickstart
    • Concepts
      • Configure Your API Keys
      • Supported LLMs
        • OpenAI
        • Gemini
        • Azure
        • AWS Bedrock
        • ANTHROPIC
      • Catalyst Access/Secret Keys
      • Enable Custom Gateway
      • Uploading Data
        • Create new project
        • RAG Datset
        • Chat Dataset
          • Prompt Format
        • Logging traces (LlamaIndex, Langchain)
        • Trace Masking Functions
        • Trace Level Metadata
        • Correlating Traces with External IDs
        • Add Dataset
      • Running RagaAI Evals
        • Executing Evaluations
        • Compare Datasets
      • Analysis
      • Embeddings
    • RagaAI Metric Library
      • RAG Metrics
        • Hallucination
        • Faithfulness
        • Response Correctness
        • Response Completeness
        • False Refusal
        • Context Relevancy
        • Context Precision
        • Context Recall
        • PII Detection
        • Toxicity
      • Chat Metrics
        • Agent Quality
        • Instruction Adherence
        • User Chat Quality
      • Text-to-SQL
        • SQL Response Correctness
        • SQL Prompt Ambiguity
        • SQL Context Ambiguity
        • SQL Context Sufficiency
        • SQL Prompt Injection
      • Text Summarization
        • Summary Consistency
        • Summary Relevance
        • Summary Fluency
        • Summary Coherence
        • SummaC
        • QAG Score
        • ROUGE
        • BLEU
        • METEOR
        • BERTScore
      • Information Extraction
        • MINEA
        • Subjective Question Correction
        • Precision@K
        • Chunk Relevance
        • Entity Co-occurrence
        • Fact Entropy
      • Code Generation
        • Functional Correctness
        • ChrF
        • Ruby
        • CodeBLEU
        • Robust Pass@k
        • Robust Drop@k
        • Pass-Ratio@n
      • Marketing Content Evaluation
        • Engagement Score
        • Misattribution
        • Readability
        • Topic Coverage
        • Fabrication
      • Learning Management System
        • Topic Coverage
        • Topic Redundancy
        • Question Redundancy
        • Answer Correctness
        • Source Citability
        • Difficulty Level
      • Additional Metrics
        • Guardrails
          • Anonymize
          • Deanonymize
          • Ban Competitors
          • Ban Substrings
          • Ban Topics
          • Code
          • Invisible Text
          • Language
          • Secret
          • Sentiment
          • Factual Consistency
          • Language Same
          • No Refusal
          • Reading Time
          • Sensitive
          • URL Reachability
          • JSON Verify
        • Vulnerability Scanner
          • Bullying
          • Deadnaming
          • SexualContent
          • Sexualisation
          • SlurUsage
          • Profanity
          • QuackMedicine
          • DAN 11
          • DAN 10
          • DAN 9
          • DAN 8
          • DAN 7
          • DAN 6_2
          • DAN 6_0
          • DUDE
          • STAN
          • DAN_JailBreak
          • AntiDAN
          • ChatGPT_Developer_Mode_v2
          • ChatGPT_Developer_Mode_RANTI
          • ChatGPT_Image_Markdown
          • Ablation_Dan_11_0
          • Anthropomorphisation
      • Guardrails
        • Competitor Check
        • Gibberish Check
        • PII
        • Regex Check
        • Response Evaluator
        • Toxicity
        • Unusual Prompt
        • Ban List
        • Detect Drug
        • Detect Redundancy
        • Detect Secrets
        • Financial Tone Check
        • Has Url
        • HTML Sanitisation
        • Live URL
        • Logic Check
        • Politeness Check
        • Profanity Check
        • Quote Price
        • Restrict Topics
        • SQL Predicates Guard
        • Valid CSV
        • Valid JSON
        • Valid Python
        • Valid Range
        • Valid SQL
        • Valid URL
        • Cosine Similarity
        • Honesty Detection
        • Toxicity Hate Speech
    • Prompt Playground
      • Concepts
      • Single-Prompt Playground
      • Multiple Prompt Playground
      • Run Evaluations
      • Using Prompt Slugs with Python SDK
      • Create with AI using Prompt Wizard
      • Prompt Diff View
    • Synthetic Data Generation
    • Gateway
      • Quickstart
    • Guardrails
      • Quickstart
      • Python SDK
    • RagaAI Whitepapers
      • RagaAI RLEF (RAG LLM Evaluation Framework)
    • Agentic Testing
      • Quickstart
      • Concepts
        • Tracing
          • Langgraph (Agentic Tracing)
          • RagaAI Catalyst Tracing Guide for Azure OpenAI Users
        • Dynamic Tracing
        • Application Workflow
      • Create New Dataset
      • Metrics
        • Hallucination
        • Toxicity
        • Honesty
        • Cosine Similarity
      • Compare Traces
      • Compare Experiments
      • Add metrics locally
    • Custom Metric
    • Auto Prompt Optimization
    • Human Feedback & Annotations
      • Thumbs Up/Down
      • Add Metric Corrections
      • Corrections as Few-Shot Examples
      • Tagging
    • On-Premise Deployment
      • Enterprise Deployment Guide for AWS
      • Enterprise Deployment Guide for Azure
      • Evaluation Deployment Guide
        • Evaluation Maintenance Guide
    • Fine Tuning (OpenAI)
    • Integration
    • SDK Release Notes
      • ragaai-catalyst 2.1.7
  • RagaAI Prism
    • Quickstart
    • Sandbox Guide
      • Object Detection
      • LLM Summarization
      • Semantic Segmentation
      • Tabular Data
      • Super Resolution
      • OCR
      • Image Classification
      • Event Detection
    • Test Inventory
      • Object Detection
        • Failure Mode Analysis
        • Model Comparison Test
        • Drift Detection
        • Outlier Detection
        • Data Leakage Test
        • Labelling Quality Test
        • Scenario Imbalance
        • Class Imbalance
        • Active Learning
        • Image Property Drift Detection
      • Large Language Model (LLM)
        • Failure Mode Analysis
      • Semantic Segmentation
        • Failure Mode Analysis
        • Labelling Quality Test
        • Active Learning
        • Drift Detection
        • Class Imbalance
        • Scenario Imbalance
        • Data Leakage Test
        • Outlier Detection
        • Label Drift
        • Semantic Similarity
        • Near Duplicates Detection
        • Cluster Imbalance Test
        • Image Property Drift Detection
        • Spatio-Temporal Drift Detection
        • Spatio-Temporal Failure Mode Analysis
      • Tabular Data
        • Failure Mode Analysis
      • Instance Segmentation
        • Failure Mode Analysis
        • Labelling Quality Test
        • Drift Detection
        • Class Imbalance
        • Scenario Imbalance
        • Label Drift
        • Data Leakage Test
        • Outlier Detection
        • Active Learning
        • Near Duplicates Detection
      • Super Resolution
        • Semantic Similarity
        • Active Learning
        • Near Duplicates Detection
        • Outlier Detection
      • OCR
        • Missing Value Test
        • Outlier Detection
      • Image Classification
        • Failure Mode Analysis
        • Labelling Quality Test
        • Class Imbalance
        • Drift Detection
        • Near Duplicates Test
        • Data Leakage Test
        • Outlier Detection
        • Active Learning
        • Image Property Drift Detection
      • Event Detection
        • Failure Mode Analysis
        • A/B Test
    • Metric Glossary
    • Upload custom model
    • Event Detection
      • Upload Model
      • Generate Inference
      • Run tests
    • On-Premise Deployment
      • Enterprise Deployment Guide for AWS
      • Enterprise Deployment Guide for Azure
  • Support
Powered by GitBook
On this page
  • Provision Base Infrastructure
  • Pre-Requiremets
  • AWS login using CLI
  • Execute the following command to Provision the Environment:
  • Install Raga Catalyst
  • Access Raga Catalyst

Was this helpful?

  1. RagaAI Catalyst
  2. On-Premise Deployment

Enterprise Deployment Guide for AWS

AWS Enterprise Deployment Guide for RagaAI Catalyst

PreviousOn-Premise DeploymentNextEnterprise Deployment Guide for Azure

Last updated 2 months ago

Was this helpful?

Deploying RagaAI Catalyst on your AWS infrastructure ensures that your data remains secure and within your control. Follow these steps to set up RagaAI Catalyst in your AWS account.

Provision Base Infrastructure

To get started quickly, you can use the Terraform Templates and scripts provided by the Raga team:

Pre-Requiremets

  • Set Up the AWS User: The user running the installation is assumed to have administrator privileges. Alternatively, create a dedicated with a specific set of permissions.

  • Service Quotas: Ensure that the Service Quotas minimum is set to 32 vCPUs at the account level for compute-optimized (C) instance types for EKS worker node groups.

  • Ensure that your system supports Bash commands

  • The following CLI tools should be installed

AWS login using CLI

Option A: Using aws configure
aws configure

Provide:

  • Access Key ID

  • Secret Access Key

  • Default Region (e.g., us-east-1)

  • Output Format (json, text, table)

Option B: Using AWS Keys Export
export AWS_ACCESS_KEY_ID=your-access-key-id
export AWS_SECRET_ACCESS_KEY=your-secret-access-key

Validate aws cli Login:

aws sts get-caller-identity

Execute the following command to Provision the Environment:

Extract the TAR file provided by the RagaAI team:

tar -zxvf raga-catalyst-terraform-<version>.tar

aws-infra.sh is the cloud provisioning script. This uses python, aws cli and terraform to provision the base infra.

bash aws-infra.sh --region=<region> --customer-name=<customer-name> --docker-hub-pat=<docker-hub-pat>

Replace placeholders:

  • <region>: Deployment region

  • <customer-name>: This will be prifix for all aws resources

  • <docker-hub-pat>: Provided by RagaAI team

List of AWS Resources Provisioned by RagaAI Terraform Infrastructure

Note: You can also set up your cloud environment using the following requirements as an Alternative to the RagaAI Terraform templates

S3 Bucket

Create S3 bucket and Configure CORS with the following settings:

  • Allowed Methods: GET, PUT

  • Allowed Origins: * (all origins)

  • Allowed Headers: * (all headers)

  • Exposed Headers: none

  • Max Age: 3000 seconds

EKS Cluster
  • Kubernetes v1.24+ (Recommend containerd runtime)

  • Ensure that the Service Quotas minimum is set to 32 vCPUs at the account level for compute-optimized (C) instance types

  • Worker Node Groups Configuration

    Worker Node Group

    Instance Type

    vCPU & Memory

    Min Size

    Root Volume

    Architecture

    Taints

    Labels

    AMD Node Group

    C5.2xlarge

    8vCPU & 16GB

    2 nodes

    64 GB

    x86_64

    N/A

    N/A

    ARM Node Group

    C6g.2xlarge

    8vCPU & 16GB

    2 nodes

    64 GB

    ARM64

    architecture=arm:NoSchedule

    architecture=arm, nodetype=arm-node

  • Cluster Add-ons

    • AWS EBS CSI Driver must be enabled

  • IRSA Setup

    • Role Name: raga-role

    • Required Permissions:

      • Access to the S3 bucket created above

      • Trust relationship with EKS OIDC provider

      • Service account should be system:serviceaccount:raga:raga-role

  • Cluster Autoscaler

    • Deploy Cluster autoscaler

RDS Instance
  • Instance Class: db.r5.large (2 vCPU & 16 GiB Memory)

  • Storage Allocation: 50 GB

  • Engine: mysql

EC2 Instance
  • Create an EC2 instance in a public subnet to act as a bastion/jump box.

    • Instance type: t2.micro (1 vCPU and 1 GB Memory)

    • EBS: 8 GB

  • Security group for bastion/jump box.

    • Inbound Rules:

      • Allows HTTP traffic on port 22.

    • Outbound Rules:

      • Permits all outbound traffic within the VPC

  • Security group for an internal load balancer within a VPC

    • Inbound Rules:

      • Allows HTTP traffic on port 80 from within the VPC.

      • Allows HTTPS traffic on port 443 from within the VPC

    • Outbound Rules:

      • Permits all outbound traffic within the VPC

VPC
  • VPC (Virtual Private Cloud)

  • 2 Public and 2 private subnets

  • 1 Internet Gateway (IGW)

  • 1 NAT Gateway

AWS Secrets

Required Secret Keys

  • MYSQL_HOST

  • MYSQL_USERNAME

  • MYSQL_PASSWORD

Install Raga Catalyst

Install Raga Catalyst:

  • SSH into the instance and run the installation command.

Download and Extract the TAR file provided by the RagaAI team:

curl -o $HOME/raga-catalyst-aws-deploy-<version>.tar "<URL>"
tar -zxvf $HOME/raga-catalyst-aws-deploy-<version>.tar

raga-catalyst-aws-deploy.sh is the Catalyst deployment script.

bash raga-catalyst-aws-deploy.sh --region=<region> --customer-name=<customer-name> --release-tag=<release-tag>
  • <location>: AWS region

  • <customer-name>: This will be prifix for all AWS resources

  • <release-tag>: Get the release version from Raga team

Access Raga Catalyst

Access Catalyst UI with the private loadbalancer Endpoint

  • Portal: http://<Load_Balancer_EndPoint>

  • API: http://<Load_Balancer_EndPoint>/api


For any issues during the deployment process or additional assistance, please contact our support team at .

contact@raga.ai
IAM policy
terraform
aws cli
python
Provision Base Infrastructure
Installation Execution
Verify Installation
Configure aws cli