Reducto

Install Reducto on EKS using Terraform.

Overview

The project creates Helm Release for Reducto on EKS in reducto namespace. And creates following required dependencies:

RDS instance
S3 bucket
Keda (for autoscaling of Reducto workers in-cluster)
Auto scaling of cluster nodes (Karpenter is configured, however you can use any cluster autoscaling tool)
AWS Load balancer controller or Ingress Nginx (however you can use any ingress controller)

This project demonstrates fully working cluster that's needed to run Reducto. Cloudflare is not a requirement, however its used here to setup TLS along with cert-manager.

Helm Chart

To obtain or inspect Helm Chart and available configurations in values.yaml

# Login
helm registry login registry.reducto.ai \
    --username <your-username>  \
    --password <your-password>

# Get latest Helm Chart
helm pull oci://registry.reducto.ai/reducto-api/reducto

Security

All worklods are only created in private subnet, including NLB for ingress-nginx.

For bootstrapping of the cluster both public and private endpoints are enabled, public endpoint access can be restricted or removed after provisioning:

Remove public endpoint cluster_endpoint_public_access = false.
Restrict public endpoint cluster_endpoint_public_access_cidrs = [ vpc_cidr ]

Terraform State

To use a bucket for Terraform state, create a bucket and update backend.tf.

OR you can skip this to quickly run Terraform plan and apply with locally managed terraform.tfstate state file for testing purposes.

Configuration

Make sure variables.tf has configuration that you desire, like restricting EKS public endpoint, avoiding VPC CIDR collisions, or database instance type.

Create terraform.tfvars with following contents:

reducto_helm_repo_username = "todo"
reducto_helm_repo_password = "todo"
reducto_host = "reducto.example.com"
cloudflare_api_token = "token"

# For alerting
slack_webhook_url = "todo"

Provisioning

Apply Terraform

terraform init
terraform plan
terraform apply

Configure Cloudflare DNS

Cloudflare DNS is used to obtain TLS certificate from Letsencrypt via cert-manager using dns01 solver.

Check the private LB hostname created by cluster for Nginx Ingress Controller and use it to create CNAME DNS record on Cloudflare to point to value provided in reducto_host.

Access Reducto

Reducto will be accessible on ingress-nginx NLB via hostname configured in reducto_host

For checking Reducto service health without public endpoint: port forward your local 4567 to Reducto service:

kubectl port-forward service/reducto-reducto-http 4567:80 -n reducto

# Access Reducto
curl localhost:4567

New AWS account

For Karpenter to request spot instances, create the service-linked role:

aws iam create-service-linked-role --aws-service-name spot.amazonaws.com

Notes on Destroy

To terraform destroy, comment out the lifecycle block in reducto-bucket.tf and remove deletion protection from DB.

You can remove deletion protection by setting var.db_deletion_protection = false and terraform apply.

terraform destroy may not finish because VPC will contain resources created outside of Terraform managment:

NLB for nginx controller created by AWS load balancer controller
EKS Nodes from autoscaling by Karpenter
Bucket not empty

So along side terraform destroy you'll need to manually delete above resources from AWS console.

Notes on NLB for Nginx

To customize NLB configuration:

See AWS Load Balancer controller annotations for Service, and Ingress Nginx Helm Chart configuration.
For NLB TLS Termination with ACM ssl cert (without cert-manager), configure target port in values/ingress-nginx-controller.yaml.
```
service:
  targetPorts:
    https: http
```

Monitoring

Reducto internal job queue length is a good indicator of overall worker health. And 5xx metric from Reducto ingress is a good indicator of API health.

PrometheusRule in manifests/prometheus/rules/01-reducto.yaml monitors internal queue length and 5xx metrics. When queue doesn't go down for a long duration OR API returns 5xx status for a long duration, alerts are sent to configured Slack channel.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
manifests/prometheus/rules		manifests/prometheus/rules
values		values
.gitignore		.gitignore
.terraform.lock.hcl		.terraform.lock.hcl
README.md		README.md
aws-load-balancer-controller.tf		aws-load-balancer-controller.tf
backend.tf		backend.tf
cert-manager.tf		cert-manager.tf
eks.tf		eks.tf
ingress-nginx-controller.tf		ingress-nginx-controller.tf
karpenter.tf		karpenter.tf
keda.tf		keda.tf
main.tf		main.tf
monitoring.tf		monitoring.tf
outputs.tf		outputs.tf
reducto-architecture-large.png		reducto-architecture-large.png
reducto-bucket.tf		reducto-bucket.tf
reducto-db.tf		reducto-db.tf
reducto-helm-release.tf		reducto-helm-release.tf
reducto-iam.tf		reducto-iam.tf
telegraf.tf		telegraf.tf
variables.tf		variables.tf
vpc.tf		vpc.tf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reducto

Overview

Helm Chart

Security

Terraform State

Configuration

Provisioning

Configure Cloudflare DNS

Access Reducto

New AWS account

Notes on Destroy

Notes on NLB for Nginx

Monitoring

About

Releases

Packages

Languages

reductoai/reducto-onprem-infra

Folders and files

Latest commit

History

Repository files navigation

Reducto

Overview

Helm Chart

Security

Terraform State

Configuration

Provisioning

Configure Cloudflare DNS

Access Reducto

New AWS account

Notes on Destroy

Notes on NLB for Nginx

Monitoring

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages