CACOP

SOLOONGOING

Chaos engineering cost analysis for Kubernetes

0 stars·0 forks·Updated 15 days ago

Overview

Most teams know that downtime is expensive, but nobody can tell you exactly how much a pod crash costs in CPU waste, memory spikes, and recovery time. CACOP puts a number on it.

The system injects controlled failures into a local Kubernetes cluster using Chaos Mesh - pod kills, network partitions, CPU stress tests. While the chaos runs, a FastAPI backend collects metrics from Prometheus, correlates resource spikes with the injected failure, and converts the inefficiency into simulated dollar costs using configurable cloud pricing models.

The PLG stack (Prometheus, Loki, Grafana) watches everything in real-time, and the dashboard lets you compare costs across different failure scenarios. It turns chaos engineering from "did the system recover?" into "how much did that recovery cost?"

Key Features

01

Controlled chaos injection via Chaos Mesh

02

Real-time cost simulation with configurable cloud pricing

03

Prometheus metrics correlation with failure events

04

PLG stack (Prometheus, Loki, Grafana) observability

05

Comparative cost analysis across failure scenarios

06

Local Kubernetes cluster via Minikube - no cloud bill surprises

Built With

Kubernetes
Minikube
Chaos Mesh
FastAPIFastAPI
PrometheusPrometheus
LokiLoki
GrafanaGrafana

Project Info

StatusOngoing
RoleSolo Developer
TypeInfrastructure