ScaleOps, which auto-manages Kubernetes compute in real time, raised $130M at an $800M valuation to fix AI infrastructure over-provisioning.
ScaleOps announced a $130M Series C led by Insight Partners at an $800M valuation. The company builds software that dynamically reallocates computing resources across Kubernetes clusters in real time, replacing static configurations. Founded in 2022 by a former Run:ai engineer, ScaleOps claims up to 80% reduction in cloud and AI infrastructure costs. The company reports 3x YoY growth and plans to more than triple headcount by year-end.
ScaleOps targets the exact pain point of DevOps and platform engineers: Kubernetes clusters with static resource configs that over-provision compute and leave GPUs idle. If your inference workloads have variable traffic patterns, you're almost certainly paying for compute you're not using. Dynamic reallocation at the cluster level — not just autoscaling at the pod level — is the architectural gap ScaleOps is filling.
Pull your last 30 days of Kubernetes resource utilization metrics this week using kubectl top nodes and compare requested vs. actual CPU/GPU usage — if the gap exceeds 40%, you have a quantifiable case to evaluate ScaleOps or similar tooling against your current setup.
Open your terminal with kubectl configured against your production cluster
Tags
Signals by role
Also today
Tools mentioned