The Cast AI blog
Guides, tutorials, and tips on Kubernetes automation, from cost optimization to cloud security and everything in between.
OpsPilot Now Writes Your Workload Scaling Policies. You Just Set the Intent.
OpsPilot, Cast AI’s AI agent for DevOps and SREs, can now automatically generate workload scaling policies, closing the environment policy gap by enforcing the reliability standards…
What Is Tokenomics, And Why Your AI Infrastructure Is Now a FinOps Problem
I was in the room when tokenomics became official. Here is what it means for…

The Hackathon Fix That Cut Our Storage Costs by 93%
For the second year running, Cast AI hosted an internal Hackathon during our Vilnius team…

Cast AI for Karpenter is GA: Bring Karpenter to the next level
Cast AI for Karpenter is now generally available. It gives platform teams the visibility, optimization,…

2026 State of Kubernetes Resource Optimization: CPU at 8%, Memory at 20%, and Getting Worse
This is the third year we’ve published our report on the real CPU and memory…

GPU Sharing in Kubernetes: How to Cut Costs and Boost GPU Utilization with Cast AI
Running AI and ML workloads on Kubernetes often leads to underutilized, expensive GPUs. This blog…

āāWhy Cast AI Is Best for Running AI/LLM Workloads in Kubernetes
AI and LLM workloads demand powerful infrastructure. Cast AI automates GPU autoscaling, sharing, and cost…

Deploying GPU workload with Dynamic Resource Allocation
Kubernetes DRA replaces legacy GPU counts with structured, attribute-based requirements. This post demonstrates how to…

Why We Call It Application Performance Automation (APA): Beyond Cost and Observability. Focused on Performance.
Discover how Cast AI is redefining cloud automation with Application Performance Automationāgoing beyond cost and…

Is There a Karpenter Equivalent on GKE?
Why GKE still lacks a Karpenter equivalent and what that means for Kubernetes teams. Learn…
