The Cast AI blog
Guides, tutorials, and tips on Kubernetes automation, from cost optimization to cloud security and everything in between.
Karpenter Cost Optimization: Consolidation Benchmark Results (7-Day Run)
Explore four approaches to Karpenter cost optimization in this benchmarking study showcasing the impact of Cast AI’s automation.

Tokens Are the New Cloud Bill
At FinOps X 2026, a major shift became clear: teams are now spending massively on…

OpsPilot Now Writes Your Workload Scaling Policies. You Just Set the Intent.
OpsPilot, Cast AI’s AI agent for DevOps and SREs, can now automatically generate workload scaling…

What Is Tokenomics, And Why Your AI Infrastructure Is Now a FinOps Problem
I was in the room when tokenomics became official. Here is what it means for…

The Hackathon Fix That Cut Our Storage Costs by 93%
For the second year running, Cast AI hosted an internal Hackathon during our Vilnius team…

The Karpenter Enterprise Suite is GA: Bring Karpenter to the next level
The Karpenter Enterprise Suite is now generally available. It gives platform teams the visibility, optimization,…

2026 State of Kubernetes Resource Optimization: CPU at 8%, Memory at 20%, and Getting Worse
This is the third year we’ve published our report on the real CPU and memory…

GPU Sharing in Kubernetes: How to Cut Costs and Boost GPU Utilization with Cast AI
Running AI and ML workloads on Kubernetes often leads to underutilized, expensive GPUs. This blog…

āāWhy Cast AI Is Best for Running AI/LLM Workloads in Kubernetes
AI and LLM workloads demand powerful infrastructure. Cast AI automates GPU autoscaling, sharing, and cost…

Deploying GPU workload with Dynamic Resource Allocation
Kubernetes DRA replaces legacy GPU counts with structured, attribute-based requirements. This post demonstrates how to…
