Automated Workload Rightsizing & PrecisionPack for Kubernetes

Kubernetes resource management and scheduling are no walk in the park. Accurately forecasting your workload’s needs is complex, leading many teams to under and over-provisioning, while the scheduler’s decisions can incur unnecessary expenses. CAST AI’s new features – Workload Rightsizing & PrecisionPack – solve these challenges with automation and innovative algorithms.

From day one, CAST AI has been on a mission to deliver a fully automated Kubernetes experience. We’re building a platform where all elements – from cost reporting to instance management – work in unison to cut costs, improve performance, and boost DevOps productivity.

The two new platform capabilities were unveiled today at KubeCon Chicago 2023, right after announcing that we closed a $35 million Series B round. This 1-2 punch underscores how new funding is accelerating CAST AI’s innovation and getting us closer to automating your Kubernetes.

Why is workload rightsizing key for cutting cloud costs?

Workload rightsizing refers to setting your workloads to request the right amount of resources to run smoothly.

In K8s, workloads are rightsized using requests and limits for CPU and memory. This step helps you avoid issues like Pod eviction, CPU starvation, or running OOM. But workload rightsizing is also essential for reducing the cost of running K8s applications.

By limiting your resource usage, you can stop overprovisioning and cut unnecessary expenses.

While CAST AI was able to provide exact workload rightsizing recommendations before, the platform still required engineers to set them manually – until now.

Why automate workload rightsizing?

Automating workload rightsizing is a fast track to optimized resource allocation, cost-efficiency, as well as improved performance and scalability.

By reducing manual tasks, you can save time and effort while ensuring precise adjustments to align your setup with your workload’s actual needs. Moreover, automation helps you avoid human error that could compromise your security and compliance.

CAST AI’s new Workload Rightsizing capability automatically scales your workload requests up and down, ensuring optimal performance and saving you money. It also adds extra overhead to remedy instability if the platform detects any Out-of-Memory container status.

As a result, you can enjoy better performance at a much lower cost – and without adding extra tasks for your engineers.

How to use CAST AI’s automated Workload Rightsizing

By default, the platform generates configurable recommendations for each of your workloads every 30 minutes.

You can find them in the new section that now appears in the main product menu:

You can specify additional overhead for CPU and RAM, adjust their percentile values, and set a threshold for automatically applying the recommendations you get. For example, your recommendations can be applied to the workload only after exceeding certain thresholds.

CAST AI’s team plans to expand automated workload rightsizing further. In the future, the platform will introduce seasonality models for resources to better anticipate hourly, daily, weekly and monthly cycles to improve response time and availability.

Stay tuned, as many exciting developments are coming your way!

PrecisionPack for more efficient K8s scheduling

Alongside the Workload Rightsizing functionality, CAST AI has also introduced PrecisionPack.

This new approach to Kubernetes scheduling focuses on eliminating random pod placement decisions.

Powered by an advanced bin-packing algorithm, PrecisionPack ensures strategic pod positioning onto designated nodes to maximize resource utilization while boosting efficiency and predictability across clusters.

Moreover, the new feature helps to reduce workload movement to improve both uptime and reliability of workloads and optimize costs along the way.

See automated Workload Rightsizing and PrecisionPack in action

We are now rolling out the new features for all CAST AI customers. So, if you already use our cloud cost optimization, head to your console and see automated Workload Rightsizing and PrecisionPack in action.

If you are new to CAST AI, book a short technical demo with our engineers to discover what these new features can do for you.

This will be the best-spent 30 minutes this week that will provide you with actionable cost optimization ideas for your K8s cluster!

CAST AI clients save an average of 63%
on their Kubernetes bills

Book a call to see if you too can get low & predictable cloud bills.

Book a demo

Automated Workload Rightsizing & PrecisionPack: Accelerating Towards Fully Automated Kubernetes

Why is workload rightsizing key for cutting cloud costs?

Why automate workload rightsizing?

How to use CAST AI’s automated Workload Rightsizing

PrecisionPack for more efficient K8s scheduling

See automated Workload Rightsizing and PrecisionPack in action

CAST AI clients save an average of 63%
on their Kubernetes bills

Leave a reply

Recent posts

How Automation Reduces Large Language Model Costs

Spot Instance Availability Demystified: AWS, Azure, and GCP

Only 13% of Provisioned CPUs End Up Being Used

Platform

Providers

Available on

Industries

Company

Resources

Automated Workload Rightsizing & PrecisionPack: Accelerating Towards Fully Automated Kubernetes

Why is workload rightsizing key for cutting cloud costs?

Why automate workload rightsizing?

How to use CAST AI’s automated Workload Rightsizing

PrecisionPack for more efficient K8s scheduling

See automated Workload Rightsizing and PrecisionPack in action

CAST AI clients save an average of 63%on their Kubernetes bills

Leave a reply

Recent posts

How Automation Reduces Large Language Model Costs

Spot Instance Availability Demystified: AWS, Azure, and GCP

Only 13% of Provisioned CPUs End Up Being Used

Platform

Providers

Available on

Industries

Company

Resources

CAST AI clients save an average of 63%
on their Kubernetes bills