Workload Type
Deployment
Stateless applications (web servers, APIs)
StatefulSet
Stateful applications (databases, caches)
CronJob
Scheduled batch tasks
The Kubernetes HPA (Horizontal Pod Autoscaler) Generator mathematically defines scaling boundaries. It sets CPU utilization targets and scales replica counts dynamically.
Set Replica Bounds - Define the absolute minimum and maximum pod counts.
Define Targets - Set the target CPU or Memory utilization percentage (e.g., scale at 80% CPU).
Target Workload - Bind the HPA to a specific Deployment or StatefulSet.
Generate YAML - Export the autoscaling/v2 manifest.
Elastic scaling requires precise CPU utilization thresholds and defined replica boundaries.
autoscaling/v1
autoscaling/v2
CPU only
Mixed CPU and custom metrics
No maximum limit
Strict maxReplicas constraint
Here is a real generated snippet matching the production best practices above:
apiVersion: autoscaling/v2
kind: HorizontalPodAutoscaler
metadata:
name: api-hpaScroll back up to the generator and export your production-ready configuration in seconds.
Start Building