Skip to main content

DataOps Kubernetes Sizing Recommendations

Kubernetes sizing guidance

When running jobs, the underlying runner requirements will vary based on what the jobs in your pipelines are doing.

We expect a typical pipeline workload to be more CPU intensive than memory. The exception will be jobs where you are processing large amounts of data as part of the job instead of orchestrating data processing with a 3rd party service. The Data Prep Orchestrator is one example of a memory-bound orchestrator.

Our recommendations for the physical infrastructure still apply, yet for cloud-specific recommendations, see below.

AWS recommendation

On AWS, for a managed cluster, we recommend an instance type of t3a.xlarge or better for cluster nodes. We recommend at least 4 Cores and 16GB of memory.

Azure recommendation

On Azure Kubernetes Service, we recommend a VM size of Standard_D4s_v3.