Skip to main content

5 docs tagged with "compute-pool"

View all tags

Create and Manage Compute Pools

This guide describes how to create and manage Compute Pools. Compute Pools are Kubernetes clusters where AI and ML applications run. You can create Dedicated clusters or Shared clusters. Refer to the Compute Pool concepts page to understand the different variants and their use cases.

Create and Manage Scaling Policies

This guide describes how to create and manage ScalingPolicy resources using YAML manifests and kubectl. A Scaling Policy defines autoscaling behavior for a Compute Pool by configuring CPU and GPU utilization thresholds, scaling durations, resource bounds, and cooldown periods.

Deploy a Single-Node Compute Pool

This guide describes how to deploy a single-node Compute Pool. A single-node Compute Pool uses one Edge host for both the Kubernetes control plane and workload scheduling. To achieve this, you configure a Compute Config that marks the control plane node as worker-eligible, then create a Compute Pool that references it. You can deploy a single-node Compute Pool using the PaletteAI UI or YAML Kubernetes manifests.