ComputePool Configuration Reference
This page provides technical reference information for configuring ComputePools.
This page provides technical reference information for configuring ComputePools.
This guide describes how to create and manage Compute Pools. Compute Pools are Kubernetes clusters where AI and ML applications run. You can create Dedicated clusters or Shared clusters. Refer to the Compute Pool concepts page to understand the different variants and their use cases.
This guide describes how to create and manage ScalingPolicy resources using YAML manifests and kubectl. A Scaling Policy defines autoscaling behavior for a Compute Pool by configuring CPU and GPU utilization thresholds, scaling durations, resource bounds, and cooldown periods.
This guide describes how to deploy a single-node Compute Pool. A single-node Compute Pool uses one Edge host for both the Kubernetes control plane and workload scheduling. To achieve this, you configure a Compute Config that marks the control plane node as worker-eligible, then create a Compute Pool that references it. You can deploy a single-node Compute Pool using the PaletteAI UI or YAML Kubernetes manifests.
This page provides troubleshooting guidance for common Compute Pool issues.