Create and Manage Tenant Settings

Use Tenant Settings to manage Tenant-scoped configuration in the PaletteAI UI. From this page, you can manage Tenant configuration such as display information, integrations, OpenID Connect (OIDC) group access, per-Tenant and per-Project GPU limits, Tenant compute inventory, Compute Configs, Scaling Policies, and Model as a Service Mappings. In the left navigation, tabs are grouped under General, Access Control, Infrastructure, and Model Management. The Settings Ref, User Access, and Resource Limits tabs each include Tenant and Projects sub-tabs so you can manage Tenant-wide configuration and review Project-specific values from one page.

For background on Tenant and Project organization, refer to Tenants and Projects.

Prerequisites

At least one Project exists in the Tenant, and you can access that Project in the PaletteAI UI. Tenant visibility in the UI depends on Project access.
To update Tenant-wide settings, you must have the Tenant admin role. For details, refer to Tenant Role Permissions.
To edit entries in the Projects sub-tabs, you must have the required Project permissions. For details, refer to Project Role Permissions.

Configure Tenant Settings

Log in to PaletteAI and select the Tenant you want to manage.
From the Projects Menu in the top-left, select All Projects.
From the left main menu, select Tenant Settings.

Manage Basic Information

Use the Basic Information tab to update Tenant display values that appear elsewhere in the PaletteAI UI.

Update Tenant Name or Description. To add or remove Annotations and Labels, expand Metadata.
Select Save changes.

Create Tenant Settings Ref

A Tenant Settings Ref holds the integrations shared across the Projects in your Tenant. Each Tenant has at most one Settings Ref. Create the Settings Ref once, then add or update integrations as your environment grows.

For background on how Settings inheritance works, refer to Settings and Integrations.

Enablement

When no Tenant Settings Ref exists, the Tenant sub-tab shows an empty state titled Tenant-level settings are not available.

On the Settings Ref tab, on the Tenant sub-tab, select Create Tenant-level Settings.
Enter a Settings Name.

The name must be 63 characters or fewer and contain only lowercase alphanumeric characters or hyphens, beginning and ending with an alphanumeric character. The name cannot be changed after creation.
Add at least one Palette, Hugging Face, or NVIDIA Inference Microservices (NIMs) integration. The create dialog requires at least one integration before you can save. For field-level guidance per integration type, refer to Add an Integration.
Select Confirm.

After the Tenant Settings Ref is created, the Tenant sub-tab shows the Settings Ref header and the Integrations table. Continue with Manage Tenant Settings Ref to add more integrations or update existing ones.

Manage Tenant Settings Ref

Add or update the integrations in the Tenant Settings Ref, and review the integrations configured in each Project. The Settings Ref tab opens to the Tenant sub-tab by default. The page header shows Settings Ref followed by the name you assigned, with a Ready or Not Ready status badge. Not Ready indicates that one or more integrations do not have a valid secret.

Add an Integration

Each integration represents a connection to an external system, such as Palette, Hugging Face, or NVIDIA NIMs. Each integration type can be added once per Tenant Settings Ref. Hugging Face and NVIDIA NIMs integrations can be shared with specific Projects in the Tenant and optionally locked so Projects cannot override them.

On the Tenant sub-tab, select Add Integration to open the Add Integration drawer.
Choose an Integration Type — Palette, Hugging Face, or NVIDIA NIMs. The form fields update to match the selected type.
Complete the fields for the chosen integration type:
- Palette integration — used to provision compute resources from your Palette tenant. Provide:
  - Integration Name — display name, for example, Palette-Acme.
  - Host URL — the Palette host URL, for example, https://console.spectrocloud.com.
  - Tenant — the Palette tenant slug, for example, acme.
  - Project ID — the default Palette project UID that PaletteAI uses when provisioning clusters.
  - (Optional) Toggle Skip SSL Certificate Verification on to bypass certificate validation when connecting to the Palette server.
  - API Key — the Palette API key. Select Validate to verify the credentials against the Palette server.
  - (Optional) Toggle Cert Secret Ref on to reference a Kubernetes Secret containing a custom certificate authority bundle for the Palette server.
- Hugging Face integration — enables Hugging Face model management features at the Project level. Provide:
  - Integration Name — display name, for example, HuggingFace-Acme.
  - API Key — your Hugging Face Hub API key, which begins with hf_. Select Validate to verify the key against the Hugging Face API.
- NVIDIA NIMs integration — enables NVIDIA NGC model management features at the Project level. Provide:
  - Integration Name — display name, for example, NVIDIA-Acme.
  - API Key — your NVIDIA NGC API key, which begins with nvapi-. Select Validate to check the key format. The Validate action does not yet contact NVIDIA NGC.
(Optional) Expand Metadata to add Kubernetes labels and annotations to the integration.
(Hugging Face and NVIDIA NIMs only) Configure shared access for the integration:
- Under Projects, choose which Projects in the Tenant can use this integration.
- To prevent Projects from configuring their own version of the integration, enable Enforce tenant integration across all projects. Locked integrations cannot be replaced by a Project-level configuration.
When you enforce the integration and one or more selected Projects already have the same integration type configured, PaletteAI shows a warning. Select Go to impacted projects in the warning to switch to the Projects sub-tab filtered to those Projects.
Select Confirm to add the integration.

Edit or Delete an Integration

Use the action menu on each row of the Integrations table to edit or delete an integration.

Edit & Share opens the integration drawer so you can update fields, change which Projects the integration is shared with, and toggle the lock setting. To replace the API key for an existing integration, clear the masked value and re-enter the key, and then select Validate before saving.
Delete removes the integration. When the integration is used by active model deployments, PaletteAI requires you to confirm by typing the integration name. Affected deployments are marked out of policy until they are updated or deleted.

Palette integrations cannot be deleted from the Tenant Settings Ref. To rotate Palette credentials, use Edit & Share to update the existing entry.

Edit Basic Information

To update the Tenant Settings Ref's labels and annotations, select the Settings dropdown next to the Settings Ref name, and then select Edit basic information. The Settings Ref name itself cannot be changed after creation.

Review Project Integrations

Use the Projects sub-tab to review the integrations configured in each Project, including Tenant integrations that have been shared into a Project.

Select the Projects sub-tab.
Search or filter to locate the Project. To open or close every Project row at once, select Expand all or Collapse all.
Expand a Project to review its integrations table. Each row shows the Integration Name, Type, Last modified timestamp, Scope badge (Tenant for an integration shared from the Tenant Settings Ref, Project for an integration owned by the Project), and Shared with Projects.
(Optional) Select Edit in the action menu on a Project-scoped row to update a Project integration. Tenant-scoped rows are read-only here; manage them from the Tenant sub-tab.

Manage User Access

On the User Access tab, control which OIDC groups administer the Tenant and which OIDC groups map to Project Viewer, Editor, and Admin roles.

On the Tenant sub-tab, add or remove OIDC groups in Tenant Admin Groups, and then select Save changes.

Blank entries are ignored. Configure at least one OIDC group before you save changes.
Select the Projects sub-tab to review each Project's OIDC Viewer, Editor, and Admin group assignments.
Search or filter to locate the Project, and then select Edit to update that Project's OIDC group assignments.

The edit drawer updates the Viewer Groups, Editor Groups, and Admin Groups fields for the selected Project.

Manage Resource Limits

On the Resource Limits tab, set GPU limits and requests for the Tenant, and review or adjust per-Project GPU quotas.

On the Tenant sub-tab, update GPU Limits and GPU Requests, and then select Save changes.

Each entry must use a unique GPU family and a limit value. If a warning indicates that Projects exceed the Tenant GPU limits, increase the Tenant limits or reduce Project limits.
Select the Projects sub-tab to review GPU limits and requests across Projects.
Search or filter to locate the Project, and then select Edit to adjust GPU limits and requests.

Manage Compute

On the Compute tab, review the worker and control plane nodes that the Tenant's Projects have discovered through their Palette integrations. The tab opens to the Tenant sub-tab by default and is read-only.

On the Tenant sub-tab, review every node across the Tenant.

The table includes Node Name, Status (Free or In use), Type (Worker or Control Plane), Architecture, GPU family, GPU count, GPU memory, CPU, and Memory.
Filter the list with Filter by name, or with the Status, Compute Pool, and GPU family selectors.
Select the Projects sub-tab to filter the same nodes by their owning Project.

Create Compute Config

Create a Tenant-scoped Compute Config that Projects can adopt as a shared infrastructure baseline. Each Compute Config captures cluster defaults such as Secure Shell (SSH) keys, Network Time Protocol (NTP) servers, and the network overlay.

Enablement

On the Compute Config tab, on the Tenant sub-tab, select Create Compute Config to open the Create tenant compute config wizard.
Complete the wizard steps:
- General Information — set the Compute Config Name and Deletion Policy, optionally add SSH Keys, and pick the Projects that can use this Compute Config under Shared access.
- Edge Configuration — configure NTP servers and the network overlay.
- Control plane defaults — set the node count, architecture, CPU, memory, labels, annotations, taints, single-node cluster mode, and worker-node eligibility.
- Worker Pool Defaults — define one or more worker pools with their node defaults.
- Review — confirm the configuration and select Create.
When the Tenant has at least one Project, you must share the Compute Config with at least one Project under Shared access.

Manage Compute Configs

Review and adjust the Tenant Compute Configs and inspect Project-scoped Compute Configs. The Compute Config tab opens to the Tenant sub-tab by default.

On the Tenant sub-tab, review the Tenant Compute Configs.

The table includes the Compute Config Name, the virtual IP (VIP), Control plane nodes, Deployment Type, Shared with, and an action menu.
Edit, clone, or delete a Compute Config from the action menu on each row.

Cloning populates the wizard from the source. Deletion is blocked while a Compute Config is in use by a Project.
Select the Projects sub-tab to inspect Compute Configs scoped to each Project.

Search by name or filter by Project, then expand a Project row to review its Project-scoped Compute Configs together with any Tenant Compute Configs shared into the Project. The Scope column distinguishes Tenant from Project sources.

Create Scaling Policy

Create a Tenant Scaling Policy that Projects can reference from their Compute Pools to autoscale workloads.

For background on autoscaling and the YAML reference, refer to Create and Manage Scaling Policies.

Enablement

On the Autoscaling Policies tab, on the Tenant sub-tab, select Create Autoscaling Policy at the top of the table to open the Create Auto-Scaling Policy dialog.
Provide the following:
- Scaling Policy Name — a unique Kubernetes-compliant name.
- Metrics — add one or more CPU or GPU metrics with scale-up and scale-down thresholds and durations. At least one CPU or GPU metric is required.
- Resource Bounds — optional minimum and maximum CPU counts, plus minimum and maximum GPU counts per family. Each GPU family in Resource Bounds must have a matching entry in Metrics.
- Cool Down Duration — minimum wait time between scaling actions. Defaults to 15 minutes.
- Timeout Duration — maximum duration for an ongoing scaling operation. Defaults to 30 minutes.
- Shared access — under Projects, choose which Projects can use this policy.
Select Create to save the policy.

Manage Scaling Policies

Review the Tenant Scaling Policies, inspect their adoption across Compute Pools, and edit, clone, or delete existing policies. The Autoscaling Policies tab opens to the Tenant sub-tab by default.

On the Tenant sub-tab, review the Tenant Scaling Policies.

The table includes Scaling Policy Name, Shared with, Metrics, In use Compute Pools, Cool Down, Timeout, Scope, and an action menu. System policies appear with a system Scope badge and only support Clone.
Select the In use Compute Pools count to open a drawer that lists the Compute Pools referencing the policy along with each pool's owning Project.
Edit, clone, or delete a policy from the action menu on each row.

Editing modifies the policy in place. Cloning opens the dialog populated from the source policy. Deletion is disabled while the policy is referenced by any Compute Pool.
Select the Projects sub-tab to inspect Scaling Policies scoped to each Project.

Search by name or filter by Project, then expand a Project row to view its Project-scoped Scaling Policies. Edit, clone, or delete a Project policy from the action menu on each row.

Create Model as a Service Mappings

Create a mapping on the Model as a Service Mappings tab to route incoming Model Deployments to a chosen Profile Bundle based on the model's source and attributes. When a Model Deployment matches the source and filters defined for a mapping, PaletteAI deploys the model using the target Profile Bundle. The tab has Tenant and Projects sub-tabs; you create new mappings on the Tenant sub-tab, while the Projects sub-tab lets you review, edit, or delete existing Project-level mappings.

Enablement

On the Model as a Service Mappings tab, on the Tenant sub-tab, select Add Mapping to open the Add Mapping drawer. The drawer has two columns.
In the Model Source column on the left, choose Hugging Face or NVIDIA NIMs as the source type, then define the Model Match filter. Each mapping supports one filter, which is a key-value pair that a model must match for the mapping to apply.

For Hugging Face, the filter key is apps with values vllm or ollama. For NVIDIA NIMs, the filter is fixed to nims = true and cannot be edited.
In the Target Profile Bundle column on the right, pick the Profile Bundle that PaletteAI uses when a model matches the filter.

The table lists each Profile Bundle's Name, Version, and Type. Only Application and Fullstack Profile Bundles are eligible; Infrastructure bundles do not appear.
Select Confirm to save the mapping, or Cancel to discard it. Confirmed mappings are saved to the Tenant automatically.

Manage Model as a Service Mappings

Review existing mappings, and edit or delete them. Changes are saved to the Tenant automatically.

On the Tenant sub-tab, review the mappings table.

The table includes Target Profile Bundle, Source Type, Model Filters, and an action menu. Filter the list by Source Type or use Filter by name to search.
Edit or delete an existing mapping from the action menu on each row.
Select the Projects sub-tab to review Project-level mappings. You can edit or delete existing Project mappings from the action menu, but you cannot create new mappings on this sub-tab.

Validate

Refresh the Tenant Settings page and confirm your saved changes persist on the applicable Tenant or Projects sub-tab.

Next Steps

Settings and Integrations to learn how Tenant-level and Project-level Settings inheritance works.
Create and Manage Projects to add Projects under the Tenant and configure Project-specific settings.
Create and Manage Tenants to review Tenant creation, namespaces, and default resources.
Troubleshooting Tenants to diagnose issues with the Tenant Settings page, Tenant admin OIDC groups, or Tenant-scoped Settings.
Compute Config to learn how Compute Configs define cluster deployment defaults.
Create and Manage Scaling Policies for the Scaling Policy YAML reference and field definitions.

Prerequisites​

Configure Tenant Settings​

Manage Basic Information​

Create Tenant Settings Ref​

Enablement​

Manage Tenant Settings Ref​

Add an Integration​

Edit or Delete an Integration​

Edit Basic Information​

Review Project Integrations​

Manage User Access​

Manage Resource Limits​

Manage Compute​

Create Compute Config​

Enablement​

Manage Compute Configs​

Create Scaling Policy​

Enablement​

Manage Scaling Policies​

Create Model as a Service Mappings​

Enablement​

Manage Model as a Service Mappings​

Validate​

Next Steps​

Prerequisites

Configure Tenant Settings

Manage Basic Information

Create Tenant Settings Ref

Enablement

Manage Tenant Settings Ref

Add an Integration

Edit or Delete an Integration

Edit Basic Information

Review Project Integrations

Manage User Access

Manage Resource Limits

Manage Compute

Create Compute Config

Enablement

Manage Compute Configs

Create Scaling Policy

Enablement

Manage Scaling Policies

Create Model as a Service Mappings

Enablement

Manage Model as a Service Mappings

Validate

Next Steps