GoogleCloudAiplatformV1ResourcePoolAutoscalingSpec
import type { GoogleCloudAiplatformV1ResourcePoolAutoscalingSpec } from "https://googleapis.deno.dev/v1/aiplatform:v1.ts";
The min/max number of replicas allowed if enabling autoscaling
interface GoogleCloudAiplatformV1ResourcePoolAutoscalingSpec {
maxReplicaCount?: bigint;
minReplicaCount?: bigint;
}§Properties
§
maxReplicaCount?: bigint
[src]Optional. max replicas in the node pool, must be ≥ replica_count and > min_replica_count or will throw error
§
minReplicaCount?: bigint
[src]Optional. min replicas in the node pool, must be ≤ replica_count and < max_replica_count or will throw error. For autoscaling enabled Ray-on-Vertex, we allow min_replica_count of a resource_pool to be 0 to match the OSS Ray behavior(https://docs.ray.io/en/latest/cluster/vms/user-guides/configuring-autoscaling.html#cluster-config-parameters). As for Persistent Resource, the min_replica_count must be > 0, we added a corresponding validation inside CreatePersistentResourceRequestValidator.java.