GoogleCloudAiplatformV1FeaturestoreOnlineServingConfigScaling
import type { GoogleCloudAiplatformV1FeaturestoreOnlineServingConfigScaling } from "https://googleapis.deno.dev/v1/aiplatform:v1.ts";
Online serving scaling configuration. If min_node_count and max_node_count are set to the same value, the cluster will be configured with the fixed number of node (no auto-scaling).
interface GoogleCloudAiplatformV1FeaturestoreOnlineServingConfigScaling {
cpuUtilizationTarget?: number;
maxNodeCount?: number;
minNodeCount?: number;
}§Properties
§
cpuUtilizationTarget?: number
[src]Optional. The cpu utilization that the Autoscaler should be trying to achieve. This number is on a scale from 0 (no utilization) to 100 (total utilization), and is limited between 10 and 80. When a cluster's CPU utilization exceeds the target that you have set, Bigtable immediately adds nodes to the cluster. When CPU utilization is substantially lower than the target, Bigtable removes nodes. If not set or set to 0, default to 50.