PendingProductionVariantSummary
import type { PendingProductionVariantSummary } from "https://aws-api.deno.dev/v0.3/services/sagemaker.ts?docs=full";
The production variant summary for a deployment when an endpoint is creating or updating with the "CreateEndpoint"
or "UpdateEndpoint"
operations.
Describes the VariantStatus
, weight and capacity for a production variant associated with an endpoint.
§Properties
The size of the Elastic Inference (EI) instance to use for the production variant. EI instances provide on-demand GPU computing for inference. For more information, see Using Elastic Inference in Amazon SageMaker.
The serverless configuration for the endpoint.
Note: Serverless Inference is in preview release for Amazon SageMaker and is subject to change. We do not recommend using this feature in production environments.
An array of DeployedImage
objects that specify the Amazon EC2 Container Registry paths of the inference images deployed on instances of this ProductionVariant
.
The number of instances requested in this deployment, as specified in the endpoint configuration for the endpoint.
The value is taken from the request to the "CreateEndpointConfig"
operation.
The serverless configuration requested for this deployment, as specified in the endpoint configuration for the endpoint.
Note: Serverless Inference is in preview release for Amazon SageMaker and is subject to change. We do not recommend using this feature in production environments.
The requested weight for the variant in this deployment, as specified in the endpoint configuration for the endpoint.
The value is taken from the request to the "CreateEndpointConfig"
operation.
The type of instances associated with the variant.
The endpoint variant status which describes the current deployment stage status or operational status.