CreateEndpointConfigInput
import type { CreateEndpointConfigInput } from "https://aws-api.deno.dev/v0.4/services/sagemaker.ts?docs=full";
§Properties
Specifies configuration for how an endpoint performs asynchronous inference. This is a required field in order for your Endpoint to be invoked using InvokeEndpointAsync.
The name of the endpoint configuration. You specify this name in a "CreateEndpoint" request.
A member of CreateEndpointConfig
that enables explainers.
The Amazon Resource Name (ARN) of a Amazon Web Services Key Management Service key that SageMaker uses to encrypt data on the storage volume attached to the ML compute instance that hosts the endpoint.
The KmsKeyId can be any of the following formats:
- Key ID:
1234abcd-12ab-34cd-56ef-1234567890ab
- Key ARN:
arn:aws:kms:us-west-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab
- Alias name:
alias/ExampleAlias
- Alias name ARN:
arn:aws:kms:us-west-2:111122223333:alias/ExampleAlias
The KMS key policy must grant permission to the IAM role that you specify in your CreateEndpoint
, UpdateEndpoint
requests.
For more information, refer to the Amazon Web Services Key Management Service sectionUsing Key Policies in Amazon Web Services KMS
Note:
Certain Nitro-based instances include local storage, dependent on the instance type.
Local storage volumes are encrypted using a hardware module on the instance.
You can't request a KmsKeyId
when using an instance type with local storage.
If any of the models that you specify in the ProductionVariants
parameter use nitro-based instances with local storage, do not specify a value for the KmsKeyId
parameter.
If you specify a value for KmsKeyId
when using any nitro-based instances with local storage, the call to CreateEndpointConfig
fails.
For a list of instance types that support local instance storage, see Instance Store Volumes.
For more information about local instance storage encryption, see SSD Instance Store Volumes.
An array of ProductionVariant
objects, one for each model that you want to host at this endpoint.
An array of ProductionVariant
objects, one for each model that you want to host at this endpoint in shadow mode with production traffic replicated from the model specified on ProductionVariants
.
If you use this field, you can only specify one variant for ProductionVariants
and one variant for ShadowProductionVariants
.
An array of key-value pairs. You can use tags to categorize your Amazon Web Services resources in different ways, for example, by purpose, owner, or environment. For more information, see Tagging Amazon Web Services Resources.