Module Values_1.CreateInferenceComponentOutputSource

Creates an inference component, which is a SageMaker AI hosting object that you can use to deploy a model to an endpoint. In the inference component settings, you specify the model, the endpoint, and how the model utilizes the resources that the endpoint hosts. You can optimize resource utilization by tailoring how the required CPU cores, accelerators, and memory are allocated. You can deploy multiple inference components to an endpoint, where each inference component contains one model and the resource utilization needs for that individual model. After you deploy an inference component, you can directly invoke the associated model when you use the InvokeEndpoint API action.

Sourcetype nonrec t = {
  1. inferenceComponentArn : InferenceComponentArn.t option;
    (*

    The Amazon Resource Name (ARN) of the inference component.

    *)
}
Sourcetype nonrec error = [
  1. | `ResourceLimitExceeded of Values_0.ResourceLimitExceeded.t
  2. | `Unknown_operation_error of string * string option
]
Sourceval make : ?inferenceComponentArn:??? -> unit -> t
Sourceval error_of_json : string -> Yojson.Safe.t -> [> `ResourceLimitExceeded of Values_0.ResourceLimitExceeded.t | `Unknown_operation_error of string * string option ]
Sourceval error_of_xml : string -> Awso.Xml.t -> [> `ResourceLimitExceeded of Values_0.ResourceLimitExceeded.t | `Unknown_operation_error of string * string option ]
Sourceval error_to_json : error -> Yojson.Safe.t
Sourceval to_value : t -> [> `Structure of (string * [> `String of InferenceComponentArn.t ]) list ]
Sourceval to_query : t -> Awso.Client.Query.t
Sourceval of_xml : Awso.Xml.t -> t
Sourceval of_string : string -> t
Sourceval of_json : Yojson.Safe.t -> t
Sourceval to_json : t -> Yojson.Safe.t