Values_0.CreateAIBenchmarkJobRequestSourceCreates a benchmark job that runs performance benchmarks against inference infrastructure using a predefined AI workload configuration. The benchmark job measures metrics such as latency, throughput, and cost for your generative AI inference endpoints.
type nonrec t = {aIBenchmarkJobName : AIEntityName.t;The name of the AI benchmark job. The name must be unique within your Amazon Web Services account in the current Amazon Web Services Region.
*)benchmarkTarget : AIBenchmarkTarget.t;The target endpoint to benchmark. Specify a SageMaker endpoint by providing its name or Amazon Resource Name (ARN).
*)outputConfig : AIBenchmarkOutputConfig.t;The output configuration for the benchmark job, including the Amazon S3 location where benchmark results are stored.
*)aIWorkloadConfigIdentifier : AIResourceIdentifier.t;The name or Amazon Resource Name (ARN) of the AI workload configuration to use for this benchmark job.
*)roleArn : RoleArn.t;The Amazon Resource Name (ARN) of an IAM role that enables Amazon SageMaker AI to perform tasks on your behalf.
*)networkConfig : AIBenchmarkNetworkConfig.t option;The network configuration for the benchmark job, including VPC settings.
*)}val make :
?networkConfig:??? ->
?tags:??? ->
aIBenchmarkJobName:AIEntityName.t ->
benchmarkTarget:AIBenchmarkTarget.t ->
outputConfig:AIBenchmarkOutputConfig.t ->
aIWorkloadConfigIdentifier:AIResourceIdentifier.t ->
roleArn:RoleArn.t ->
unit ->
tval to_value :
t ->
[> `Structure of
(string
* [> `List of
[> `Structure of (string * [> `String of TagKey.t ]) list ] list
| `String of AIEntityName.t
| `Structure of
(string
* [> `String of S3Uri.t
| `Structure of
(string
* [> `List of
[> `String of SecurityGroupId.t
| `Structure of
(string * [> `String of AIResourceIdentifier.t ])
list ]
list
| `String of AIResourceIdentifier.t ])
list ])
list ])
list ]