Values_1.InferenceComponentComputeResourceRequirementsSourceDefines the compute resources to allocate to run a model, plus any adapter models, that you assign to an inference component. These resources include CPU cores, accelerators, and memory.
type nonrec t = {numberOfCpuCoresRequired : NumberOfCpuCores.t option;The number of CPU cores to allocate to run a model that you assign to an inference component.
*)numberOfAcceleratorDevicesRequired : NumberOfAcceleratorDevices.t option;The number of accelerators to allocate to run a model that you assign to an inference component. Accelerators include GPUs and Amazon Web Services Inferentia.
*)minMemoryRequiredInMb : MemoryInMb.t;The minimum MB of memory to allocate to run a model that you assign to an inference component.
*)maxMemoryRequiredInMb : MemoryInMb.t option;The maximum MB of memory to allocate to run a model that you assign to an inference component.
*)}val make :
?numberOfCpuCoresRequired:??? ->
?numberOfAcceleratorDevicesRequired:??? ->
?maxMemoryRequiredInMb:??? ->
minMemoryRequiredInMb:MemoryInMb.t ->
unit ->
tval to_value :
t ->
[> `Structure of
(string * [> `Float of NumberOfCpuCores.t | `Integer of MemoryInMb.t ])
list ]