Values.DatasetInputDataConfigSourceSpecifies the format and location of the input data for the dataset.
type nonrec t = {augmentedManifests : DatasetAugmentedManifestsList.t option;A list of augmented manifest files that provide training data for your custom model. An augmented manifest file is a labeled dataset that is produced by Amazon SageMaker Ground Truth.
*)dataFormat : DatasetDataFormat.t option;COMPREHEND_CSV: The data format is a two-column CSV file, where the first column contains labels and the second column contains documents. AUGMENTED_MANIFEST: The data format
*)documentClassifierInputDataConfig : DatasetDocumentClassifierInputDataConfig.t
option;The input properties for training a document classifier model. For more information on how the input file is formatted, see Preparing training data in the Comprehend Developer Guide.
*)entityRecognizerInputDataConfig : DatasetEntityRecognizerInputDataConfig.t
option;The input properties for training an entity recognizer model.
*)}val to_value :
t ->
[> `Structure of
(string
* [> `Enum of string
| `List of
[> `Structure of
(string
* [> `Enum of string
| `List of [> `String of AttributeNamesListItem.t ] list
| `String of S3Uri.t ])
list ]
list
| `Structure of
(string
* [> `String of S3Uri.t
| `Structure of
(string * [> `Enum of string | `String of S3Uri.t ]) list ])
list ])
list ]