Values.CreateDataSourceRequestSourceCreates a data source connector that you want to use with an Amazon Kendra index. You specify a name, data source connector type and description for your data source. You also specify configuration information for the data source connector. CreateDataSource is a synchronous operation. The operation returns 200 if the data source was successfully created. Otherwise, an exception is raised. For an example of creating an index and data source using the Python SDK, see Getting started with Python SDK. For an example of creating an index and data source using the Java SDK, see Getting started with Java SDK.
type nonrec t = {name : DataSourceName.t;A name for the data source connector.
*)indexId : IndexId.t;The identifier of the index you want to use with the data source connector.
*)type_ : DataSourceType.t;The type of data source repository. For example, SHAREPOINT.
*)configuration : DataSourceConfiguration.t option;Configuration information to connect to your data source repository. You can't specify the Configuration parameter when the Type parameter is set to CUSTOM. If you do, you receive a ValidationException exception. The Configuration parameter is required for all other data sources.
*)vpcConfiguration : DataSourceVpcConfiguration.t option;Configuration information for an Amazon Virtual Private Cloud to connect to your data source. For more information, see Configuring a VPC.
*)description : Description.t option;A description for the data source connector.
*)schedule : ScanSchedule.t option;Sets the frequency for Amazon Kendra to check the documents in your data source repository and update the index. If you don't set a schedule Amazon Kendra will not periodically update the index. You can call the StartDataSourceSyncJob API to update the index. Specify a cron- format schedule string or an empty string to indicate that the index is updated on demand. You can't specify the Schedule parameter when the Type parameter is set to CUSTOM. If you do, you receive a ValidationException exception.
*)roleArn : RoleArn.t option;The Amazon Resource Name (ARN) of an IAM role with permission to access the data source and required resources. For more information, see IAM access roles for Amazon Kendra.. You can't specify the RoleArn parameter when the Type parameter is set to CUSTOM. If you do, you receive a ValidationException exception. The RoleArn parameter is required for all other data sources.
*)clientToken : ClientTokenName.t option;A token that you provide to identify the request to create a data source connector. Multiple calls to the CreateDataSource API with the same client token will create only one data source connector.
*)languageCode : LanguageCode.t option;The code for a language. This allows you to support a language for all documents when creating the data source connector. English is supported by default. For more information on supported languages, including their codes, see Adding documents in languages other than English.
*)customDocumentEnrichmentConfiguration : CustomDocumentEnrichmentConfiguration.t
option;Configuration information for altering document metadata and content during the document ingestion process. For more information on how to create, modify and delete document metadata, or make other content alterations when you ingest documents into Amazon Kendra, see Customizing document metadata during the ingestion process.
*)}val make :
?configuration:??? ->
?vpcConfiguration:??? ->
?description:??? ->
?schedule:??? ->
?roleArn:??? ->
?tags:??? ->
?clientToken:??? ->
?languageCode:??? ->
?customDocumentEnrichmentConfiguration:??? ->
name:DataSourceName.t ->
indexId:IndexId.t ->
type_:DataSourceType.t ->
unit ->
tval to_value :
t ->
[> `Structure of
(string
* [> `Enum of string
| `List of
[> `Structure of (string * [> `String of TagKey.t ]) list ] list
| `String of DataSourceName.t
| `Structure of
(string
* [> `List of
[> `String of SubnetId.t
| `Structure of
(string
* [> `Boolean of Boolean.t
| `Structure of
(string
* [> `Boolean of Boolean.t
| `Enum of string
| `String of DocumentAttributeKey.t
| `Structure of
(string
* [> `List of [> `String of String_.t ] list
| `Long of Long.t
| `String of DocumentAttributeStringValue.t
| `Timestamp of Timestamp.t ])
list ])
list ])
list ]
list
| `String of RoleArn.t
| `Structure of
(string
* [> `Boolean of Boolean.t
| `Enum of string
| `Float of MaxContentSizePerPageInMegaBytes.t
| `Integer of CrawlDepth.t
| `List of
[> `Enum of string
| `String of
DataSourceInclusionsExclusionsStringsMember.t
| `Structure of
(string
* [> `Enum of string
| `List of
[> `Structure of
(string
* [> `String of DataSourceFieldName.t ])
list ]
list
| `String of DataSourceFieldName.t ])
list ]
list
| `String of S3BucketName.t
| `Structure of
(string
* [> `Boolean of Boolean.t
| `Enum of string
| `Integer of Port.t
| `List of
[> `Enum of string
| `String of SubnetId.t
| `Structure of
(string
* [> `Enum of string
| `Integer of Port.t
| `List of
[> `Structure of
(string
* [> `String of DataSourceFieldName.t ])
list ]
list
| `String of DataSourceFieldName.t ])
list ]
list
| `String of S3ObjectKey.t
| `Structure of
(string
* [> `Enum of string
| `List of
[> `String of SeedUrl.t
| `Structure of
(string
* [> `String of DataSourceFieldName.t ])
list ]
list
| `Long of Long.t
| `String of DataSourceFieldName.t
| `Timestamp of Timestamp.t ])
list ])
list ])
list ])
list ])
list ]