Awso_glue.Values_0Sourceval structure_to_value_aux :
('a * 'b option) list ->
f:(('a * 'b) list -> 'c) ->
[> `Structure of 'c ]val structure_to_wrapped_value :
wrapper:'a ->
response:'a ->
('b * 'c option) list ->
[> `Structure of ('a * [> `Structure of ('b * 'c) list ]) list ]Access to a resource was denied.
Specifies configuration properties of a notification.
Specifies the set of parameters needed to perform aggregation in the aggregate transform.
Specifies a transform that groups rows by chosen fields and computes the aggregated value by specified function.
An object representing a value allowed for a property.
A resource to be created or added already exists.
Specifies an optional value when connecting to the Redshift cluster.
Specifies an Amazon Redshift node.
Specifies an Amazon Redshift source.
Specifies an Amazon Redshift target.
A failed annotation.
A timestamped inclusion annotation.
A Statistic Annotation.
module Mapping : sig ... endSpecifies the mapping of data property keys.
module Mappings : sig ... endSpecifies a transform that maps data property keys in the data source to data property keys in the data target. You can rename keys, modify the data types for keys, and choose which keys to drop from the dataset.
Specifies a single column in a Glue schema definition.
Specifies a user-defined schema when a schema cannot be determined by Glue.
Specifies a connector to an Amazon Athena data source.
A structure containing the Lake Formation audit context.
An object that defines a connection type for a compute environment.
The authentication configuration for a connection returned by the DescribeConnectionType API.
The OAuth2 client app used for the connection.
A structure containing properties for OAuth2 authentication.
A structure containing the authentication configuration.
The credentials used when the authentication type is OAuth2 authentication.
The set of properties required for the the OAuth2 AUTHORIZATION_CODE grant type workflow.
A structure containing properties for OAuth2 in the CreateConnection request.
For supplying basic auth credentials when not providing a SecretArn value.
A structure containing the authentication configuration in the CreateConnection request.
Specifies configuration options for automatic data quality evaluation in Glue jobs. This structure enables automated data quality checks and monitoring during ETL operations, helping to ensure data integrity and reliability without manual intervention.
Contains a list of values defining partitions.
A list of errors that can occur when registering partition indexes for an existing table. These errors give the details about why an index registration failed and provide a limited number of partitions in the response, so that you can fix the partitions at fault and try registering the index again. The most common set of errors that can occur are categorized as follows: EncryptedPartitionError: The partitions are encrypted. InvalidPartitionTypeDataError: The partition value doesn't match the data type for that partition column. MissingPartitionValueError: The partitions are encrypted. UnsupportedPartitionCharacterError: Characters inside the partition value are not supported. For example: U+0000 , U+0001, U+0002. InternalError: Any error which does not belong to other error codes.
Defines a property configuration for connection types, default values, and where the property should be used in requests.
Basic authentication configuration that defines the username and password properties for HTTP Basic authentication.
Specifies a target that uses a Glue Data Catalog table.
Specifies skewed values in a table. Skewed values are those that occur with very high frequency.
Information about a serialization/deserialization program (SerDe) that serves as an extractor and loader.
An object that references a schema stored in the Glue Schema Registry.
Describes the physical storage of table data.
The structure used to create and update a partition.
Creates one or more partitions in a batch operation.
A resource numerical limit was exceeded.
Contains details about an error.
Contains information about a partition error.
The operation timed out.
The input provided was not valid.
An internal service error occurred.
An encryption operation failed.
A specified entity does not exist
Creates one or more partitions in a batch operation.
Deletes a list of connection definitions from the Data Catalog.
Deletes a list of connection definitions from the Data Catalog.
Deletes one or more partitions in a batch operation.
Deletes one or more partitions in a batch operation.
Deletes multiple tables at once. After completing this operation, you no longer have access to the table versions and partitions that belong to the deleted table. Glue deletes these "orphaned" resources asynchronously in a timely manner, at the discretion of the service. To ensure the immediate deletion of all related resources, before calling BatchDeleteTable, use DeleteTableVersion or BatchDeleteTableVersion, and DeletePartition or BatchDeletePartition, to delete any resources that belong to the table.
An error record for table operations.
A resource was not ready for a transaction.
Deletes multiple tables at once. After completing this operation, you no longer have access to the table versions and partitions that belong to the deleted table. Glue deletes these "orphaned" resources asynchronously in a timely manner, at the discretion of the service. To ensure the immediate deletion of all related resources, before calling BatchDeleteTable, use DeleteTableVersion or BatchDeleteTableVersion, and DeletePartition or BatchDeletePartition, to delete any resources that belong to the table.
Deletes a specified batch of versions of a table.
An error record for table-version operations.
Deletes a specified batch of versions of a table.
Retrieves information about a list of blueprints.
When there are multiple versions of a blueprint and the latest version has some errors, this attribute indicates the last successful blueprint definition that is available with the service.
Retrieves information about a list of blueprints.
Returns a list of resource metadata for a given list of crawler names. After calling the ListCrawlers operation, you can call this operation to access the data to which you have been granted permissions. This operation supports all IAM permissions, including permission conditions that uses tags.
A policy that specifies update and deletion behaviors for the crawler.
When crawling an Amazon S3 data source after the first crawl is complete, specifies whether to crawl the entire dataset again or to crawl only folders that were added since the last crawler run. For more information, see Incremental Crawls in Glue in the developer guide.
Specifies data lineage configuration settings for the crawler.
Status and error information about the most recent crawl.
Specifies Lake Formation configuration settings for the crawler.
Specifies a data store in Amazon Simple Storage Service (Amazon S3).
Specifies an Amazon DocumentDB or MongoDB data store to crawl.
Specifies a JDBC data store to crawl.
Specifies an Apache Iceberg data source where Iceberg tables are stored in Amazon S3.
Specifies an Apache Hudi data source.
Specifies an Amazon DynamoDB table to crawl.
Specifies a Delta data store to crawl one or more Delta tables.
Specifies an Glue Data Catalog target.
Specifies data stores to crawl.
Specifies a crawler program that examines a data source and uses classifiers to try to determine its schema. If successful, the crawler records metadata concerning the data source in the Glue Data Catalog.
Returns a list of resource metadata for a given list of crawler names. After calling the ListCrawlers operation, you can call this operation to access the data to which you have been granted permissions. This operation supports all IAM permissions, including permission conditions that uses tags.
Retrieves the details for the custom patterns specified by a list of names.
An object representing a custom pattern for detecting sensitive data across the columns and rows of your structured data.
Retrieves the details for the custom patterns specified by a list of names.
Retrieves a list of data quality results for the specified result IDs.
The database and table in the Glue Data Catalog that is used for input or output data.
SQL Query of SparkSQL format that can be used to pre-process data before running Data Quality Operations.
The database and table in the Glue Data Catalog that is used for input or output data for Data Quality Operations.
A data source (an Glue table) for which you want data quality results.
Describes the result of the evaluation of a data quality rule.
Describes the data quality metric value according to the analysis of historical data.
Describes the metric based observation generated based on evaluated data quality metrics.
Describes the observation generated after evaluating the rules and analyzers.
Describes the result of the evaluation of a data quality analyzer.
A summary of metrics showing the total counts of processed rows and rules, including their pass/fail statistics based on row-level results.
Describes a data quality result.
Retrieves a list of data quality results for the specified result IDs.
Returns a list of resource metadata for a given list of development endpoint names. After calling the ListDevEndpoints operation, you can call this operation to access the data to which you have been granted permissions. This operation supports all IAM permissions, including permission conditions that uses tags.
A development endpoint where a developer can remotely debug extract, transform, and load (ETL) scripts.
Returns a list of resource metadata for a given list of development endpoint names. After calling the ListDevEndpoints operation, you can call this operation to access the data to which you have been granted permissions. This operation supports all IAM permissions, including permission conditions that uses tags.
Returns a list of resource metadata for a given list of job names. After calling the ListJobs operation, you can call this operation to access the data to which you have been granted permissions. This operation supports all IAM permissions, including permission conditions that uses tags.
The details for a source control configuration for a job, allowing synchronization of job artifacts to or from a remote repository.
Specifies code that runs when a job is run.
An execution property of a job.
Specifies the connections used by a job.
Specifies a transform that combines the rows from two or more datasets into a single result.
Specifies a transform that splits data property keys into two DynamicFrames. The output is a collection of DynamicFrames: one with selected data property keys, and one with the remaining data property keys.
Specifies a transform that writes samples of the data to an Amazon S3 bucket.
Specifies a transform where you enter a SQL query using Spark SQL syntax to transform the data. The output is a single DynamicFrame.
Specifies a target that uses an Apache Spark connector.
Specifies a connector to an Apache Spark data source.
Specifies configuration for Snowflake nodes in Glue Studio.
Specifies a Snowflake target.
Specifies a Snowflake data source.
Specifies a transform that chooses one DynamicFrame from a collection of DynamicFrames. The output is the selected DynamicFrame
Specifies a transform that chooses the data property keys that you want to keep.
Specifies additional connection options for the Amazon S3 data store.
Specifies an Apache Parquet data store stored in Amazon S3.
Specifies a JSON data store stored in Amazon S3.
A policy that specifies update behavior for the crawler.
Specifies a target that writes to an Iceberg data source in Amazon S3.
A policy that specifies update behavior for the crawler.
Specifies an Apache Iceberg catalog target that writes data to Amazon S3 and registers the table in the Glue Data Catalog.
Specifies a HyperDirect data target that writes to Amazon S3.
Specifies a Hudi data source stored in Amazon S3.
Specifies a target that writes to a Hudi data source in Amazon S3.
Specifies a target that writes to a Hudi data source in the Glue Data Catalog.
Specifies a data target that writes to Amazon S3 in Apache Parquet columnar storage.
Specifies an S3 Excel data source.
Specifies a data target that writes to Amazon S3.
Specifies a Delta Lake data source stored in Amazon S3.
Specifies a target that writes to a Delta Lake data source in Amazon S3.
Specifies a target that writes to a Delta Lake data source in the Glue Data Catalog.
Specifies a command-separated value (CSV) data store stored in Amazon S3.
Specifies a data target that writes to Amazon S3 using the Glue Data Catalog.
Specifies additional connection options for the Amazon S3 data store.
Specifies an Amazon S3 data store in the Glue Data Catalog.
Specifies an Apache Iceberg data source that is registered in the Glue Data Catalog. The Iceberg data source must be stored in Amazon S3.
Specifies a Hudi data source that is registered in the Glue Data Catalog. The Hudi data source must be stored in Amazon S3.
Specifies a Delta Lake data source that is registered in the Glue Data Catalog. The data source must be stored in Amazon S3.
Represents a single entry in the list of values for a FilterExpression.
Specifies a filter expression.
Specifies a group of filters with a logical operator that determines how the filters are combined to evaluate routing conditions.
Specifies a route node that directs data to different output paths based on defined filtering conditions.
Specifies a transform that renames a single data property key.
Specifies a Relational database data source in the Glue Data Catalog.
The options to configure an upsert operation when writing to a Redshift target .
Specifies a target that uses Amazon Redshift.
Specifies an Amazon Redshift data store.
Actions defined in the Glue Studio data preparation recipe node.
Condition expression defined in the Glue Studio data preparation recipe node.
A recipe step used in a Glue Studio data preparation recipe node.
A reference to a Glue DataBrew recipe.
Specifies a target that uses Postgres SQL.
Specifies a PostgresSQL data source in the Glue Data Catalog.
Specifies a transform that identifies, removes or masks PII data.
Specifies a target that uses Oracle SQL.
Specifies an Oracle data source in the Glue Data Catalog.
Specifies a target that uses MySQL.
Specifies a MySQL data source in the Glue Data Catalog.
Specifies a target that uses Microsoft SQL.
Specifies a Microsoft SQL server data source in the Glue Data Catalog.
Specifies a transform that merges a DynamicFrame with a staging DynamicFrame based on the specified primary keys to identify records. Duplicate records (records with the same primary keys) are not de-duplicated.
Specifies a column to be joined.
Specifies a transform that joins two datasets into one dataset using a comparison phrase on the specified data property keys. You can use inner, outer, left, right, left semi, and left anti joins.
Specifies a data target that writes to Amazon S3 in Apache Parquet columnar storage.
Additional connection options for the connector.
Specifies a connector to a JDBC data source.
Specifies a data target that writes to Amazon S3 using the Glue Data Catalog.
Specifies the data store in the governed Glue Data Catalog.
Specifies a transform that splits a dataset into two, based on a filter condition.
Specifies a transform that locates records in the dataset that have missing values and adds a new field with a value determined by imputation. The input data set is used to train the machine learning model that determines what the missing value should be.
Options to configure how your job will stop if your data quality evaluation fails.
Options to configure how your data quality evaluation results are published.
Specifies your data quality evaluation criteria.
Specifies your data quality evaluation criteria.
Specifies connection options for DynamoDB ELT (Extract, Load, Transform) operations. This structure contains configuration parameters for connecting to and extracting data from DynamoDB tables using the ELT connector.
Specifies a DynamoDB ELT connector source for extracting data from DynamoDB tables.
Specifies additional options for DynamoDB ELT catalog operations.
Specifies a DynamoDB data source in the Glue Data Catalog.
Specifies the parameters in the config file of the dynamic transform.
Specifies the set of parameters needed to perform the dynamic transform.
Represents a custom null value such as a zeros or other value being used as a null placeholder unique to the dataset.
Represents whether certain values are recognized as null values for removal.
Specifies a transform that removes columns from the dataset if all values in the column are 'null'. By default, Glue Studio will recognize null objects, but some values such as empty strings, strings that are "null", -1 integers or other placeholders such as zeros, are not automatically recognized as nulls.
Specifies a transform that chooses the data property keys that you want to drop.
Specifies a transform that removes rows of repeating data from a data set.
Specifies options related to data preview for viewing a sample of your data.
Additional options for the Amazon Kinesis streaming data source.
Specifies a direct Amazon Kinesis data source.
Additional options for streaming.
Specifies an Apache Kafka data store.
Specifies the direct JDBC source connection.
Specifies a transform that uses custom code you provide to perform the data transformation. The output is a collection of DynamicFrames.
Specifies a target generated with standard connection options.
Specifies a source generated with standard connection options.
Specifies a data store in the Glue Data Catalog.
Specifies a Kinesis data source in the Glue Data Catalog.
Specifies an Apache Kafka data store in the Data Catalog.
Specifies an Apache Iceberg data source that is registered in the Glue Data Catalog.
Specifies a Hudi data source that is registered in the Glue Data Catalog.
Specifies a Delta Lake data source that is registered in the Glue Data Catalog.
CodeGenConfigurationNode enumerates all valid Node types. One and only one of its member variables can be populated.
Returns a list of resource metadata for a given list of job names. After calling the ListJobs operation, you can call this operation to access the data to which you have been granted permissions. This operation supports all IAM permissions, including permission conditions that uses tags.
A structure used as a protocol between query engines and Lake Formation or Glue. Contains both a Lake Formation generated authorization identifier and information from the request's authorization context.
Retrieves partitions in a batch request.
An error that indicates your data is in an invalid state.
A federation source failed, but the operation may be retried.
A federation source failed.
Retrieves partitions in a batch request.
Represents a table optimizer to retrieve in the BatchGetTableOptimizer operation.
Contains details on one of the errors in the error list returned by the BatchGetTableOptimizer operation.
Returns the configuration for the specified table optimizers.
The throttling threshhold was exceeded.
Metrics for the optimizer run. This structure is deprecated. See the individual metric members for compaction, retention, and orphan file deletion.
Snapshot retention metrics for Iceberg for the optimizer run.
A structure that contains retention metrics for the optimizer run.
Orphan file deletion metrics for Iceberg for the optimizer run.
A structure that contains orphan file deletion metrics for the optimizer run.
Compaction metrics for Iceberg for the optimizer run.
A structure that contains compaction metrics for the optimizer run.
Contains details for a table optimizer run.
An object that describes the VPC configuration for a table optimizer. This configuration is necessary to perform optimization on tables that are in a customer VPC.
The configuration for an Iceberg snapshot retention optimizer.
The configuration for a snapshot retention optimizer.
The configuration for an Iceberg orphan file deletion optimizer.
The configuration for an orphan file deletion optimizer.
The configuration for an Iceberg compaction optimizer. This configuration defines parameters for optimizing the layout of data files in Iceberg tables.
The configuration for a compaction optimizer. This configuration defines how data files in your table will be compacted to improve query performance and reduce storage costs.
Contains details on the configuration of a table optimizer. You pass this configuration when creating or updating a table optimizer.
Contains details about an optimizer associated with a table.
Contains details for one of the table optimizers returned by the BatchGetTableOptimizer operation.
Returns the configuration for the specified table optimizers.
Returns a list of resource metadata for a given list of trigger names. After calling the ListTriggers operation, you can call this operation to access the data to which you have been granted permissions. This operation supports all IAM permissions, including permission conditions that uses tags.
Defines the predicate of the trigger, which determines when it fires.
Batch condition that must be met (specified number of events received or batch time window expired) before EventBridge event trigger fires.
Returns a list of resource metadata for a given list of trigger names. After calling the ListTriggers operation, you can call this operation to access the data to which you have been granted permissions. This operation supports all IAM permissions, including permission conditions that uses tags.
Returns a list of resource metadata for a given list of workflow names. After calling the ListWorkflows operation, you can call this operation to access the data to which you have been granted permissions. This operation supports all IAM permissions, including permission conditions that uses tags.
Workflow run statistics provides statistics about the workflow run.
The details of a Trigger node present in the workflow.
A job run that was used in the predicate of a conditional trigger that triggered this job run.
The details of a Job node present in the workflow.
The details of a Crawler node present in the workflow.
A node represents an Glue component (trigger, crawler, or job) on a workflow graph.
An edge represents a directed connection between two Glue components that are part of the workflow the edge belongs to.
A workflow graph represents the complete workflow containing all the Glue components present in the workflow and all the directed connections between them.
The batch condition that started the workflow run. Either the number of events in the batch size arrived, in which case the BatchSize member is non-zero, or the batch window expired, in which case the BatchWindow member is non-zero.
A workflow run is an execution of a workflow providing all the runtime information.
The details of a blueprint.
A workflow is a collection of multiple dependent Glue jobs and crawlers that are run to complete a complex ETL task. A workflow manages the execution and monitoring of all its jobs and crawlers.
Returns a list of resource metadata for a given list of workflow names. After calling the ListWorkflows operation, you can call this operation to access the data to which you have been granted permissions. This operation supports all IAM permissions, including permission conditions that uses tags.
An Inclusion Annotation.
Annotate datapoints over time for a specific data quality statistic. The API requires both profileID and statisticID as part of the InclusionAnnotation input. The API only works for a single statisticId across multiple profiles.
Annotate datapoints over time for a specific data quality statistic. The API requires both profileID and statisticID as part of the InclusionAnnotation input. The API only works for a single statisticId across multiple profiles.
Records an error that occurred when attempting to stop a specified job run.
Stops one or more job runs for a specified job definition.
Records a successful request to stop a specified JobRun.
Stops one or more job runs for a specified job definition.
Contains information about a batch update partition error.
A structure that contains the values and structure used to update a partition.
Updates one or more partitions in a batch operation.
Updates one or more partitions in a batch operation.
Defines column statistics supported for bit sequence data values.
The details of a blueprint run.
Defines column statistics supported for Boolean data columns.
Cancels the specified recommendation run that was being used to generate rules.
Cancels the specified recommendation run that was being used to generate rules.
Cancels a run where a ruleset is being evaluated against a data source.
Cancels a run where a ruleset is being evaluated against a data source.
Cancels (stops) a task run. Machine learning task runs are asynchronous tasks that Glue runs on your behalf as part of various machine learning workflows. You can cancel a machine learning task run at any time by calling CancelMLTaskRun with a task run's parent transform's TransformID and the task run's TaskRunId.
Cancels (stops) a task run. Machine learning task runs are asynchronous tasks that Glue runs on your behalf as part of various machine learning workflows. You can cancel a machine learning task run at any time by calling CancelMLTaskRun with a task run's parent transform's TransformID and the task run's TaskRunId.
Cancels the statement.
The session is in an invalid state to perform a requested operation.
Cancels the statement.
Specifies the supported authentication types returned by the DescribeConnectionType API.
A structure that describes a target catalog for resource linking.
The Lake Formation principal.
Permissions granted to a principal.
A catalog that points to an entity outside the Glue Data Catalog.
A structure that contains the output properties of Iceberg table optimization configuration for your catalog resource in the Glue Data Catalog.
The output properties of the data lake access configuration for your catalog resource in the Glue Data Catalog.
Property attributes that include configuration properties for the catalog resource.
The catalog object represents a logical grouping of databases in the Glue Data Catalog or a federated source. You can now create a Redshift-federated catalog or a catalog containing resource links to Redshift databases in another account or region.
Specifies a table definition in the Glue Data Catalog.
A structure containing migration status information.
A structure that specifies Iceberg table optimization properties for the catalog, including configurations for compaction, retention, and orphan file deletion operations.
Input properties to configure data lake access for your catalog resource in the Glue Data Catalog.
A structure that specifies data lake access properties and other custom properties.
A structure that describes catalog properties.
Validates the supplied schema. This call has no side effects, it simply validates using the supplied schema using DataFormat as the format. Since it does not take a schema set name, no compatibility checks are performed.