CleanRoomsService / Client / populate_intermediate_table

populate_intermediate_table

CleanRoomsService.Client.populate_intermediate_table(**kwargs)

Executes the stored query of an intermediate table to materialize data into managed storage. With this operation, you can perform initial population and subsequent refreshes. Each call creates a new version. The returned analysis ID can be tracked using GetProtectedQuery. Only the intermediate table owner can call this operation.

See also: AWS API Documentation

Request Syntax

response = client.populate_intermediate_table(
    intermediateTableIdentifier='string',
    membershipIdentifier='string',
    parameters={
        'string': 'string'
    },
    computeConfiguration={
        'queryComputeConfiguration': {
            'type': 'CR.1X'|'CR.4X',
            'number': 123,
            'properties': {
                'spark': {
                    'string': 'string'
                }
            }
        }
    },
    analysisPayerAccountId='string'
)
Parameters:
  • intermediateTableIdentifier (string) –

    [REQUIRED]

    The unique identifier of the intermediate table to populate.

  • membershipIdentifier (string) –

    [REQUIRED]

    The unique identifier of the membership that contains the intermediate table.

  • parameters (dict) –

    The runtime parameter values that override the defaults in the stored query.

    • (string) –

      • (string) –

  • computeConfiguration (dict) –

    The compute configuration for the population query execution.

    Note

    This is a Tagged Union structure. Only one of the following top level keys can be set: queryComputeConfiguration.

    • queryComputeConfiguration (dict) –

      The configuration of the compute resources for workers running an analysis with the Clean Rooms SQL analytics engine.

      • type (string) –

        The worker compute configuration type.

      • number (integer) –

        The number of workers.

        SQL queries support a minimum value of 2 and a maximum value of 400.

        PySpark jobs support a minimum value of 4 and a maximum value of 128.

      • properties (dict) –

        The configuration properties for the worker compute environment. These properties allow you to customize the compute settings for your Clean Rooms workloads.

        Note

        This is a Tagged Union structure. Only one of the following top level keys can be set: spark.

        • spark (dict) –

          The Spark configuration properties for SQL and PySpark workloads. This map contains key-value pairs that configure Apache Spark settings to optimize performance for your data processing jobs. You can specify up to 50 Spark properties, with each key being 1-200 characters and each value being 0-500 characters. These properties allow you to adjust compute capacity for large datasets and complex workloads.

          • (string) –

            • (string) –

  • analysisPayerAccountId (string) – The account ID of the member that pays for the analysis compute costs.

Return type:

dict

Returns:

Response Syntax

{
    'analysisId': 'string',
    'analysisType': 'QUERY',
    'versionId': 'string'
}

Response Structure

  • (dict) –

    • analysisId (string) –

      The identifier for the protected query execution. Use this value with GetProtectedQuery to track the population progress.

    • analysisType (string) –

      The type of analysis performed to populate the intermediate table.

    • versionId (string) –

      The unique identifier of the version created by this population operation.

Exceptions

  • CleanRoomsService.Client.exceptions.ConflictException

  • CleanRoomsService.Client.exceptions.ServiceQuotaExceededException

  • CleanRoomsService.Client.exceptions.ResourceNotFoundException

  • CleanRoomsService.Client.exceptions.InternalServerException

  • CleanRoomsService.Client.exceptions.ValidationException

  • CleanRoomsService.Client.exceptions.ThrottlingException

  • CleanRoomsService.Client.exceptions.AccessDeniedException