CleanRoomsService / Client / populate_intermediate_table

populate_intermediate_table¶

CleanRoomsService.Client.populate_intermediate_table(**kwargs)¶

Executes the stored query of an intermediate table to materialize data into managed storage. With this operation, you can perform initial population and subsequent refreshes. Each call creates a new version. The returned analysis ID can be tracked using GetProtectedQuery. Only the intermediate table owner can call this operation.

Request Syntax

response = client.populate_intermediate_table(
    intermediateTableIdentifier='string',
    membershipIdentifier='string',
    parameters={
        'string': 'string'
    },
    computeConfiguration={
        'queryComputeConfiguration': {
            'type': 'CR.1X'|'CR.4X',
            'number': 123,
            'properties': {
                'spark': {
                    'string': 'string'
                }
            }
        }
    },
    analysisPayerAccountId='string'
)

Parameters:

intermediateTableIdentifier (string) –
[REQUIRED]

The unique identifier of the intermediate table to populate.
membershipIdentifier (string) –
[REQUIRED]

The unique identifier of the membership that contains the intermediate table.
parameters (dict) –
The runtime parameter values that override the defaults in the stored query.
- (string) –
  - (string) –
computeConfiguration (dict) –
The compute configuration for the population query execution.

Note
This is a Tagged Union structure. Only one of the following top level keys can be set: queryComputeConfiguration.
- queryComputeConfiguration (dict) –
  
  The configuration of the compute resources for workers running an analysis with the Clean Rooms SQL analytics engine.
  - type (string) –
    
    The worker compute configuration type.
  - number (integer) –
    
    The number of workers.
    
    SQL queries support a minimum value of 2 and a maximum value of 400.
    
    PySpark jobs support a minimum value of 4 and a maximum value of 128.
  - properties (dict) –
    
    The configuration properties for the worker compute environment. These properties allow you to customize the compute settings for your Clean Rooms workloads.
    
    Note
    This is a Tagged Union structure. Only one of the following top level keys can be set: spark.
    - spark (dict) –
      
      The Spark configuration properties for SQL and PySpark workloads. This map contains key-value pairs that configure Apache Spark settings to optimize performance for your data processing jobs. You can specify up to 50 Spark properties, with each key being 1-200 characters and each value being 0-500 characters. These properties allow you to adjust compute capacity for large datasets and complex workloads.
      - (string) –
        
        (string) –
analysisPayerAccountId (string) – The account ID of the member that pays for the analysis compute costs.

Return type:

dict

Returns:

Response Syntax

{
    'analysisId': 'string',
    'analysisType': 'QUERY',
    'versionId': 'string'
}

Response Structure

(dict) –
- analysisId (string) –
  
  The identifier for the protected query execution. Use this value with GetProtectedQuery to track the population progress.
- analysisType (string) –
  
  The type of analysis performed to populate the intermediate table.
- versionId (string) –
  
  The unique identifier of the version created by this population operation.

Exceptions

CleanRoomsService.Client.exceptions.ConflictException
CleanRoomsService.Client.exceptions.ServiceQuotaExceededException
CleanRoomsService.Client.exceptions.ResourceNotFoundException
CleanRoomsService.Client.exceptions.InternalServerException
CleanRoomsService.Client.exceptions.ValidationException
CleanRoomsService.Client.exceptions.ThrottlingException
CleanRoomsService.Client.exceptions.AccessDeniedException