CleanRoomsService / Client / populate_intermediate_table
populate_intermediate_table¶
- CleanRoomsService.Client.populate_intermediate_table(**kwargs)¶
Executes the stored query of an intermediate table to materialize data into managed storage. With this operation, you can perform initial population and subsequent refreshes. Each call creates a new version. The returned analysis ID can be tracked using
GetProtectedQuery. Only the intermediate table owner can call this operation.See also: AWS API Documentation
Request Syntax
response = client.populate_intermediate_table( intermediateTableIdentifier='string', membershipIdentifier='string', parameters={ 'string': 'string' }, computeConfiguration={ 'queryComputeConfiguration': { 'type': 'CR.1X'|'CR.4X', 'number': 123, 'properties': { 'spark': { 'string': 'string' } } } }, analysisPayerAccountId='string' )
- Parameters:
intermediateTableIdentifier (string) –
[REQUIRED]
The unique identifier of the intermediate table to populate.
membershipIdentifier (string) –
[REQUIRED]
The unique identifier of the membership that contains the intermediate table.
parameters (dict) –
The runtime parameter values that override the defaults in the stored query.
(string) –
(string) –
computeConfiguration (dict) –
The compute configuration for the population query execution.
Note
This is a Tagged Union structure. Only one of the following top level keys can be set:
queryComputeConfiguration.queryComputeConfiguration (dict) –
The configuration of the compute resources for workers running an analysis with the Clean Rooms SQL analytics engine.
type (string) –
The worker compute configuration type.
number (integer) –
The number of workers.
SQL queries support a minimum value of 2 and a maximum value of 400.
PySpark jobs support a minimum value of 4 and a maximum value of 128.
properties (dict) –
The configuration properties for the worker compute environment. These properties allow you to customize the compute settings for your Clean Rooms workloads.
Note
This is a Tagged Union structure. Only one of the following top level keys can be set:
spark.spark (dict) –
The Spark configuration properties for SQL and PySpark workloads. This map contains key-value pairs that configure Apache Spark settings to optimize performance for your data processing jobs. You can specify up to 50 Spark properties, with each key being 1-200 characters and each value being 0-500 characters. These properties allow you to adjust compute capacity for large datasets and complex workloads.
(string) –
(string) –
analysisPayerAccountId (string) – The account ID of the member that pays for the analysis compute costs.
- Return type:
dict
- Returns:
Response Syntax
{ 'analysisId': 'string', 'analysisType': 'QUERY', 'versionId': 'string' }
Response Structure
(dict) –
analysisId (string) –
The identifier for the protected query execution. Use this value with
GetProtectedQueryto track the population progress.analysisType (string) –
The type of analysis performed to populate the intermediate table.
versionId (string) –
The unique identifier of the version created by this population operation.
Exceptions
CleanRoomsService.Client.exceptions.ConflictExceptionCleanRoomsService.Client.exceptions.ServiceQuotaExceededExceptionCleanRoomsService.Client.exceptions.ResourceNotFoundExceptionCleanRoomsService.Client.exceptions.InternalServerExceptionCleanRoomsService.Client.exceptions.ValidationExceptionCleanRoomsService.Client.exceptions.ThrottlingExceptionCleanRoomsService.Client.exceptions.AccessDeniedException