Data Destination - Azure Synapse Analytics
    • PDF

    Data Destination - Azure Synapse Analytics

    • PDF

    Article summary

    summary

    Help Page for ETL Configuration of Data Setting to Azure Synapse Analytics.

    constraints

    • none in particular

    Setting items

    STEP1 Basic settings

    item nameindispensabledefault valueContents
    Azure Synapse Analytics Connection ConfigurationYes-Select the preregistered Azure Synapse Analytics Connection Configuration that has the necessary permissions for this ETL Configuration.
    databaseYes-Select the Database Name for Data Destination.
    schemaYes-Select the name of the Data Destination schema.
    tableYes-Select the name of the Data Destination table.
    If the target table does not exist in the Data Destination database schema, it will be created automatically.
    transfer modeYesAppend (INSERT)Select the transfer mode.
    For more information, see About Transfer Mode below.
    merge keyNo-Can be entered when UPSERT (MERGE) is selected in transfer mode.
    Enter the name of the column that will be treated as a merge key (primary key) if the primary key does not exist in the Data Destination table.
    The merge key should be populated with columns that have no duplicate values and no null values.
    Table StructureYesHeap (HEAP )Select the table structure.
    For more information, see Table Structure below.
    Index Key RowYes-Enter when CLUSTERED COLUMNSTORE INDEX is selected for the table structure.
    Specify the input and order of key column names for the index.
    Table DistributionYesRound Robin Variance (ROUND_ROBIN )Select table variance.
    For more information, see Table Distribution below.
    distribution columnYes-Enter when hashdistribution(HASH ) is selected for table distribution.
    Specify the name of the distribution column that will be used as the hash key.

    About transfer mode

    transfer modeContents
    Append (INSERT)First, a temporary table is created and data is transferred.
    After all temporary tables have been created, insert data into the target tables.
    Postscript (INSERT DIRECT)Inserts rows directly into the target table.
    If the transfer fails midway, some data may have been inserted into the target table.
    TRUNCATE INSERTFirst, a temporary table is created and data is transferred to the temporary table.
    After all temporary tables have been created, delete the contents of the target table and then insert data into the target table.
    All cases washed (REPLACE)First, a temporary table is created and data is transferred.
    Once the temporary table has been created, delete the target table and rename the temporary table to the target name.
    If the transfer fails during the process, the target table may be deleted.
    UPSERT (MERGE)First, a temporary table is created and data is transferred.
    Once all temporary tables have been created
    For the target table, rows that match the merge key and value in the temporary table are updated, and rows that do not match are inserted.

    About Table Structure

    For more information on table structures, please refer to the official Azure Synapse Analytics documentation - Table Structure Options.

    Table StructureContents
    Heap (HEAP )Stores tables as a heap.
    CLUSTERED COLUMNSTORE INDEXStore the table as a clustered column store index.
    CLUSTERED INDEXStore tables as clustered indexes containing one or more key columns.

    About Table Distributions

    For more information on table distribution, see Azure Synapse Analytics Official Documentation - Table Distribution Options.

    Table DistributionContents
    Hash Dispersion (HASH )Distribute rows based on hash values of up to 8 columns.
    Round Robin Variance (ROUND_ROBIN )Distribute rows evenly across all distributions in a round-robin fashion.
    REPLICATEOne copy of the table is stored on each computing node.

    STEP1 Detailed settings

    item namedefault valueContents
    Maximum number of retries12Specifies the maximum number of retries.
    Retry wait time (ms)1000Specifies the waiting time for retries.
    Maximum retry wait time (ms)1800000Specifies the maximum time to wait for retries.
    Batch size (MB)16Specify batch size.
    Default time zone-Specify the default time zone.
    Connection timeout (sec)300Specify the timeout in seconds before the driver connects.
    If 0 is specified, the driver's default value of 30 seconds is adopted.
    Socket timeout (seconds)1800Specify the timeout in seconds before executing the query.
    If 0 is specified, there is no timeout.

    Was this article helpful?