Data Destination - Databricks
    • PDF

    Data Destination - Databricks

    • PDF

    Article summary

    Overview

    Help page for setting up data transfer to Databricks.

    Available Cloud Environments

    Data Source and Data Destination Databricks is available in any of the following cloud environments.

    TROCCO uses Unity Catalog Volume as temporary file storage during data transfer, so it does not depend on storage services or interfaces specific to each cloud service.

    Limitations

    Unavailable Data Types

    Settings

    Step 1: Basic settings

    Field NameRequiredDefault ValueDetails
    Databricks Connection ConfigurationYes-Select from a pre-set Databricks Connection Configuration that has the necessary permissions for this ETL Configuration.
    CatalogYes-Select the name of the destination catalog.
    SchemaYes-Select the schema name of the destination.
    TableYes-Select the destination table name.
    If the target table does not exist in the destination database schema, it will be created automatically.
    Transfer ModeYesAPPEND (INSERT)Select Transfer Mode.
    For details, see [About Transfer Modes](#About Transfer Modes) below.
    Merge KeyNo-Can be entered when UPSERT (MERGE) is selected in the Transfer Mode.
    If the primary key does not exist in the destination table, enter the name of the column to be treated as a merge key (primary key).
    For merge keys, enter columns that have no duplicate values and no null values.

    Step 1: Advanced Settings

    Field NameDefault ValueDetails
    Batch Size (MB)50Specify the batch size.
    Default Time ZoneEtc/UTCSpecify the default time zone.

    Additional Information

    About Transfer Modes

    Transfer ModeDetails
    APPEND (INSERT)Creates a temporary table before transferring the data.
    After all temporary tables have been created, inserts the data into the target table.
    APPEND (INSERT DIRECT)Insert rows directly into the target table.
    If the transfer fails, some data may have been inserted into the target table.
    REPLACE (TRUNCATE INSERTCreates a temporary table before transferring the data.
    After all temporary tables have been created, deletes the content of the target table and then insert data into the target table.
    REPLACECreates a temporary table before transferring the data.
    Once the temporary table has been created, deletes the target table and rename the temporary table to the target name.
    If the transfer fails, the target table may be deleted.
    UPSERT (MERGE)Creates a temporary table before transferring the data.
    After all temporary tables have been created, the following process will be occur:
    For the target table, rows that match the merge key and value in the temporary table are updated, and rows that do not match are inserted.

    Was this article helpful?