Data Destination - Amazon S3 Parquet
    • PDF

    Data Destination - Amazon S3 Parquet

    • PDF

    Article summary

    summary

    Help page for ETL Configuration of Data Settings for transferring data in Apache Parquet (.parquet) format to Amazon Web Services' S3.

    constraints

    • none in particular

    Setting items

    STEP1 Basic settings

    item nameindispensabledefault valueContents
    S3 Connection ConfigurationYes-Select the previously registered Connection Configuration that has the necessary permissions for this ETL Configuration.
    Please refer to another page for Connection Configuration.
    regionYesap-northeast-1Enter the region you specified when creating the bucket to be used.
    Please refer to the official AWS page for an explanation of regions.
    bucketYes-Specify the name of the Data Destination bucket.
    path prefixYes-Specify the Data Destination path prefix.
    TROCCO outputs multiple files in the Data Destination bucket, beginning with the path prefix.
    Custom Variables can also be used to dynamically determine the value of a setting during ETL Configuration of TROCCO's data.
    compressed formatYesuncompressedSpecifies the file compression method.
    ・umcompressed
    ・snappy
    ・gzip
    ・lzo
    ・broti
    ・lz4
    ・zstd
    You can choose from

    STEP2 Detailed settings

    item namedefault valueContents
    Naming Conventions for Multiple File Output.%03d.%02dYou can set the rules for naming files when outputting multiple files.
    Output file extensionparquet-
    Default timestamp format%Y-%m-%d %H:%M:%S.%6N %z-
    Default time zoneUTC-
    Block size (byte)134217728-
    Page size (byte)1048576-
    Maximum padding size (byte)8388608-
    Data Catalog Settinglocal-
    Column Setting--

    Was this article helpful?