Data Destination - S3 Parquet
  • 07 Dec 2022
  • Dark
    Light
  • PDF

Data Destination - S3 Parquet

  • Dark
    Light
  • PDF

Article Summary

Note

This is a machine-translated version of the original Japanese article.
Please understand that some of the information contained on this page may be inaccurate.

summary

This is a help page for setting up transferring data to Amazon Web Services S3 in Apache Parquet (.parquet) format.

Supported Protocols

constraint

  • Nothing in particular

Setting items

STEP1 Basic settings

Itemnamerequireddefault
valuecontent
S3 connection informationYes-From the connection information registered in advance, select the one that has the necessary permissions for this transfer setting.
Please refer to the separate page for how to set the connection information.
RegionYesap-northeast-1Enter the region you specified when creating the bucket to use.
For an explanation of regions, please refer to the official AWS page.
bucketYes-Specify the name of the bucket to which you want to transfer data.
Path prefixYes-Specify the path prefix to which the data is transferred.
trocco outputs multiple files to the destination bucket, starting with the path prefix.
Custom variables can also be used to dynamically determine the setting value during trocco data transfer.
Compression formatYesuncompressedSpecifies the compression method for the file.
・umcompressed
・Snappy
・gzip
・LZO
・broti
・LZ4
・zstd
You can choose from:

STEP2 Advanced settings

Item namedefaultvalue content
Naming conventions for multi-file output.%03d.%02dYou can set rules for the name of multiple files when outputting them.
Output file extensionparquet-
Default timestamp format%Y-%m-%d %H:%M:%S.%6N %z-
Default time zoneUTC-
Block Size (byte)134217728-
Page Size (byte)1048576-
Maximum padding size (byte)8388608-
Set up your data catalognot-
Column settings--

Was this article helpful?