Slide 29
Slide 29 text
Performance
EC2: m5ad.large (2 core, 8GB)
transfer: local_file -> BigQuery
embulk: 0.9.23
embulk-output-bigquery(v0.6.4)
embulk-output-bigquery_java (0.0.14)
speedometer を使ったスループットを計測
in:
type: file
path_prefix: /home/ec2-user/bq_rb/data.csv
parser:
charset: UTF-8
newline: LF
type: csv
delimiter: ','
quote: '"'
escape: '"'
trim_if_not_quoted: false
skip_header_lines: 1
allow_extra_columns: false
allow_optional_columns: false
columns:
- {name: c1, type: long}
- {name: c2, type: string}
- {name: c3, type: long}
- {name: c4, type: double}
- {name: c5, type: string}
out:
type: bigquery_java
auth_method: service_account
json_keyfile: ***
dataset: ****
table: bq_performance_java
auto_create_dataset: false
auto_create_table: true
mode: replace
location: US
open_timeout_sec: 300
timeout_sec: 300
send_timeout_sec: 300
read_timeout_sec: 300
retries: 5
allow_quoted_newlines: true
source_format: NEWLINE_DELIMITED_JSON
compression: GZIP
path_prefix: "/home/ec2-user/bq_java/"