Slide 23
Slide 23 text
© 2022 Treasure Data, Inc. Public
© 2022 Treasure Data, Inc. Public
query_fetcher:
handler: com.treasudata.hadoop.hive.simulator.executor.fetch.SqlQueryFetcherImpl
input_params:
fetch_query_sql: |
SELECT time, database, query, elapsed, job_id, success, ...
FROM hive
WHERE TD_TIME_RANGE(time, '2022-11-01', '2022-11-02')
AND elapsed >= 600
query_executor:
query_execution_params:
- id: 1
hive_docker_version: worker8-hive:0.1.109-8478
hive_configurations:
td.hive.jfr.enabled: "true"
mapreduce.map.java.opts: "-XX:-Inline"
Query Simulator
23
Queries to be tested
TD Hive version
Enable sampling
We can try any parameters in an isolated environment
- With Query Simulator, we collect stack traces with the given configurations
- Query Simulator is a tool to replay real queries in a secure environment
- Described in 『Journey of Migrating Millions of Queries on The Cloud』
- Our paper: https://arxiv.org/abs/2205.08664