This keynote talk was delivered with Li Shen, VP of Engineering at PingCAP, at Percona Live Europe 2018, to provide a technical overview of TiDB, new features in version 2.1, and multi-cloud deployment using a Kubernetes Operator.
(stateless SQL layer) • TiKV (distributed transactional key-value store on RocksDB) • TiSpark (Apache Spark plug-in to TiKV) • Open Sourced from Day 1 • GA 2.0: April 2018
to TiDB • Full data migration + binlog replication • Features: • Data migration job management • Monitoring system • Sharded table merge migration • Data transformation
dump -> TiDB cluster • Fast: • Can import > 100 GB / hour on production grade hardware • Full Import: • Import the entire database once from scratch; not incremental import • Suitable for migrating an existing database to a new TiDB cluster • TiDB Cluster: • Lightning interacts with TiDB’s internal structures • Only works with TiDB cluster
• Synchronizing downstream data platform in real-time to: • Hot backup • MySQL slave • OLAP data warehouses • Writing incremental backup files • Event subscription & trigger
Improve cross-DC data availability Hotspot scheduling • Automatically distribute hotspots to more idle instances Batch region-splits • Avoid large region formation under heavy write workloads 2X read performance improvement !24