Staying in Sync: From Transactions to Streams

Slides from a talk given at QCon London on 7 March 2016.
http://martin.kleppmann.com/2016/03/07/qcon-london.html

Abstract:

For the very simplest applications, a single database is sufficient, and then life is pretty good. But as your application needs to do more, you often find that no single technology can do everything you need to do with your data. And so you end up having to combine several databases, caches, search indexes, message queues, analytics tools, machine learning systems, and so on, into a heterogeneous infrastructure…

Now you have a new problem: your data is stored in several different places, and if it changes in one place, you have to keep it in sync in the other places, too. It’s not too bad if all your systems are up and running smoothly, but what if some parts of your systems have failed, some are running slow, and some are running buggy code that was deployed by accident?

It’s not easy to keep data in sync across different systems in the face of failure. Distributed transactions and 2-phase commit have long been seen as the “correct” solution, but they are slow and have operational problems, and so many systems can’t afford to use them.

In this talk we’ll explore using event streams and Kafka for keeping data in sync across heterogeneous systems, and compare this approach to distributed transactions: what consistency guarantees can it offer, and how does it fare in the face of failure?

References:

1. Mahesh Balakrishnan, Dahlia Malkhi, Ted Wobber, et al.: “Tango: Distributed Data Structures over a Shared Log,” at 24th ACM Symposium on Operating Systems Principles (SOSP), pages 325–340, November 2013. http://research.microsoft.com/pubs/199947/Tango.pdf

2. Molly Bartlett Dishman and Martin Fowler: “Agile Architecture,” at O'Reilly Software Architecture Conference, March 2015. http://conferences.oreilly.com/software-architecture/sa2015/public/schedule/detail/40388

3. Shirshanka Das, Chavdar Botev, Kapil Surlaker, et al.: “All Aboard the Databus!,” at ACM Symposium on Cloud Computing (SoCC), October 2012. http://www.socc2012.org/s18-das.pdf

4. Pat Helland: “Life beyond Distributed Transactions: an Apostate’s Opinion,” at 3rd Biennial Conference on Innovative Data Systems Research (CIDR), pages 132–141, January 2007. http://www-db.cs.wisc.edu/cidr/cidr2007/papers/cidr07p15.pdf

5. Pat Helland: “Immutability Changes Everything,” at 7th Biennial Conference on Innovative Data Systems Research (CIDR), January 2015. http://www.cidrdb.org/cidr2015/Papers/CIDR15_Paper16.pdf

6. Martin Kleppmann: “Designing Data-Intensive Applications.” O’Reilly Media, to appear. http://dataintensive.net/

7. Jay Kreps: “I ♥︎ Logs.” O'Reilly Media, September 2014. http://shop.oreilly.com/product/0636920034339.do

8. Jay Kreps: “Putting Apache Kafka to use: A practical guide to building a stream data platform.” 25 February 2015. http://blog.confluent.io/2015/02/25/stream-data-platform-1/

9. Leslie Lamport: “Time, Clocks, and the Ordering of Events in a Distributed System,” Communications of the ACM, volume 21, number 7, pages 558–565, July 1978. http://research.microsoft.com/en-US/um/people/Lamport/pubs/time-clocks.pdf

10. Neha Narkhede: “Announcing Kafka Connect: Building large-scale low-latency data pipelines.” 18 February 2016. http://www.confluent.io/blog/announcing-kafka-connect-building-large-scale-low-latency-data-pipelines

11. Fred B Schneider: “Implementing Fault-Tolerant Services Using the State Machine Approach: A Tutorial,” ACM Computing Surveys, volume 22, number 4, pages 299–319, December 1990. http://www.cs.cornell.edu/fbs/publications/smsurvey.pdf

12. Yogeshwer Sharma, Philippe Ajoux, Petchean Ang, et al.: “Wormhole: Reliable Pub-Sub to Support Geo-replicated Internet Services,” at 12th USENIX Symposium on Networked Systems Design and Implementation (NSDI), May 2015. https://www.usenix.org/system/files/conference/nsdi15/nsdi15-paper-sharma.pdf

13. Martin Thompson: “Single Writer Principle.” 22 September 2011. http://mechanical-sympathy.blogspot.co.uk/2011/09/single-writer-principle.html

14. Vaughn Vernon: Implementing Domain-Driven Design. Addison-Wesley Professional, February 2013.

Martin Kleppmann

March 07, 2016

More Decks by Martin Kleppmann

See All by Martin Kleppmann

Mitigating geopolitical risks with local-first software and atproto

ept

510

Local-first software and geopolitical risk

ept

590

Collaborative text editing with Eg-walker: Better, faster, smaller

ept

1.8k

Byzantine Eventual Consistency and Local-first Access Control

ept

1.8k

The past, present, and future of local-first

ept

3.9k

Where local-first came from and where it's going

ept

4.9k

Byzantine fault tolerance for peer-to-peer collaboration

ept

1.6k

New algorithms for collaborative text editing

ept

1.9k

Creating local-first collaboration software with Automerge

ept

3.8k

Other Decks in Programming

See All in Programming

霧の中の代数的エフェクト

funnyycat

400

AIキャラアプリkaiwaの低遅延音声通話基盤をどう作ったか - AWS Gravitonで支える低遅延・低コストAI Agent基盤

mogamit

170

継続モナドとリアクティブプログラミング

yukikurage

610

「正の参照」と「負の導出」で組むハーネスエンジニアリング

cottpan

140

20260623_Loop Engineeringで自分の分身の問い合わせBotを作る

ryugen04

220

【やさしく解説設計編・中級 #1】一つの車に、運転手は一人　～ある倉庫システムの事例から～

panda728

PRO

180

型も通る、synthも通る、それでも危ない〜AIのCDKの権限とコストを機械で検証する〜 / It Passes Type Checks, It Passes Synth Checks, but It’s Still Risky — Automatically Verifying Permissions and Costs in AI’s CDK —

seike460

PRO

350

Embedded SREと共に達成した会員管理システムのAWS移行 - SRE NEXT 2026 ランチスポンサーセッション

niftycorp

PRO

2.6k

Terraform標準の組織で AWS CDKをどう使うか

mu7889yoon

280

Laravel Boostに学ぶ、AIにPHPを書かせる技術〜OSSの実装から蒸留するエージェント制御の王道〜

kentaroutakeda

460

Snowflake Summitでの新機能 CoCo / CoWork / snowflake-summit-2026-overall-what-new-coco

tatsuhiro

230

エンジニア向け会社紹介/Findy Company Profile

findyinc

360k

Featured

See All Featured

Optimising Largest Contentful Paint

csswizardry

3.8k

Navigating the moral maze — ethical principles for Al-driven product design

skipperchong

420

SEO in 2025: How to Prepare for the Future of Search

ipullrank

3.6k

The Impact of AI in SEO - AI Overviews June 2024 Edition

aleyda

1.1k

WCS-LA-2024

lcolladotor

710

Sam Torres - BigQuery for SEOs

techseoconnect

PRO

300

Conquering PDFs: document understanding beyond plain text

inesmontani

PRO

2.9k

Ethics towards AI in product and experience design

skipperchong

330

It's Worth the Effort

188

29k

Amusing Abliteration

ianozsvald

230

Digital Projects Gone Horribly Wrong (And the UX Pros Who Still Save the Day) - Dean Schuster

uxyall

StorybookのUI Testing Handbookを読んだ

zakiyama

6.8k

Transcript

None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
None
References (1) 1.  Mahesh Balakrishnan, Dahlia Malkhi, Ted Wobber, et
al.: “Tango: Distributed Data Structures over a Shared Log,” at 24th ACM Symposium on Operating Systems Principles (SOSP), pages 325–340, November 2013. http://research.microsoft.com/pubs/199947/Tango.pdf 2.  Molly Bartlett Dishman and Martin Fowler: “Agile Architecture,” at O'Reilly Software Architecture Conference, March 2015. http://conferences.oreilly.com/software-architecture/ sa2015/public/schedule/detail/40388 3.  Shirshanka Das, Chavdar Botev, Kapil Surlaker, et al.: “All Aboard the Databus!,” at ACM Symposium on Cloud Computing (SoCC), October 2012. http://www.socc2012.org/s18- das.pdf 4.  Pat Helland: “Life beyond Distributed Transactions: an Apostate’s Opinion,” at 3rd Biennial Conference on Innovative Data Systems Research (CIDR), pages 132–141, January 2007. http:// www-db.cs.wisc.edu/cidr/cidr2007/papers/cidr07p15.pdf 5.  Pat Helland: “Immutability Changes Everything,” at 7th Biennial Conference on Innovative Data Systems Research (CIDR), January 2015. http://www.cidrdb.org/cidr2015/Papers/ CIDR15_Paper16.pdf 6.  Martin Kleppmann: “Designing Data-Intensive Applications.” O’Reilly Media, to appear. http://dataintensive.net/ 7.  Jay Kreps: “I ♥︎ Logs.” O'Reilly Media, September 2014. http://shop.oreilly.com/product/ 0636920034339.do 8.  Jay Kreps: “Putting Apache Kafka to use: A practical guide to building a stream data platform.” 25 February 2015. http://blog.conﬂuent.io/2015/02/25/stream-data-platform-1/
References (2) 9.  Leslie Lamport: “Time, Clocks, and the Ordering
of Events in a Distributed System,” Communications of the ACM, volume 21, number 7, pages 558–565, July 1978. http:// research.microsoft.com/en-US/um/people/Lamport/pubs/time-clocks.pdf 10. Neha Narkhede: “Announcing Kafka Connect: Building large-scale low-latency data pipelines.” 18 February 2016. http://www.conﬂuent.io/blog/announcing-kafka-connect- building-large-scale-low-latency-data-pipelines 11. Fred B Schneider: “Implementing Fault-Tolerant Services Using the State Machine Approach: A Tutorial,” ACM Computing Surveys, volume 22, number 4, pages 299–319, December 1990. http://www.cs.cornell.edu/fbs/publications/smsurvey.pdf 12. Yogeshwer Sharma, Philippe Ajoux, Petchean Ang, et al.: “Wormhole: Reliable Pub-Sub to Support Geo-replicated Internet Services,” at 12th USENIX Symposium on Networked Systems Design and Implementation (NSDI), May 2015. https://www.usenix.org/system/ﬁles/ conference/nsdi15/nsdi15-paper-sharma.pdf 13. Martin Thompson: “Single Writer Principle.” 22 September 2011. http://mechanical- sympathy.blogspot.co.uk/2011/09/single-writer-principle.html 14. Vaughn Vernon: Implementing Domain-Driven Design. Addison-Wesley Professional, February 2013.
None

Staying in Sync: From Transactions to Streams

Staying in Sync: From Transactions to Streams

Martin Kleppmann

More Decks by Martin Kleppmann

Other Decks in Programming

Featured

Transcript

References (1) 1. Mahesh Balakrishnan, Dahlia Malkhi, Ted Wobber, et

References (2) 9. Leslie Lamport: “Time, Clocks, and the Ordering

References (1) 1.  Mahesh Balakrishnan, Dahlia Malkhi, Ted Wobber, et

References (2) 9.  Leslie Lamport: “Time, Clocks, and the Ordering