years of industry experience • Specialties ✓ Virtualization ✓ Data Storage ✓ Disaster Recovery • Personal ✓ Live on a houseboat ✓ Love to travel ✓ Excited to be in Paris! About Me.
listings • More than 174 thousand destinations • Over 220 countries and territories • Website and support in 45 languages • 100k+ servers, VMs, and containers • 3 Availability Zones in Central/Western Europe
servers • In production since July 2025 • Replacing Bare Metal servers • Replacing OpenStack virtual machines • One of the fastest growing infrastructure services in Booking's history What is BC2?
on AWS EC2 • Jan 2026: Exceeded count on Bare Metal • Mar 2026: Exceeded count on OpenStack • EOY 2026: 100% BC2 On-Prem ✓ ~10,000 Instances ✓ ~1,000 Hypervisors
on-prem services • How should disaster recovery work? • Vanilla Linux + KVM offers little guidance here • Off the shelf data protection solutions are still VMware focused • What to do??? The Problem - Disaster Recovery.
VM images on NFS ✓ Snapshot NFS every 10 minutes ✓ Make snapshots immutable • Hypervisors ✓ Leverage LVM snapshots ✓ Image the boot drive to NFS ✓ Make the image immutable
Version 3 (Jun 1995) is 30 years old • Version 4 (Dec 2000) is merely 25 • Version 4.1 (Jan 2010) just turned 16 • Version 4.2 (Nov 2016) is almost 10 "If I'd known I was going to live this long, I'd have taken better care of myself.” - Eubie Blake
appliances • Broad hypervisor support (including KVM) • Compliance grade immutability • Snapshots • Deduplication • Malware Scanning • Ransomware Detection "Had I known what you were going to do with it I would never have invented it!" - Sir Robert Watson-Watt (RADAR Pioneer)
every 10 minutes • A sparse image identical to the boot drive is created on NFS • Partitions and volumes are cloned with ~1 second granularity • A map of the resulting sparse file is generated for fast restore • Compliance mode immutability is set on the output files Hypervisor Boot Drive ✓ Partition Table ✓ Boot Partition ✓ LVM Volumes NFS Appliance DR Image ✓ Image File ✓ Map File ✓ Log File NFS v3 NFS v4.2
Boot partitions are small ✓ Copies take < 1 second • LVM snapshots are fast ✓ Snaps take < 1 second for all volumes ✓ Snaps are deleted once copied • Point-in-time granularity of ~1 second for each boot drive snapshot efi partition 200M boot partition 1000M / (root_snap) 64G /home (home_snap) 64G /var (var_snap) 64G /var/log (log_snap) 64G /tmp (tmp_snap) 16G Boot Drive
lseek() ✓ SEEK_HOLE ✓ SEEK_DATA • Without these parameters, generating a sparse map is not possible (holes and data look the same). • Hypervisor images contain thousands of holes (~99% of a 1.9T boot drive).
is still a workhorse • NFSv4 is powerful but needs care • POSIX APIs and tooling still solve problems that have not been solved elsewhere • Disaster recovery without expensive proprietary software is possible! Lessons Learned.
✓ Per-file invocation (atime) ✓ Compliance mode is not optional Call to Action! NFS Vendors Virtualization Vendors / Cloud Providers • Support open source backup and restore • Many large gaps vs. commercial tooling
- Dependency The Problem - Disaster Recovery Wikimedia Commons - EU, Copernicus Sentinel-2 imagery The Solution - Immutable Snapshots Wikimedia Commons - 1930s Leica advert in Time magazine About NFS Wikimedia Commons - U.S. Library of Congress Why NFS in 2026? Imperial War Museum - Catalogue number CH 15337 Why NFS 4.2? Wikimedia Commons - Cat Chapman Lessons Learned Booking.com Media Library - Hufton+Crow / UNStudio Call to Action Wikimedia Commons - Gallard No Attribution Peter Buschman - Personal Library Sidebar Image Credits.