Offline-First: Architektur-Patterns & Implementierung für APIs & Backends

Thomas Hilzendegen @hilzendegent Consultant Manuel Rauber @ManuelRauber Consultant Architektur-Patterns &
Implementierung Ofﬂine First APIs & Backends

Thomas Hilzendegen Speakers Manuel Rauber [email protected] @hilzendegent [email protected] @manuelrauber https://manuel-rauber.com
Microsoft MVP

Timetable Time Doing 09:00 - 10:30 Part I 10:30 -
11:00 ☕ 11:00 - 12:30 Part II

• Progressive Web App • Ofﬂine availability • Ofﬂine synchronization
• Change tracking • Security considerations • Pitfalls Agenda

Ofﬂine… but why?

• Just because the phone has a connection, e.g. Edge,
it does not mean we are online • Depends on the use case which connection quality is required to determine, if the app is reliable online • Connection quality could be measured by the time an exclusive request takes • Duration < 150 ms: online • Duration !>=150 ms: bad connection What does “ofﬂine” mean?

• A lot of reasons to be ofﬂine • Traveling,
Train, Flights (bad or no signal) • Server is not available • Routing problems • Roaming • Costs Motivation

Demo Application

• A little application to manage boardgames • Angular, .NET
Core, MS SQL Server • Brownﬁeld application, started as a pure online application • As typical for any good demo: no security • Available on Azure: https://tt-boardist.azurewebsites.net • GitHub: https://github.com/thinktecture/thinktecture-boardist Demo Application “Thinktecture Boardist”

• Backend • .NET Core 3.0 • SQL Server •
Frontend • Angular 8.2.13 Demo Application Specs

Demo Application Database Scheme

Live Demo

Now let’s go ofﬂine!

Does it work in ﬂight mode? NOPE!

Progressive Web App

Live Demo

Does it work ofﬂine now? STILL NOPE!

Real ofﬂine availability

• ServiceWorker is only able to take data offline which
has been requested by the application • If all the URLs are known beforehand, the ServiceWorker could cache them all • Data which was not requested, is not available offline (no real offline synchronisation) • But what about … the Background Sync API? What’s the problem?

• Name is misleading • Does not offer any data
synchronisation possibilities • Just raises an event periodically or after a delay (optionally based on the network type) • It’s totally up to the developer what to do in the onsync event • Since the sync is done in the ServiceWorker, the page can be closed, the sync will be fulﬁlled anyway • It does not help anything with syncing your actual data! Background Sync API

Background Sync API

• Cookies (not meant for large data or binaries) •
Web Storage like session storage or local storage (not meant for large data or binaries) • IndexedDB • Cache Storage (based on request/response) Storage capabilities

User can clear anything at any time by “Remove temporary
internet ﬁles” or “Clear website data”

• Key-value database of the browser • Stores data permanently
• ServiceWorker and Web App share access to the same IndexedDB • Possibility of scenarios, where the ServiceWorker (or Web App) stores synchronised data in the IndexedDB and the Web App reads the data • Like everything based on the origin IndexedDB

IndexedDB Availability

• The standard API of IndexedDB is inconvenient to use
(lots of callback) • Dexie.js is a minimalistic wrapper for IndexedDB • Operations are promise-based • Near native performance (even for bulk inserts) • Open Source @ GitHub: https://github.com/dfahlander/Dexie.js IndexedDB API

Ofﬂine Sync

• Offline Sync means to download all data available to
client into a persistent offline storage, without the user having to explicitly request the data • Depending on the scenario, client can do CRUD on the offline data • Data will be synced back to the server, whenever a connection is possible Offline Sync Basics

• Online/offline recognition • Conflict management • Binary data •
Local changes • Deleting data • Update interval (incoming new data) • Error handling • Primary key generation Offline Sync Challenges

• Having Edge may not mean that we are online
• Being online could depend on several factors • Connection quality (Edge, 3G, 4G, 5G) • Connection speed (latency to your backend) • Reachability of backend systems (database, 3rd party system, storages) Ofﬂine Sync Challenges - Online/ofﬂine recognition

• Client needs to be online for write operations •
Locks the data, so no other client can overwrite it • Data stays locked, until the client either saves or discards changes • Last One (Write) Wins • Visual conflict management (diffing like in Git, SVN, etc.) Offline Sync Challenges - Conflict management

• Downloading large binary data may not be possible on
all platforms • A browser does not have a big enough storage for storing the data • Native apps (Cordova, Electron) could be a solution to directly access the platform’s file system • Uploading large binary data • Server needs to support chunk based upload with connection interruptions • Client needs permanent access to the binary file being uploaded, which could be hard in a browser-only scenario Offline Sync Challenges - Binary data

• User opens an edit form and makes changes to
data • Incoming sync from server would change the data the user is editing • Depending on your use-case, decide what to do • Inform the user about the data change? • Show a visual diff? • Override the local changed data? • “Do nothing” and override the server data on next sync Ofﬂine Sync Challenges - Local changes

• Mark “deleted” data with a ﬂag only • Use
a trigger to keep the deleted ID in a separate table • Client needs some info about “deleted” data • Simple list of IDs “deleted” since the last sync Ofﬂine Sync Challenges - Deleting data

• Decide when you need to sync which data •
Not all data needs to be synced in the same interval • An additional real time connection (WebSocket) could be established • The server can send a signal when some data needs to be synchronised Ofﬂine Sync Challenges - Update interval

• Show it, don’t hide them! • The user needs
to know that something has occurred which shouldn’t • If you can’t sync to the client anymore, you may need to re-sync everything • Depending on your use case, you may want to write all data in one transaction on the server to not have partial data written into the database Ofﬂine Sync Challenges - Error handling

Primary key generation

• Inserted ofﬂine data needs to be available in the
client immediately (e.g. the users reloads the application) • Behaves like “real synced data”, even if it was not written to server yet • Client is able to route to ofﬂine inserted data • Therefor it needs some kind of primary key/id to route to • Primary key can be generated on server-side and on client-side Primary key generation - challenge

• Server is responsible for generating the primary key •
Client will generate a temporary ofﬂine primary key • After syncing, the client needs to replace the temporary ofﬂine key with the server generated primary key • Allows to use sequential GUIDs on the server side (MSSQL) Primary key generation - server-side

• Client is responsible for generating the primary key (GUIDs
only) • Server inserts the client generated key into the database as the primary key • Server additionally needs to have a clustered key • Otherwise inserting a client non-sequential GUID will lead to a reclustering of the database, every time a client syncs new data Primary key generation - client-side

Change Tracking

• Possibilities to determine the client’s current state • Use
highest rowversion within result • Determine by result set or additional query • Result of @@DBTS • Returns the last used rowversion of the database • Result of MIN_ACTIVE_ROWVERSION() • Returns the lowest (active) rowversion of the database Change Tracking - MS SQL Server’s ROWVERSION

• If you do a million updates a second, every
second, the timestamp will wrap around in about 585000 years • Could be used for concurrency checks as well (see Conflict Management) • Before writing data into a row, you could check if it still as the client’s saved row version • If yes: data was not modified, saving is easily possible • If no: data was modified, decide per use case what to do! Change Tracking - MS SQL Server’s ROWVERSION

Change Tracking - Highest ROWVERSION within result Transaction #1 ID
ROWVERSION 1 0x01 2 0x02 … … 41 0x41 Transaction #2 ID ROWVERSION 1 0x01 2 0x02 … … 41 0x41 42 0x42 43 0x43 SELECT MAX(ROWVERSION) FROM TABLE Client’s state Last known rowversion 41 43 NOT RELIABLE!

• A later started transaction but committed ﬁrst results in
a queryable rowversion higher than the pending ones • Data will be lost (never seen by the client’s delta request) Change Tracking - Highest ROWVERSION within result

Change Tracking - Query @@DBTS Transaction #1 ID ROWVERSION 1
0x01 2 0x02 … … 41 0x41 Transaction #2 ID ROWVERSION 1 0x01 2 0x02 … … 41 0x41 42 0x42 43 0x43 SELECT @@DBTS Client’s state Last known rowversion 41 42 43 NOT RELIABLE!

• Value is database global • Running transaction updates the
value instantly (before commit or rollback) • Data will be lost (never seen by the client’s delta request) Change Tracking - Query @@DBTS

Change Tracking - Query MIN_ACTIVE_ROWVERSION() Transaction #1 ID ROWVERSION 1
0x01 2 0x02 … … 41 0x41 Transaction #2 ID ROWVERSION 1 0x01 2 0x02 … … 41 0x41 42 0x42 43 0x43 SELECT MIN_ACTIVE_ROWVERSION() Client’s state Last known rowversion 41 42 44 RELIABLE!

• Value is database global • Running transaction updates the
value instantly (before commit or rollback) • Data will be safe • Needs to be compared with greater or equal • May query data multiple times Change Tracking - Query MIN_ACTIVE_ROWVERSION()

Backend Preparation

• All syncable entities need to have rowversion column •
rowversion is updated by MS SQL Server automatically whenever the row is changed (created & updated) • For deleted entities • Either set a IsDeleted ﬂag to true (never delete any rows physically) • Or save the deleted IDs of the entities somewhere else (by trigger) Backend Preparation - MS SQL Server

• Use equivalents of rowversion and triggers • Manual implement
mechanism in business logic (error-prone!) • Update tracking column manually by incrementing a database global number (during one transaction!) • Will be very hard für multi-row updates/inserts • Manual implement mechanism in triggers (if available) Backend Preparation - Other Database Systems

Frontend Preparation

• Choose storage area for data (e.g. IndexedDB) • Write
all the code • Periodic data synchronization • Binary synchronization when data changes • Tracking of timestamps Frontend Preparation

Security

Typical token-based security Security - API level Browser Identity Provider
Web API

• User is only able to see data based on
his security level • Permissions • Roles • Policies • What happens, if the user permission change is leading to different data visible to the user? • What about data, which he does not see due to rights, but is connected to other data? • Remove frontend data after logout Security - Data Level

Pitfalls

• De-normalize the relational data (document style) • Results in
multiple rowversion for one entry (use most recent one) • Multiplies the data (more trafﬁc) • Consistent data for one entry • Keep the relations up to the frontend • Explicit rowversion for each entry • Partial consistency (related data may not be synced yet) Relational Data

• Keep some relational data and de-normalize some of it
• De-normalize many-to-many relations • Needs trigger or business logic to change main entry’s rowversion when relation changes • Partial consistency (related data may not be synced yet) Relational Data

• PWA helps taking offline the application, but not the
data • Online != Edge is available • Offline data may be “temporary” (“Remove temporary internet files”) • Use MIN_ACTIVE_ROWVERSION() and “greater or equal than” operator • Think about data level security (permissions, roles, etc.) • Think about conflict management • Depends heavily on your use case Summary

And… now? • Slides: https://speakerdeck.com/manuelrauber • Repository: https://github.com/thinktecture/thinktecture-boardist

Offline-First: Architektur-Patterns & Implement...

Offline-First: Architektur-Patterns & Implementierung für APIs & Backends

More Decks by Manuel Rauber

Other Decks in Programming

Featured

Transcript