Bulutta Yüksek Performanslı ve Verimli Sistem Tasarlama

May 18, 2020 Bulutta Yüksek Performanslı ve Verimli Sistem Tasarlama
Serkan ÖZAL

WHO AM I? • Founder & CTO @ Thundra •
Co-organizer of ◦ Cloud and Serverless Turkey • In serverless era since 4 years • PhD candidate @METU @serkan_ozal serkan-ozal

AGENDA • Metrics • Caching • I/O • Event Driven
(Async) Architecture • Replication • Recovery

METRICS

Metrics to Keep an Eye on [1] - Latency (avg,
p50, p90, p99, max) - Errors (HTTP 4XX, 5XX) - CPU - Memory - IO

Metrics to Keep an Eye on [2] - AWS Lambda
(Errors, Throttles, Conc. Exec, Iterator age) - AWS SQS (# Msg Visible, Age Of Oldest Msg) - AWS Kinesis (R/W Provisioned Throughput Exceeded, Iterator age) - AWS DynamoDB (Throttled Reqs, Consumed RW Capacity Units) - Redis (# Connections, Hit/Miss ratio, # Evictions, CPU, Memory)

CACHING

How to Cache? - Local cache - Invalidation - TTL
(Soft/Hard) & Expiration - Eviction - Versioning - Negative vs Positive Cache - Inline vs Side Caches - Thundering Herd & Request coalescing

How to I/O? - Async IO vs Non-Blocking IO (select/poll,
epoll, kqueue) - Connection reuse / Keep-alive - TCP No-Delay - HTTP/2 vs HTTP/1 - Rest vs gRPC - CDN

Redis - TCP-KeepAlive - Pipelining - MGET on cluster (custom
hashing by {}) - Disable RDB Snapshot / AOF - Disable disk swap - Be aware of command complexity

Elasticsearch - Evenly distributed sharding - Be careful when using
custom id and routing - Reindex / Force merge - Increase refresh interval - Use Bulk requests - Use optimum data types (text vs keyword) - Disable disk swap - Monitor by “profiling” API

EVENT DRIVEN ARCHITECTURE

How to be Event Driven? - Authenticate & validate (pre-process?)
immediately - Process later - Resilient to upstream service failures - Loosely coupled systems - Batch processing - Fallback instead of retry

REPLICATION

Why Replicate? - Data locality - Disaster recovery

RECOVERY

How to Recover? - Periodic (hourly, daily etc ...) snapshots
- Event logs - Replay events - Replicate

Thank you !

Bulutta Yüksek Performanslı ve Verimli Sistem Tasarlama

Bulutta Yüksek Performanslı ve Verimli Sistem Tasarlama

Serkan ÖZAL

Other Decks in Programming

Featured

Transcript

May 18, 2020 Bulutta Yüksek Performanslı ve Verimli Sistem Tasarlama

WHO AM I? • Founder & CTO @ Thundra •

AGENDA • Metrics • Caching • I/O • Event Driven

METRICS

Metrics to Keep an Eye on [1] - Latency (avg,

Metrics to Keep an Eye on [2] - AWS Lambda

CACHING

How to Cache? - Local cache - Invalidation - TTL

I/O

How to I/O? - Async IO vs Non-Blocking IO (select/poll,

Redis - TCP-KeepAlive - Pipelining - MGET on cluster (custom

Elasticsearch - Evenly distributed sharding - Be careful when using

EVENT DRIVEN ARCHITECTURE

How to be Event Driven? - Authenticate & validate (pre-process?)

REPLICATION

Why Replicate? - Data locality - Disaster recovery

RECOVERY

How to Recover? - Periodic (hourly, daily etc ...) snapshots

Thank you !