How to Scale in the Cloud: Chargeback is Back, Baby!

Slide 1

Slide 1 text

How to Scale in the Cloud Chargeback is Back, Baby! Neil J. Gunther @DrQz Performance Dynamics Rocky Mountain CMG Denver, Colorado December 5, 2019 c 2019 Performance Dynamics How to Scale in the Cloud December 6, 2019 1 / 38

Slide 2

Slide 2 text

Everything old is new again Abstract The need for system administrators—especially Linux sys admins—to do performance management has returned with a vengeance. Why? The cloud. Resource consumption in the cloud is all about run now, pay later1 (AKA chargeback2 in mainframe-ese). This talk will show you how performance models can help to ﬁnd the most cost-effective deployment of your applications on Amazon Web Services (AWS). The same technique should be transferable to other cloud services. 1 Chargeback disappeared with the arrival of the PC revolution and the advent of distributed client-server architectures. 2 Chargeback underpins the cloud business model, especially when it comes to the development of hot applications, e.g., “Microsoft wants every developer to be an AI developer, which would help its already booming Azure Cloud business do better still: AI demands data, which requires cloud processing power and generates bills.” —The Register, May 2018 c 2019 Performance Dynamics How to Scale in the Cloud December 6, 2019 2 / 38

Slide 25

Slide 25 text

Corrected scaling model Hypothesis (b) ... Backwards 7 October 2016 Sched 300 300 444.41 411.62 ± 7.36 675.05 651.54 ± 3.66 Approx. 10% March 2018 Spot 254 254 203.60 199.36 ± 1.48 1247.54 1192.03 ± 8.54 Approx. 90% † Nknee is an input parameter to the PDQ model ‡ Corrected PDQ model Parallel is Just Fast Serial From the standpoint of queueing theory, parallel processing can be regarded as a form of fast serial processing. The left side of the diagram shows a pair of parallel queues, where requests arriving from outside at rate are split equally to arrive with reduced rate /2 into either one of the two queues. Assume = 0.5 requests/second and S = 1 second. When a request joins the tail of one of the parallel waiting lines, its expected time to get through that queue (waiting + service) is given by equation (1) in Berechenbare Performance [9], namely: Tpara = S 1 ( /2)S = 1.33 seconds (1) The right side of the diagram shows two queues in tandem, each twice as fast (S/2) as a parallel queue. Since the arrival ﬂow is not split, the expected time to get through both queues is the sum of the times spent in each queue: Tserial = S/2 1 (S/2) + S/2 1 (S/2) = S 1 (S/2) = 1.33 seconds (2) Tserial in equation (2) is identical to Tpara in equation (1). Conversely, multi-stage serial processing can be trans- formed into an equivalent form of parallel processing [6, 8]. This insight helped identify the “hidden parallelism” in the July and October 2016 performance data that led to the correction of the initial PDQ Tomcat model. com/2014/07/a-little-triplet. html 2014 Systems Principles, Bolton Landing, New York, October 19–22, 2003 [13] N. Gunther, Guerrilla Capa Planning: A Tactical Approach 7 Inspired by a CMG 1993 paper, I developed an algorithm to solve parallel queues in the PDQ analyzer circa 1994, based on my observation above, and used it in my 1998 book The Practical Performance Analyst. c 2019 Performance Dynamics How to Scale in the Cloud December 6, 2019 23 / 38

Slide 1

Slide 1 text

Slide 2

Slide 2 text

Slide 3

Slide 3 text

Slide 4

Slide 4 text

Slide 5

Slide 5 text

Slide 6

Slide 6 text

Slide 7

Slide 7 text

Slide 8

Slide 8 text

Slide 9

Slide 9 text

Slide 10

Slide 10 text

Slide 11

Slide 11 text

Slide 12

Slide 12 text

Slide 13

Slide 13 text

Slide 14

Slide 14 text

Slide 15

Slide 15 text

Slide 16

Slide 16 text

Slide 17

Slide 17 text

Slide 18

Slide 18 text

Slide 19

Slide 19 text

Slide 20

Slide 20 text

Slide 21

Slide 21 text

Slide 22

Slide 22 text

Slide 23

Slide 23 text

Slide 24

Slide 24 text

Slide 25

Slide 25 text

Slide 26

Slide 26 text

Slide 27

Slide 27 text

Slide 28

Slide 28 text

Slide 29

Slide 29 text

Slide 30

Slide 30 text

Slide 31

Slide 31 text

Slide 32

Slide 32 text

Slide 33

Slide 33 text

Slide 34

Slide 34 text

Slide 35

Slide 35 text

Slide 36

Slide 36 text

Slide 37

Slide 37 text

Slide 38

Slide 38 text

Slide 39

Slide 39 text

Slide 40

Slide 40 text