Tech Exeter Conference: Scaling clusters to tho...

Jacob Tomlinson

September 21, 2017

83

Tech Exeter Conference: Scaling clusters to thousands of servers in the cloud

In order to analyse the petabytes of data we have at the Met Office we need very large clusters of servers. However procuring these pieces of infrastructure takes months or even years of planning and large up-front capital expense.

In the Informatics Lab we have been exploring using scalable cloud infrastructure to create next generation data analysis clusters. In our latest prototype we used scalable resources from AWS along with a Python computation scheduler called Dask to create clusters with thousands of CPU cores on-demand. The cluster only exists for the time that we need it and then we can shut it down again, so we only pay for what we use.

Scaling to these levels takes a lot of thinking about. In order for everything to scale linearly you need to also scale your data access, monitoring, system configuration and everything else to avoid bottlenecks.

This talk will cover the practicalities of building these things, the pitfalls we found when crossing certain thresholds and the new challenges we face when working in this new paradigm.

Jacob Tomlinson

September 21, 2017

Tweet

More Decks by Jacob Tomlinson

See All by Jacob Tomlinson

Tech Exeter - Intro to Kubernetes 10 Year Update

0

25

Who Builds the PyData Ecosystem?

0

35

The Art of Wrangling Your GPU Python Environments

0

47

Getting science done with accelerated Python computing platforms

0

44

Dask on HPC in 2024 - Lightning Talk

0

58

GPU Acceleration in the PyData community

0

53

Dask on HPC in 2024

0

30

GPU Acceleration in the PyData community

0

34

When to rebuild things that already exist

0

36

Featured

See All Featured

GitHub's CSS Performance

1031

460k

Six Lessons from altMBA

28

3.9k

Building Flexible Design Systems

yeseniaperezcruz

328

39k

Keith and Marios Guide to Fast Websites

411

22k

Typedesign – Prime Four

42

2.7k

sergeychernyshev

32

1.1k

Helping Users Find Their Own Way: Creating Modern Search Experiences

29

2.8k

Documentation Writing (for coders)

73

5k

Agile that works and the tools we love

329

21k

The Invisible Side of Design

301

51k

個人開発の失敗を避けるイケてる考え方 / tips for indie hackers

110

19k

Mobile First: as difficult as doing things right

223

9.9k

Transcript