Cassandra at Gowalla - Speaker Deck

Slide 1

Slide 1 text

Cassandra at Gowalla, A Retrospective Adam Keys Austin on Rails, November 2011 @therealadam, http://therealadam.com, http://github.com/therealadam Tuesday, November 29, 11

Slide 2

Slide 2 text

How and why does Gowalla use Cassandra? Tuesday, November 29, 11

Slide 3

Slide 3 text

Why Cassandra? Tuesday, November 29, 11 Applications that have stable access patterns. High-velocity data growth. Time-oriented data access. Dynamo-style operation.

Slide 4

Slide 4 text

Why not Cassandra? Tuesday, November 29, 11 Prototypes, getting things off the ground. Applications that change their query patterns often. Applications that don’t grow data quickly.

Slide 5

Slide 5 text

Audit https://github.com/therealadam/audit Tuesday, November 29, 11 Store AR change data to Cassandra. Our training-wheels trial project. Incrementally deployed using rollout and degrade. Worked well, so we proceeded.

Slide 6

Slide 6 text

Chronologic https://github.com/gowalla/chronologic/ Tuesday, November 29, 11 Activity feeds stored in Cassandra. Started off as a secondary index cache, but became a system of record. Works pretty well, but the query/access model didn’t always jive with how web developers expected to access data.

Slide 7

Slide 7 text

Active stories Tuesday, November 29, 11 Store “joinability” data for users at a spot so we can pre-merge stories. Built and integrated in one pull request a few weeks before launch. Has worked pretty well.

Slide 8

Slide 8 text

Social graph caches Tuesday, November 29, 11 Store friends from other systems so we can quickly list/suggest friends. This started life on Redis, but the data was growing too quickly. We decoupled it from Redis and wrote a Cassandra backend. We incrementally deployed it and got Redis out of the picture within two weeks. That was cool.

Slide 9

Slide 9 text

What worked? Tuesday, November 29, 11

Slide 10

Slide 10 text

Stable on launch Tuesday, November 29, 11 A couple weeks before launch, I switched to “devlops” mode. Along with Adam McManus, our ops guy, we focused on tuning Cassandra for better read performance and to resolve stability problems. We ended up bringing in a DataStax consultant to help us verify we were doing the right things with Cassandra. The result of this was that, at launch, our cluster held up well and we didn’t have any Cassandra-related problems.

Slide 11

Slide 11 text

Easy to tune Tuesday, November 29, 11 I found Cassandra interesting and easy to tune. There is a little bit of upfront research in ﬁguring out exactly what the knobs mean and what the reporting tools are saying. Once I ﬁgured that out, it was easy to iteratively tweak things and see if they were having a positive effect on the performance of our cluster.

Slide 12

Slide 12 text

Time-series or semi-granular data Tuesday, November 29, 11 Of the databases I’ve tinkered with, Cassandra stands out in terms of modeling time-related data. If an application is going to pull data in time-order most of the time, Cassandra is a really great place to start. I also like the column-oriented data model. It’s great if you mostly need a key-value store, but occasionally need a key-key-value store.

Slide 13

Slide 13 text

What didn’t work Tuesday, November 29, 11

Slide 14

Slide 14 text

Developer localhost setups Tuesday, November 29, 11 We started using Cassandra in the 0.6 release, when it was a giant pain to set up locally (XML conﬁgs). It’s better now, but I should have put more energy into helping the other developers on our team getting Cassandra up and working properly. If I were to do it again, I’d probably look into leaning on the install scripts the cassandra gem includes, rather than Homebrew and a myriad of scripts to hack the Cassandra conﬁg.

Slide 15

Slide 15 text

Eventual consistency, magic database voodoo Tuesday, November 29, 11 Cassandra does not work like MySQL or Redis. It has different design constraints and a relatively unique approach to those constraints. In advocating and explaining Cassandra, I think I pitched too much as a database nerd and not enough as “here’s a great tool that can help us solve some problems”. I hope that CQL makes it easier to put Cassandra in front of non-database nerds in terms that they can easily relate to and immediately ﬁnd productivity.

Slide 16

Slide 16 text

Rigid query model Tuesday, November 29, 11 Once we got several million rows of data into Cassandra, we found it difficult to quickly change how we represented that data. It became a game of “how can we incrementally rejigger this data structure to have these other properties we just ﬁgured out we want?” I’m not sure that’s a game you can easily win at with Cassandra. I’d love to read more about building evolvable data structures in Cassandra and see how people are dealing with high- volume, evolving data.

Slide 17

Slide 17 text

Things to try Tuesday, November 29, 11

Slide 18

Slide 18 text

More like a hash, less like a database Tuesday, November 29, 11 Having developed a database-like thing, I have come to the conclusion that developers really don’t like them very much. AR was hugely successful because it was so much more effective than anything previous to it that tried to make databases just go away. The closer a database is to one of the native data structures in the host language, the better. If it’s not a native data structure, it should be something they can create in a REPL and then say “magically save this for me!”

Slide 19

Slide 19 text

Better tools and automation Tuesday, November 29, 11 That said, every abstraction leaks. Once it does, developers want simple and useful tools that let them ﬁgure out what’s going on, what the data really looks like, tinker with it, and get back to their abstracted world as quickly as possible. This starts with tools for setting up the database, continues through interacting with it (database REPL), and for operating it (logging, introspection, etc.) Cassandra does pretty well with these tools, but they’re still a bit nerdy.

Slide 20

Slide 20 text

Moar indexes Tuesday, November 29, 11 We didn’t design our applications to use secondary indexes (a great feature) because they didn’t exist just yet. I should have spent more time integrating this into the design of our services. We got bit a lot towards the end of our release cycle because we were building all of our indexes in the application and hadn’t designed for reverse indexes. We also designed a rather coarse schema, which further complicated ad-hoc querying, which is another thing non-database-nerds love.

Slide 21

Slide 21 text

Thanks! http://speakerdeck.com/u/therealadam http://weblog.therealadam.com/ Tuesday, November 29, 11