Stream processing - Continuous computation - Distributed Remote Procedure Calls Just Works - Great scripts - Storm-deploy project for EC2 10 Thursday 24 May 12
submit Jar to Nimbus - Nimbus distributes the code around the cluster Use `storm` client to communicate - only for remote clusters - deploy new topologies - kill topologies - ... 13 Thursday 24 May 12
for message passing Single node Quorum sufficient for most cases Watch out! - Fails fast • Use monitoring software - Keeps growing • Cron job to compact data and logs 15 Thursday 24 May 12
Tasks are spread evenly across workers Every worker uses a port - Starts at 6700 - configurable Multiple workers per machine - defaults to 4 - configurable 18 Thursday 24 May 12
Typed Needs to know how to serialize each value type - Extendable with custom serializers - Java serialization by default (Slow!) 23 Thursday 24 May 12