This talk gives a brief overview of the BigData tooling landscape and where Scoobi, a distributed collection Scala library for Hadoop, stands. Then it shows what are the challenges in translating Scoobi abstractions to Hadoop constructs and how Scala, as programming language, and Kiama (http://code.google.com/kiama), as a graph-processing library, can be leveraged to support this translation. In particular:
- How rewriting rules based on partial functions are a very succinct way to pre-process the computation graph and to optimise it.
- How attribute grammars can be used to implement general graph traversal algorithms