Let's play Flink - Fun with streaming applications at InnoGames
Presented at: code.talks 2018
Chocolate, ice cream and games are perhaps 3 of the most popular universally understood words that can bring joy to anyone between 5-60 years of age!
At InnoGames we not only have all three of those things but in addition we build up a powerful data infrastructure because it’s expensive to run your business blind. And being able to evaluate key performance indicators fast to make good decisions and deliver personalized and relevant content to each and every gamer is essential to be successful and it is how a customer becomes a fan.
With a revenue of 130 million Euros in 2016, InnoGames is one of the world's leading developers and providers of online games. InnoGames has more than 200 million registered players and has scored major successes with games such as Tribal Wars, Forge of Empires and Elvenar.
Our data infrastructure mainly consists of a data pipeline that covers the streaming part and a data platform to perform batch processing. The latter is based on the Hadoop ecosystem using technologies such as Hive, Spark, Hue, R and more to give our data scientists a high flexibility. There were several evolutions of the data pipeline, starting with Kestrel and custom streaming applications. Later on we switched the base technologies to Apache Kafka and Apache Storm. Last year we recreated our streaming infrastructure based on Apache Flink which is an open-source stream processing framework for distributed, high-performing, always-available, and accurate data streaming applications.