I? • Igor Khrol • Team Lead / QA Engineer at Toptal Analytics department • >10 years in IT • Engineer, Team Lead, Manager, Architect, Trainer, Consultant • Python, Scala, Ruby, Java, SQL etc • www.khroliz.com 2
historical data • Used by ETL to reconstruct changes history • Stores current data • Analytics “UI” • Used by stakeholders as self-service analytics • Tomorrow at 14:50 by Márton Kodok
A remote procedure call and data serialization framework developed within Apache's Hadoop project. • Serialized data in a compact binary format • Data types
way... • Luigi has tasks • Tasks have targets and requirements • If target is absent task is executed • Before task run required tasks should be completed ETL Task Developers Countries Country Statisctics
GAE • Quick and easy to start • Seamless integration with BigQuery ◦ Data should be cached for quick access • Examples: ◦ Machine Learning services ◦ Monitoring dashboards
Questions? www.toptal.com Hire the top 3% of freelance talent Igor Khrol [email protected][email protected] skype: igor.khrol https://github.com/Khrol/luigi_google_demo