> 4000 nodes – > 40k concurrent tasks • Problems with resource utilization • Slots only for Map or Reduce • Single NameNode, single point of failure • Clients and Cluster must be at same version
– Long life • Node Manager – One per data server – Monitors resources on node • Application Master – One per application – Short life – Manages task / scheduling
www.semtech-solutions.co.nz – [email protected] • We offer IT project consultancy • We are happy to hear about your problems • You can just pay for those hours that you need • To solve your problems