This talk was presented at the inaugural Elastic{ON} conference, http://elasticon.com
Session Abstract:
Over the past two years GitHub’s source code search product has grown from a small research project into a very large index containing nearly 4 billion documents. This is an ever changing and continuously growing data set that has presented us with some interesting scaling problems. This talk will cover how they have tackled these scaling problems - from monitoring and alerting, application changes, growing clusters, and tuning Lucene parameters.
Presented by Tim Pease, GitHub