Slide 1

Slide 1 text

Take a look @ KMKLabs Tech Talk Michael Himawan

Slide 2

Slide 2 text

Information Retrieval Information retrieval (IR) is finding material (usually documents) of an unstructured nature (usually text) that satisfies an information need from within large collections (usually stored on computers) (2009, An Introduction to Information Retrieval)

Slide 3

Slide 3 text

What is Solr ? - an Information Retrieval Machine - is a Car (same with ElasticSearch) - an enterprise search platform How Solr works ? - Solr up (as a service) - Solr index new document or load the current indexed document - Solr returned the Query (search text) from user

Slide 4

Slide 4 text

Peeking to Inverted Index Doc 1: Brad work at Jakarta Doc 2: People come to Jakarta for work Term Doc ID Brad 1 work 1,2 at 1 Jakarta 1,2 People 2 come 2 to 2 for 2

Slide 5

Slide 5 text

Who used Solr ?

Slide 6

Slide 6 text

When we use Solr ? If you want to build a good search engine, use Solr (or others which behave like it)

Slide 7

Slide 7 text

Queries per Second (in exp using 10 queries) - Postgre with multiple connection will be slower, (true fact) - but if using the same connection to do multiple queries Postgre can do 29.6% more queries than Solr per second Solr vs Postgresql (by Charles Nagy)

Slide 8

Slide 8 text

Query time - Postgre with multiple connection will be slower, (true fact) - Postgresql is faster with 27% than Solr in terms of query time. Solr vs Postgresql (by Charles Nagy)

Slide 9

Slide 9 text

Thank you

Slide 10

Slide 10 text

Resource - An Introduction to Information Retrieval - http://www.sitepoint.com/using-solarium-solr-search-setup/ - http://stackoverflow.com/questions/10053050/why-is-solr-so-much-faster-than-postgres - http://db-engines.com/en/system/PostgreSQL%3BSolr - http://charlesnagy.info/it/postgresql/postgresql-full-text-search-vs-solr