Upgrade to Pro — share decks privately, control downloads, hide ads and more …

How Blackboard Curbs Cheating with Elasticsearch

Elastic Co
March 08, 2017

How Blackboard Curbs Cheating with Elasticsearch

Blackboard uses Elasticsearch to power SafeAssign, its plagiarism detection software. Learn about their internal web search engine, how they transitioned from a Solr deployment to a multi-terabyte, full-text search engine powered by Elasticsearch, and their experiences running their infrastructure on AWS.

Slavomir Tecl l Development Manager l Blackboard

Elastic Co

March 08, 2017
Tweet

More Decks by Elastic Co

Other Decks in Technology

Transcript

  1. High level architecture SafeAssign Engine DB Services Logstash ES Logs

    ES Institutional & global database ES Internet User data
  2. Web Institutional & global database Logging, ELK stack • ~3B

    docs • ~200ms query time • ~10M queries/day • Hundreds of millions docs • Ever growing • ~6M queries/day • 2 months of logging data • ~2B records • Kibana frontend Our clusters
  3. Web Institutional & global database Logging, ELK stack • Spark

    jobs parsing big data • Storm continuous jobs • User data • Logstash agents • Redis queueing Feeding clusters
  4. How we built the Web cluster Start small(ish) 1 2

    3 4 5 Load with data Performance test Tune up and repeat previous step Enjoy
  5. USE MORE CLIENT NODES = OFFLOAD CPU, SEARCH FASTER* *

    https://www.elastic.co/guide/en/elasticsearch/reference/current/modules-node.html#coordinating-node
  6. RESCORE MATCHES = RUN MORE EXPENSIVE SECONDARY QUERY LATER* *

    https://www.elastic.co/guide/en/elasticsearch/reference/current/search-request-rescore.html
  7. { } Why doesn’t ES route requests based on search

    queue size? * https://github.com/elastic/elasticsearch/issues/12496
  8. Except where otherwise noted, this work is licensed under http://creativecommons.org/licenses/by-nd/4.0/

    Creative Commons and the double C in a circle are registered trademarks of Creative Commons in the United States and other countries. Third party marks and brands are the property of their respective holders. Please attribute Elastic with a link to elastic.co