Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Advancing Affordable Energy With Advanced Energy Economy

Elastic Co
February 18, 2016

Advancing Affordable Energy With Advanced Energy Economy

Learn how the Advanced Energy Economy utilizes Elastic’s hosted Elasticsearch offering to power a scalable, secure, full-text search platform that makes more than 40M pages of energy policy data accessible to organizations looking to make energy cleaner, more affordable, and increasingly secure.

Elastic Co

February 18, 2016
Tweet

More Decks by Elastic Co

Other Decks in Technology

Transcript

  1. 1 1 Overview: •  Who We Are •  What We

    Do and Why It Matters •  How We Served Big Data and Lessons Learned •  Questions
  2. 3 National association of businesses making the global energy system

    more secure, clean, and affordable. Mission: Transform public policy to enable rapid growth of advanced energy companies.
  3. 4 The Advanced Energy Economy includes many different types of

    companies and technologies. •  Building Efficiency •  Energy Efficiency •  Demand Response •  Electric Generation •  Solar •  Wind •  Grid Technologies •  Smart Grid •  EV Charging •  Transportation •  Plug-in / Hybrid Vehicles
  4. 5 We have three distinct customer segments ranging from Fortune

    100 companies to academic institutions. Commercial 1 Non-profit 2 Free 3
  5. 6 We are a small team within a small non-profit.

    Eric Fitz Charlie Forcey Bradley Sheehan
  6. 8 PowerSuite: We make energy policy documents accessible and actionable

    across 50 states.   Identify and engage in policy issues   Grow the advanced energy industry Staying on top of energy policy is hard… PowerSuite makes it easy.
  7. 9 Our platform has three layers. 50 states CA MA

    IL TX CT Collection Processing S3 Application 1 Middleware 2 Data Pipeline 3
  8. 10 PowerSuite allows users to search, track, and collaborate on

    regulatory proceedings (“dockets”) from across the country. Staying on top of energy policy is hard… PowerSuite makes it easy. Search Collaborate Track Search
  9. 11

  10. 12

  11. 13

  12. 15 A “docket” is a collection of hundreds (or thousands)

    of documents submitted over many months. 270K Dockets 4M Documents 2TB of pdfs 45M Pages 90GB of text
  13. 16 We have built the first national database of dockets

    and have more content than the English Wikipedia. 45M Pages 2TB of pdfs 90GB of text 270K Dockets 4M Documents How did we get here?
  14. 18 At launch in 2014, our MVP full text search

    and primary database was powered by PostgreSQL. 50 states CA MA IL TX CT Collection Processing S3 Application 1 Middleware 2 Data Pipeline 3
  15. 19 2015! Summer Our user and data growth trajectory indicated

    that we were going to scale out of PostgreSQL.   Index bloat   Search performance   Memory   Cost 2014 Summer
  16. 21 2015! Summer How do we scale our platform with

    limited resources, a looming cliff, and need to continue to deliver new features?   Index bloat   Search performance   Flexibility   Cost 2014 Summer 2015 Fall   Cluster Management?   Search Performance?   Flexibility?   Cost?
  17. 23 With 45M documents, our sweet spot configuration was a

    cluster with 32GB memory, two nodes, and a tie-breaker.   Cluster Configuration:   One node per instance   16 Shards   32GB Memory   SSD Disk (tiebreaker) N0 N1 N2 (Found)
  18. 24 Elastic Cloud allowed us to keep the lights on

    and significantly reduce our monthly costs. Cluster Management Search Performance & Flexibility Cost   Scale on demand   Push button upgrades   1.x 2.x   Simple plugin integration   Millisecond queries   Flexible tuning   Real-time indexing   Easy A/B testing   $1,000/month reduction   Opportunity Cost 1) No cluster management worries 2) All of Elastic that you know and love
  19. 25 Elastic Cloud allowed us to keep the lights on

    and significantly reduce our monthly costs and… Cluster Management Search Performance & Flexibility Cost   Scale on demand   Push button upgrades   1.x 2.x   Simple plugin integration   Millisecond queries   Flexible tuning   Real-time indexing   Easy A/B testing   $1,000/month reduction   Opportunity Cost
  20. 28 Elastic is now our primary database. 50 states CA

    MA IL TX CT Collection Processing S3 Application 1 Middleware 2 Data Pipeline 3
  21. 29 2016 Winter 2015! Summer Not only was the transition

    easy, but it has also unlocked a whole new class of features for PowerSuite.   Index bloat   Search performance   Memory   Cost   Advanced queries   Faceted search   Highlighting   Visualizations 2014 Summer 2015 Fall
  22. 30 www.aee.net / @aeenet / Washington DC San Francisco Boston

    / powersuite.aee.net Questions? http://powersuite.aee.net Eric Fitz Senior Director Engineering and Product Development [email protected] @ehfitz
  23. 31

  24. 32