Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Jennifer Reif

Jennifer Reif

Learn about what a graph database is, how the property graph model is constructed, how to write statements in the Cypher query language, and what Neo4j is. Session also includes information on the Paradise Papers data set and provides queries and steps to accessing and exploring the data for yourself to discover your own interesting connections!

Jennifer Reif

May 01, 2018
Tweet

More Decks by Jennifer Reif

Other Decks in Technology

Transcript

  1. Who Am I? • Developer Relations Engineer for Neo4j •

    Continuous learner • Conference speaker • Hobbies: cat, coffee, traveling
  2. Database - specifically graph • Database: a structured set of

    data held in a computer, especially one that is accessible in various ways. • Relational? NoSQL? Graph? • Graph database: uses graph structures for semantic queries with nodes, edges and properties to represent and store data.
  3. The Matrix Cloud Atlas Tom Hanks ACTED_IN Lana Wachowski DIRECTED

    DIRECTED Hugo Weaving ACTED_IN ACTED_IN Whiteboard friendliness
  4. name: Tom Hanks born: 1956 title: Cloud Atlas released: 2012

    title: The Matrix released: 1999 name: Lana Wachowski born: 1965 ACTED_IN roles: Zachry ACTED_IN roles: Bill Smoke DIRECTED DIRECTED ACTED_IN roles: Agent Smith name: Hugo Weaving born: 1960 Person Movie Movie Person Director Actor Person Actor Whiteboard friendliness
  5. Car Property Graph Model Components Nodes • Represent the objects

    in the graph • Can be labeled Person Person
  6. Car DRIVES Property Graph Model Components Nodes • Represent the

    objects in the graph • Can be labeled Relationships • Relate nodes by type and direction LOVES LOVES LIVES WITH OW NS Person Person
  7. Car DRIVES name: “Dan” born: May 29, 1970 twitter: “@dan”

    name: “Ann” born: Dec 5, 1975 since: Jan 10, 2011 brand: “Volvo” model: “V70” Property Graph Model Components Nodes • Represent the objects in the graph • Can be labeled Relationships • Relate nodes by type and direction Properties • Name-value pairs that can go on nodes and relationships. LOVES LOVES LIVES WITH OW NS Person Person
  8. Neo4j Fast Reliable No size limit Binary & HTTP protocol

    ACID transactions 2-4 M
 ops/s per core Clustering scale & HA Official Drivers Neo4j is a database
  9. Neo4j Native GraphDB Cypher Query Language Developer Workbench Extensible Procedures

    & Functions Schema Free Property Graph Model Graph Visualization Graph Storage Neo4j is a graph database
  10. Cypher: Powerful and Expressive CREATE (:Person { name:“Dan”}) -[:LOVES]-> (:Person

    { name:“Ann”}) LOVES Dan Ann LABEL PROPERTY NODE NODE LABEL PROPERTY
  11. Almost 200 journalists Based in 65 countries “Our aim is

    to bring journalists from different countries together in teams - eliminating rivalry and promoting collaboration. Together, we aim to be the world’s best cross-border investigative team.” — icij.org/about
  12. Common Use Case • Real time recommendations • Fraud Detection

    • Network & IT Management • Social Networks • Bill of Materials • Knowledge Graphs • Master Data Management • Access Management • Microservices Analysis • IoT • ...
  13. SELECT l.name, c.jurisdiction FROM legislators l LEFT JOIN committee c

    ON c.member_ID=l.thomasID WHERE c.thomasID = “HSAP”
  14. Relational Versus Graph Models Relational Model Graph Model KNOWS KNOWS

    KNOWS ANDREAS TOBIAS MICA DELIA Person Friend Person-Friend ANDREAS DELIA TOBIAS MICA
  15. ICIJ’s Extract Tool • Content extraction from documents • Apache

    Tika • Redis job queue • Parallel, distributed • Writes to Solr, CSV https://github.com/ICIJ/extract
  16. Paradise Papers Leak • 1.4TB (13.4 million documents) • Appleby

    law firm • 19 corporate registries "This leak is important because it's the high end of town. People may have dismissed the Mossack Fonseca leaks as they were rogue players who would take any client. Most of the offshore world is not like that at all. Here you have the gold- plated company.” — Gerard Ryle, ICIJ http://www.bbc.com/news/world-41880153