Delegate, Automate, Dominate: Putting Graph Tech to Work for You to Unlock Hidden Insights and Opportunities

Jennifer Reif Email: [email protected] Twitter: @JMHReif LinkedIn: linkedin.com/in/jmhreif Github: github.com/JMHReif
Website: jmhreif.com Delegate, Automate, Dominate Putting Graph Tech to Work for You to Unlock Hidden Insights and Opportunities Mark Heckler Email: [email protected] Twitter: @mkheck LinkedIn: linkedin.com/in/markheckler Github: github.com/mkheck Website: thehecklers.com

Who Am I? • Developer + Advocate • Continuous learner
• Technical content writer • Conference speaker • Other: geek

Who Am I? • Author • Architect & Developer •
Developer Advocate, Java/JVM Languages • Java Champion, Rockstar • Kotlin Developer Expert • Pilot bit.ly/springbootbook

What makes a good graph?

Connected data! • Mixed entity types with queries spanning multiple
• Analyzing connections between entities • Changing data models and needs • Impacts/Dependencies layers deep

Data set?

Kaggle Netflix • Relationship context is important • Multiple types
of entities connected • Kaggle Net fl ix set + Wikipedia country names

Data model

Import - tips and tricks

Load Productions LOAD CSV WITH HEADERS FROM "https://raw.githubusercontent.com/JMHReif/graph-demo-datasets/main/kaggle-net fl ix/titles.csv"
as row CALL apoc.merge.node(["Production",apoc.text.capitalize(toLower(row.type))], {productionId: row.id}, {title: row.title, …}, {}) YIELD node as p WITH row, p CALL { … MERGE (g:Genre {name: apoc.text.capitalize(genre)}) MERGE (p)-[r:CATEGORIZED_BY]->(g) } WITH row, p CALL { … MERGE (c:Country {iso2Code: country}) MERGE (p)-[r2:PRODUCED_IN]->(c) } RETURN count(row); https://github.com/JMHReif/graph-demo-datasets/blob/main/kaggle-net fl ix/load-data.cypher

Load Production People :auto LOAD CSV WITH HEADERS FROM "https://raw.githubusercontent.com/JMHReif/graph-demo-datasets/main/kaggle-net
fl ix/credits.csv" as row WITH row CALL { WITH row MERGE (p:Person {personId: row.person_id}) SET p.name = row.name WITH row, p CALL apoc.create.addLabels(p,[apoc.text.capitalize(toLower(row.role))]) YIELD node as person RETURN person } IN TRANSACTIONS OF 20000 ROWS RETURN count(row); https://github.com/JMHReif/graph-demo-datasets/blob/main/kaggle-net fl ix/load-data.cypher

Top to Bottom • 2 CSV fi les • Add
country name with Wikipedia • Use Cypher + APOC magic • Start in small pieces • :auto IN TRANSACTIONS to batch • Multiple statements to conserve memory • Can be scripted • Also can schedule (using APOC)

Let’s build an API! Demo time!

Automate • Platforming • Con fi guration • Deployment •
Idempotent • Monitoring • Management

Actionable insights Multiple levels, multiple perspectives • Data • System
of systems • Platform

Resources • Source code: github.com/HecklerReifCollab/person-service • Data set: github.com/JMHReif/graph-demo-datasets/tree/main/kaggle-net fl
ix • Neo4j AuraDB: dev.neo4j.com/aura • Azure Spring Apps: aka.ms/azurespringapps Jennifer Reif Email: [email protected] Twitter: @JMHReif LinkedIn: linkedin.com/in/jmhreif Github: GitHub.com/JMHReif Website: jmhreif.com Mark Heckler Email: [email protected] Twitter: @mkheck LinkedIn: linkedin.com/in/markheckler Github: github.com/mkheck Website: thehecklers.com

Delegate, Automate, Dominate: Putting Graph Tec...

Delegate, Automate, Dominate: Putting Graph Tech to Work for You to Unlock Hidden Insights and Opportunities

Jennifer Reif

More Decks by Jennifer Reif

Other Decks in Technology

Featured

Transcript

Jennifer Reif Email: [email protected] Twitter: @JMHReif LinkedIn: linkedin.com/in/jmhreif Github: github.com/JMHReif

Who Am I? • Developer + Advocate • Continuous learner

Who Am I? • Author • Architect & Developer •

What makes a good graph?

Connected data! • Mixed entity types with queries spanning multiple

Data set?

Kaggle Netflix • Relationship context is important • Multiple types

Data model

Import - tips and tricks

Load Productions LOAD CSV WITH HEADERS FROM "https://raw.githubusercontent.com/JMHReif/graph-demo-datasets/main/kaggle-net fl ix/titles.csv"

Load Production People :auto LOAD CSV WITH HEADERS FROM "https://raw.githubusercontent.com/JMHReif/graph-demo-datasets/main/kaggle-net

Top to Bottom • 2 CSV fi les • Add

Let’s build an API! Demo time!

Automate • Platforming • Con fi guration • Deployment •

Actionable insights Multiple levels, multiple perspectives • Data • System

Resources • Source code: github.com/HecklerReifCollab/person-service • Data set: github.com/JMHReif/graph-demo-datasets/tree/main/kaggle-net fl