Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Cypher Sleuthing: Tips and Tricks for Querying a Graph

Cypher Sleuthing: Tips and Tricks for Querying a Graph

Searching for ways to improve your knowledge and skill in writing queries with graph data? While many marketing and sales messages tell us graphs are easier to understand and relatable for other users, writing and structuring graph queries is often different than the skills we currently possess. So how can we up-skill without the pain and time of trial-and-error? Join us as we dive into the structure of a graph database and understand how queries operate. To help us understand the query logic, we will explore tools and tricks to finding out where things went wrong or where we can optimize our queries. We will cover tips and tricks, as well as pitfalls, to help us avoid future road blocks or performance hits. Come to this session to see how to navigate graph data with confidence to retrieve accurate and performant results!

Jennifer Reif

November 02, 2021
Tweet

More Decks by Jennifer Reif

Other Decks in Technology

Transcript

  1. Who Am I? • Developer • Continuous learner • Conference

    speaker • Blogger • Geek Jennifer Reif Email: [email protected] Twitter: @JMHReif LinkedIn: linkedin.com/in/jmhreif Github: GitHub.com/JMHReif Website: jmhreif.com
  2. Cypher, graph, query languages? • What does it have to

    do with you? • Documenting my journey learning Cypher/graph • Lessons for anything: • Adding skills • Graph/NoSQL not going away • Understanding data structures • Query language logic Photo by Matt Howard on Unsplash
  3. Tackling a solution… • Trial-and-error • At the mercy of

    the language • Whack-a-mole fi xes • What if…. Photo by Michal Vrba on Unsplash
  4. Understanding is the key • How to wield the tool

    brings mastery • Knowing when and how —> • E ffi ciency • Better results Credit: unsplash.com
  5. • Graph query language • Created by Neo4j ~10yrs ago

    • Open sourced • Widely supported • Part of GQL standardization • opencypher.org What is Cypher? Photo by Eugene Chystiakov on Unsplash
  6. • Follow path of connected data • Help understand data

    patterns • Surface hidden connections • Traverse graph data in a database What does it do? Photo by ConvertKit on Unsplash
  7. Functional and visual • Designed to store and retrieve data

    in a graph • Based on ASCII-art • Declarative query language • Focus on what to retrieve, not how A B LIKES MATCH ( A ) - [ : LIKES] - > ( B )
  8. Cypher in 20 seconds… • Nodes look like this: •

    (var:Label) OR (var:Label {propKey: propValue}) • Relationships look like this: • -[var:REL_TYPE]-> OR -[var:REL_TYPE {propKey: propValue}]-> • Using Cypher is looking for patterns of those nodes/relationships: • (var1:Label)-[var2:REL_TYPE]->(var3:Label)
  9. Cypher create/insert Jennifer Neo4j WORKS_FOR CREATE (:Person { name: ‘Jennifer’})

    -[:WORKS_FOR]-> (:Company { name: ‘Neo4j’}) PROPERTY LABEL NODE NODE PROPERTY LABEL
  10. Pitfalls (mine, at least) • Programming language logic (e.g. Java)

    • Variable-setting • Complex conditionals Photo by Wander Fleur on Unsplash
  11. Examples //Bad syntax LOAD CSV…AS row MERGE (c:Company {companyId: row.Id})

    WITH row, c, CASE row.type WHEN 'P' THEN row.type = 'Public' WHEN 'R' THEN row.type = 'Private' WHEN 'G' THEN row.type = 'Government' ELSE row.type = 'Other' END SET c.businessType = row.type RETURN * //Bad syntax 2 LOAD CSV…AS row WITH row, CASE row.BusinessType WHEN 'P' THEN type = 'Public' WHEN 'R' THEN type = 'Private' WHEN 'G' THEN type = 'Government' ELSE type = 'Other' END RETURN row.CompanyId, row.CompanyName, type
  12. Correct Logic • 1. Expression comparison against multiple values MATCH

    (p:Person)-[r:IS_MANAGED_BY]->(m:Manager)-[r2:OVERSEES]->(d:Department) RETURN p.name, CASE p.role WHEN 'management' THEN d.departmentPhone WHEN 'business' THEN p.businessPhone WHEN 'technical' THEN p.emailAddress ELSE d.departmentEmail END as personContact • 2. Multiple conditional statements expressed MATCH (p:Person) RETURN p.name, CASE WHEN dateHired is null THEN 'candidate' WHEN dateHired > date(‘2020–09–17') THEN 'newHire' ELSE 'employee' END as personStatus, CASE WHEN dateFired is null THEN dateHired WHEN dateHired is null THEN entryDate ELSE 'n/a' END as leadDate
  13. Avoid pitfalls with CASE • Logic: • Compare values, conditionals

    • Not a programming language • No variable-setting, manipulation • Hint: • Logic occurs in WHEN clause • THEN clause contains single value Photo by JESHOOTS.COM on Unsplash
  14. Pitfalls • Date format • ISO 8601 • Pesky literal

    'T' • Durations • Precision • Groups Credit: unsplash.com
  15. Date format • ISO 8601 standard • Date: 2021-09-17 •

    Datetime: 2021-09-17T09:00:15Z • Literal 'T' between date and time values Credit: unsplash.com
  16. Truncate temporals • Ghosting any smaller units (defaults) • Helpful

    for generic searches or defaulting chunks of time • Date search for current year • Activity in speci fi c month • Any temporal type accepts .truncate() • Example: WITH date() as toda y RETURN date.truncate('month',today);
  17. Translating to ISO 8601 • Cypher accepts: • ISO 8601

    • Strings in ISO 8601 format • What if you have ANYTHING else? • Error!
  18. What is APOC? • Utility library for Neo4j • Widely

    used, broadly applicable • 550+ procedures and functions Photo by Alexis Fauvet on Unsplash
  19. Epoch time • Unix system date/time • Processes, logs, etc.

    • Seconds since 1970-01-01T00:00:00Z • Example: 1635866400 (end time for this session) • Handled with apoc.date.format() or apoc.date.toISO8601() https://www.epochconverter.com/
  20. Date Strings • Process: • Temporal string -> ISO 8601

    string -> Neo4j temporal • Any temporal string with speci fi ed format • Handled with apoc.date.convertFormat() or apoc.temporal.toZonedTemporal()
  21. Durations • Distance in time measurements • Literal ‘P’ and

    ’T’ in syntax • Precision - months, days, seconds • P1M3D60000S • Duration functions (between, inMonths, inDays, inSeconds) Photo by Randy Fath on Unsplash
  22. Duration conversions • Duration functions: inMonths, inDays, inSeconds • Most

    components -> whole values only, no remainders • Convert between units: functions + components • Component groups
  23. Avoid pitfalls with temporals • Think through format: • Temporal/string

    in ISO 8601 • Other must be translated (Cypher or APOC) • Process duration calculations: • Know start->end formats • Determine groups (months, days, seconds) • Outline translation steps (group-to-group, components, etc) Photo by JESHOOTS.COM on Unsplash
  24. Examples WITH datetime(‘2021-09-30T23:59:59') as eo m RETURN apoc.date. fi elds(apoc.temporal.format(eom,

    'yyyy-MM-dd HH:mm:ss'),'yyyy- MM-dd HH:mm:ss'); WITH datetime() as no w RETURN apoc.date.toYears(now.epochMillis), duration.inMonths(date('1970-01-01'), localDatetime(now)).years;
  25. What is Eager? • Eager loading • Consistency and con

    fl icts • Operations occur to all rows before continuing • Avoid read/write con fl icts Credit: unsplash.com
  26. Does it matter? • Low memory/heap • Large dataset •

    Query performance Credit: unsplash.com
  27. Avoid pitfalls with EAGER • Know your data: • Size

    • Heavy processing (i.e. cleaning, operations, etc) • Monitor your queries: • Degrading performance • Fixing or when in doubt: • Separate operations (avoid con fl ict) • PROFILE/EXPLAIN is your friend! Photo by JESHOOTS.COM on Unsplash
  28. What we learned • Cypher syntax • CASE logic •

    Cypher temporals/duration calculations • EAGER operator
  29. Resources • Blog posts: CASE, temporals, eager, and more! •

    jmhreif.com/blog/ • Repository: • github.com/JMHReif/cypher-sleuthing • Ask for help! • dev.neo4j.com/forum -> Cypher channel Credit: unsplash.com Jennifer Reif Email: [email protected] Twitter: @JMHReif LinkedIn: linkedin.com/in/jmhreif Github: GitHub.com/JMHReif Website: jmhreif.com