The Ubiquitous Graph: Two Use Cases from the Real World

The Ubiquitous Graph Two Use Cases from the Real World
Tareq Abedrabbo - Data Science London December 2013

About me • CTO at OpenCredo • Working with Neo4j
for (almost) 3 years on a number of different projects • Co-author of Neo4j in Action (Manning)

“If I'm to believe Twitter, half of the earth's population
are importing Wikipedia into Neo4j, for very obscure reasons.”

Agenda • Graph applications • Use cases • Best practices

What type of applications can be built with a graph
database?

Domain-centric applications

• Well-deﬁned data model • Data changes through user interactions
• Flexible but predictable data structure(s) • Recommendation engines, social networks, etc… • Top-down design

Data-centric applications

• Complex connected data that typically models real world networks
• Integrated from a variety of different sources • Data can be unpredictable • Telco networks, utility networks, etc… • bottom-up design

Typically applications fall somewhere between these 2 types

How can I use the information available in my graph?

• Search and pattern-matching • Find a recommendation based on
behaviour • Graph algorithms • Shortest path, disconnected components • Optimisation • Maximise oil ﬂow while minimising water

Graphs are naturally data-driven

Use case 1: Network Impact Analysis

Domain: a telco network. Millions of connected network components, services
and customers

Requirement: Identify the impact of failing components

Requirement: Identify interesting patterns, such as single points of failure

The network is “semi- structured”

Labelled property graph is a natural ﬁt for the model

Additional “dimensions” can be added to capture abstract concepts: network
redundancy, load-balancing

Cypher queries are a natural solution to delivering the different
requirements

• Other requirements • Multiple starting points • Impact on
quality of service • Abstraction of repeatable patterns

Use case 2: Oil ﬂow optimisation

Domain: an oil extraction network. Hundreds of connected components with
complex conﬁguration options

Requirement: Identify candidate conﬁgurations to maximise ﬂow

Interlude: Genetic Algorithms

“Search heuristic that mimics the process of natural selection” -
Wikipedia

1. Start from an initial population of candidate solutions 2.
Assess each solution using a ﬁtness function 3. Apply genetic operators to derive a new and potentially ﬁtter generation 4. Rinse and repeat!

The Ubiquitous Graph: Two Use Cases from the Re...

The Ubiquitous Graph: Two Use Cases from the Real World

More Decks by Tareq Abedrabbo

Other Decks in Technology

Featured

Transcript