ArangoDB at Monsters on Rails

1 Lucas Dohmen @moonbeamlabs ! the multi-purpose NoSQL Database !
www.arangodb.org

Lucas Dohmen ‣ ArangoDB Core Team ‣ ArangoDB Foxx &
Ruby Adapter ‣ Student on the master branch ‣ hacken.in & nerdkun.de 2 /\ (~( ) ) /\_/\ ( _-----_(@ @) ( \ / /|/--\|\ V " " " "

Why did we start ArangoDB? How should an ideal multi-purpose
database look like? Is it already out there? ! ‣ Second Generation NoSQL DB ‣ Unique feature set ‣ Solves some problems of other NoSQL DBs ‣ Greenﬁeld project ‣ Experienced team building NoSQL DBs for more than 10 years 3

Main Features 4 ‣ Open source and free ‣ Multi
model database ‣ Convenient querying ‣ Extendable through JS & MRuby ‣ High performance & space efﬁciency ‣ Easy to use ‣ Started in Sep 2011 ‣ Current Version: 1.4

Free and Open Source ‣ Apache 2 License ‣ On
Github ‣ Do what you want with it ‣ ... and don‘t pay a dime!

Multi model database 6 Key/Value Store Document Store Graph Database
Source: Andrew Carol Polyglot Persistence

Key-Value Store ‣ Map value data to unique string keys
(identiﬁers) ‣ Treat data as opaque (data has no structure) ‣ Can implement scaling and partitioning easily due to simplistic data model ‣ Key-value can be seen as a special case of documents. For many applications this is sufﬁcient, but not for all cases. ! ArangoDB ‣ It‘s currently supported as a key-value document. ‣ In the near future it supports special key-value collection. ‣ One of the optimization will be the elimination of JSON in this case, so the value need not be parsed. ‣ Sharding capabilities of Key-Value Collections will differ from Document Collections

Document Store ‣ Normally based on key-value stores (each document
still has a unique key) ‣ Allow to save documents with logical similarity in „collections“ ‣ Treat data records as attribute-structured documents (data is no longer opaque) ‣ Often allows querying and indexing document attributes ! ArangoDB ‣ It supports both. A database can contain collections from different types. ‣ For efﬁcient memory handling we have an automatic schema recognition. ‣ It has different ways to retrieve data. CRUD via RESTful Interface, QueryByExample, JS for graph traversals and AQL.

‣ Example: Computer Science Bibliography ! ! ! ! !
ArangoDB ‣ Supports Property Graphs ‣ Vertices and edges are documents ‣ Query them using geo-index, full-text, SQL-like queries ‣ Edges are directed relations between vertices ‣ Custom traversals and built-in graph algorithms Graph Store

Analytic Processing DBs Transaction Processing DBs Managing the evolving state
of an IT system Complex Queries Map/Reduce Graphs Extensibility Key/Value Column-  Stores Documents Massively Distributed Structured Data NoSQL Map

*) Source: Martin Fowler, http://martinfowler.com/articles/nosql-intro.pdf Reporting RDBMS User activity log
Cassandra Product Catalog MongoDB Analytics Cassandra Shopping Cart Riak Recommendations Neo4J Financial Data RDBMS User Sessions Redis Polyglot Persistence Example*  Polyglot Persistence with ArangoDB Reporting RDBMS User activity log Cassandra Product Catalog ArangoDB Analytics Cassandra Shopping Cart ArangoDB Recommendations ArangoDB Financial Data ArangoDB User Sessions ArangoDB

Convenient querying Different scenarios require different access methods: ‣ Query
a document by its unique id / key: GET /_api/document/users/12345 ‣ Query by providing an example document: PUT /_api/simple/by-example { "name": "Jan", "age": 38 } ‣ Query via AQL: FOR user IN users FILTER user.active == true RETURN { name: user.name } ‣ Graph Traversals und JS for your own traversals ‣ JS Actions for “intelligent” DB request

Why another query language?

Other Document Stores ‣ MongoDB uses JSON/BSON as its “query
language” ‣ Limited ‣ Hard to read & write for more complex queries ‣ Complex queries, joins and transactions not possible ‣ CouchDB uses Map/Reduces ‣ It‘s not a relational algebra, and therefore hard to generate ‣ Not easy to learn ‣ Complex queries, joins and transactions not possible

ArangoDB Query Language (AQL)

Example for Aggregation ‣ Retrieve cities with the number of
users: FOR u IN users COLLECT city = u.city INTO g RETURN { "city" : city, "numUsersInCity": LENGTH(g) }

Example for Graph Query ‣ Paths: FOR u IN users
LET userRelations = ( FOR p IN PATHS( users, relations, "OUTBOUND" ) FILTER p._from == u._id RETURN p ) RETURN { "user" : u, "relations" : userRelations }

Extendable through JS & MRuby ‣ Scripting-Languages enrich ArangoDB ‣
Multi Collection Transactions ‣ Building small and efﬁcient Apps - Foxx App Framework ‣ Individually Graph Traversals ‣ Cascading deletes/updates ‣ Assign permissions to actions ‣ Aggregate data from multiple queries into a single response ‣ Carry out data-intensive operations ‣ Help to create efﬁcient Push Services - in the near Future ! ‣ Currently supported ‣ Javascript (Google V8) ‣ Mruby (experimental, not fully integrated yet)

Action Server - kind of Application Server ‣ ArangoDB can
answer arbitrary HTTP requests directly ‣ You can write your own JavaScript functions (“actions”) that will be executed server-side ‣ Includes a permission system ! ➡ You can use it as a database or as a combined database/app server

‣ Single Page Web Applications ‣ Native Mobile Applications ‣
ext. Developer APIs APIs - will become more & more important

ArangoDB Foxx ‣ What if you could talk to the
database directly? ‣ It would only need an API. ‣ What if we could deﬁne this API in JavaScript? ! ! ! ! ! ! ‣ ArangoDB Foxx is streamlined for API creation – not a jack of all trades ‣ It is designed for front end developers: Use JavaScript, which you already know (without running into callback hell)

Foxx - Simple Example

Foxx - More features ‣ Full access to ArangoDB‘s internal
APIs: ‣ Simple Queries ‣ AQL ‣ Traversals ‣ Automatic generation of interactive documentation ‣ Models and Repositories ‣ Central repository of Foxx apps for re-use and inspiration ‣ Authentication Module

High performance & space efficiency RAM is cheap, but it's
still not free and data volume is growing fast. Requests volumes are also growing. So performance and space efficiency are key features of a multi-purpose database. ! ‣ ArangoDB supports automatic schema recognition, so it is one of the most space efficient document stores. ‣ It offers a performance oriented architecture with a C database core, a C++ communication layer, JS and C++ for additional functionalities. ‣ Performance critical points can be transformed to C oder C++. ‣ Although ArangoDB has a wide range of functions, such as MVCC real ACID, schema recognition, etc., it can compete with popular stores documents.

Space Efﬁciency ‣ Measure the space on disk of different
data sets ‣ First in the standard conﬁg, then with some optimization ‣ We measured a bunch of different tasks

Store 50,000 Wiki Articles

3,459,421 AOL Search Queries

Performance: Disclaimer ‣ Always take performance tests with a grain
of salt ‣ Performance is very dependent on a lot of factors including the speciﬁc task at hand ‣ This is just to give you a glimpse at the performance ‣ Always do your own performance tests (and if you do, report back to us :) ) ‣ But now: Let‘s see some numbers

Execution Time: Bulk Insert of 10,000,000 documents

Conclusion from Tests ‣ ArangoDB is really space efﬁcient ‣
ArangoDB is “fast enough” ‣ Please test it for your own use case

Easy to use ‣ Easy to use admin interface ‣
Simple Queries for simple queries, AQL for complex queries ‣ Simplify your setup: ArangoDB only – no Application Server etc. – on a single server is sufﬁcient for some use cases ‣ You need graph queries or key value storage? You don't need to add another component to the mix. ‣ No external dependencies like the JVM – just install ArangoDB ‣ HTTP interface – use your load balancer

Admin Frontend Dashboard

Admin Frontend Collections & Documents

Admin Frontend AQL development

Admin Frontend complete V8 access

ArangoShell

Join the growing community

ArangoDB.explain() { "type": "2nd generation NoSQL database", "model": [ "document",
"graph", "key-value" ], "openSource": true, "license“: "apache 2", "version": [ "1.3 stable", "1.4 alpha" ], "builtWith": [ "C", "C++", "JS" ], "uses": [ "Google V8" ], "mainFeatures": [ "Multi-Collection-Transaction", "Foxx API Framework", "ArangoDB Query Language", "Various Indexes", "API Server", "Automatic Schema Recognition" ] }

ArangoDB at Monsters on Rails

ArangoDB at Monsters on Rails

More Decks by Lucas Dohmen

Other Decks in Technology

Featured

Transcript