Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Geoindexing with MongoDB
Search
Leszek Krupiński
May 17, 2012
Programming
61
0
Share
Geoindexing with MongoDB
Leszek Krupiński
May 17, 2012
More Decks by Leszek Krupiński
See All by Leszek Krupiński
So that the daemon won’t die
leafnode
2
410
Practical PHP7
leafnode
2
490
Dobrze posól swoje hasło
leafnode
0
120
Dobrze posól swoje hasło (z notatkami)
leafnode
0
110
PHPNG kontra HHVM
leafnode
0
120
PHPNG kontra HHVM (z notatkami)
leafnode
0
86
Ewolucja PHP: PHP 5.6, NG, PHP 7, HHVM
leafnode
2
310
Sculpin - Generowanie statycznych stron w PHP
leafnode
2
77
Skalowanie aplikacji PHP
leafnode
1
430
Other Decks in Programming
See All in Programming
AIエージェントと協働するCLI開発 — BunとOpenClawで学んだこと
yoshikouki
1
230
タクシーアプリ『GO』の バックエンド開発のおける AI利活用と若者のすべて
pyama86
3
1.8k
tsserverとは何だったのか、これからどうなるのか
nowaki28
1
430
Oxcを導入して開発体験が向上した話
yug1224
4
280
ふつうのFeature Flag実践入門
irof
7
3.5k
Old Dog, New Tricks: The Java 25 Reinvention - JNation
bazlur_rahman
0
140
Copilot CLI の継戦能力を高める コンテキスト管理
nozomutu
1
1.2k
LLM Plugin for Node-REDの利用方法と開発について
404background
0
150
Spec-Driven Development with AI-Agents: From High-Level Requirements to Working Software
antonarhipov
2
440
Oxlintのカスタムルールの現況
syumai
5
970
TAKTでAI駆動開発の品質を設計する
j5ik2o
4
260
プラグインで拡張される Context をtype-safe にする難しさと設計判断
kazupon
2
550
Featured
See All Featured
JAMstack: Web Apps at Ludicrous Speed - All Things Open 2022
reverentgeek
1
460
Believing is Seeing
oripsolob
1
140
Agile Leadership in an Agile Organization
kimpetersen
PRO
0
160
Responsive Adventures: Dirty Tricks From The Dark Corners of Front-End
smashingmag
254
22k
Stewardship and Sustainability of Urban and Community Forests
pwiseman
0
220
10 Git Anti Patterns You Should be Aware of
lemiorhan
PRO
659
62k
My Coaching Mixtape
mlcsv
0
140
A Tale of Four Properties
chriscoyier
163
24k
Building Better People: How to give real-time feedback that sticks.
wjessup
370
20k
The Myth of the Modular Monolith - Day 2 Keynote - Rails World 2024
eileencodes
28
3.5k
DevOps and Value Stream Thinking: Enabling flow, efficiency and business value
helenjbeal
1
210
The Curse of the Amulet
leimatthew05
1
13k
Transcript
Geoindexing with MongoDB Leszek Krupiński WebClusters 2012
About me
On-line since 1997
Funny times
1 hr of internet for 1 USD
None
None
First social site: geocities
My first web page
What do I do now
Day-time job Managing team of developers for Polish Air Force
Side: consulting, optimizing, desiging
Buzzwords incoming!
The Internet 2008
Web 2.0
http://en.wikipedia.org/wiki/File:Web_2.0_Map.svg CC-BY-SA-2.5
Be social in your bedroom
alone.
The Internet 2012
Web 3.0
None
Why geospatial?
Needs shifted
Why? Because they could.
None
None
None
How to implement?
Database. Duh.
Keep, but also query
Is there a person at 53.438522,14.52198? Nope. Is there a
person at 53.438522,14.52199? Nope. Is there a person at 53.438522,14.52199? Yeah, here’s Johnny!
Not too useful.
Give me nearby homies. Within the range of 1 km
there is: • Al Gore (53.438625,14.52103) • Bill Clinton (53.432531,14.55127) • Johnny Bravo (53.438286,14.52363)
Now that’s better.
Geoindexing. Nothing new.
Oracle, PostreSQL, Lucene/Solr, even MySQL (via extensions)
SELECT c.holding_company, c.location FROM competitor c, bank b WHERE b.site_id
= 1604 AND SDO_WITHIN_DISTANCE(c.location, b.location, ’distance=2 unit=mile’) = ’TRUE’ ORACLE
SQL is so last year
Let’s use something cool
MongoDB. Because all the cool kids use NoSQL now
None
Why MongoDB?
Choose your NoSQL wise.
NoSQL in MongoDB • Document –based • Queries (JS-like syntax)
• JSON-like storage
Why MongoDB? Use Cases • Archiving • Event logging •
Document and CMS • Gaming • High volume sites • Mobile • Operational datastore • Agile development • Real-time stats Features • Ad hoc queries • Indexing • Replication • Load Balancing • File Storage • Aggregation • Server-side JavaScript • Capped collections http://en.wikipedia.org/wiki/Mongodb
Back to geo.
{ loc: [ 52.0, 21.0 ], name: ”Warsaw”, type: ”City”
}
db.nodes.ensureIndex({loc: '2d'})
That’s it.
Query • Exact o db.places.find( { loc : [50,50] }
) • Near o db.places.find( { loc : { $near : [50,50] } } ) • Limit o db.places.find( { loc : { $near : [50,50] } } ).limit(20) • Distance o db.places.find( { loc : { $near : [50,50] , $maxDistance : 5 } } ).limit(20)
Compound index • db.places.ensureIndex( { location : "2d" , category
: 1 } ); • db.places.find( { location : { $near : [50,50] }, category : 'coffee‚ } );
Bound queries • box = [ [40.73083, -73.99756], [40.741404, -73.988135]
] • db.places.find( {"loc" : {"$within" : {"$box" : box }} } )
Problems
Units
Coordinates in arc units Distance in kilometers
In query
earthRadius = 6378 // km multi = earthRadius * PI
/ 180.0 range = 3000 // km … maxDistance : range * multi…
In results
pointDistance = distances[0].dis / multi
Earth is not flat.
Problem: can’t use linear distance
Earth isn’t flat too.
Solution? Use approximation.
MongoDB has it built-in distances = db.runCommand( { geoNear :
"points", near : [0, 0], spherical : true, maxDistance : range / earthRadius /* to radians */ } ).results
Focus: runCommand distances = db.runCommand({ geoNear : "points" …
Sort by distance Only with runCommand
Automatically sorted • db.runCommand( { geoNear : "places" , near
: [50,50], num : 10 } ); • { "ns" : "test.places", "results" : [ { "dis" : 69.29646421910687, "obj" : … }, { "dis" : 69.29646421910687, "obj" : … }, … ], … }
Demo
OpenStreetMaps database of Poland imported into MongoDB
14.411.552 nodes
3GB of raw XML data
PHP in virtual machine
Imported about 100.000 nodes every 10s.
Pretty cool, eh?
Kudos to Derick Rethans Part of this talk was inspired
by his talk
Questions?
Thanks! Rate me at https://joind.in/talk/view/6475
Geoindexing with MongoDB supplement Leszek Krupiński WebClusters 2012
Why MongoDB?
Evaluate.
PostGIS is cool too. (but it’s SQL, meh)
Why MongoDB? Use Cases • Archiving • Event logging •
Document and CMS • Gaming • High volume sites • Mobile • Operational datastore • Agile development • Real-time stats Features • Ad hoc queries • Indexing • Replication • Load Balancing • File Storage • Aggregation • Server-side JavaScript • Capped collections http://en.wikipedia.org/wiki/Mongodb
If you need other features of MongoDB, use it
If you don’t, evaluate.
Evaluate.
Demo (hopefully)
Questions?
Please leave feedback! Rate me at https://joind.in/6475