Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Geoindexing with MongoDB
Search
Leszek Krupiński
May 17, 2012
Programming
0
59
Geoindexing with MongoDB
Leszek Krupiński
May 17, 2012
Tweet
Share
More Decks by Leszek Krupiński
See All by Leszek Krupiński
So that the daemon won’t die
leafnode
2
400
Practical PHP7
leafnode
2
460
Dobrze posól swoje hasło
leafnode
0
110
Dobrze posól swoje hasło (z notatkami)
leafnode
0
95
PHPNG kontra HHVM
leafnode
0
100
PHPNG kontra HHVM (z notatkami)
leafnode
0
70
Ewolucja PHP: PHP 5.6, NG, PHP 7, HHVM
leafnode
2
300
Sculpin - Generowanie statycznych stron w PHP
leafnode
2
63
Skalowanie aplikacji PHP
leafnode
1
410
Other Decks in Programming
See All in Programming
『毎日の移動』を支えるGoバックエンド内製開発
yutautsugi
2
250
Flutterで分数(Fraction)を表示する方法
koukimiura
0
130
After go func(): Goroutines Through a Beginner’s Eye
97vaibhav
0
390
TFLintカスタムプラグインで始める Terraformコード品質管理
bells17
2
170
私達はmodernize packageに夢を見るか feat. go/analysis, go/ast / Go Conference 2025
kaorumuta
2
560
bootcamp2025_バックエンド研修_WebAPIサーバ作成.pdf
geniee_inc
0
110
10年もののAPIサーバーにおけるCI/CDの改善の奮闘
mbook
0
820
overlayPreferenceValue で実現する ピュア SwiftUI な AdMob ネイティブ広告
uhucream
0
180
Devoxx BE - Local Development in the AI Era
kdubois
0
130
monorepo の Go テストをはやくした〜い!~最小の依存解決への道のり~ / faster-testing-of-monorepos
convto
2
490
デミカツ切り抜きで面倒くさいことはPythonにやらせよう
aokswork3
0
230
3年ぶりにコードを書いた元CTOが Claude Codeと30分でMVPを作った話
maikokojima
0
130
Featured
See All Featured
CSS Pre-Processors: Stylus, Less & Sass
bermonpainter
359
30k
Cheating the UX When There Is Nothing More to Optimize - PixelPioneers
stephaniewalter
285
14k
Refactoring Trust on Your Teams (GOTO; Chicago 2020)
rmw
35
3.2k
Performance Is Good for Brains [We Love Speed 2024]
tammyeverts
12
1.2k
The Success of Rails: Ensuring Growth for the Next 100 Years
eileencodes
46
7.7k
The Myth of the Modular Monolith - Day 2 Keynote - Rails World 2024
eileencodes
26
3.1k
It's Worth the Effort
3n
187
28k
Unsuck your backbone
ammeep
671
58k
What’s in a name? Adding method to the madness
productmarketing
PRO
24
3.7k
Producing Creativity
orderedlist
PRO
347
40k
Chrome DevTools: State of the Union 2024 - Debugging React & Beyond
addyosmani
8
910
The Power of CSS Pseudo Elements
geoffreycrofte
79
6k
Transcript
Geoindexing with MongoDB Leszek Krupiński WebClusters 2012
About me
On-line since 1997
Funny times
1 hr of internet for 1 USD
None
None
First social site: geocities
My first web page
What do I do now
Day-time job Managing team of developers for Polish Air Force
Side: consulting, optimizing, desiging
Buzzwords incoming!
The Internet 2008
Web 2.0
http://en.wikipedia.org/wiki/File:Web_2.0_Map.svg CC-BY-SA-2.5
Be social in your bedroom
alone.
The Internet 2012
Web 3.0
None
Why geospatial?
Needs shifted
Why? Because they could.
None
None
None
How to implement?
Database. Duh.
Keep, but also query
Is there a person at 53.438522,14.52198? Nope. Is there a
person at 53.438522,14.52199? Nope. Is there a person at 53.438522,14.52199? Yeah, here’s Johnny!
Not too useful.
Give me nearby homies. Within the range of 1 km
there is: • Al Gore (53.438625,14.52103) • Bill Clinton (53.432531,14.55127) • Johnny Bravo (53.438286,14.52363)
Now that’s better.
Geoindexing. Nothing new.
Oracle, PostreSQL, Lucene/Solr, even MySQL (via extensions)
SELECT c.holding_company, c.location FROM competitor c, bank b WHERE b.site_id
= 1604 AND SDO_WITHIN_DISTANCE(c.location, b.location, ’distance=2 unit=mile’) = ’TRUE’ ORACLE
SQL is so last year
Let’s use something cool
MongoDB. Because all the cool kids use NoSQL now
None
Why MongoDB?
Choose your NoSQL wise.
NoSQL in MongoDB • Document –based • Queries (JS-like syntax)
• JSON-like storage
Why MongoDB? Use Cases • Archiving • Event logging •
Document and CMS • Gaming • High volume sites • Mobile • Operational datastore • Agile development • Real-time stats Features • Ad hoc queries • Indexing • Replication • Load Balancing • File Storage • Aggregation • Server-side JavaScript • Capped collections http://en.wikipedia.org/wiki/Mongodb
Back to geo.
{ loc: [ 52.0, 21.0 ], name: ”Warsaw”, type: ”City”
}
db.nodes.ensureIndex({loc: '2d'})
That’s it.
Query • Exact o db.places.find( { loc : [50,50] }
) • Near o db.places.find( { loc : { $near : [50,50] } } ) • Limit o db.places.find( { loc : { $near : [50,50] } } ).limit(20) • Distance o db.places.find( { loc : { $near : [50,50] , $maxDistance : 5 } } ).limit(20)
Compound index • db.places.ensureIndex( { location : "2d" , category
: 1 } ); • db.places.find( { location : { $near : [50,50] }, category : 'coffee‚ } );
Bound queries • box = [ [40.73083, -73.99756], [40.741404, -73.988135]
] • db.places.find( {"loc" : {"$within" : {"$box" : box }} } )
Problems
Units
Coordinates in arc units Distance in kilometers
In query
earthRadius = 6378 // km multi = earthRadius * PI
/ 180.0 range = 3000 // km … maxDistance : range * multi…
In results
pointDistance = distances[0].dis / multi
Earth is not flat.
Problem: can’t use linear distance
Earth isn’t flat too.
Solution? Use approximation.
MongoDB has it built-in distances = db.runCommand( { geoNear :
"points", near : [0, 0], spherical : true, maxDistance : range / earthRadius /* to radians */ } ).results
Focus: runCommand distances = db.runCommand({ geoNear : "points" …
Sort by distance Only with runCommand
Automatically sorted • db.runCommand( { geoNear : "places" , near
: [50,50], num : 10 } ); • { "ns" : "test.places", "results" : [ { "dis" : 69.29646421910687, "obj" : … }, { "dis" : 69.29646421910687, "obj" : … }, … ], … }
Demo
OpenStreetMaps database of Poland imported into MongoDB
14.411.552 nodes
3GB of raw XML data
PHP in virtual machine
Imported about 100.000 nodes every 10s.
Pretty cool, eh?
Kudos to Derick Rethans Part of this talk was inspired
by his talk
Questions?
Thanks! Rate me at https://joind.in/talk/view/6475
Geoindexing with MongoDB supplement Leszek Krupiński WebClusters 2012
Why MongoDB?
Evaluate.
PostGIS is cool too. (but it’s SQL, meh)
Why MongoDB? Use Cases • Archiving • Event logging •
Document and CMS • Gaming • High volume sites • Mobile • Operational datastore • Agile development • Real-time stats Features • Ad hoc queries • Indexing • Replication • Load Balancing • File Storage • Aggregation • Server-side JavaScript • Capped collections http://en.wikipedia.org/wiki/Mongodb
If you need other features of MongoDB, use it
If you don’t, evaluate.
Evaluate.
Demo (hopefully)
Questions?
Please leave feedback! Rate me at https://joind.in/6475