Geospatial Analysis Made Easy with meza

GEOSPATIAL ANALYSIS MADE EASY WITH MEZA GeoPython — Basel, Switzerland
— May 10, 2017 by Reuben Cummings @reubano

WHO AM I? Managing Director, Nerevu Development Founder of Arusha
Coders Author of several popular Python packages

ME ZA ( GI TH UB .C OM / R
E UB A NO /M E ZA )

readers converters MEZA OVERVIEW records input output

MEZA INPUT/OUTPUT Input Formats Output Formats Array CSV GeoJSON JSON
GeoJSON MDB CSV/TSV SQLITE DBF XLS(X) JSON YAML HTML

MT. K I L IMA NJ AR O (M OS
HI , TAN ZA N I A ) Photo Credit: Reuben Cummings

{ "type": "FeatureCollection", "features": [ { "type": "Feature", "properties": {
"peak": "uhuru", "id": 10 }, "geometry": { UHURU_PEAK.GEOJSON

"type": "Point", "coordinates": [ 37.350666, -3.066465 ] } } ]
} UHURU_PEAK.GEOJSON

{ "type": "FeatureCollection", "features": [ { "type": "Feature", "properties": {
"peak": "kibo", "id": 11 }, "geometry": { KIBO_PEAK.GEOJSON

"type": "Point", "coordinates": [ 37.353333, -3.075833 ] } } ]
} KIBO_PEAK.GEOJSON

MEZA DEMO

>>> from meza import io >>> >>> records = io.read('kibo_peak.geojson')
>>> next(records) {'id': 11, 'lat': Decimal('-3.075833'), 'lon': Decimal('37.353333'), 'peak': 'kibo', 'type': 'Point'} MEZA DEMO (READERS)

CHALLENGE #1 MERGING

MEZA DEMO (MERGING) >>> from meza import convert as cv
>>> >>> paths = ( ... 'uhuru_peak.geojson', ... 'kibo_peak.geojson') >>> >>> records = io.join(*paths) >>> geojson = cv.records2geojson(records) >>> io.write('meza_peaks.geojson', geojson)

{ "type": "FeatureCollection", "bbox": [ 37.350666, -3.075833, 37.353333, -3.066465 ],
"features": [ { MEZA_PEAKS.GEOJSON

"type": "Feature", "id": 10, "geometry": { "type": "Point", "coordinates": [
37.350666, -3.066465 ] }, "properties": { MEZA_PEAKS.GEOJSON

"id": 10, "peak": "uhuru" } }, { "type": "Feature", "id":
11, "geometry": { "type": "Point", MEZA_PEAKS.GEOJSON

"coordinates": [ 37.353333, -3.075833 ] }, "properties": { "id": 11,
"peak": "kibo" } } MEZA_PEAKS.GEOJSON

], "crs": { "type": "name", "properties": { "name": "urn:ogc:def:crs:OGC:1.3:CRS84" }
} } MEZA_PEAKS.GEOJSON

>>> records = io.read('meza_peaks.geojson') >>> csv = cv.records2csv(records) >>> io.write('meza_peaks.csv',
csv) MEZA DEMO (MERGING) $ pip install --user csvkit $ csvlook meza_peaks.csv | id | type | lat | lon | peak | | -- | ----- | ------- | ------- | ----- | | 10 | Point | -3.066… | 37.350… | uhuru | | 11 | Point | -3.075… | 37.353… | kibo |

CHALLENGE #2 SPLIT BY ID

>>> for _id, _records in groups: ... f = cv.records2geojson(_records)
... io.write(name.format(_id), f) >>> from meza import process as pr >>> >>> records = io.read('meza_peaks.geojson') >>> groups = pr.group(records, 'id') >>> name = 'peak_{}.geojson' >>> MEZA DEMO (SPLIT BY ID)

$ ls peak_* peak_10.geojson peak_11.geojson MEZA DEMO (SPLIT BY ID)

CHALLENGE #3 EXTRACT BY ID

>>> records = io.read('peaks.geojson') >>> groups = pr.group(records, 'id') >>>
group = next( ... g for g in groups if g[0] == 11) >>> MEZA DEMO (EXTRACT BY ID)

>>> geojson = cv.records2csv(group[1]) >>> io.write('id_11_peaks.csv', geojson) >>> records =
io.read('peaks.geojson') >>> groups = pr.group(records, 'id') >>> group = next( ... g for g in groups if g[0] == 11) >>> MEZA DEMO (EXTRACT BY ID)

$ csvlook id_11_peaks.csv | id | type | lat |
lon | peak | | -- | ----- | ------- | ------- | ---- | | 11 | Point | -3.076… | 37.353… | kibo | MEZA DEMO (EXTRACT BY ID)

BUT WAIT, THERE'S MORE! ME ZA D E MO

CHALLENGE #4 EXTRACT BY ID V2

MEZA DEMO (EXTRACT BY ID V2) >>> from urllib.request import
urlopen >>> >>> BASE = 'https://raw.githubusercontent.com' >>> REPO = 'drei01/geojson-world-cities' >>> path = '{}/{}/master/cities.geojson' >>> url = path.format(BASE, REPO) >>> f = urlopen(url) >>> records = io.read_geojson(f)

MEZA DEMO (EXTRACT BY ID V2) >>> next(records) {'NAME': 'TORSHAVN',
'id': None, 'lat': Decimal('62.015167236328125'), 'lon': Decimal('-6.758638858795166'), 'pos': 0, 'type': 'Polygon'}

MEZA DEMO (EXTRACT BY ID V2) >>> clean = (
... r for r in records if r.get('NAME')) >>> >>> splits = pr.split( ... clean, 'NAME', chunksize=1024) >>> >>> b_splits = ( ... s for s in splits if 'BASE' in s[1]) >>> >>> name = 'base_cities.csv'

MEZA DEMO (EXTRACT BY ID V2) >>> for pos, split
in enumerate(b_splits): ... f = cv.records2csv( ... split[0], skip_header=pos) ... ... io.write(name, f, mode='ab+')

$ csvstat base_cities.csv | tail -n12 6. "NAME" Unique values:
4 Most common values: BASEL (102x) KABASELE-PANIA (23x) MATSUBASE (17x) WABBASEKA (10x) Row count: 152 MEZA DEMO (EXTRACT BY ID)

MEZA DEMO (EXTRACT BY ID) $ csvcut -c NAME,lon,lat base_cities.csv
\ | csvlook --max-rows 3 | NAME | lon | lat | | ----- | ------ | ------- | | BASEL | 7.549… | 47.544… | | BASEL | 7.544… | 47.545… | | BASEL | 7.539… | 47.547… | | ... | ... | ... |

Reuben Cummings @reubano THANKS!

Geospatial Analysis Made Easy with meza

Geospatial Analysis Made Easy with meza

More Decks by Reuben Cummings

Other Decks in Programming

Featured

Transcript