Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
CKAN
Search
Rae Knowler
February 12, 2015
Programming
0
620
CKAN
An introduction to the open data management system, CKAN.
Rae Knowler
February 12, 2015
Tweet
Share
More Decks by Rae Knowler
See All by Rae Knowler
How do vampires use the internet? An exploration of fandom and technology
bellisk
0
390
Liiptalk__Mental_Health_First_Aid.pdf
bellisk
0
74
Hypothesis: Property-based testing for Python
bellisk
0
73
Python, Locales and Writing Systems - PyCon US, 12th May 2018
bellisk
0
110
Unsafe at Any Speed (PyDays Vienna, 5th May 2018)
bellisk
0
84
Unsafe at Any Speed (PyCon UK, 26th October 2017)
bellisk
1
650
Python, Locales and Writing Systems (PyCon Poland, 18th August 2017)
bellisk
1
120
Python, Locales and Writing Systems (PyCon Italia, 7th April 2017)
bellisk
1
100
Python, Locales and Writing Systems (Swiss Python Summit 2017)
bellisk
0
110
Other Decks in Programming
See All in Programming
Spring gRPC で始める gRPC 入門 / Introduction to gRPC with Spring gRPC
mackey0225
2
510
事業戦略を理解してソフトウェアを設計する
masuda220
PRO
22
6.2k
Haskell でアルゴリズムを抽象化する / 関数型言語で競技プログラミング
naoya
17
4.7k
AWS CDKの推しポイント 〜CloudFormationと比較してみた〜
akihisaikeda
3
280
XSLTで作るBrainfuck処理系
makki_d
0
210
カクヨムAndroidアプリのリブート
numeroanddev
0
430
Team topologies and the microservice architecture: a synergistic relationship
cer
PRO
0
840
Claude Codeの使い方
ttnyt8701
1
120
A2A プロトコルを試してみる
azukiazusa1
2
530
UPDATEがシステムを複雑にする? イミュータブルデータモデルのすすめ
shimomura
1
550
コードの90%をAIが書く世界で何が待っているのか / What awaits us in a world where 90% of the code is written by AI
rkaga
39
26k
Cursor Meetup Tokyo ゲノミクスとCursor: 進化と制約のあいだ
koido
2
1k
Featured
See All Featured
A Modern Web Designer's Workflow
chriscoyier
693
190k
Practical Tips for Bootstrapping Information Extraction Pipelines
honnibal
PRO
20
1.3k
Git: the NoSQL Database
bkeepers
PRO
430
65k
YesSQL, Process and Tooling at Scale
rocio
172
14k
JavaScript: Past, Present, and Future - NDC Porto 2020
reverentgeek
48
5.4k
Large-scale JavaScript Application Architecture
addyosmani
512
110k
Docker and Python
trallard
44
3.4k
Sharpening the Axe: The Primacy of Toolmaking
bcantrill
43
2.4k
Thoughts on Productivity
jonyablonski
69
4.7k
RailsConf 2023
tenderlove
30
1.1k
Bootstrapping a Software Product
garrettdimon
PRO
307
110k
Testing 201, or: Great Expectations
jmmastey
42
7.5k
Transcript
Rae Knowler Liip Techtalk, 12th February 2015 1
What is CKAN? 2
CKAN is a powerful data management system that makes data
accessible – by providing tools to streamline publishing, sharing, finding and using data. ckan.org/ 3
OGD@Bund (http://opendata.admin.ch) 4
Python backend, JavaScript frontend - Framework: Pylons - ORM: SQLAlchemy
- Database engine: PostgreSQL - Search: Solr - Templates: Jinja2 5 Technical basis
What is Open Data? 6
Data ... 7 - any kind - from governments -
from research and science - from businesses - ...
… that's Open 8 - accessible - free - reusable
/ remixable
The Five Stars of Open Data 9 Tim Berners-Lee (image
source)
10 Andreas Amsler (image source)
Why use CKAN? 11
Well, what happens when you don't? 12 - "If only
we could get the statistics for this…" - "Not another scanned-in PDF!" - "Do you have the hard drive with our results on it? … No, the other hard drive?"
CKAN lets you ... - upload any kind of file
- from anywhere - with a sensible metadata system - track metadata history - publish data, easily - preview it - share it 13
Who uses CKAN? 14
The Natural History Museum, UK (http://www.nhm.ac.uk/research- curation/research/informatics/creating-digital-infrastructure-science/data-portal/index.html) 15
data.gov.uk 16
CKAN features 17
Open Source - No tool lock-in - No need to
rely on the core devs for features - All the other advantages of Open Source! 18
Federation of data sources - Federation of data sources -
Can store data within CKAN itself - Search all the databases! 19
Super searching - Solr for full-text search - Facetted search
- Tags and groups for datasets - Related material 20
Groups and organisations - Organise datasets by source and topic
- Privilege control for users 21
API - RESTful JSON API - Well-documented - All features
of web interface - Access dataset statistics - Get RDF version of the catalogue - CSV / JSON dump of catalogue - ... 22
Plus ... - Responsive design - Data preview - Geospatial
features - Customisable - Extensible 23
CKAN extensions 24
Add comments 25
Google Analytics 26
QA checker (including the 5 stars) 27
Harvesting https://github.com/ckan/ckanext-harvest - Harvest datasets from any CKAN instance -
Basis for further extensions to harvest data from anywhere 28
Harvesting to a local and a central CKAN instance 29
http://extensions.ckan.org/ 30
CKAN at Liip 31
OGD@Bund (http://opendata.admin.ch) 32
Open Data Zürich (https://data.integ.stadt-zuerich.ch/) 33
ORD@CH (http://openresearchdata.ch) 34
The CKAN community 35
The Open Knowledge Foundation - UK-based non-profit organisation - Manages
CKAN development - Employs full-time dev team 36
Everyone else! - Other contributors are welcome - Useful mailing
lists: - https://lists.okfn.org/mailman/listinfo/ckan-dev - https://groups.google.com/forum/#!forum/ckan- global-user-group - Good issues for contribution: - https://github.com/ckan/ckan/labels/Good%20for% 20Contribution 37
CKANCon 2014, Berlin 38
Useful links 39
- http://ckan.org - http://docs.ckan.org/en/latest/ - https://github.com/liip/ckan-vagrant - Vagrant box for
CKAN development 40 Useful Links
Thanks! 41