Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
CKAN
Search
Sponsored
·
SiteGround - Reliable hosting with speed, security, and support you can count on.
→
Rae Knowler
February 12, 2015
Programming
0
670
CKAN
An introduction to the open data management system, CKAN.
Rae Knowler
February 12, 2015
Tweet
Share
More Decks by Rae Knowler
See All by Rae Knowler
How do vampires use the internet? An exploration of fandom and technology
bellisk
0
430
Liiptalk__Mental_Health_First_Aid.pdf
bellisk
0
78
Hypothesis: Property-based testing for Python
bellisk
0
87
Python, Locales and Writing Systems - PyCon US, 12th May 2018
bellisk
0
120
Unsafe at Any Speed (PyDays Vienna, 5th May 2018)
bellisk
0
97
Unsafe at Any Speed (PyCon UK, 26th October 2017)
bellisk
1
700
Python, Locales and Writing Systems (PyCon Poland, 18th August 2017)
bellisk
1
130
Python, Locales and Writing Systems (PyCon Italia, 7th April 2017)
bellisk
1
120
Python, Locales and Writing Systems (Swiss Python Summit 2017)
bellisk
0
110
Other Decks in Programming
See All in Programming
humanlayerのブログから学ぶ、良いCLAUDE.mdの書き方
tsukamoto1783
0
200
Oxlint JS plugins
kazupon
1
1k
AI時代のキャリアプラン「技術の引力」からの脱出と「問い」へのいざない / tech-gravity
minodriven
22
7.5k
Data-Centric Kaggle
isax1015
2
790
そのAIレビュー、レビューしてますか? / Are you reviewing those AI reviews?
rkaga
6
4.6k
Amazon Bedrockを活用したRAGの品質管理パイプライン構築
tosuri13
5
810
KIKI_MBSD Cybersecurity Challenges 2025
ikema
0
1.3k
16年目のピクシブ百科事典を支える最新の技術基盤 / The Modern Tech Stack Powering Pixiv Encyclopedia in its 16th Year
ahuglajbclajep
5
1.1k
LLM Observabilityによる 対話型音声AIアプリケーションの安定運用
gekko0114
2
440
AWS re:Invent 2025参加 直前 Seattle-Tacoma Airport(SEA)におけるハードウェア紛失インシデントLT
tetutetu214
2
120
並行開発のためのコードレビュー
miyukiw
2
1.4k
dchart: charts from deck markup
ajstarks
3
1k
Featured
See All Featured
How STYLIGHT went responsive
nonsquared
100
6k
Git: the NoSQL Database
bkeepers
PRO
432
66k
Facilitating Awesome Meetings
lara
57
6.8k
Reality Check: Gamification 10 Years Later
codingconduct
0
2k
Highjacked: Video Game Concept Design
rkendrick25
PRO
1
290
Prompt Engineering for Job Search
mfonobong
0
160
Optimising Largest Contentful Paint
csswizardry
37
3.6k
Docker and Python
trallard
47
3.7k
Noah Learner - AI + Me: how we built a GSC Bulk Export data pipeline
techseoconnect
PRO
0
110
Stop Working from a Prison Cell
hatefulcrawdad
273
21k
How to build an LLM SEO readiness audit: a practical framework
nmsamuel
1
650
What Being in a Rock Band Can Teach Us About Real World SEO
427marketing
0
180
Transcript
Rae Knowler Liip Techtalk, 12th February 2015 1
What is CKAN? 2
CKAN is a powerful data management system that makes data
accessible – by providing tools to streamline publishing, sharing, finding and using data. ckan.org/ 3
OGD@Bund (http://opendata.admin.ch) 4
Python backend, JavaScript frontend - Framework: Pylons - ORM: SQLAlchemy
- Database engine: PostgreSQL - Search: Solr - Templates: Jinja2 5 Technical basis
What is Open Data? 6
Data ... 7 - any kind - from governments -
from research and science - from businesses - ...
… that's Open 8 - accessible - free - reusable
/ remixable
The Five Stars of Open Data 9 Tim Berners-Lee (image
source)
10 Andreas Amsler (image source)
Why use CKAN? 11
Well, what happens when you don't? 12 - "If only
we could get the statistics for this…" - "Not another scanned-in PDF!" - "Do you have the hard drive with our results on it? … No, the other hard drive?"
CKAN lets you ... - upload any kind of file
- from anywhere - with a sensible metadata system - track metadata history - publish data, easily - preview it - share it 13
Who uses CKAN? 14
The Natural History Museum, UK (http://www.nhm.ac.uk/research- curation/research/informatics/creating-digital-infrastructure-science/data-portal/index.html) 15
data.gov.uk 16
CKAN features 17
Open Source - No tool lock-in - No need to
rely on the core devs for features - All the other advantages of Open Source! 18
Federation of data sources - Federation of data sources -
Can store data within CKAN itself - Search all the databases! 19
Super searching - Solr for full-text search - Facetted search
- Tags and groups for datasets - Related material 20
Groups and organisations - Organise datasets by source and topic
- Privilege control for users 21
API - RESTful JSON API - Well-documented - All features
of web interface - Access dataset statistics - Get RDF version of the catalogue - CSV / JSON dump of catalogue - ... 22
Plus ... - Responsive design - Data preview - Geospatial
features - Customisable - Extensible 23
CKAN extensions 24
Add comments 25
Google Analytics 26
QA checker (including the 5 stars) 27
Harvesting https://github.com/ckan/ckanext-harvest - Harvest datasets from any CKAN instance -
Basis for further extensions to harvest data from anywhere 28
Harvesting to a local and a central CKAN instance 29
http://extensions.ckan.org/ 30
CKAN at Liip 31
OGD@Bund (http://opendata.admin.ch) 32
Open Data Zürich (https://data.integ.stadt-zuerich.ch/) 33
ORD@CH (http://openresearchdata.ch) 34
The CKAN community 35
The Open Knowledge Foundation - UK-based non-profit organisation - Manages
CKAN development - Employs full-time dev team 36
Everyone else! - Other contributors are welcome - Useful mailing
lists: - https://lists.okfn.org/mailman/listinfo/ckan-dev - https://groups.google.com/forum/#!forum/ckan- global-user-group - Good issues for contribution: - https://github.com/ckan/ckan/labels/Good%20for% 20Contribution 37
CKANCon 2014, Berlin 38
Useful links 39
- http://ckan.org - http://docs.ckan.org/en/latest/ - https://github.com/liip/ckan-vagrant - Vagrant box for
CKAN development 40 Useful Links
Thanks! 41