Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Intro a Google Refine
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
dcabo
May 25, 2013
0
640
Intro a Google Refine
dcabo
May 25, 2013
Tweet
Share
More Decks by dcabo
See All by dcabo
Open Data y Transparencia
dcabo
0
72
Mejorando el periodismo con Ruby
dcabo
0
590
Reutilización de datos y transparencia
dcabo
3
340
Preparando datos para su análisis
dcabo
0
620
Beyond FOIA (FOIA and Technology)
dcabo
1
87
Open Data y Transparencia
dcabo
0
200
¿Dónde van mis impuestos?
dcabo
3
240
Casos prácticos de la reutilización de datos públicos
dcabo
2
130
Against the Spanish odds (the techie side)
dcabo
3
270
Featured
See All Featured
The Power of CSS Pseudo Elements
geoffreycrofte
80
6.2k
Bridging the Design Gap: How Collaborative Modelling removes blockers to flow between stakeholders and teams @FastFlow conf
baasie
0
450
The Impact of AI in SEO - AI Overviews June 2024 Edition
aleyda
5
730
Reflections from 52 weeks, 52 projects
jeffersonlam
356
21k
The Success of Rails: Ensuring Growth for the Next 100 Years
eileencodes
47
7.9k
Paper Plane (Part 1)
katiecoart
PRO
0
4.2k
Writing Fast Ruby
sferik
630
62k
Dominate Local Search Results - an insider guide to GBP, reviews, and Local SEO
greggifford
PRO
0
77
Connecting the Dots Between Site Speed, User Experience & Your Business [WebExpo 2025]
tammyeverts
11
830
Utilizing Notion as your number one productivity tool
mfonobong
3
220
Templates, Plugins, & Blocks: Oh My! Creating the theme that thinks of everything
marktimemedia
31
2.7k
Testing 201, or: Great Expectations
jmmastey
46
8k
Transcript
Limpiando datos con Google Refine David Cabo (@dcabo)
[email protected]
Limpiando datos • Refine: Herramienta de exploración y limpieza de
datos • Proceso • 1. Obtener los datos • 2. Limpiarlos con Refine • 3. Analizarlos: Excel, Open Office, R...
¿Qué puede hacer? • Filtrar y agrupar datos por distintos
criterios • Aplicar transformaciones a los datos • Unir/partir columnas • Verificar con bases de datos externas:FreeBase, Open Corporates... • Clustering: limpieza basada en similitudes: corrección de erratas • ...