Computing & Data: From academia to open source

February 26, 2013

Research

210

Computing & Data: From academia to open source

Slides for a lightning talk for the UC Berkeley workshop "Supporting Data Science - A Campus-Wide Workshop": http://vcresearch.berkeley.edu/datascience/workshop

Fernando Perez

February 26, 2013

More Decks by Fernando Perez

See All by Fernando Perez

Open Source Software in Science: Beyond the Code

0

160

Scientific Open Source Software: meat and bits but not papers. Is it real work?

0

340

Open source, academic science and the public mission of research: reflections from the field

0

590

Project Jupyter: Architecture and Evolution of an Open Platform for Modern Data Science

2

860

Keynote for JupyterCon 2017 in NYC

3

760

IPython & Project Jupyter: A language-independent architecture for open computing and data science

9

3.4k

Big Data, Ciencia y Sociedad: ¿moda pasajera o transformación de la ciencia misma?

0

420

speakerdeck-rendering-bug

0

410

Software Sustainability? Lessons from IPython

2

980

Other Decks in Research

See All in Research

業界横断副業コンプライアンス調査三者（副業者・本業先・発注者）におけるトラブル認知ギャップの構造分析

0

1.3k

データセンター事業者を取り巻く近年の状況とその中での研究開発動向、テストベッドへの貢献の可能性

1

270

【Zozo Research 技術共有会】三次元領域の現在と展望

3

470

「AIとWhyを深堀る」をAIと深堀る

0

520

nlp2026 In-Context Learningに基づく経路案内のための地理的知識の活用方法に関する検討

0

110

Harness Engineering and Al Agent

3

1.8k

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

4

1.1k

AIで最適化を解けるか？

0

140

COFFEE-Japan PROJECT Impact Report（Uminomukou Coffee）

0

250

Sequences of Logits Reveal the Low Rank Structure of Language Models

PRO

1

280

Claude Code × autoresearch 実践

0

210

第12回人と環境にやさしい交通をめざす全国大会／熊本都市圏「車1割削減、渋滞半減、公共交通2倍」をめざして

0

140

Featured

See All Featured

Effective software design: The role of men in debugging patriarchy in IT @ Voxxed Days AMS

0

450

Applied NLP in the Age of Generative AI

PRO

4

2.4k

技術選定の審美眼（2025年版） / Understanding the Spiral of Technologies 2025 edition

PRO

118

120k

Practical Orchestrator

191

11k

The Director’s Chair: Orchestrating AI for Truly Effective Learning

1

220

Ethics towards AI in product and experience design

2

330

Refactoring Trust on Your Teams (GOTO; Chicago 2020)

35

3.6k

How to Talk to Developers About Accessibility

2

410

The Curse of the Amulet

2

13k

How Fast Is Fast Enough? [PerfNow 2025]

3

660

Ecommerce SEO: The Keys for Success Now & Beyond - #SERPConf2024

1

2.1k

How to Grow Your eCommerce with AI & Automation

PRO

1

230

Transcript

Computing & Data From academia to open source Fernando Pérez
http://fperez.org, @fperez_org [email protected] Henry H. Wheeler Jr. Brain Imaging Center, UC Berkeley Supporting Data Science Feb 23, 2013
Computing and data Now part of the DNA of science
Much more than “the third/fourth branch” of science Computing and data are everybody’s problem... Therefore they are nobody’s problem
An educational problem: the computer as a research tool All
scientists need to own their computational processes. This means literacy in statistics, linear algebra, algorithms,... But also in ’software carpentry’ skills: version control, software design, testing, documentation, ... NOT yet another department on campus (ask Dave Culler)...
Open Source: skills, tools and practices we need! The culture
where these things get done. Wildly collaborative Reproducible by necessity Version control, testing, documentation, public peer review, etc.
Reward Structure in academia: we punish all of the above
Departmental boundaries: interdisciplinary work is a great buzzword, not such a great career path. Computational heritage is built on code, not on citations of prior literature. Continuous evolution vs publication milestones Authorship in collaborative works vs the ﬁrst-author paper. Scholarship and intellectual eﬀort embedded in the code.