Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Meet Rosie
Search
Sponsored
·
Ship Features Fearlessly
Turn features on and off without deploys. Used by thousands of Ruby developers.
→
Jessica Temporal
March 27, 2017
Programming
86
0
Share
Meet Rosie
Machine Intelligence against corruption
Jessica Temporal
March 27, 2017
More Decks by Jessica Temporal
See All by Jessica Temporal
Tips for Writing Great Technical Content
jtemporal
0
320
Traveling Through a Secure API with Python and Auth0
jtemporal
0
180
Rosie, Robot
jtemporal
0
450
Serenata de Amor's data science
jtemporal
1
170
Final Graduation Project
jtemporal
0
260
Other Decks in Programming
See All in Programming
要はバランスからの卒業 #yumemi_grow
kajitack
0
200
Inspired By RubyKaigi (EN)
atzzcokek
0
440
Moments When Things Go Wrong
aurimas
3
130
密結合なバックエンドから TypeScript のコードを生成する
kemuridama
1
390
関係性から理解する"同一性"の型用語たち
pvcresin
2
620
1人1案件のプロダクトエンジニア時代に、"プロセス監督"としてチャレンジしたこと
non0113
0
350
代数的データ型って何が嬉しいの? #frontend_phpcon_do
kajitack
1
470
さぁV100、メモリをお食べ・・・
nilpe
0
110
Oxlintのカスタムルールの現況
syumai
5
900
OSもどきOS
arkw
0
330
Oxlintはいかにしてtsgolintのlint ruleを呼び出しているのか
syumai
2
1k
CLIであることを活かしたGitHub Copilot CLI活用術 / GitHub Copilot CLI Pro Tips & Tricks
nao_mk2
1
1.1k
Featured
See All Featured
What’s in a name? Adding method to the madness
productmarketing
PRO
24
4.1k
The Limits of Empathy - UXLibs8
cassininazir
1
340
How GitHub (no longer) Works
holman
316
150k
AI: The stuff that nobody shows you
jnunemaker
PRO
7
670
The untapped power of vector embeddings
frankvandijk
2
1.7k
The Director’s Chair: Orchestrating AI for Truly Effective Learning
tmiket
1
180
Designing Experiences People Love
moore
143
24k
Sharpening the Axe: The Primacy of Toolmaking
bcantrill
46
2.8k
Conquering PDFs: document understanding beyond plain text
inesmontani
PRO
4
2.8k
HDC tutorial
michielstock
2
680
Responsive Adventures: Dirty Tricks From The Dark Corners of Front-End
smashingmag
254
22k
Unsuck your backbone
ammeep
672
58k
Transcript
1 Rosie
So far 2 • 5 implemented classifiers • ~3k suspicious
reimbursements found • 629 reports made • 216 congresspeople reported
How we get the data? 3 • Scrapping • APIs
◦ ReceitaWS ◦ Camara
I have the data, what do I do now? •
Develop a hypothesis • Test it out • Implement a classifier • Report 4
Jupyter Notebooks 5
GitHub 6
Server 7
The irregular companies classifier
import data 9
data.format() 10
data.head(5) 11
12 pd.merge
13 data.query()
Fixtures • Sample data 14
Tests • Test first, code later 15
Show me the code • The classifier 16
What about Rosie? • import Classifier 17
5222 18
github.com/datasciencebr @jesstemporal apoia.se/serenata 19