Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Meet Rosie
Search
Jessica Temporal
March 27, 2017
Programming
0
59
Meet Rosie
Machine Intelligence against corruption
Jessica Temporal
March 27, 2017
Tweet
Share
More Decks by Jessica Temporal
See All by Jessica Temporal
Tips for Writing Great Technical Content
jtemporal
0
180
Traveling Through a Secure API with Python and Auth0
jtemporal
0
99
Rosie, Robot
jtemporal
0
430
Serenata de Amor's data science
jtemporal
1
170
Final Graduation Project
jtemporal
0
200
Other Decks in Programming
See All in Programming
どうして手を動かすよりもチーム内のコードレビューを優先するべきなのか
okashoi
3
940
カンファレンス動画鑑賞会のススメ / Osaka.swift #1
hironytic
0
190
AWS re:Invent 2024個人的まとめ
satoshi256kbyte
0
140
SwiftUIで単方向アーキテクチャを導入して得られた成果
takuyaosawa
0
110
ESLintプラグインを使用してCDKのセオリーを適用する
yamanashi_ren01
2
270
AHC041解説
terryu16
0
500
Compose でデザインと実装の差異を減らすための取り組み
oidy
1
220
[JAWS-UG横浜 #80] うわっ…今年のServerless アップデート、少なすぎ…?
maroon1st
0
130
.NETでOBS Studio操作してみたけど…… / Operating OBS Studio by .NET
skasweb
0
130
テストコード書いてみませんか?
onopon
2
360
ペアーズでの、Langfuseを中心とした評価ドリブンなリリースサイクルのご紹介
fukubaka0825
1
160
Beyond ORM
77web
11
1.6k
Featured
See All Featured
Rails Girls Zürich Keynote
gr2m
94
13k
Bootstrapping a Software Product
garrettdimon
PRO
305
110k
YesSQL, Process and Tooling at Scale
rocio
170
14k
Visualization
eitanlees
146
15k
Imperfection Machines: The Place of Print at Facebook
scottboms
267
13k
Agile that works and the tools we love
rasmusluckow
328
21k
Templates, Plugins, & Blocks: Oh My! Creating the theme that thinks of everything
marktimemedia
28
2.2k
Building Applications with DynamoDB
mza
93
6.2k
ピンチをチャンスに:未来をつくるプロダクトロードマップ #pmconf2020
aki_iinuma
113
50k
Fashionably flexible responsive web design (full day workshop)
malarkey
406
66k
The Language of Interfaces
destraynor
156
24k
Chrome DevTools: State of the Union 2024 - Debugging React & Beyond
addyosmani
3
260
Transcript
1 Rosie
So far 2 • 5 implemented classifiers • ~3k suspicious
reimbursements found • 629 reports made • 216 congresspeople reported
How we get the data? 3 • Scrapping • APIs
◦ ReceitaWS ◦ Camara
I have the data, what do I do now? •
Develop a hypothesis • Test it out • Implement a classifier • Report 4
Jupyter Notebooks 5
GitHub 6
Server 7
The irregular companies classifier
import data 9
data.format() 10
data.head(5) 11
12 pd.merge
13 data.query()
Fixtures • Sample data 14
Tests • Test first, code later 15
Show me the code • The classifier 16
What about Rosie? • import Classifier 17
5222 18
github.com/datasciencebr @jesstemporal apoia.se/serenata 19