Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Meet Rosie
Search
Jessica Temporal
March 27, 2017
Programming
0
59
Meet Rosie
Machine Intelligence against corruption
Jessica Temporal
March 27, 2017
Tweet
Share
More Decks by Jessica Temporal
See All by Jessica Temporal
Tips for Writing Great Technical Content
jtemporal
0
170
Traveling Through a Secure API with Python and Auth0
jtemporal
0
95
Rosie, Robot
jtemporal
0
430
Serenata de Amor's data science
jtemporal
1
170
Final Graduation Project
jtemporal
0
200
Other Decks in Programming
See All in Programming
AppRouterを用いた大規模サービス開発におけるディレクトリ構成の変遷と問題点
eiganken
1
160
Асинхронность неизбежна: как мы проектировали сервис уведомлений
lamodatech
0
980
Webエンジニア主体のモバイルチームの 生産性を高く保つためにやったこと
igreenwood
0
340
MCP with Cloudflare Workers
yusukebe
2
230
PHPで作るWebSocketサーバー ~リアクティブなアプリケーションを知るために~ / WebSocket Server in PHP - To know reactive applications
seike460
PRO
2
650
103 Early Hints
sugi_0000
1
260
Jakarta EE meets AI
ivargrimstad
0
280
Haze - Real time background blurring
chrisbanes
1
520
テストコード書いてみませんか?
onopon
2
210
「Chatwork」Android版アプリを 支える単体テストの現在
okuzawats
0
180
KubeCon + CloudNativeCon NA 2024 Overviewat Kubernetes Meetup Tokyo #68 / amsy810_k8sjp68
masayaaoyama
0
260
nekko cloudにおけるProxmox VE利用事例
irumaru
3
460
Featured
See All Featured
Cheating the UX When There Is Nothing More to Optimize - PixelPioneers
stephaniewalter
280
13k
A Tale of Four Properties
chriscoyier
157
23k
Unsuck your backbone
ammeep
669
57k
Building Adaptive Systems
keathley
38
2.3k
Speed Design
sergeychernyshev
25
680
Put a Button on it: Removing Barriers to Going Fast.
kastner
59
3.6k
Evolution of real-time – Irina Nazarova, EuRuKo, 2024
irinanazarova
6
450
Bash Introduction
62gerente
609
210k
RailsConf & Balkan Ruby 2019: The Past, Present, and Future of Rails at GitHub
eileencodes
132
33k
The Straight Up "How To Draw Better" Workshop
denniskardys
232
140k
A Philosophy of Restraint
colly
203
16k
GitHub's CSS Performance
jonrohan
1031
460k
Transcript
1 Rosie
So far 2 • 5 implemented classifiers • ~3k suspicious
reimbursements found • 629 reports made • 216 congresspeople reported
How we get the data? 3 • Scrapping • APIs
◦ ReceitaWS ◦ Camara
I have the data, what do I do now? •
Develop a hypothesis • Test it out • Implement a classifier • Report 4
Jupyter Notebooks 5
GitHub 6
Server 7
The irregular companies classifier
import data 9
data.format() 10
data.head(5) 11
12 pd.merge
13 data.query()
Fixtures • Sample data 14
Tests • Test first, code later 15
Show me the code • The classifier 16
What about Rosie? • import Classifier 17
5222 18
github.com/datasciencebr @jesstemporal apoia.se/serenata 19