$30 off During Our Annual Pro Sale. View Details »
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Meet Rosie
Search
Jessica Temporal
March 27, 2017
Programming
0
71
Meet Rosie
Machine Intelligence against corruption
Jessica Temporal
March 27, 2017
Tweet
Share
More Decks by Jessica Temporal
See All by Jessica Temporal
Tips for Writing Great Technical Content
jtemporal
0
280
Traveling Through a Secure API with Python and Auth0
jtemporal
0
160
Rosie, Robot
jtemporal
0
450
Serenata de Amor's data science
jtemporal
1
170
Final Graduation Project
jtemporal
0
240
Other Decks in Programming
See All in Programming
sbt 2
xuwei_k
0
130
テストやOSS開発に役立つSetup PHP Action
matsuo_atsushi
0
130
S3 VectorsとStrands Agentsを利用したAgentic RAGシステムの構築
tosuri13
4
240
ローターアクトEクラブ アメリカンナイト:川端 柚菜 氏(Japan O.K. ローターアクトEクラブ 会長):2720 Japan O.K. ロータリーEクラブ2025年12月1日卓話
2720japanoke
0
240
モデル駆動設計をやってみよう Modeling Forum2025ワークショップ/Let’s Try Model-Driven Design
haru860
0
210
Why Kotlin? 電子カルテを Kotlin で開発する理由 / Why Kotlin? at Henry
agatan
2
2k
Media Capture and Streams: W3C仕様と現場での知見
nowaki28
0
120
社内オペレーション改善のためのTypeScript / TSKaigi Hokuriku 2025
dachi023
1
290
20251127_ぼっちのための懇親会対策会議
kokamoto01_metaps
2
370
AIコードレビューがチームの"文脈"を 読めるようになるまで
marutaku
0
250
AIエージェントでのJava開発がはかどるMCPをAIを使って開発してみた / java mcp for jjug
kishida
4
840
JJUG CCC 2025 Fall Javaコミュニティの歩き方 〜参加から貢献まで、すべて教えます〜
takasyou
0
730
Featured
See All Featured
Unsuck your backbone
ammeep
671
58k
Build your cross-platform service in a week with App Engine
jlugia
234
18k
Making the Leap to Tech Lead
cromwellryan
135
9.6k
Keith and Marios Guide to Fast Websites
keithpitt
413
23k
Docker and Python
trallard
46
3.7k
Producing Creativity
orderedlist
PRO
348
40k
It's Worth the Effort
3n
187
29k
Optimizing for Happiness
mojombo
379
70k
Testing 201, or: Great Expectations
jmmastey
46
7.8k
Being A Developer After 40
akosma
91
590k
The Art of Delivering Value - GDevCon NA Keynote
reverentgeek
16
1.8k
Music & Morning Musume
bryan
46
7k
Transcript
1 Rosie
So far 2 • 5 implemented classifiers • ~3k suspicious
reimbursements found • 629 reports made • 216 congresspeople reported
How we get the data? 3 • Scrapping • APIs
◦ ReceitaWS ◦ Camara
I have the data, what do I do now? •
Develop a hypothesis • Test it out • Implement a classifier • Report 4
Jupyter Notebooks 5
GitHub 6
Server 7
The irregular companies classifier
import data 9
data.format() 10
data.head(5) 11
12 pd.merge
13 data.query()
Fixtures • Sample data 14
Tests • Test first, code later 15
Show me the code • The classifier 16
What about Rosie? • import Classifier 17
5222 18
github.com/datasciencebr @jesstemporal apoia.se/serenata 19