Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Meet Rosie
Search
Jessica Temporal
March 27, 2017
Programming
0
63
Meet Rosie
Machine Intelligence against corruption
Jessica Temporal
March 27, 2017
Tweet
Share
More Decks by Jessica Temporal
See All by Jessica Temporal
Tips for Writing Great Technical Content
jtemporal
0
190
Traveling Through a Secure API with Python and Auth0
jtemporal
0
110
Rosie, Robot
jtemporal
0
430
Serenata de Amor's data science
jtemporal
1
170
Final Graduation Project
jtemporal
0
210
Other Decks in Programming
See All in Programming
Strategic Design (DDD)for the Frontend @DDD Meetup Stuttgart
manfredsteyer
PRO
0
150
AI Agents with JavaScript
slobodan
0
230
Contribute to Comunities | React Tokyo Meetup #4 LT
sasagar
0
170
Agentic Applications with Symfony
el_stoffel
2
300
MCP調べてみました! / Exploring MCP
uhzz
2
2.3k
PHP で学ぶ OAuth 入門
azuki
1
200
API for docs
soutaro
2
1.3k
生成AIを使ったQAアプリケーションの作成 - ハンズオン補足資料
oracle4engineer
PRO
3
220
gen_statem - OTP's Unsung Hero
whatyouhide
1
200
Deoptimization: How YJIT Speeds Up Ruby by Slowing Down / RubyKaigi 2025
k0kubun
0
830
Empowering Developers with HTML-Aware ERB Tooling @ RubyKaigi 2025, Matsuyama, Ehime
marcoroth
2
660
On-the-fly Suggestions of Rewriting Method Deprecations
ohbarye
1
2.1k
Featured
See All Featured
GraphQLとの向き合い方2022年版
quramy
46
14k
GraphQLの誤解/rethinking-graphql
sonatard
71
10k
Making the Leap to Tech Lead
cromwellryan
133
9.2k
Building Better People: How to give real-time feedback that sticks.
wjessup
367
19k
Fight the Zombie Pattern Library - RWD Summit 2016
marcelosomers
233
17k
The World Runs on Bad Software
bkeepers
PRO
67
11k
Designing Experiences People Love
moore
141
24k
Understanding Cognitive Biases in Performance Measurement
bluesmoon
29
1.6k
Embracing the Ebb and Flow
colly
85
4.6k
Typedesign – Prime Four
hannesfritz
41
2.6k
Exploring the Power of Turbo Streams & Action Cable | RailsConf2023
kevinliebholz
32
5.4k
Optimising Largest Contentful Paint
csswizardry
36
3.2k
Transcript
1 Rosie
So far 2 • 5 implemented classifiers • ~3k suspicious
reimbursements found • 629 reports made • 216 congresspeople reported
How we get the data? 3 • Scrapping • APIs
◦ ReceitaWS ◦ Camara
I have the data, what do I do now? •
Develop a hypothesis • Test it out • Implement a classifier • Report 4
Jupyter Notebooks 5
GitHub 6
Server 7
The irregular companies classifier
import data 9
data.format() 10
data.head(5) 11
12 pd.merge
13 data.query()
Fixtures • Sample data 14
Tests • Test first, code later 15
Show me the code • The classifier 16
What about Rosie? • import Classifier 17
5222 18
github.com/datasciencebr @jesstemporal apoia.se/serenata 19