Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Meet Rosie
Search
Jessica Temporal
March 27, 2017
Programming
0
64
Meet Rosie
Machine Intelligence against corruption
Jessica Temporal
March 27, 2017
Tweet
Share
More Decks by Jessica Temporal
See All by Jessica Temporal
Tips for Writing Great Technical Content
jtemporal
0
200
Traveling Through a Secure API with Python and Auth0
jtemporal
0
120
Rosie, Robot
jtemporal
0
440
Serenata de Amor's data science
jtemporal
1
170
Final Graduation Project
jtemporal
0
220
Other Decks in Programming
See All in Programming
Cline指示通りに動かない? AI小説エージェントで学ぶ指示書の書き方と自動アップデートの仕組み
kamomeashizawa
1
430
Use Perl as Better Shell Script
karupanerura
0
690
Prism.parseで 300本以上あるエンドポイントに 接続できる権限の一覧表を作ってみた
hatsu38
1
110
OpenNext + Hono on Cloudflare でイマドキWeb開発スタックを実現する
rokuosan
0
120
Elixir で IoT 開発、 Nerves なら簡単にできる!?
pojiro
1
120
C++20 射影変換
faithandbrave
0
340
Cloudflare Realtime と Workers でつくるサーバーレス WebRTC
nekoya3
0
390
漸進。
ssssota
0
1.9k
AWS CDKの推しポイント 〜CloudFormationと比較してみた〜
akihisaikeda
3
200
Beyond Portability: Live Migration for Evolving WebAssembly Workloads
chikuwait
0
340
Datadog RUM 本番導入までの道
shinter61
1
250
Using AI Tools Around Software Development
inouehi
0
1.2k
Featured
See All Featured
GraphQLの誤解/rethinking-graphql
sonatard
71
11k
VelocityConf: Rendering Performance Case Studies
addyosmani
329
24k
The Language of Interfaces
destraynor
158
25k
Evolution of real-time – Irina Nazarova, EuRuKo, 2024
irinanazarova
8
780
実際に使うSQLの書き方 徹底解説 / pgcon21j-tutorial
soudai
PRO
180
53k
Rebuilding a faster, lazier Slack
samanthasiow
81
9k
Mobile First: as difficult as doing things right
swwweet
223
9.6k
Producing Creativity
orderedlist
PRO
346
40k
Art, The Web, and Tiny UX
lynnandtonic
299
21k
Improving Core Web Vitals using Speculation Rules API
sergeychernyshev
16
910
Done Done
chrislema
184
16k
Fashionably flexible responsive web design (full day workshop)
malarkey
407
66k
Transcript
1 Rosie
So far 2 • 5 implemented classifiers • ~3k suspicious
reimbursements found • 629 reports made • 216 congresspeople reported
How we get the data? 3 • Scrapping • APIs
◦ ReceitaWS ◦ Camara
I have the data, what do I do now? •
Develop a hypothesis • Test it out • Implement a classifier • Report 4
Jupyter Notebooks 5
GitHub 6
Server 7
The irregular companies classifier
import data 9
data.format() 10
data.head(5) 11
12 pd.merge
13 data.query()
Fixtures • Sample data 14
Tests • Test first, code later 15
Show me the code • The classifier 16
What about Rosie? • import Classifier 17
5222 18
github.com/datasciencebr @jesstemporal apoia.se/serenata 19