Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Meet Rosie
Search
Jessica Temporal
March 27, 2017
Programming
0
58
Meet Rosie
Machine Intelligence against corruption
Jessica Temporal
March 27, 2017
Tweet
Share
More Decks by Jessica Temporal
See All by Jessica Temporal
Tips for Writing Great Technical Content
jtemporal
0
170
Traveling Through a Secure API with Python and Auth0
jtemporal
0
88
Rosie, Robot
jtemporal
0
420
Serenata de Amor's data science
jtemporal
1
170
Final Graduation Project
jtemporal
0
190
Other Decks in Programming
See All in Programming
Mercari AI/LLM Hackathon TeamBの発表資料
imaikosuke
0
180
Vertical Architectures for Scalable Angular Applications
manfredsteyer
PRO
0
190
レガシーな Android アプリのリアーキテクチャ戦略
oidy
1
140
デバッグの話 / Debugging for Beginners
kaityo256
PRO
8
730
20241004 モノタロウ式~ドメインモデリングとリアーキテクチャ
monotaro
PRO
2
660
.NET Aspireのクラウド対応検証: Azureと他環境での実践
ymd65536
1
660
tsconfig.jsonの最近の新機能 ファイルパス編
uhyo
7
1.9k
ポケモンで考えるコミュニケーション / Communication Lessons from Pokémon
mackey0225
5
220
学生の時に開催したPerl入学式をきっかけにエンジニアが組織に馴染むために勉強会を主催や仲間と参加して職能間の境界を越えていく
ohmori_yusuke
2
330
ML-прайсинг_на_Lamoda__вошли_и_вышли__приключение_на_20_минут__Слава_Цыганков.pdf
lamodatech
0
390
Quarto Clean Theme
nicetak
0
220
2024-10-02 dev2next - Application Observability like you've never heard before
jonatan_ivanov
0
200
Featured
See All Featured
Raft: Consensus for Rubyists
vanstee
136
6.6k
Product Roadmaps are Hard
iamctodd
PRO
48
10k
Automating Front-end Workflow
addyosmani
1365
200k
CSS Pre-Processors: Stylus, Less & Sass
bermonpainter
355
29k
Easily Structure & Communicate Ideas using Wireframe
afnizarnur
191
16k
Rails Girls Zürich Keynote
gr2m
93
13k
Agile that works and the tools we love
rasmusluckow
327
21k
How to train your dragon (web standard)
notwaldorf
87
5.6k
Thoughts on Productivity
jonyablonski
67
4.3k
Done Done
chrislema
181
16k
Code Reviewing Like a Champion
maltzj
519
39k
jQuery: Nuts, Bolts and Bling
dougneiner
61
7.5k
Transcript
1 Rosie
So far 2 • 5 implemented classifiers • ~3k suspicious
reimbursements found • 629 reports made • 216 congresspeople reported
How we get the data? 3 • Scrapping • APIs
◦ ReceitaWS ◦ Camara
I have the data, what do I do now? •
Develop a hypothesis • Test it out • Implement a classifier • Report 4
Jupyter Notebooks 5
GitHub 6
Server 7
The irregular companies classifier
import data 9
data.format() 10
data.head(5) 11
12 pd.merge
13 data.query()
Fixtures • Sample data 14
Tests • Test first, code later 15
Show me the code • The classifier 16
What about Rosie? • import Classifier 17
5222 18
github.com/datasciencebr @jesstemporal apoia.se/serenata 19