$30 off During Our Annual Pro Sale. View Details »

Open Source in Real Life

Open Source in Real Life

Ana Schwendler

September 24, 2019
Tweet

More Decks by Ana Schwendler

Other Decks in Programming

Transcript

  1. OPEN SOURCE IN REAL LIFE

    View Slide

  2. WHAT IS SERENATA?

    View Slide

  3. WHAT IS SERENATA?
    ● The main goal: use artificial intelligence to social control of public
    administration
    ● We learnt how to work with data science using open data (CSVs that show
    reimbursements).
    ● Multidisciplinary team: Scientists, programers, marketing and journalists
    ● Open Source: More than 700 members in the Telegram group.

    View Slide

  4. WHY?
    ● Advantages: Bringing citizens and government closer, suggesting
    technology solutions
    ● For the developer: tool choice flexibility

    View Slide

  5. ● We did a crowdfunding campaign that would pay 3 months of development
    ● Data science projects usually take 6 months to a year, what can we do in 3
    months?
    ● Techniques: hypothesis driven development and timeboxing
    HOW DO WE GET HERE?

    View Slide

  6. TECHNIQUES

    View Slide

  7. ● Hypothesis-Driven Development
    ● Survey of hypotheses that seek the solution of a problem
    ● Multidisciplinary team as a way to expand knowledge
    HDD: HYPOTHESES

    View Slide

  8. ● List of hypotheses to explore
    ● Associate a time window with development, and if it doesn't work, switch to
    another hypothesis
    ● Back to previous assumptions as time goes by
    TIMEBOXING

    View Slide

  9. ● We studied the available dataset, and by that we defined some hypothesis we
    could have:
    ○ Non-Standard Prices on Food
    ○ Traveled distance and spending
    ○ Invalid tax identification number
    ○ Monthly maximums (taxi, fuel, ...)
    DEVELOPED HYPOTHESES

    View Slide

  10. ● Jupyter notebook with initial analysis
    ● Script for parsing the entire database
    ● Training an initial model
    ● Retraining after time period
    DEVELOPMENT CYCLE

    View Slide

  11. DEVELOPMENT CYCLE

    View Slide

  12. DEVELOPMENT CYCLE

    View Slide

  13. RESULTS

    View Slide

  14. MAIN REPO

    View Slide

  15. REIMBURSEMENTS DASHBOARD

    View Slide

  16. TWEETING ROBOT

    View Slide

  17. INSPIRATION FOR OTHER PROJECTS

    View Slide

  18. SERENATA.AI/EN
    @anaschwendler

    View Slide