to Jnd out more about your political orientation. They might be useful in some cases, but there are some serious issues related to both Bias and Privacy Quizzers
new ways to interact with the Parliament • Allow citizens to simulate the Parliament in their smartphones • Encourage healthy political debate • Keep its usage anonymous Our goals
did; • Detailed information about each proposal: • Original full-text pdf; • MPs biographies; • Related news; • Share results with your friends. Results After voting 10 proposals you get to know:
pipeline framework AWS Chalice React Native MongoDB Storage About 3.000 proposals Web scrapping Refinement Text summarization Etc Endpoint: Provide 10 random proposals to vote Overview
React Navigation • Screen/Component composition • Notification Service • Updated version of React-Native • New proposal button • Extra context on proposal cards • A lot of bug fixing […a lot] Mobile APP - Version 2 The beta almost release candidate
provide a fair random method. It is not as easy as it sounds: • Number of proposals by political parties are imbalanced • We need to take into account majority and opposition in the Parliament for each proposal debate. Proposals aggregated by political party
• Download proposal’s pdf • Parse pdf extracting textual content • Summarize pdf content • Analyze text readability • Index into MongoDB collection • Why Luigi *? • Manages tasks dependences • Task execution monitor Data processing pipeline From plain python scripts to task management * https://github.com/spotify/luigi
Flask as python web framework • Data manipulation using Pandas • Load json file in memory • Use DataFrame data structure • In-memory SQL support • Support sampling select • Deployment on Heroku • Pros • Easy to deploy using GitHub integration • Cons • Free tier wasn’t reliable for production • Too expensive to scale
as python web framework • Data manipulation using Pandas • MongoDB as database • Code deployment on Heroku • Pros • It is a database :) • Better query support • Cons • Bottleneck. Heroku’s free tier • Too expensive to scale for our pockets
Pros • Minimal infrastructure management; • Load balance and fault tolerance by default; • Cheaper to scale if necessary; • Easier to deploy; • Support different stages (e.g. dev, test, prod, …) • Support api versioning • Better monitoring and metrics • Advanced log analytics • Alerts (e.g. define latency threshold) • Cons • Bottleneck is now MongoDB (up to 200 simultaneous connections) Backend API - Version 3 Serverless architecture AWS Lambda
can be expensive. • Back to pandas ? • Idea • Pipeline • Save results at Amazon S3 bucket • Backend • Load file from S3 bucket using pandas Open question Remove mongodb and go back to pandas? AWS Lambda
into an Apache Solr index • Free text search • The problem • Recall versus precision • Hard to find precise results • Solution • Provide links for the main news outlets Backend API - one last thing Related news