Building a Data-Driven Web App that Everyone Can Use

Building a Data-Driven Web App That Everyone Can Use galuh.me
| @galuhsahid

What is a data-driven web application?

Let's start with a problem

$ git clone $ jupyter notebook $ python mycoolmodel.py

Web applications are super cool

It takes a lot of people to build one

It takes a lot of people to build one Myth

If you want to build one yourself, there are so
many things that you have to learn

If you want to build one yourself, there are so
many things that you have to learn Myth

You can build one yourself Fact

You can build one yourself Fact ... and it doesn't
have to take forever

About our web app • Allows users to input their
own data • Displays data from an outside source (e.g. a third-party API) • Displays the prediction result of a model we've trained previously • Displays a graph that is dynamic--based on the user input

About our web app • Allows users to input their
own data • Displays data from an outside source (e.g. a third-party API) • Displays the prediction result of a model we've trained previously • Displays a graph that is dynamic--based on the user input Works as a prototype, demo, or simple minimum viable product (MVP) (making a scalable web app is a whole diﬀerent story!)

From something like this: To something like this:

from flask import Flask app = Flask(__name__) @app.route('/') def index():
return "Hello, world!" app.py $ export FLASK_APP=app.py $ flask run * Running on http:// 127.0.0.1:5000/ (Press CTRL+C to quit) * Restarting with stat Flask

return "Hello, world!" app.py

return "Hello, world!" @app.route('/result') def get_result(): result = 100000 return str(result) app.py

from flask import Flask, render_template app = Flask(__name__) @app.route('/') def
index(): return render_template("index.html") @app.route('/result') def get_result(): result = 100000 return str(result) app.py <h1>Hello, world!</h1> templates/index.html

from flask import Flask, render_template app = Flask(__name__) @app.route('/') def
index(): return render_template("index.html") @app.route('/result') def get_result(): result = 100000 return render_template("result.html", result=result) app.py <h1>{{ result }}</h1> templates/result.html

Getting user input

<h1>Campaign name:</h1> <form action="/result" method="GET"> <input name="campaign" type="text" required></input> </form>
templates/index.html Getting user input 127.0.0.1:500/result?campaign=some-campaign

from flask import Flask, render_template, request ... @app.route('/result') def get_result():
campaign = request.args.get('campaign', None) return render_template("result.html", campaign=campaign) app.py Getting user input <h1>{{ campaign }}</h1> app.py 127.0.0.1:5000/result?campaign=some-campaign query string

Getting some data

Getting some data example.com/api/v1/campaigns?name=some-campaign

import json import urllib ... @app.route('/result') def get_result(): campaign =
request.args.get('campaign', None) base_url = 'http://example.com/api/v1/ campaigns?id=' url = "{}/{}".format(base_url, campaign) response = urllib.urlopen(url) data = json.loads(response.read()) return render_template("result.html", data=data) app.py Getting some data

{{ data }} result.html Getting some data

What to do with our model?

What to do with our model? • We only want
to use one model with the best accuracy/ smallest error

What to do with our model? • We only want
to use one model with the best accuracy/ smallest error • We don't want to retrain our model every time a new request hits our web application

Make it persistent! What to do with our model? •
pickle: built-in Python module to serialize and de-serialize a Python object structure • joblib: a library that provides utilities for pipelining Python jobs • Or we can use each library's speciﬁc method (libsvm: svm_save_model and svm_load_model, TensorFlow: tf.train.Saver() class)

Making our model persistent from sklearn.ensemble import RandomForestRegressor from sklearn.externals
import joblib rf = RandomForestRegressor(n_estimators=300) rf.fit(X_train, y_train) joblib.dump(rf, 'model.sav') Your original Python script/Jupyter Notebook

import json import pandas as pd from sklearn.externals import joblib
def get_prediction(data): df_data = pd.DataFrame([data]).astype(float) model = joblib.load("model.sav") predicted_amount = model.predict(df_data)[0] target_amount = data["target_amount"] if (target_amount > predicted_amount): is_funded = False else: is_funded = True prediction = {"amount": predicted_amount, "is_funded": is_funded} return json.dumps(prediction) model.py Making our model persistent

from model import get_prediction ... @app.route('/result') def get_result(): ... data
= json.loads(response.read()) prediction = json.loads(get_prediction(data)) return render_template("result.html", data=data, prediction=prediction) app.py Making our model persistent

{{ data }} {{ prediction }} result.html Making our model
persistent False is_funded':

Some pitfalls...

Some pitfalls • Make sure you're loading trusted data

Some pitfalls • Make sure you're loading trusted data •
Saving a model using a particular version of a library and loading it using another version might give unexpected results

Some pitfalls • Make sure you're loading trusted data •
Saving a model using a particular version of a library and loading it using another version might give unexpected results So what to do? Keep your: • Training data • Source code that generates the model • Version of the library used • Dependencies used • Cross validation score obtained

Data visualization

import matplotlib.pyplot as plt y = [1, 2, 3, 4,
5] x = [0, 2, 1, 3, 4] plt.plot(x, y) Your original Python script/Jupyter Notebook Data visualization

Data visualization import matplotlib.pyplot as plt import StringIO import base64
import json def get_plot_url(fb_shares): y = [1, 2, 3, 4, fb_shares] x = [0, 2, 1, 3, 4] plt.plot(x, y) img = StringIO.StringIO() plt.savefig(img, format='png') img.seek(0) plot_url = base64.b64encode(img.getvalue()) return json.dumps({'plot_url': plot_url}) graph.py

{{ data }} {{ prediction }} {{ graph }} result.html
Data visualization

Putting everything together

<head> <title>Campaign Success Estimator</title> </head> <body> <h1>{{ data["id"] }}</h1> <h2>Statistics</h2>
<ul> <li><strong>Story word count:</strong> {{ data["story_word_count"] }}</li> <li><strong>Number of images:</strong> {{ data["number_of_images"] }}</li> <li><strong>Number of videos:</strong> {{ data["number_of_videos"] }}</li> <li><strong>Number of Facebook shares:</strong> {{ data["number_of_fb_shares"] }}</li> <li><strong>Target amount:</strong> {{ data["target_amount"] }}</li> </ul> <h2>Prediction</h2> <strong>Predicted amount:</strong> {{ prediction["amount"] }} </body> templates/result.html Putting everything together

<head> <title>Campaign Success Estimator</title> </head> <body> ... <img src="data:image/png;base64, {{
graph['plot_url'] }}" /> </body> templates/result.html Putting everything together

Flask's templating engine: Jinja2 <title>{% block title %}{% endblock %}</title>
<ul> {% for user in users %} <li><a href="{{ user.url }}">{{ user.username }} </a></li> {% endfor %} </ul>

Jinja2 examples jinja.pocoo.org/docs/2.10/

Conditionals <head> <title>Campaign Success Estimator</title> <style> html { font-family: "Arial"
} .prediction { margin: 10px; } .prediction .funded { color: #2ecc71; } .prediction .not-funded { color: #e74c3c; } </style> </head> ... templates/result.html

Conditionals ... <div class="prediction"> <strong>Predicted amount: </strong> <span class="prediction {{'funded'
if prediction['is_funded'] else 'not-funded'}}"> {{ prediction["amount"] }} </span> </div> ... templates/result.html <span class="prediction funded"> if is_predicted returns True <span class="prediction not-funded"> if is_predicted returns Frue

Conditionals

Custom ﬁlters ... @app.template_filter('format_currency') def format_currency(value): value = int(value) return
"Rp{:,}".format(value) ... app.py

Custom ﬁlters ... <li><strong>Target amount:</strong> {{ data["target_amount"]| format_currency }}</li> </ul>
<h2>Prediction</h2> <div class="prediction"> <strong>Predicted amount: </strong> <span class="prediction {{'funded' if prediction['is_predicted'] else 'not-funded'}}"> {{ prediction["amount"]|format_currency }} </span> </div> ... templates/result.html

Custom ﬁlters

What's next? • Deploy it and share it with the
world: • Heroku • Google App Engine • And many other options • Add some more functionalities: • Flask Admin • Flask Login • ... and so on

What's next? • Make it more interactive • react-flask •
react-redux-flask • flask-vuejs • flask + d3.js • Make it more scalable

Examples

Flask Source code: https://github.com/galuhsahid/campaign-success-predictor Paper: https://ieeexplore.ieee.org/document/8355046/ Campaign Success Predictor

Campaign Success Predictor Flask Paper: https://ieeexplore.ieee.org/document/8355046/ Source code: https://github.com/galuhsahid/campaign-success-predictor

Indonesian Word Embedding (http://indonesian-word-embedding.herokuapp.com) Flask + Vue.js Source code: https://github.com/galuhsahid/indonesian-word-embedding

Resources • Flask documentation • Jinja2 documentation

That's it. Thanks!

Building a Data-Driven Web App that Everyone Ca...

Building a Data-Driven Web App that Everyone Can Use

More Decks by Galuh Sahid

Other Decks in Programming

Featured

Transcript