Trends in Real-world Recommender Systems

Trends in Real-world Recommender Systems Your “fancy” algorithm doesn’t scale
in production Takuya Kitazawa @takuti

$ whoami Treasure Data, Inc. Data Science Engineer Apache Hivemall
Committer * All contents are based on the speaker's own thought, and they do NOT reﬂect the view of any of his previous and current aﬃliations.

takuti.me

Trend Beyond rating Realistic scenario Me Persistent cold-start Online algorithm
Future New application Production scale

Messages Recommendation ≠ Machine Learning Keep Things Simple, Be Data-Driven
Get Outside of Your Lab

User Modeling in Folksonomies Persistently Cold-Starting Online Item Recommendation Users
Web pages + Tag Master’s thesis (2016) Bachelor’s thesis (2014) Internship

Master’s thesis (2016) Users Web pages + Tag #BUDI 0OMJOF
User Modeling in Folksonomies Persistently Cold-Starting Online Item Recommendation Bachelor’s thesis (2014) Internship

Master’s thesis (2016) Users Web pages + Tag #BUDI 0OMJOF
User Modeling in Folksonomies Persistently Cold-Starting Online Item Recommendation Bachelor’s thesis (2014) Trend? Internship

ACM RecSys Conference 2014-2017 https://takuti.me/note/recsys-wordcloud/ 2014 2016 2015 2017

ACM RecSys Conference 2014-2017 https://takuti.me/note/recsys-wordcloud/ 2014 2016 2015 2017 Beyond
collaborative filtering on rating

“Netflix never implemented that solution itself” https://digit.hbs.org/submission/the-netﬂix-prize-crowdsourcing-to-improve-dvd-recommendations/ https://www.techdirt.com/blog/innovation/articles/20120409/03412518422/why-netflix-never-implemented-algorithm-that-won-netflix-1-million-challenge.shtml

https://digit.hbs.org/submission/the-netﬂix-prize-crowdsourcing-to-improve-dvd-recommendations/ https://www.techdirt.com/blog/innovation/articles/20120409/03412518422/why-netflix-never-implemented-algorithm-that-won-netflix-1-million-challenge.shtml Change from US DVDs to global streaming Did
not scale against dynamic growth of users and items Use more blended technique

https://www.slideshare.net/optimaltransformation/a-collection-of-quotes-from-albert-einstein

System requirements Wide-ranging applications and data “Practices” Scalability Batch vs
streaming Social networks Product review (EC) Group recommendation

Recommendation is Predicting users’ unforeseen behavior from data Users’ history
Item attributes Context …

Recommendation is Predicting users’ unforeseen behavior from data But,

Recommendation ≠ Machine Learning

Practice: Golf package recommendation at Rakuten Course Price Options  (e.g.
caddy, lunch) + + ML as a tool Interpretable Simple R. Swezey and Y. Chung. Recommending Short-Lived Dynamic Packages for Golf Booking Services. CIKM 2015.

Theory: My new recommender

Factorization Machines S. Rendle. Factorization Machines with libFM. ACM Transactions
on Intelligent Systems and Technology, 3(3).

Practice: My new “fancy” recommender on real data Poor accuracy
Many hyper-params Ineﬃcient Worse than Matrix Factorization Don’t squeeze everything into single method

Keep Things Simple, Be Data-Driven

# of data = # of solutions Whew! My new
algorithm beats well-known methods!

# of data = # of solutions Always recommend “most
popular” items ML-ish techniques Whew! My new algorithm beats well-known methods! Accuracy High Low

Simplest: Non-personalized recommendation Most Popular Average rating Random

Do the “minimum” math https://takuti.me/note/the-amazon-way-on-iot/

Q. Which technique should I use?

Q. Which technique should I use? A. It depends on
your data and application

Persistent cold-start problem at Rakuten Institute of Technology

Golf package recommendation at Rakuten Course Price Options  (caddy, lunch,
…) + + Q. What happens for dynamic trends (e.g., changing price and/or users’ taste)

Persistent cold-start on ad data (Yahoo! Lab; 2013)

Persistent cold-start on real web service (Booking.com; 2015)

Persistent cold-start Online update Rich auxiliary data Incremental Factorization Machines
Persistently Cold-Starting Online Item Recommendation RecProﬁle 2016 Master’s thesis Problem Effective approach

Production-level algorithm should be “usable” at Treasure Data

Implement anomaly detection algorithms Test on real system metrics https://takuti.me/note/td-intern-2016/

Time-series data e.g., syslog Outlier and change-point in time-series data
STEP 1 Find patterns from past observations Wide-scale “global” change time value … … 1508966854 290 1508966853 294 1508966852 38 1508966852 290 1508966851 294 1508958753 301 1508955307 38 1508954422 38 1508948503 38 … … Change-Point Spiky “local” data point Outlier STEP 2 Compute score at each point in time “How far from past pattern”

‣ Probabilistic approach ‣ Many hyper-parameters and sensitive result ‣
Mathematically tractable, numerical algebraic approach ‣ Minimum # of hyper-params with robust result ‣ Eﬃcient approximation scheme ChangeFinder Singular Spectrum Transformation Easy-to-use, Interpretable

Similarities between anomaly detection and recommendation Feature-expressiveness Rich vector representation
Online-updating Finding similar/dissimilar samples in real-time Usability Simple hyper-params, interpretable result Scalability Production-level eﬃcient back-end system Implicit feedback Binary feedback (buy or not, anomaly or not)

Apply “usable” anomaly detection method for recommendation

RecSys 2016 tutorial by Quora Implicit >>> Explicit https://www.slideshare.net/xamat/recsys-2016-tutorial-lessons-learned-from-building-reallife-recommender-systems

Don’t be algorithm-driven at Silver Egg Technology

‣ 1M+ purchase log ‣ Attributes - Customer’s session ID
- Item ID - Timestamp Started from algorithm Real e-commerce data Lack of features

Understanding data Small amount of daily purchase Customers Items 0.0086%
nonzero Need to take advantage of sparsity in terms of both algorithm and implementation

Understanding data Rapidly increasing # of customers and items High
dimensionality Customers Items

Understanding data Small % of customers/items contribute many purchases Massive
“useless” customers and items Customers Items

Understanding data Timestamp represents seasonality

Assumption My algorithm might NOT be effective on this data…

Anyway, let me try as much as I can… Dimensionality
reduction by hashing Store item candidates with time window ‣ Only use most-recently observed 100 items for recommendation

Lessons Start from data Understanding data leads appropriate algorithm Think
of hybrid approach

Messages Recommendation ≠ Machine Learning Keep Things Simple, Be Data-Driven

Future: Scaling recommender in production Personalization is everywhere in various
ways as Netﬂix said “Everything is recommendation” https://www.slideshare.net/justinbasilico/past-present-future-of-recommender-systems-an-industry-perspective

Listen podcast episode with Dr. Joseph Konstan ‣ “I hate
Amazon’s ﬁrst page” ‣ Recommendation for education ‣ Context-aware recommender ‣ Cross validation is NOT realistic ‣ Serendipity ≠ Just “BAD” - = like & didn’t know ‣ …

First step Online course https://takuti.me/note/coursera-recommender-systems/

First step Pre-programmed (mostly static) algorithms and metrics ‣ Surprise
(Python) http://surpriselib.com/ ‣ fastFM (Python) http://ibayer.github.io/fastFM/ ‣ Implicit (Python) http://implicit.readthedocs.io/en/latest/ ‣ MyMediaLite (C#) http://www.mymedialite.net/ ‣ LibRec (Java) https://www.librec.net/ ‣ LensKit (Java) http://lenskit.org/ On Apache Hadoop, Hive, Spark: ‣ Apache Mahout http://mahout.apache.org/ ‣ Apache Hivemall https://hivemall.incubator.apache.org/ ‣ Spark MLlib https://spark.apache.org/mllib/

And, FluRS :)

Trends in Real-world Recommender Systems Your “fancy” algorithm doesn’t scale
in production Takuya Kitazawa @takuti

Trends in Real-world Recommender Systems

Trends in Real-world Recommender Systems

More Decks by Takuya Kitazawa

Other Decks in Technology

Featured

Transcript