Machine Learning for HCI @ NTU CSIE, 2013/7/21

Machine Learning for HCI Johnson @ NTU Mobile HCI Lab

ML in Projects iRotateGrasp by Xman 龍哥 SenSleep by Jimmyken
RingTune (Final project for the course Data Mining)

iRotateGrasp

SenSleep

SenSleep Basic Flow

Ringtune

Ringtune 來來電鈴鈴聲調整晃動聲音趨近感測光感測 sqlite wifi train
data

What’s Common in Those Projects? They collects input data. iRotateGrasp:
44 capacitive sensor values. SenSleep: mobile & PC activities. Ringtune: Ambient sound, accelerations, light.

What’s Common in Those Projects? (2) The data is used
to determine an output. iRotateGrasp: screen orientation. SenSleep: if the user is sleeping in a time slot. Ringtune: the desired ringer volume (0~7)

The Core of Decision Making Classiﬁer 分類器 Given: <input, output>
pairs Goal: Given any inputs, predict the outputs.

Training & Testing a Classiﬁer Training Learns from data <input,
output> <input, output> <input, output> <input, output> <input, output> <input, output> <input, output> <input, output> <input, output> <input, output> ...... Testing Ask for output <input, ?>

How & What to Train?

How & What to Train? <input, output>

How & What to Train? <input, output> 收哪些資料？收幾筆？

How & What to Train? <input, output> 收哪些資料？收幾筆？「正確答案」
Ground truth 打哪來？

iRotateGrasp 的作法

iRotateGrasp Prototype

44 sensors

44 sensors < s1, s2, 14, ......, s44 > &
output 44-value input ＋output 

44 sensors < s1, s2, 14, ......, s44 > &
output 44-value input ＋output LIBSVM Classiﬁer   Chih-Chung Chang and Chih-Jen Lin, LIBSVM : a library for support vector machines. ACM Transactions on Intelligent Systems and Technology, 2:27:1--27:27, 2011

LIBSVM Data Collection Session 

LIBSVM Data Collection Session Stand Sit Lie down Lie down
(side) ❌ 

(side) Left hand Right hand Both hands ❌ ❌ 

(side) Left hand Right hand Both hands ❌ ❌ ❌ 

(side) Left hand Right hand Both hands ❌ ❌ ❌  162,000 samples

SenSleep 的作法

SenSleep Data Collection Mobile Screen Lock/Unlock events Accelerometer values Battery
charging events Light sensor values Current location change events System-deﬁned broadcast events Desktop PC Idle Intervals 12 participants, 7 days

SenSleep Ground Truth IP Camera @ participants’ bedrooms...... Participants see
the pictures taken & report actual sleeping time.

That’s A Lot to Learn From!

That’s A Lot to Learn From! Too much raw data
Requires lots of training data to conclude!

That’s A Lot to Learn From! Too much raw data
Requires lots of training data to conclude! What really matters? “Features” Features should describe our data better.

SenSleep Features Screen on/off (0 or 1) Elapsed time since
screen on/off Battery charging on/off Elapsed time since last battery event Current coordinate (location) Offset in location, compared to 15 min before Accelerometer average values Accelerometer median values Elapsed time since last PC keyboard / mouse activity <f1, ..., f9>

SenSleep Features Screen on/off (0 or 1) Elapsed time since
screen on/off Battery charging on/off Elapsed time since last battery event Current coordinate (location) Offset in location, compared to 15 min before Accelerometer average values Accelerometer median values Elapsed time since last PC keyboard / mouse activity <f1, ..., f9> 9-dimensional feature vector

Training SenSleep Classiﬁer <f1, ..., f9> 9-dimensional feature vector +
is_sleeping label Input Output

History-Related Feature < f1, ..., f9 >

History-Related Feature < f1, ..., f9 > < last_is_sleeping, f1,
..., f9 >

History-Related Feature < f1, ..., f9 > < last_is_sleeping, f1,
..., f9 > < last_f1, ..., last_f9, f1, ..., f9 >

RingTune 的作法

Ringer Volume Adj. 來來電鈴鈴聲調整晃動聲音趨近感測光感測 sqlite
wifi train data Implementation »

Incoming Call 來來電鈴鈴聲調整晃動聲音趨近感測光感測 sqlite wifi
train data Implementation »

Collection Pending 來來電鈴鈴聲調整晃動聲音趨近感測光感測 sqlite wifi
train data Implementation »

Turn On Sensors 來來電鈴鈴聲調整晃動聲音趨近感測光感測 sqlite
wifi train data Implementation »

‹ › Classifier Implementation » avg_x, avg_y, avg_z, var_x, var_y,
var_z, avg_dx, avg_dy, avg_dz, light, close 11D feature vector & Volume

Feature Extraction in Other Work

Phone Localization Martin et al. MobiCom '09, Duke University SurroundSense:
mobile phone localization via ambience ﬁngerprinting

mobile phone localization via ambience ﬁngerprinting Sound

mobile phone localization via ambience ﬁngerprinting Sound Color of Light

mobile phone localization via ambience ﬁngerprinting Sound Color of Light Motion

Sound Feature Waveform Waveform (Zoomed to samples) 1 0 -1
Martin et al. MobiCom '09, Duke University SurroundSense: mobile phone localization via ambience ﬁngerprinting

Color Feature Martin et al. MobiCom '09, Duke University SurroundSense:
mobile phone localization via ambience ﬁngerprinting

Motion Feature Moving Static Feature: moving average & variance of
instaneous acceleration Martin et al. MobiCom '09, Duke University SurroundSense: mobile phone localization via ambience ﬁngerprinting

Sensing Grip Pattern Determine On Table / In Hand Thumb
/ Index Finger Left / Right Thumb Pressure Mayank et al. UIST '12, University of Washington GripSense: using built-in sensors to detect hand posture and pressure on commodity mobile phones

Actual Application in Use Hailpern et al. DIS '10, University
of Illinois at Urbana Champaign The CLOTHO Project: Predicting Application Utility 程式啟動 / 結束視窗切換登入登出開機關機開機關機某程式 CPU 用量某程式 RAM 用量視窗 z-buffer 桌面大小視窗大小視窗座標視窗可見範圍 Focused App 滑鼠位置 Timestamp 真正在使用的「重要的程式」 High-utilization Application 當前系統快照 System Snapshot

Document Classiﬁcation Bag of words John likes to watch movies.
Mary likes too. John also likes to watch football games.

Mary likes too. John also likes to watch football games. Dictionary John, likes, tp, watch, movies, also, football, games, Mary, too

Mary likes too. John also likes to watch football games. Dictionary John, likes, tp, watch, movies, also, football, games, Mary, too <1, 2, 1, 1, 1, 0, 0, 0, 1, 1> <1, 1, 1, 1, 0, 1, 1, 1, 0, 0> (Multinominal, counts occurrence)

Mary likes too. John also likes to watch football games. Dictionary John, likes, to, watch, movies, also, football, games, Mary, too <1, 1, 1, 1, 1, 0, 0, 0, 1, 1> <1, 1, 1, 1, 0, 1, 1, 1, 0, 0> (Bernoulli, present or not)

中文的「word」需要斷詞來得到 feature vector。 Dictionary-based 中研院斷詞系統 Stanford Word Segmenter MMSeg
n-gram

Evaluating a Classiﬁer

Confusion Matrix (Binary)

Confusion Matrix

Precision, Recall PJ Cheng, Text Categorization, 2013 Web IR Class

Accuracy

Which data should we use? It has to be labeled.

PJ Cheng, Text Categorization, 2013 Web IR Slides

iRotateGrasp Cross Validation Result Within-subject cross validation Average: 90.4%

iRotateGrasp Cross Validation Result Within-subject cross validation Leave-one-subject-out cross validation
Average: 80.9% Average: 90.4%

Learning Curve Mayank et al. UIST '12, University of Washington
GripSense: using built-in sensors to detect hand posture and pressure on commodity mobile phones

Classifying Algorithms

Routine Choose algorithms Tune parameters Compare the results of different
algos / params

Numerical Function Numeric input & numeric output. Categorical Output -
Discrete “type” labels Continuous Output - Real values

Artiﬁcal Neuro Network

MS Cheng, Neuro Net & SVM, 2012 Data Mining Slides

Support Vector Machines

Probability (Graphical) Models

Naive Bayes

=P(buys_comp=y|X) =P(buys_comp=n|X) MS Cheng, Decision Tree & Naive Bayes, 2012
Data Mining Slides

Hidden Markov Model (HMM)

HMM o1 o2 s1 s2 s1 s2 s1 s2

Other Algorithms

Instance Based: K-Nearest Neighbor (KNN)

PJ Cheng, Text Categorization, 2013 Web IR Slides Choose a
distance metric to calculate the distances between feature vectors

Trees: J48 (C4.5) Decision Tree

剛剛講的這些演算法

老師教啥，Model 學啥

Supervised Learning

⼀一棵樹分兩邊：supervised, unsupervised 「classiﬁer」/「cluster」 SD Lin, Final Mark on Machine Leaerning,
2013 PGM Slides

2013 PGM Slides 剛剛教的幾乎在這

2013 PGM Slides 剛剛教的幾乎在這 HMM 在這

2013 PGM Slides 剛剛教的幾乎在這 HMM 在這 AI 會教

2013 PGM Slides 剛剛教的幾乎在這 HMM 在這 AI 會教 Clustering 分群

技能樹 Starter Class, 必修 (?) 鄭卜壬網路資訊檢索與探勘下學期修完必修之後會偏涼：陳銘憲(@EE)
資訊勘測上學期陳信希自然語言處理上學期李琳山數位語音處理概論下學期會用到但沒太大關係的課于天立(EE) / 許永真人工智慧上學期徐宏民多媒體資訊分析與檢索上學期前往真理前要打倒的大魔王林軒田機器學習上學期林守德機率圖形學習模型上學期我很猛想比賽林智仁機器學習理論與實務下學期

Weka The University of Waikato The WEKA Data Mining Software:
An Update, 2009 In Java Can be put in Android GUI ! Multiple algorithms implemented Uniﬁed input / output format

Weka Explorer Try algorithms Create model ﬁles

Weka Explorer - Preprocess Open ﬁle Switch to Classify

Weka Explorer - Classify Specify label Choose classiﬁer Set parameter
Set test options Start

LIBSVM

LIBSVM 林智仁老師 LIBSVM: A library for support vector machines, 2011
In multiple Languages Can be put in Android & iOS ! Simple install (just make!) Simple input / output format Tutorial: http://www.csie.ntu.edu.tw/~piaip/docs/svm/

LIBSVM Input Format http://www.csie.ntu.edu.tw/~piaip/docs/svm/#

LIBSVM Output format Model file Prediction file One label per
line Confidence attached if a flag is set.

LIBSVM Binaries svmtrain svmpredict 讀 http://www.csie.ntu.edu.tw/~piaip/docs/svm/#

217 Workstations 硬體 http://wslab.csie.ntu.edu.tw/hardware/ 家目錄在 NFS 裡 /tmp2 在各台主機的硬碟裡系統狀態
http://mrtg.csie.ntu.edu.tw/

217 Train Stations Usually running grid.py Finding optimal c (cost)
and g (gamma) SSH authorized_keys setup (google SSH 免密碼) ssh_workers & nr_local_worker 讀 svm 資料夾/tools/README

Machine Learning for HCI @ NTU CSIE, 2013/7/21

Machine Learning for HCI @ NTU CSIE, 2013/7/21

More Decks by Johnson Liang

Other Decks in Education

Featured

Transcript