[CS Foundation] AIML - 2 - Regression

AI/ML - Regression Lo Pang-Yun Ting X-Village

Outline • Introduction of regression • Linear regression • Gradient
descent • Ordinary least square 2

Unsupervised Learning (非監督式學習) Supervised Learning (監督式學習) Machine Learning 聚類分類
迴歸降維 3

Machine Learning 聚類分類迴歸降維 4

Machine Learning • Classification(分類) v.s. Regression(迴歸) Lv. 1 Lv. 1
Man！ Q1:超人是否能打敗Lv. 1怪物？ 5 Seven！ Neos！ Man！ Taro！ Tiga！ Q2:需要幾位超人才能打敗Lv.1怪物？

Regression 6 • What is ‘regression’ analysis (迴歸分析)? 是一種統計學上分析數據的方法，目的在於了解兩個或多個變數間是否相關、相關方向與強度，並建立數學模型以便觀察特定變數來預測研究者感興趣的變
數。（from wiki）

Regression • What is ‘regression’ analysis? 怪物等級 1 2 3
4 5 6 7 8 9 10 打敗怪物所需超人數量 1 1 2 3 6 7 11 13 13 15 7

Regression • What is ‘regression’ analysis? 8 怪物等級打敗怪物所需超人數量找出曲線/直線來擬合數據

Regression Features: x(i) = [x 1, … x d ]
Outputs: y(i) linear regression (線性迴歸) polynomial regression (多項式迴歸) 9

Linear Regression • Model representation • Hypothesis(假說) : maps from
X to Y Choose θ so that h θ (x) is close to y for training examples Training examples 10 Weigh t

Linear Regression • How to choose θ ? Find lines/hyperplanes
with small error 11

Linear Regression • Definition of cost function 預測真實誤差
mean square error (MSE) Cost Function J(θ 0 , θ 1 ) minimize J(θ 0 , θ 1 ) 12 h θ (x) Hypothesis

Linear Regression • Look into cost function x 1 y
0 1 2 3 0 1 2 3 θ 0 = 0 Simplified 13 Goal Hypothesis Weights Cost function minimize Goal Hypothesis Weights Cost function minimize

Linear Regression x 1 y 0 1 2 3 0
1 2 3 • Look into cost function 0 0.5 1 1.5 0 1 2 3 J(θ 1 ) θ 1 = 1 Hypothesis Cost function 2 2.5 θ 1 θ 1 = 0.5 θ 1 = 1.5 14 (02 + 02 + 02) 1 2 x 3 J(1) = ((0.5 - 1)2 + (1 - 2)2 + (1.5 - 3)2) 1 2 x 3 J(0.5) = ≈ 0.58 ((1.5 - 1)2 + (1 - 2)2 + (0.5 - 3)2) 1 2 x 3 J(1.5) = ≈ 0.58 = 0

Linear Regression x 1 y 0 1 2 3 0
1 2 3 • Look into cost function 0 0.5 1 1.5 0 1 2 3 J(θ 1 ) θ 1 = 1 Hypothesis Cost function 2 2.5 θ 1 θ 1 = 0.5 θ 1 = 1.5 15

Linear Regression • Look into cost function 16 θ 1
θ 0 J(θ 0, θ 1 )

• Look into cost function J(θ 0, θ 1 )
θ 1 θ 0 17 Linear Regression

Minimize The Cost Function

Linear Regression • Optimize linear regression • Gradient descent •
Ordinary least square 19

• Gradient descent (梯度下降法) Gradient Descent 21 Cost function J(θ
0, θ 1 ) Goal J(θ 0, θ 1 ) minimize OUTLINE • Start with some θ 0 , θ 1 • Keep changing θ 0 , θ 1 to reduce J(θ 0, θ 1 ) until we hopefully end up at a minimum

repeat until convergence { } • Gradient descent alogrithm Gradient
Descent 22 Learning rate Assign value from right side to left side

• Gradient descent alogrithm Gradient Descent 23

• Gradient descent intuition Gradient Descent 24 J(θ 1 )
θ 1 · (positive value) Positive Slope repeat until convergence { } 當前 θ 值所處點的切線斜率 θ 1 Current value θ 1 becomes smaller Cost becomes smaller

θ 1 · (negative value) Negative Slope repeat until convergence { } 當前 θ 值所處點的切線斜率 θ 1 Current value θ 1 becomes bigger Cost becomes smaller

θ 1 repeat until convergence { } Learning rate If learning rate is too big It may fail to converge or even diverge θ 1 Current value

θ 1 repeat until convergence { } Learning rate If learning rate is too small Gradient descent can be slow θ 1 Current value

Exercise - (1) • TASK: Implement linear regression • Sample
code 28

Exercise - (1) • Requirements 1. 完成 hypothesis function 和
cost function 29 Hypothesis Cost function

Exercise - (1) • Requirements 2. 分別測試 (θ 0 θ
1 ) = (0, 0), (1, 1), (10, -1)，印出算出的cost值 3. 觀察不同 θ 值所得到的regression line和cost之間的關係 30

Exercise - (1) • Output 31

Ordinary Least Square • Ordinary least square (最小平方法/最小二乘法) 33 repeat
until convergence { } Solve Gradient descent Cost function OLS

Ordinary Least Square • OLS v.s. Gradient descent 34 Gradient
descent OLS θ 1 Initial value 直接求最佳解迭代計算求最佳解

Example • sklearn - LinearRegression 35 Use OLS to optimize
linear regression

Example • sklearn - SGDRegressor 36 Use Gradient descent to
optimize linear regression

Evaluation 37 • Framework Evaluation results

Evaluation • Evaluation metrics for regression 38

Evaluation • Evaluation metrics for regression 39 • Mean square
error (MSE) • Root mean square error (RMSE) • Mean absolute error (MAE) 預測真實 • 預測值和真實值的差值 • 越小越好

Evaluation • Evaluation metrics for regression 40 • R-squared score
(R2 score) • 預測值和真實數據的擬合程度 • 最佳值為1

Example • sklearn - mean_squared_error, mean_absolute_error, r2_score 41

Exercise - (2) • TASK: Use sklearn to implement linear
regression • Sample code • Requirements • 使用 Exercise - (1) 的數據來訓練LinearRegression( ) and SGDRegressor( ) • 印出兩種方法訓練完後得到的weight值 (θ) • 觀察兩種方法的結果 42

Exercise - (3) 43 • TASK: Use sklearn.metrics to evaluate
models • Requirements • 印出Exercise - (2)兩個models的RMSE (測試的資料先用training data替代)

Exercise - (3) 44 • Output

[CS Foundation] AIML - 2 - Regression

[CS Foundation] AIML - 2 - Regression

More Decks by x-village

Other Decks in Programming

Featured

Transcript