Slide 1

Slide 1 text

Scoring probability model based on service landing location and ranking points in men’s professional tennis matches MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST Fumiya SHIMIZU, *Eiji KONAKA (Meijo University)

Slide 2

Slide 2 text

Today’s presentation Shimizu and Konaka. “Scoring probability model based on service landing location and ranking points in men’s professional tennis matches” MathSportInternational Conference 2023@Budapest ⚫Background ⚫Objective ⚫Data collection ⚫Hawk-Eye, Data in official website ⚫ATP Ranking ⚫Model construction ⚫Method ⚫Result and discussion ⚫Web implementation MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST

Slide 3

Slide 3 text

Background: Service in tennis ⚫Service shot in tennis ⚫All plays begin with service ⚫Only shot without being influenced by opponent. ⚫Server has clear advantage. ⚫Served shot should land on service area MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST https://www.photo-ac.com/ Question: ”Where is advantageous service landing location?”

Slide 4

Slide 4 text

Rule of service in tennis ⚫Service is shot from outside the baseline. ⚫Directly land the service on the service area once ⚫Deuce side (green) ⚫Ad side (blue) ⚫Alternate Deuce/Ad sides. ⚫Second chance ⚫1st/2nd service MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST (Based on [ITF RULES OF TENNIS 2022], edited by the authors)

Slide 5

Slide 5 text

(Area definition in the following slides) ⚫In the following slides, the service area (red box) will be depicted. ⚫Left, green: Deuce side ⚫Right, blue: Ad side MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST (Based on [ITF RULES OF TENNIS 2022], edited by the authors)

Slide 6

Slide 6 text

Background: Mathematical modeling of service in tennis PREVIOUS STUDIES ⚫Visualization of the boundary line of the advantageous/disadvantageous area of the service landing location [2020] ⚫Separation line is calculated by using Support Vector Machine (SVM) MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST Near the sideline is advantageous

Slide 7

Slide 7 text

Background: Mathematical modeling of service in tennis PREVIOUS STUDIES AND THEIR LIMITATION ⚫Visualization of the boundary line of the advantageous/disadvantageous area of the service landing location [2020] ⚫Prediction model of ace by using k-nearest neighbor(NN) method. [Whiteside et al, 2017] ⚫Prediction model from service/stroke landing location sequence. Court is partitioned into large rectangles.[Born et al, 2021] MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST

Slide 8

Slide 8 text

Background: Previous studies and their limitation (summary) Input Input resolution Objective Output type [2020] Service landing location Continuous (Hawk- Eye) Prediction Advantageous or not (Binary, SVM) [Whiteside et al, 2017] Service impact and landing location, speed, score, … Continuous (Hawk- Eye) Prediction Ace or not (Binary, k-NN) [Born et al, 2021] Service to 4th stroke landing locations Partitioned Analysis between player level (WTA/ITF) - MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST

Slide 9

Slide 9 text

Background: Previous studies and their limitation (summary) Input Input resolution Objective Output type [2020] Service landing location Continuous (Hawk- Eye) Prediction Advantageous or not (Binary, SVM) [Whiteside et al, 2017] Service impact and landing location, speed, score, … Continuous (Hawk- Eye) Prediction Ace or not (Binary, k-NN) [Born et al, 2021] Service to 4th stroke landing locations Partitioned Analysis between player level (WTA/ITF) - MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST Red text: limitation of each method Blue text: advance of each method

Slide 10

Slide 10 text

Proposed method: Overcome limitations of previous studies Input Input resolution Objective Output type [Shimizu and Konaka, 2023] Service landing location, player strength Continuous (Hawk- Eye) Prediction Score probability (Continuous, GLM) [2020] Service landing location Continuous (Hawk- Eye) Prediction Advantageous or not (Binary, SVM) [Whiteside et al, 2017] Service impact and landing location, speed, score, … Continuous (Hawk- Eye) Prediction Ace or not (Binary, k-NN) [Born et al, 2021] Service to 4th stroke landing locations Partitioned Analysis between player level (WTA/ITF) - MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST

Slide 11

Slide 11 text

Proposed method: Overcome limitations of previous studies Input Input resolution Objective Output type [Shimizu and Konaka, 2023] Service landing location, player strength Continuous (Hawk- Eye) Prediction Score probability (Continuous, GLM) [2020] Service landing location Continuous (Hawk- Eye) Prediction Advantageous or not (Binary, SVM) [Whiteside et al, 2017] Service impact and landing location, speed, score, … Continuous (Hawk- Eye) Prediction Ace or not (Binary, k-NN) [Born et al, 2021] Service to 4th stroke landing locations Partitioned Analysis between player level (WTA/ITF) - MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST Player strength is used High-resolution Hawk-Eye data is used Output continuous score probability

Slide 12

Slide 12 text

Objective of proposed method OBJECTIVE ⚫Construct a mathematical prediction model of scoring in tennis ⚫Input variables ⚫Service landing location ⚫Player strength metric ⚫Method: Generalized Linear Model Regression PROS OF PROPOSED METHOD MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST ⚫Large amount, high-resolution Hawk- Eye data ⚫Over 100,000 services. ⚫Four models: Deuce/Ad sides, 1st /2nd services ⚫Continuous probability output ⚫Web-based implementation

Slide 13

Slide 13 text

Today’s presentation Shimizu and Konaka. “Scoring probability model based on service landing location and ranking points in men’s professional tennis matches” MathSportInternational Conference 2023@Budapest ⚫Background✓ ⚫Objective✓ ⚫Data collection ⚫Hawk-Eye, Data in official website ⚫ATP Ranking ⚫Model construction ⚫Method ⚫Result and discussion ⚫Web implementation MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST

Slide 14

Slide 14 text

About ATP / Hawk-Eye system ATP ⚫Association of Tennis Professionals ⚫Official governing body of men’s professional tennis ⚫Management of data acquisition system (Hawk-Eye) and collected data publication ⚫Management of official ranking point system Hawk-Eye ⚫Measure three-dimensional trajectory of ball using multiple camera ⚫Support of umpires ⚫“Challenge” system ⚫Many ATP Tour tournaments accepts the system MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST

Slide 15

Slide 15 text

About ATP / Hawk-Eye system Hawk-Eye ⚫Measure three-dimensional trajectory of ball using multiple camera ⚫Support of umpires ⚫“Challenge” system ⚫Many ATP Tour tournaments accepts the system MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST Y.Baodong, “Hawkeye technology using tennis match,” (2014)

Slide 16

Slide 16 text

About ATP / Hawk-Eye system Hawk-Eye ⚫Measure three-dimensional trajectory of ball using multiple camera ⚫Support of umpires ⚫“Challenge” system ⚫Many ATP Tour tournaments accepts the system MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST https://www.sony.com/ja/SonyInfo/technology/stories/Hawk-Eye/

Slide 17

Slide 17 text

Data publication on official website MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST https://www.atptour.com/en/stats/second-screen/archive/2019/339/MS003

Slide 18

Slide 18 text

Data publication on official website MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST https://www.atptour.com/en/stats/second-screen/archive/2019/339/MS003 Relation between service landing location and its point have been published (NOT all tournaments)

Slide 19

Slide 19 text

ATP Ranking ⚫Official world ranking of men’s professional tennis players ⚫Based on the tournament final standings of the latest 52 weeks ⚫One Win→Awarded point multiplies 5/3 to 2 times ⚫Top players should participate in mandatory tournaments MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST https://www.atptour.com/en/rankings/singles?ra nkRange=0-100&rankDate=2023-05-29

Slide 20

Slide 20 text

Strength difference measured by ATP Ranking ⚫Axes: Ranking point ratio (horizontal), Win probability (vertical) ⚫ATP ranking is good strength metric ⚫Predicted win probability=Logistic regression whose variable is logarithm of ranking point ratio. ⚫Ex. 6815/(6815+3100)=0.687 MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST Strength difference =Log of ranking point ratio

Slide 21

Slide 21 text

Today’s presentation Shimizu and Konaka. “Scoring probability model based on service landing location and ranking points in men’s professional tennis matches” MathSportInternational Conference 2023@Budapest ⚫Background✓ ⚫Objective✓ ⚫Data collection ✓ ⚫Hawk-Eye, Data in official website ✓ ⚫ATP Ranking ✓ ⚫Model construction ⚫Method ⚫Result and discussion ⚫Web implementation MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST

Slide 22

Slide 22 text

Service scoring probability model: Data COLLECTED DATA ⚫ATP Tour, 2019, 30 Tournaments, 897 Matches, 111473 Services ⚫Deta definition ⚫位置 ⚫得点 ⚫サーバー,レシーバー名 ⚫ランキングポイント ⚫Data distribution (right figure) MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST Variable Meaning 𝑠, 𝑟 Index of server and receiver 𝑥, 𝑦 Service landing location 𝑃𝑠 , 𝑃𝑟 ATP Ranking point 𝑡 Score (1:Server, 0:Receiver)

Slide 23

Slide 23 text

Service scoring probability model: Data DATA ANALYSIS MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST ⚫Service shots landed more around the sideline/centerline ⚫Different distribution between Deuce and Ad sides ⚫Biased dominant arm. Right:Left=167:21 ⚫Few shots landed on the front side (near the net)

Slide 24

Slide 24 text

Service scoring probability model: Data DATA ANALYSIS MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST ⚫Service shots landed more around the sideline/centerline✓ ⚫Different distribution between Deuce and Ad sides ⚫Biased dominant arm. Right:Left=167:21 ⚫Few shots landed on the front side (near the net)

Slide 25

Slide 25 text

Service scoring probability model: Data DATA ANALYSIS MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST ⚫Service shots landed more around the sideline/centerline ✓ ⚫Different distribution between Deuce and Ad sides ✓ ⚫Unbalanced dominant arm. Right:Left=167:21 ⚫Few shots landed on the front side (near the net)

Slide 26

Slide 26 text

Service scoring probability model: Data DATA ANALYSIS MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST ⚫Service shots landed more around the sideline/centerline ✓ ⚫Different distribution between Deuce and Ad sides ✓ ⚫Unbalanced dominant arm. Right:Left=167:21 ⚫Few shots landed on the front side (near the net) ✓

Slide 27

Slide 27 text

Service scoring probability model: Stepwise regression ⚫Preprocess ⚫Each service shot→(𝑥, 𝑦, 𝑃𝑠 , 𝑃𝑟 , 𝑡) ⚫𝐿𝑠,𝑟 ≡ log 𝑃𝑠 𝑃𝑟 ⚫Predictor variables:(𝑥, 𝑦, 𝐿𝑠,𝑟 ) ⚫Response variable :𝑡 ∈ {0,1} ⚫Add dummy data on the front side ⚫Method: Stepwise regression ⚫Predictor terms: 4th polynomial ⚫Link function: logit ⚫Use stepwiseglm in MATLAB ⚫Four models (1st/2nd , Ad/Deuce) are constructed. MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST 𝑡~ 1 1 + exp(−𝑋) 𝑋: (Up to) 4th order polynomial of predictor variables

Slide 28

Slide 28 text

Today’s presentation Shimizu and Konaka. “Scoring probability model based on service landing location and ranking points in men’s professional tennis matches” MathSportInternational Conference 2023@Budapest ⚫Background✓ ⚫Objective✓ ⚫Data collection ✓ ⚫Hawk-Eye, Data in official website ✓ ⚫ATP Ranking ✓ ⚫Model construction ✓ ⚫Method ✓ ⚫Result and discussion ⚫Web implementation MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST

Slide 29

Slide 29 text

Result 1ST/DEUCE SIDE ⚫ 𝑥1 , 𝑥2 , 𝑥3 = 𝑥, 𝑦, 𝐿𝑠,𝑟 ⚫Stepwise regression ⚫Add/remove term based on p-value MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST

Slide 30

Slide 30 text

Result: 1st service/equal strength MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST ⚫Asymmetric ⚫Advantageous on sidelines ⚫Advantageous area protrudes from outside to inside ⚫Disadvantageous on service line

Slide 31

Slide 31 text

Result: 1st service/equal strength MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST ⚫Asymmetric✓ ⚫Advantageous on sidelines ⚫Advantageous area protrudes from outside to inside ⚫Disadvantageous on service line

Slide 32

Slide 32 text

Result: 1st service/equal strength MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST ⚫Asymmetric ✓ ⚫Advantageous on sidelines ✓ ⚫Advantageous area protrudes from outside to inside ⚫Disadvantageous on service line

Slide 33

Slide 33 text

Result: 1st service/equal strength MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST ⚫Asymmetric ✓ ⚫Advantageous on sidelines ✓ ⚫Advantageous area protrudes from outside to inside ✓ ⚫Disadvantageous on service line

Slide 34

Slide 34 text

Result: 2nd service/equal strength MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST ⚫Almost symmetric ⚫Decreased scoring probability ← Reduced service speed ⚫Disadvantageous on service line

Slide 35

Slide 35 text

Result: 2nd service/equal strength MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST ⚫Almost symmetric ✓ ⚫Decreased scoring probability ← Reduced service speed ⚫Disadvantageous on service line

Slide 36

Slide 36 text

Result: 2nd service/equal strength MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST ⚫Almost symmetric ✓ ⚫Decreased scoring probability ← Reduced service speed ⚫Disadvantageous on service line

Slide 37

Slide 37 text

Result: 2nd service/equal strength MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST ⚫Almost symmetric ✓ ⚫Decreased scoring probability ← Reduced service speed ✓ ⚫Disadvantageous on service line

Slide 38

Slide 38 text

Result: 1st /Deuce side/Not equal strength ⚫In case Ranking Point Ratio =5 ⚫Server is higher-ranked ⚫On sidelines, less difference ⚫Large difference near center service line MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST

Slide 39

Slide 39 text

Result: 1st /Deuce side/Not equal strength ⚫In case Ranking Point Ratio =5 ⚫Server is higher-ranked ⚫On sidelines, less difference ⚫Large difference near center service line MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST

Slide 40

Slide 40 text

Result: 1st /Deuce side/Not equal strength ⚫In case Ranking Point Ratio =5 ⚫Server is higher-ranked ⚫On sidelines, less difference ⚫Large difference near center service line MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST

Slide 41

Slide 41 text

Result: 1st /Deuce side/Not equal strength ⚫In case Ranking Point Ratio =5 ⚫Server is higher-ranked ⚫On sidelines, less difference ⚫Large difference near center service line ⚫Higher-ranked players can score even if his service land on easier location MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST

Slide 42

Slide 42 text

Result: 1st /Deuce side/Not equal strength ⚫Higher-ranked players can score even if his service land on easier location ⚫Higher-ranked players can land his service on difficult location more frequently MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST

Slide 43

Slide 43 text

Result: 1st /Deuce side/Not equal strength ⚫Higher-ranked players can score even if his service land on easier location ⚫Higher-ranked players can land his service on difficult location more frequently MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST

Slide 44

Slide 44 text

Result: 1st /Deuce side/Not equal strength ⚫Higher-ranked players can score even if his service land on easier location ⚫Higher-ranked players can land his service on difficult location more frequently MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST

Slide 45

Slide 45 text

Demonstration: Implementation on Website lhttps://www-ie.meijo- u.ac.jp/~konaka/tennisServiceProb_Eng.html MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST

Slide 46

Slide 46 text

Summary WHAT WE COULD ACHIEVE ⚫Construct a mathematical prediction model of scoring in tennis ⚫Input variables ⚫Service landing location ⚫Player strength metric ⚫Output variable ⚫Scoring probability ⚫Method: Generalized Linear Model Regression FUTURE WORKS ⚫Include service speed ⚫(We wish we could have found service data with speed.) ⚫Player evaluation ⚫Match analysis ⚫Analysis by court surface MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST

Slide 47

Slide 47 text

References Shimizu and Konaka. “Scoring probability model based on service landing location and ranking points in men’s professional tennis matches” MathSportInternational Conference 2023@Budapest ⚫[2020] https://logmi.jp/tech/articles/324033 (written in Japanese. Non-academic presentation) ⚫[Whiteside et al, 2017] Spatial characteristics of professional tennis serves with implications for serving aces: A machine learning approach. Journal of Sports Sciences, 35(7) ⚫[Born et al, 2021] Stroke placement in women's professional tennis: What's after the serve? Sport Science, 3 MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST

Slide 48

Slide 48 text

MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST

Slide 49

Slide 49 text

Hawk-Eye system MATHSPORTINTERNATIONAL CONFERENCE 2023@BUDAPEST https://www.sony.com/ja/SonyInfo/technology/stories/Hawk-Eye/ Y.Baodong, “Hawkeye technology using tennis match,” (2014)