Mitigating the Latency-Accuracy Trade-off in Mobile Data Analytics Systems

Mitigating the Latency-Accuracy Trade- off in Mobile Data Analytics Systems
Anand Iyer ⋆, Li Erran Li⬩, Mosharaf Chowdhury✢, Ion Stoica ⋆ ⋆UC Berkeley ⬩Fudan University/Pony.ai ✢University of Michigan MobiCom, November 1, 2018

§ Many emerging domains Mobile Data Analytics Very Popular

§ Many emerging domains Common Goal: Understand user/entity behavior Mobile
Data Analytics Very Popular

Mobile Data Analytics

Uplink SINR > -11.75 RSRQ > -16.5 RSRQ Available? Success
Drop Uplink SINR > -5.86 CQI > 5.875 Drop Drop Yes No Yes No No Yes Success No No Yes Yes Success Mobile Data Analytics

Drop Uplink SINR > -5.86 CQI > 5.875 Drop Drop Yes No Yes No No Yes Success No No Yes Yes Success Mobile Data Analytics Tasks operate on data ingested in (near) real-time for low-latency decisions

Drop Uplink SINR > -5.86 CQI > 5.875 Drop Drop Yes No Yes No No Yes Success No No Yes Yes Success Mobile Data Analytics Tasks operate on data ingested in (near) real-time for low-latency decisions Model/predict per-user/per-entity behavior

Latency-Accuracy Trade-off Data collection latency Model Accuracy

Latency-Accuracy Trade-off Data collection latency Model Accuracy Statistically insignificant

Latency-Accuracy Trade-off Data collection latency Model Accuracy Statistically insignificant High
latency

latency 1 hour latency for 94% accuracy

latency 1 hour latency for 94% accuracy Staleness enforces short interval analyses

latency 1 hour latency for 94% accuracy Staleness enforces short interval analyses Highest achieved accuracy ~66%

latency 1 hour latency for 94% accuracy Staleness enforces short interval analyses Highest achieved accuracy ~66% Need to update models frequently

Mitigating Latency-Accuracy Trade-off Data collection latency Model Accuracy

Mitigating Latency-Accuracy Trade-off Data collection latency Model Accuracy Efficient Task
Formulations

Mitigating Latency-Accuracy Trade-off Data collection latency Model Accuracy Efficient Task
Formulations Intelligent Data Grouping

Hybrid Multi-Task Learning Mitigating Latency-Accuracy Trade-off Data collection latency Model
Accuracy Efficient Task Formulations Intelligent Data Grouping

PCA based partitioning Hybrid Multi-Task Learning Mitigating Latency-Accuracy Trade-off Data
collection latency Model Accuracy Efficient Task Formulations Intelligent Data Grouping

Cellular RAN Performance Diagnostics

Goal: Diagnose problems using data collected at base stations Cellular
RAN Performance Diagnostics

Base stations vary widely in data ��
��

Base stations vary widely in data ��
�� Many base stations do not collect enough data in small intervals

Latency-Accuracy Trade-off in RANs 0 20 40 60 80 100
0 1 2 3 4 5 6 7 8 Accuracy (%) Data Collection Latency (minutes) Random Forest Lasso Regression 10 60 (Call drops) (Throughput)

0 1 2 3 4 5 6 7 8 Accuracy (%) Data Collection Latency (minutes) Random Forest Lasso Regression 10 60 High latency incurred for good accuracy (Call drops) (Throughput)

0 1 2 3 4 5 6 7 8 Accuracy (%) Data Collection Latency (minutes) Random Forest Lasso Regression 10 60 High latency incurred for good accuracy Staleness causes huge variance and errors (Call drops) (Throughput)

Cellscope Architecture CellScope Domain-Speciﬁc MTL Gradient Boosted Trees RAN Performance
Analyzer ML Lib Bearer Level Trace Dashboards Self-Organizing Networks (SON) Throughput Drop Feature Engineering PCA-Based Similarity Grouping Streaming Hybrid MTL

Multi Task Learning (MTL)

Multi Task Learning (MTL) Jointly learn many tasks by exploiting
commonalities and differences

commonalities and differences ℎ " = $(&' ( , &* " , … , &, ("))

commonalities and differences Data Train Model Task 1 Data Train Model Task 2 Data Train Model Task N … ℎ " = $(&' ( , &* " , … , &, ("))

commonalities and differences Data Train Model Task 1 Data Train Model Task 2 Data Train Model Task N … ℎ " = $(&' ( , &* " , … , &, (")) Data Task 1 Data Task 2 Data Task N … Model Model Model … Train

commonalities and differences Data Train Model Task 1 Data Train Model Task 2 Data Train Model Task N … ℎ " = $(&' ( , &* " , … , &, (")) Data Task 1 Data Task 2 Data Task N … Model Model Model … Train ℎ " = $./ (&' ( , &* " , … , &, ("))

commonalities and differences Data Train Model Task 1 Data Train Model Task 2 Data Train Model Task N … ℎ " = $(&' ( , &* " , … , &, (")) Data Task 1 Data Task 2 Data Task N … Model Model Model … Train ℎ " = $./ (&' ( , &* " , … , &, (")) Assumes that all tasks are related

MTL in Cellscope

MTL in Cellscope Train Data Task 1 Data Task 2
Data Task N … Model Model Model … ℎ " = $%& (() * , (, " , … , (. ("))

MTL in Cellscope Train Data Task 1 Data Task 2
Data Task N … Model Model Model … ℎ " = $%& (() * , (, " , … , (. (")) … … Train Data Task 1 Task 2 Task K … Model … Group 1 Data Data Model Model Train Data Task 1 Task 2 Task K … Model … Group N Data Data Model Model ℎ " = $0(%&) (() * , (, " , … , (. ("))

MTL in Cellscope Problem: Scalable maintenance of large number of
models Train Data Task 1 Data Task 2 Data Task N … Model Model Model … ℎ " = $%& (() * , (, " , … , (. (")) … … Train Data Task 1 Task 2 Task K … Model … Group 1 Data Data Model Model Train Data Task 1 Task 2 Task K … Model … Group N Data Data Model Model ℎ " = $0(%&) (() * , (, " , … , (. ("))

min $ % ℎ ': )*+ , - + /||
1(': )*+ )|| Hybrid MTL Model estimation by L1 regularized loss minimization

min $ % ℎ ': )*+ , - + /||
1(': )*+ )|| Hybrid MTL Model estimation by L1 regularized loss minimization Prediction error

min $ % ℎ ': )*+ , - + /||
1(': )*+ )|| Hybrid MTL Model estimation by L1 regularized loss minimization Per base-station parameters Prediction error

min $ % ℎ ': )*+ , - + /||
1(': )*+ )|| Hybrid MTL Model estimation by L1 regularized loss minimization Per base-station parameters Regularization parameter Prediction error

min $ % ℎ ': )*+ , - + /||
1(': )*+ )|| Hybrid MTL Model estimation by L1 regularized loss minimization Per base-station parameters Regularization parameter Prediction error Decompose parameters into shared common set fc and base station specific set fs

min $ % ℎ ': )*+ , - + /||
1(': )*+ )|| Hybrid MTL Model estimation by L1 regularized loss minimization Per base-station parameters Regularization parameter Prediction error Decompose parameters into shared common set fc and base station specific set fs $( ∑% ℎ ': )+ , )5 , - + /||1(': )+ )||) + /||1(': )5 )||

min $ % ℎ ': )*+ , - + /||
1(': )*+ )|| Hybrid MTL Model estimation by L1 regularized loss minimization Per base-station parameters Regularization parameter Prediction error Decompose parameters into shared common set fc and base station specific set fs Base-station specific $( ∑% ℎ ': )+ , )5 , - + /||1(': )+ )||) + /||1(': )5 )||

Hybrid MTL Structure of determines efficient implementation ℎ : #$
, #&

Hybrid MTL Structure of determines efficient implementation Restrict models to
be of form w . x ℎ : #$ , #&

Hybrid MTL Structure of determines efficient implementation Restrict models to
be of form w . x Leverage ensemble methods ℎ : #$ , #&

Hybrid MTL Structure of determines efficient implementation Dataset Restrict models
to be of form w . x Leverage ensemble methods ℎ : #$ , #&

Hybrid MTL Structure of determines efficient implementation Dataset Model 2
Model 3 Model 4 Model N Model 1 … Restrict models to be of form w . x Leverage ensemble methods ℎ : #$ , #&

Hybrid MTL Structure of determines efficient implementation Dataset Ensemble Model
Model 2 Model 3 Model 4 Model N Model 1 … Restrict models to be of form w . x Leverage ensemble methods ℎ : #$ , #&

Model 2 Model 3 Model 4 Model N Model 1 … Restrict models to be of form w . x Leverage ensemble methods ℎ : #$ , #& f1 f2 f3 f4 fN

Model 2 Model 3 Model 4 Model N Model 1 … Restrict models to be of form w . x Leverage ensemble methods Gradient Boosted Trees ℎ : #$ , #& f1 f2 f3 f4 fN

Hybrid MTL More details in the paper Structure of determines
efficient implementation Dataset Ensemble Model Model 2 Model 3 Model 4 Model N Model 1 … Restrict models to be of form w . x Leverage ensemble methods Gradient Boosted Trees ℎ : #$ , #& f1 f2 f3 f4 fN

Data Grouping for MTL Key Idea: Use Principal Component Analysis
(PCA) to find normal behavior

Data Grouping for MTL Project large number of correlated dimensions
to a small set of orthogonal dimensions. Key Idea: Use Principal Component Analysis (PCA) to find normal behavior

to a small set of orthogonal dimensions. Key Idea: Use Principal Component Analysis (PCA) to find normal behavior …

to a small set of orthogonal dimensions. Key Idea: Use Principal Component Analysis (PCA) to find normal behavior … n

to a small set of orthogonal dimensions. Key Idea: Use Principal Component Analysis (PCA) to find normal behavior … … … … … … … … n m Measurement matrix

to a small set of orthogonal dimensions. Key Idea: Use Principal Component Analysis (PCA) to find normal behavior … … … … … … … … … … … … … … … … … … … … n m n k Measurement matrix

to a small set of orthogonal dimensions. Key Idea: Use Principal Component Analysis (PCA) to find normal behavior … … … … … … … … … … … … … … … … … … … … n m n k loadings Measurement matrix

PCA Similarity Find the similarity between the principal components

PCA Similarity Find the similarity between the principal components …
… … … … … … … mA n … … … … … … … … mB n

… … … … … … … mA n … … … … … … … … mB n … … … … … … … … … … … … n k … … … … … … … … … … … … n k

… … … … … … … mA n … … … … … … … … mB n … … … … … … … … … … … … n k … … … … … … … … … … … … n k S"#$%%&'()$ = + ,-. / + 0-. 1 |30, − 50, |

… … … … … … … mA n … … … … … … … … mB n … … … … … … … … … … … … n k … … … … … … … … … … … … n k S"#$%%&'()$ = + ,-. / + 0-. 1 |30, − 50, | × 7809:;1'$(=,?)

Implementation & Evaluation § Implemented on Apache Spark § Extends
Mllib and provides a simple API for grouping § Evaluated using data from a live RAN § Data over several months § Models for two metrics: drops and throughput prediction § Also analyzed several issues in the wild

Cellscope Reduces Latency 0 20 40 60 80 100 0
1 2 3 4 5 6 7 8 Accuracy (%) Data Collection Latency (min) Per Base Station Cellscope 10 60 > 90% accuracy with 3 minutes data (compared to 60 minutes) 3 x

Cellscope Improves Accuracy 0 20 40 60 80 100 0
1 2 3 4 5 6 7 8 Accuracy (%) Data Collection Latency (min) Per Base Station Cellscope 10 60 Achieves high accuracy in small timespans 1.4 x 4 x

Real-world Analysis with Cellscope § Cellscope can significantly reduce operator
efforts § Reduces the need for field trials, can build accurate models quickly § Up to 2 order of magnitudes (10s of hours → minutes) § Cellscope found new issues previously unknown § E.g., Grouping revealed high interference base station clusters § Cellscope can aid domain expert § Can reduce the troubleshooting search space significantly

Much more in the paper… § Extending the techniques to
other domains § Straightforward & effective § Comparison with strawman grouping techniques § Why they’re not sufficient § Implementation details of our hybrid MTL & API § Extensive evaluation § Real-world analysis & findings

Summary § Mobile data analytics popular § Need low-latency decisions
on live data § Latency-Accuracy Trade-off § Not enough data in small timespans, staleness determines bounds on data collection latencies § Intelligent grouping and efficient task formulations § Hybrid MTL and PCA based partitioning http://www.cs.berkeley.edu/~api [email protected]

Mitigating the Latency-Accuracy Trade-off in Mo...

Mitigating the Latency-Accuracy Trade-off in Mobile Data Analytics Systems

More Decks by Anand Iyer

Other Decks in Research

Featured

Transcript