Efficient Spatial Sampling of Large Geographical Tables

Efficient Spatial Sampling of Large Geographical Tables (SIGMOD ‘12 /
TODS ‘13) Anish Das Sarma, Hongrae Lee, Hector Gonzalez, Jayant Madhavan, Alon Halevy Google Research Presented by Emaad Ahmed Manzoor March 10, 2014

Thinning

Constraints Objectives Challenges

Definitions

Visibility Zoom Consistency Adjacency Constraints

The Thinning Problem

K = 1 M1 = { 4, 4, 4, 4,
4 } M2 = { 1, 3, 4, 4, 4 } M3 = { 2, 3, 4, 4, 4 }

Maximality Fairness Importance Objectives

K = 1 M1 = { 4, 4, 4, 4,
4 } M2 = { 1, 3, 4, 4, 4 } M3 = { 2, 3, 4, 4, 4 }

Problem Maximality Fairness Importance Visibility Zoom Consistency Adjacency Constraints Objectives

Problem Maximality Fairness Importance Visibility Zoom Consistency Adjacency Constraints Objectives
Optimization

Integer Programming

Variables

Sampling Constraints

Zoom Consistency & Visibility Constraints

Thinning solution

Program Size

Critical nodes

Bounded Cover

Critical nodes

Program Size

Relaxing Integer Constraints

Objectives

Maximality

Strong Maximality There does not exist M’ such that:

K = 1 M1 = { 4, 4, 4, 4,
4 } M2 = { 1, 3, 4, 4, 4 } M3 = { 2, 3, 4, 4, 4 } M4 = { 1, 4, 4, 4, 3 }

Strong Maximality is NP-Hard

Weak Maximality There does not exist M’ such that: for
some 1 <= i <= n

K = 1 M1 = { 4, 4, 4, 4,
4 } M2 = { 1, 3, 4, 4, 4 } M3 = { 2, 3, 4, 4, 4 } M4 = { 1, 4, 4, 4, 3 }

K = 1 M2 = { 1, 3, 4, 4,
4 }

Point-only Datasets

Experiments

2.67GHz quad-core 12GB (starting at 1GB, or 4GB for the
scalability tests) Java 1.6 Apache Simplex K=500 “Some plots were too big, so we threw them out.”

Program Size

Integer Relaxation

Scalability

Objectives

Takeaways

Use DFS if you care only about maximality Otherwise use
the minimised LP The randomized points-only algorithm consumes constant memory and scales arbitrarily (not shown)

Efficient Spatial Sampling of Large Geographica...

Efficient Spatial Sampling of Large Geographical Tables

More Decks by Emaad Manzoor

Other Decks in Science

Featured

Transcript