Efficient Spatial Sampling of Large
Geographical Tables
(SIGMOD ‘12 / TODS ‘13)
Anish Das Sarma, Hongrae Lee, Hector Gonzalez, Jayant Madhavan, Alon Halevy
Google Research
Presented by Emaad Ahmed Manzoor
March 10, 2014
2.67GHz quad-core
12GB (starting at 1GB, or 4GB for the scalability tests)
Java 1.6
Apache Simplex
K=500
“Some plots were too big, so we threw them out.”
Slide 51
Slide 51 text
Program Size
Slide 52
Slide 52 text
No content
Slide 53
Slide 53 text
Integer Relaxation
Slide 54
Slide 54 text
Scalability
Slide 55
Slide 55 text
No content
Slide 56
Slide 56 text
Objectives
Slide 57
Slide 57 text
No content
Slide 58
Slide 58 text
No content
Slide 59
Slide 59 text
Takeaways
Slide 60
Slide 60 text
Use DFS if you care only about maximality
Otherwise use the minimised LP
The randomized points-only algorithm consumes constant
memory and scales arbitrarily (not shown)