social media newspapers, e-commerce transactions PHYSICAL SENSORS satellite imagery nightlights data CROWDSOURCING Volunteered Geographic Information Structured Versus Unstructured Data Varying Levels of spatial and temporal variation Repurposed Data Sources Data Characteristics
Data (text, video, photos) Digitization of documents and content Sentiment/Preferences Crowdsourcing What’s New? Ripe for Travel Demand Analysis (And Transportation In General) What’s It For Us?
Dynamic Models and Dynamic Data Old Data with a New Twist GPS and Emerging Data in Travel Forecasting International Transport Forum 2014 Summit, Leipzig •Big Data in Transport: Applications, Implications, Limitations Exploratory Advanced Research •Video feature extraction automation •Real time data for connected highway-vehicle systems •FY 2014 Solicitation, Data for safety analysis and freight Source: Shanjiang Zhu, George Mason University …And of course Transportation Datapalooza!
and why do people travel? • What mode of transport do they use? • What routes do they take? • Do they change their travel behavior under certain circumstances? • Do they travel with others? • How do activities drive travel behavior? • How does travel behavior vary depending on characteristics of individuals? • What are the preferences of travelers? • Capacity Planning/Expansions • Transportation Demand Management • Strategies to Enhance Accessibility (Including Disadvantaged Segments of the population) • Livable/Sustainable Solutions • Democratization of Transport Planning/Policy Process
Lack of consistency across regions Lack information on socialization/social ties Small samples, margin of error Short-Distance Travel Information on Residents Costly, Timely to Collect Very fine-grained levels of resolution Available for many, if not all parts of the globe Social networks Large Datasets Long-Distance Information on Visitors (In addition to Residents) Costly, Timely to Process
mobility and resulting contacts between people have a high degree of predictability (Song et al., 2010)” Regular patterns across population, despite the different socio-economic characteristics Of Individuals and travel context Scaling in distances travelled, time spent at locations and popularity of places Artefact of Underlying Development or Do We Care? Example of Scaling Behavior Source: brenocom.com
Source: Cheng et al. (2011), Exploring Millions of Footprints in Location Sharing Services, Proceedings of the Fifth International AAAI Conference on Weblogs and Social Media
other set represent individuals The full dataset contains 4,491,143 “check-ins” made by Brightkite users over the period April 2008-October 2010. The undirected social graph is comprised of 58,228 individuals and 214, 078 social ties (Cho et al. 2011). We extracted “check-ins for only t hose locations in the contiguous United States, and assigned “check-ins” to counties. There were almost 3 million “check- ins, 21,001 unique users and 2706 counties with at least one “check-in.” Schintler, L.A., Kulkarni, R. Haynes, K. and R. Stough (2014). Sensing “Socio-Spatio” Interaction and Accessibility from Location-Sharing Services Data. eds. Margerida, A. et al. Accessibility and Spatial Interaction
Observations not independent Issues related to temporal/spatial data Traditional Methods Do Not Apply Inequities Computational Complexity www.thesalesblog.com
the statistical accuracy of big data, what types of bias are present, how can they be corrected for, and what analytical methods are best-suited for the data!!