Feasibility of estimating travel demand using geolocations of social media data

Saved in:
Bibliographic Details
Title: Feasibility of estimating travel demand using geolocations of social media data
Authors: Liao, Yuan, 1991, Yeh, Sonia, 1973, Gil, Jorge, 1972
Source: Hållbara städer: användande av stora datamängder för att förstå och hantera rörelsemönster och trafikstockningar Transportation. 49(1):137-161
Subject Terms: gravity model, social media data, lateral data, origin-destination estimation, travel demand, longitudinal data
Description: Travel demand estimation, as represented by an origin–destination (OD) matrix, is essential for urban planning and management. Compared to data typically used in travel demand estimation, the key strengths of social media data are that they are low-cost, abundant, available in real-time, and free of geographical partition. However, the data also have significant limitations: population and behavioural biases, and lack of important information such as trip purpose and social demographics. This study systematically explores the feasibility of using geolocations of Twitter data for travel demand estimation by examining the effects of data sparsity, spatial scale, sampling methods, and sample size. We show that Twitter data are suitable for modelling the overall travel demand for an average weekday but not for commuting travel demand, due to the low reliability of identifying home and workplace. Collecting more detailed, long-term individual data from user timelines for a small number of individuals produces more accurate results than short-term data for a much larger population within a region. We developed a novel approach using geotagged tweets as attraction generators as opposed to the commonly adopted trip generators. This significantly increases usable data, resulting in better representation of travel demand. This study demonstrates that Twitter can be a viable option for estimating travel demand, though careful consideration must be given to sampling method, estimation model, and sample size.
File Description: electronic
Access URL: https://research.chalmers.se/publication/522089
https://research.chalmers.se/publication/523131
https://research.chalmers.se/publication/522358
https://research.chalmers.se/publication/523131/file/523131_Fulltext.pdf
Database: SwePub
Description
Abstract:Travel demand estimation, as represented by an origin–destination (OD) matrix, is essential for urban planning and management. Compared to data typically used in travel demand estimation, the key strengths of social media data are that they are low-cost, abundant, available in real-time, and free of geographical partition. However, the data also have significant limitations: population and behavioural biases, and lack of important information such as trip purpose and social demographics. This study systematically explores the feasibility of using geolocations of Twitter data for travel demand estimation by examining the effects of data sparsity, spatial scale, sampling methods, and sample size. We show that Twitter data are suitable for modelling the overall travel demand for an average weekday but not for commuting travel demand, due to the low reliability of identifying home and workplace. Collecting more detailed, long-term individual data from user timelines for a small number of individuals produces more accurate results than short-term data for a much larger population within a region. We developed a novel approach using geotagged tweets as attraction generators as opposed to the commonly adopted trip generators. This significantly increases usable data, resulting in better representation of travel demand. This study demonstrates that Twitter can be a viable option for estimating travel demand, though careful consideration must be given to sampling method, estimation model, and sample size.
ISSN:00494488
15729435
DOI:10.1007/s11116-021-10171-x