Skip to main content
Log in

Using Twitter data for transit performance assessment: a framework for evaluating transit riders’ opinions about quality of service

  • Original Paper
  • Published:
Public Transport Aims and scope Submit manuscript

Abstract

Social media platforms such as Facebook, Instagram, and Twitter have drastically altered the way information is generated and disseminated. These platforms allow their users to report events and express their opinions toward these events. The profusion of data generated through social media has proved to have the potential for improving the efficiency of existing traffic management systems and transportation analytics. This study complements existing literature by proposing a framework to evaluate transit riders’ opinion about quality of transit service using Twitter data. Although previous studies used keyword search to extract transit-related tweets, the extracted tweets can still be noisy and might not be relevant to transit quality of service at all. In this study, we leverage topic modeling, an unsupervised machine learning technique, to sift tweets that are relevant to the actual user experience of the transit system. Sentiment analysis is further performed based on the tweet-per-topic index we developed, to gauge transit riders’ feedback and explore the underlying reasons causing their dissatisfaction on the service. This framework can be potentially quite useful to transit agencies for user-oriented analysis and to assist with investment decision making.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

References

  • Arias M, Arratia A, Xuriguera R (2013) Forecasting with Twitter data. ACM Trans Intell Syst Technol 5(1):1–24

    Article  Google Scholar 

  • Barreira N, Godinho P, Melo P (2013) Nowcasting unemployment rate and new car sales in South-western Europe with Google Trends. NETNOMICS Econ Res Electron Netw 14(3):129–165

    Article  Google Scholar 

  • Bernardo JM, Bayarri MJ, Berger JO, Dawid AP, Heckerman D, Smith AFM, West M (2003) The variational Bayesian EM algorithm for incomplete data: with application to scoring graphical model structures. Bayesian Stat 7:453–464

    Google Scholar 

  • Blei DM, Ng AY, Jordan MI (2003) Latent Dirichlet allocation. J Mach Learn Res 3:993–1022

    Google Scholar 

  • Bose S, Saha U, Kar D, Goswami S, Nayak AK, Chakrabarti S (2017) RSentiment: a tool to extract meaningful insights from textual reviews. In: Proceedings of the 5th international conference on frontiers in intelligent computing: theory and applications, Singapore

  • Bughin J (2015) Google searches and Twitter mood: nowcasting telecom sales performance. NETNOMICS Econ Res Electron Netw 16(1–2):87–105

    Article  Google Scholar 

  • Cheng Z, Caverlee J, Lee KD, Sui DZ (2011) Exploring millions of footprints in location sharing services. International AAAI conference on web and social media (ICWSM), pp 81-88

  • Cho E, Myers SA, Leskovec J (2011) Friendship and mobility: user movement in location-based social networks. In: Proceedings of the 17th ACM SIGKDD international conference on knowledge discovery and data mining. ACM: 1082–1090

  • Collins C, Hasan S, Ukkusuri SV (2013) A novel transit rider satisfaction metric. J Public Transp 16(2):21–45

    Article  Google Scholar 

  • Farber S, Ritter B, Fu L (2016) Space–time mismatch between transit service and observed travel patterns in the Wasatch Front, Utah: a social equity perspective. Travel Behav Soc 4:40–48

    Article  Google Scholar 

  • Fayyaz SK, Liu XC, Porter RJ (2017) Dynamic transit accessibility and transit gap causality analysis. J Transp Geogr 59:27–39

    Article  Google Scholar 

  • Fu K, Nune R, Tao JX (2015) Social media data analysis for traffic incident detection and management. Transportation research board 94th annual meeting 15-4022, Washington, D.C

  • Gao H, Tang J, Liu H (2012) Exploring social-historical ties on location-based social networks. In: International AAAI conference on web and social media (ICWSM). The AAAI Press, California

  • Golder SA, Macy MW (2011) Diurnal and seasonal mood vary with work, sleep and daylength across diverse cultures. Science 333(6051):1878–1881

    Article  Google Scholar 

  • Goldsmith S (2017) L.A.’s testing ground for transportation efficiency, Mar. 2016. http://www.governing.com/blogs/bfc/gov-los-angeles-transportation-efficiency-mobility-management.html. Accessed 20 Jul 2017

  • Goodchild MF (2007) Citizens as sensors: the world of volunteered geography. GeoJournal 69(4):211–221

    Article  Google Scholar 

  • Hasan S, Zhan X, Ukkusuri SV (2013) Understanding urban human activity and mobility patterns using large-scale location-based data from online social media. In: Proceedings of the 2nd ACM international workshop on urban computing, pp 6:1–6:8

  • Hornik K, Grün B (2011) Topicmodels: an R package for fitting topic models. J Stat Softw 40(13):1–30

    Google Scholar 

  • Kaplan AM, Haenlein M (2010) Users of the world, unite! The challenges and opportunities of Social Media. Bus Horiz 53(1):59–68

    Article  Google Scholar 

  • Kosala R, Adi E (2012) Harvesting real time traffic information from Twitter. Procedia Eng 50:1–11

    Article  Google Scholar 

  • Lindsay BR (2011) Social media and disasters: current uses, future options, and policy considerations. Congress research service 41987

  • Liu B (2012) Sentiment analysis and opinion mining. Synthesis lectures on human language technologies, vol 5, no 1. Morgan & Claypool Publishers

  • Luong TT, Houston D (2015) Public opinions of light rail service in Los Angeles, an analysis using Twitter Data. iConference 2015 Proceedings, Philadelphia

    Google Scholar 

  • Maghrebi M, Abbasi A, Rashidi TH, Waller ST (2015) Complementing travel diary surveys with Twitter data: application of text mining techniques on activity location, type and time. 18th international conference on intelligent transportation systems (ITSC), Las Palmas, Spain

  • Mai E, Hranac R (2013) Twitter interactions as a data source for transportation incidents. Presented at Transportation Research Board 92nd annual meeting, Washington, D.C

  • Sakaki T, Okazaki M, Matsuo Y (2010) Earthquake shakes Twitter users: real-time event detection by social sensors. In: Proceedings of the 19th international conference on World Wide Web, Raleigh, North Carolina

  • Schweitzer L (2014) Planning and social media: a case study of public transit and stigma on Twitter. J Am Plan Assoc 80(3):218–238

    Article  Google Scholar 

  • Steiger E, Ellersiek T, Zipf A (2014) Explorative public transport flow analysis from uncertain social media data. In: Proceedings of the 3rd ACM SIGSPATIAL international workshop on crowd sourced and volunteered geographic information—GeoCrowd’14. New York, New York, ACM Press, pp 1–7

  • Steur RJ (2015) Twitter as a spatio-temporal source for incident management. Master’s Thesis, Utrecht University, Netherlands

  • Tasse D, Hong JI (2014) Using social media data to understand cities. In: Proceedings of NSF workshop on big data and urban informatics. Carnegie Mellon University, Pittsburg, Pennsylvania

  • Tian Y, Zmud M, Chiu YC, Carey D, Dale J, Smarda D, Lehr R, James R (2016) Quality assessment of social media traffic reports—a field study in Austin, Texas. Transportation Research Board 95th annual meeting, No. 16-6852, Washington, D.C

  • Transportation Research Board (2003) Transit capacity and quality of service manual. TCRP Report 100. National Academy Press, Washington, D.C.

  • Ukkusuri S, Zhan X, Sadri A, Ye Q (2014) Use of social media data to explore crisis informatics: study of 2013 Oklahoma Tornado. Transp Res Rec J Transp Res Board 2459:110–118

    Article  Google Scholar 

  • Vision Zero (2016) High injury network. http://visionzero.lacity.org/high-injury-network/. Accessed 20 Jul 2017

  • Wanichayapong N, Pruthipunyaskul W, Pattara-Atikom W, Chaovalit P (2011) Social-based traffic information extraction and classification. 2011 IEEE 11th international conference on ITS telecommunications, pp 107–112

  • Wei R, Liu X, Wang L, Golub A, Farber S (2017) Evaluating public transit services for operational efficiency and access equity. J Transp Geogr 65:70–79

    Article  Google Scholar 

  • Yin Z, Fabbri D, Rosenbloom ST, Malin B (2015) A scalable framework to detect personal health mentions on Twitter. J Med Internet Res 17(6):e138

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xiaoyue Cathy Liu.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Haghighi, N.N., Liu, X.C., Wei, R. et al. Using Twitter data for transit performance assessment: a framework for evaluating transit riders’ opinions about quality of service. Public Transp 10, 363–377 (2018). https://doi.org/10.1007/s12469-018-0184-4

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12469-018-0184-4

Keywords

Navigation