research-article

Open Access

"Can you believe [1:21]?!": Content and Time-Based Reference Patterns in Video Comments

Authors:
Matin Yarmand

University of British Columbia, Vancouver, BC, Canada

University of British Columbia, Vancouver, BC, Canada
View Profile

,
Dongwook Yoon

University of British Columbia, Vancouver, BC, Canada

University of British Columbia, Vancouver, BC, Canada
View Profile

,
Samuel Dodson

University of British Columbia, Vancouver, BC, Canada

University of British Columbia, Vancouver, BC, Canada
View Profile

,
Ido Roll

University of British Columbia, Vancouver, BC, Canada

University of British Columbia, Vancouver, BC, Canada
View Profile

,
Sidney S. Fels

University of British Columbia, Vancouver, BC, Canada

University of British Columbia, Vancouver, BC, Canada
View Profile

CHI '19: Proceedings of the 2019 CHI Conference on Human Factors in Computing SystemsMay 2019Paper No.: 489Pages 1–12https://doi.org/10.1145/3290605.3300719

Published:02 May 2019Publication History

CHI '19: Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems

Pages 1–12

ABSTRACT

As videos become increasingly ubiquitous, so is video-based commenting. To contextualize comments, people often reference specific audio/visual content within video. However, the literature falls short of explaining the types of video content people refer to, how they establish references and identify referents, how video characteristics (e.g., genre) impact referencing behaviors, and how references impact social engagement. We present a taxonomy for classifying video references by referent type and temporal specificity. Using our taxonomy, we analyzed 2.5K references with quotations and timestamps collected from public YouTube comments. We found: 1) people reference intervals of video more frequently than time-points, 2) visual entities are referenced more often than sounds, and 3) comments with quotes are more likely to receive replies but not more "likes". We discuss the need for in-situ dereferencing user interfaces, illustrate design concepts for typed referencing features, and provide a dataset for future studies.

Supplemental Material

paper489p.mp4

mp4

2 MB

Download

pn4375.mp4

mp4

28.4 MB

Download

Available for Download

zip

paper489pvc.zip (1.2 KB)

Preview video captions

zip

pn4375.zip (446.3 KB)

Our dataset consists of 2,994 potential references from 7 genres which were collected using YouTube Data API. Every entry consists of a number of comment and video attributes (such as video link, comment ID, and etc), as well as 3 others that were assigned during our coding process (Referent Type, Expression, and Temporal Specificity).

zip

pn4375vc.zip (2.9 KB)

Video figure captions

References

Saeideh Bakhshi, David A Shamma, and Eric Gilbert. 2014. Faces engage us: Photos with faces attract more likes and comments on Instagram. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. ACM, New York, NY, 965--974. Google ScholarDigital Library
Aaron Bangor, Philip Kortum, and James Miller. 2009. Determining what individual SUS scores mean: Adding an adjective rating scale. Journal of usability studies 4, 3 (2009), 114--123. Google ScholarDigital Library
John Brooke et al. 1996. SUS-A quick and dirty usability scale. Usability evaluation in industry 189, 194 (1996), 4--7.Google Scholar
A. J. Bernheim Brush, David Bargeron, Jonathan Grudin, Alan Borning, and Anoop Gupta. 2002. Supporting Interaction Outside of Class: Anchored Discussions vs. Discussion Boards. In Proceedings of the Conference on Computer Support for Collaborative Learning: Foundations for a CSCL Community (CSCL '02). International Society of the Learning Sciences, 425--434. Google ScholarDigital Library
Konstantinos Chorianopoulos. 2018. A taxonomy of asynchronous instructional video styles. The International Review of Research in Open and Distributed Learning 19, 1 (2018), 294--311.Google ScholarCross Ref
Gayle Christensen, Andrew Steinmetz, Brandon Alcorn, Amy Bennett, Deirdre Woods, and Ezekiel Emanuel. 2013. The MOOC phenomenon: Who takes massive open online courses and why?Google Scholar
Soon Hau Chua, Toni-Jan Keith Palma Monserrat, Dongwook Yoon, Juho Kim, and Shengdong Zhao. 2017. Korero: Facilitating complex referencing of visual materials in asynchronous discussion interface. Proceedings of the ACM on Human-Computer Interaction 1 (2017), 34:1-- 34:19. Google ScholarDigital Library
Herbert Clark. 1996. Using language. Cambridge University Press, Cambridge, United Kingdom.Google Scholar
Herbert H Clark, Susan E Brennan, et al. 1991. Grounding in communication. Perspectives on socially shared cognition 13, 1991 (1991), 127--149.Google Scholar
Samuel Dodson, Ido Roll, Matthew Fong, Dongwook Yoon, Negar M Harandi, and Sidney Fels. 2018. Active Viewing: A Framework for Understanding Student Engagement With Educational Videos. In Proceedings of the Fifth Annual ACM Conference on Learning at Scale. ACM, New York, NY, 24:1--24:4. Google ScholarDigital Library
Brian Dorn, Larissa B Schroeder, and Adam Stankiewicz. 2015. Piloting TrACE: Exploring spatiotemporal anchored collaboration in asynchronous learning. In Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing. ACM, New York, NY, 393--403. Google ScholarDigital Library
Yrjö Engeström. 1987. Learning by Expanding: An Activity-Theoretical Approach to Developmental Research. Cambridge University Press, Cambridge, United Kingdom.Google Scholar
Matthew Fong, Samuel Dodson, Xueqin Zhang, Ido Roll, and Sidney Fels. 2018. ViDeX: A platform for personalizing educational videos. In Proceedings of the 18th ACM/IEEE Joint Conference on Digital Libraries. ACM, New York, NY, 331--332. Google ScholarDigital Library
Susan R Fussell, Leslie D Setlock, Jie Yang, Jiazhi Ou, Elizabeth Mauer, and Adam DI Kramer. 2004. Gestures over video streams to support remote collaboration on physical tasks. Human-Computer Interaction 19, 3 (2004), 273--309. Google ScholarDigital Library
Darren Gergle, Robert E. Kraut, and Susan R. Fussell. 2004. Language Efficiency and Visual Technology: Minimizing Collaborative Effort with Visual Information. Journal of Language and Social Psychology 23, 4 (2004), 491--517.Google ScholarCross Ref
Elena L Glassman, Juho Kim, Andrés Monroy-Hernández, and Meredith Ringel Morris. 2015. Mudslide: A Spatially Anchored Census of Student Confusion for Online Lecture Videos. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems. ACM, New York, NY, 1555--1564. Google ScholarDigital Library
Jonathan Grudin. 1988. Why CSCW Applications Fail: Problems in the Design and Evaluationof Organizational Interfaces. In Proceedings of the 1988 ACM Conference on Computer-supported Cooperative Work (CSCW '88). ACM, New York, NY, USA, 85--93. Google ScholarDigital Library
Christian Heilmann. 2008. YouTube now offers deep links to timestamps (via URI hash). Retrieved Jan 05, 2019 from https://christianheilmann.com/2008/10/26/youtube-now-offers-deep-links-to-timestamps-via-uri-hash/Google Scholar
Michel Hupet, Xavier Seron, and Yves Chantraine. 1991. The effects of the codability and discriminability of the referents on the collaborative referring procedure. British Journal of Psychology 82, 4 (1991), 449--462.Google ScholarCross Ref
Graham M Jones and Bambi B Schieffelin. 2009. Talking text and talking back:"My BFF Jill" from boob tube to YouTube. Journal of Computer-Mediated Communication 14, 4 (2009), 1050--1079.Google ScholarCross Ref
Anastasia Kavada. 2012. Engagement, bonding, and identity across multiple platforms: Avaaz on Facebook, YouTube, and MySpace. MedieKultur: Journal of media and communication research 28, 52 (2012), 21.Google Scholar
M. Laeeq Khan. 2017. Social media engagement: What motivates user participation and consumption on YouTube? Computers in Human Behavior 66 (2017), 236--247. Google ScholarDigital Library
Juho Kim, Phu Tran Nguyen, Sarah Weir, Philip J Guo, Robert C Miller, and Krzysztof Z Gajos. 2014. Crowdsourcing step-by-step information extraction to enhance existing how-to videos. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. ACM, New York, NY, 4017--4026. Google ScholarDigital Library
David Kirk, Tom Rodden, and Danaë Stanton Fraser. 2007. Turn It This Way: Grounding Collaborative Action with Remote Gestures. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. ACM, New York, NY, 1039--1048. Google ScholarDigital Library
Jean Lave and Etienne Wenger. 1991. Situated Learning: Legitimate Peripheral Participation. Cambridge University Press, Cambridge, United Kingdom.Google ScholarCross Ref
Yun-Jung Lee, Jung-Min Shim, Hwan-Gue Cho, and Gyun Woo. 2010. Detecting and visualizing the dispute structure of the replying comments in the internet forum sites. In Cyber-Enabled Distributed Computing and Knowledge Discovery. IEEE, Piscataway, NJ, 456--463. Google ScholarDigital Library
Scott LeeTiernan and Jonathan Grudin. 2001. Fostering Engagement in Asynchronous Learning through Collaborative Multimedia Annotation.. In INTERACT. Citeseer, 472--479.Google Scholar
Amy Madden, Ian Ruthven, and David McMenemy. 2013. A classification scheme for content analyses of YouTube video comments. Journal of Documentation 69, 5 (2013), 693--714.Google ScholarCross Ref
Heather Molyneaux, Susan O'Donnell, Kerri Gibson, and Janice Singer. 2008. Exploring the gender divide on YouTube: An analysis of the creation and reception of vlogs. American Communication Journal 10, 2 (2008), 1--14.Google Scholar
Xiangming Mu. 2010. Towards effective video annotation: An approach to automatically link notes with video content. Computers & Education 55, 4 (2010), 1752--1763. Google ScholarDigital Library
Amy Pavel, Dan B. Goldman, Björn Hartmann, and Maneesh Agrawala. 2016. VidCrit: Video-based asynchronous video review. In Proceedings of the 29th Annual Symposium on User Interface Software and Technology. ACM, New York, NY, 517--528. Google ScholarDigital Library
Martin Potthast and Steffen Becker. 2010. Opinion summarization of web comments. In European Conference on Information Retrieval. Springer, Berlin, Germany, 668--669. Google ScholarDigital Library
Ido Roll, Daniel M Russell, and Dragan Gaevi. 2018. Learning at Scale. International Journal of Artificial Intelligence in Education (2018), CHI 2019, May 4--9, 2019, Glasgow, Scotland UK M. Yarmand et al. 1--7.Google Scholar
Dana Rotman and Jennifer Preece. 2010. The'WeTube'in YouTube-- creating an online community through video sharing. International Journal of Web Based Communities 6, 3 (2010), 317--333. Google ScholarDigital Library
José Miguel Santos-Espino, María Dolores Afonso-Suárez, and Cayetano Guerra-Artal. 2016. Speakers and boards: A survey of instructional video styles in MOOCs. Technical Communication 63, 2 (2016), 101--115.Google Scholar
Peter Schultes, Verena Dorner, and Franz Lehner. 2013. Leave a comment! An in-depth analysis of user comments on YouTube. Wirtschaftsinformatik 42 (2013), 659--673.Google Scholar
Peter Schultes, Verena Dorner, and Franz Lehner. 2013. Leave a Comment! An In-Depth Analysis of User Comments on YouTube. Wirtschaftsinformatik 42 (2013), 659--673.Google Scholar
Stefan Siersdorfer, Sergiu Chelaru, Wolfgang Nejdl, and Jose San Pedro. 2010. How useful are your comments?: Analyzing and predicting YouTube comments and comment ratings. In Proceedings of the 19th International Conference on World Wide Web. ACM, New York, NY, 891--900. Google ScholarDigital Library
William Sugar, Abbie Brown, and Kenneth Luterbach. 2010. Examining the anatomy of a screencast: Uncovering common elements and instructional strategies. The International Review of Research in Open and Distributed Learning 11, 3 (2010), 1--20.Google ScholarCross Ref
Clive Thompson. 2011. How Khan Academy is changing the rules of education. Wired Magazine 126 (2011), 1--5.Google Scholar
Michael Tsang, George W. Fitzmaurice, Gordon Kurtenbach, Azam Khan, and Bill Buxton. 2002. Boom Chameleon: Simultaneous Capture of 3D Viewpoint, Voice and Gesture Annotations on a Spatially-aware Display. In Proceedings of the 15th Annual ACM Symposium on User Interface Software and Technology (UIST '02). ACM, New York, NY, USA, 111--120. Google ScholarDigital Library
Dustin J Welbourne and Will J Grant. 2016. Science communication on YouTube: Factors that affect channel and video popularity. Public Understanding of Science 25, 6 (2016), 706--718.Google ScholarCross Ref
Dongwook Yoon, Nicholas Chen, François Guimbretière, and Abigail Sellen. 2014. RichReview: Blending ink, speech, and gesture to support collaborative document review. In Proceedings of the 27th Annual ACM Symposium on User Interface Software and Technology. ACM, New York, NY, 481--490. Google ScholarDigital Library
Xun Yuan, Wei Lai, Tao Mei, Xian-Sheng Hua, Xiu-Qing Wu, and Shipeng Li. 2006. Automatic video genre categorization using hierarchical SVM. In International Conference on Image Processing. IEEE, Piscataway, NJ, 2905--2908.Google ScholarCross Ref
Sacha Zyto, David Karger, Mark Ackerman, and Sanjoy Mahajan. 2012. Successful Classroom Deployment of a Social Document Annotation System. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. ACM, New York, NY, 1883--1892. Google ScholarDigital Library

Index Terms

"Can you believe [1:21]?!": Content and Time-Based Reference Patterns in Video Comments
1. Applied computing
  1. Document management and text processing
    1. Document preparation
      1. Hypertext / hypermedia creation
2. Human-centered computing
  1. Collaborative and social computing
    1. Empirical studies in collaborative and social computing
  2. Human computer interaction (HCI)
    1. Empirical studies in HCI

Recommendations

Knowing funny: genre perception and categorization in social video sharing
CHI '11: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems

Categorization of online videos is often treated as a tag suggestion task; tags can be generated by individuals or by machine classification. In this paper, we suggest categorization can be determined socially, based on people's interactions around ...
Read More
Social media engagement

This study unearths the motives for YouTube user engagement that has been conceptualized as active participation and passive content consumption. In light of the Uses and Gratifications framework, a sample of 1143 registered YouTube users completed ...
Read More
Proposing to your fans

The study identifies differences across different consumer engagement activities and industries.Content categories and post characteristics may strategically address different levels of consumer engagement.Post interactivity mainly has a positive effect ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CHI '19: Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems
May 2019
9077 pages
ISBN:9781450359702
DOI:10.1145/3290605
General Chairs:
Stephen Brewster
University of Glasgow, Scotland, UK
,
Geraldine Fitzpatrick
TU Wien, Austria
,
Program Chairs:
Anna Cox
University College London, UK
,
Vassilis Kostakos
University of Melbourne, Australia
Copyright © 2019 Owner/Author
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 2 May 2019
Check for updates
Badges
- Honorable Mention
Author Tags
comment
engagement
reference
timestamp
video
youtube
Qualifiers
- research-article
Conference

Acceptance Rates
CHI '19 Paper Acceptance Rate703of2,958submissions,24%Overall Acceptance Rate6,199of26,314submissions,24%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 16
  Total Citations
  View Citations
- 1,422
  Total Downloads
- Downloads (Last 12 months)352
- Downloads (Last 6 weeks)38
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

"Can you believe [1:21]?!": Content and Time-Based Reference Patterns in Video Comments

CHI '19: Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems

ABSTRACT

Supplemental Material

Available for Download

References

Cited By

Index Terms

Recommendations

Knowing funny: genre perception and categorization in social video sharing

Social media engagement

Proposing to your fans