ABSTRACT
Hierarchy fundamentally shapes how we act at work. In this paper, we explore the relationship between the words people write in workplace email and the rank of the email's recipient. Using the Enron corpus as a dataset, we perform a close study of the words and phrases people send to those above them in the corporate hierarchy versus those at the same level or lower. We find that certain words and phrases are strong predictors. For example, "thought you would" strongly suggests that the recipient outranks the sender, while "let's discuss" implies the opposite. We also find that the phrases people write to their bosses do not demonstrate cognitive processes as often as the ones they write to others. We conclude this paper by interpreting our results and announcing the release of the predictive phrases as a public dataset, perhaps enabling a new class of status-aware applications.
- T. Brants and A. Franz. Web 1T 5-gram Version 1. Linguistic Data Consortium, Philadelphia, 2006.Google Scholar
- M. Burke and R. Kraut. Mopping up: Modeling wikipedia promotion decisions. In Proc. CSCW, pages 27--36, 2008. Google ScholarDigital Library
- R. Carasik and C. Grantham. A case study of cscw in a dispersed organization. In Proc. CHI, pages 61--66, 1988. Google ScholarDigital Library
- C. Diehl, L. Getoor, and G. Namata. Name reference resolution in organizational email archives. In SIAM International Conference on Data Mining, pages 20--22, 2006.Google ScholarCross Ref
- C. Diehl, G. Namata, and L. Getoor. Relationship identification for social network discovery. In Proc. NCAI, volume 22, page 546, 2007. Google ScholarDigital Library
- P. Dodds and C. Danforth. Measuring the happiness of large-scale written expression: Songs, blogs, and presidents. Journal of Happiness Studies, pages 1--16, 2009.Google Scholar
- A. Esuli and F. Sebastiani. Sentiwordnet: A publicly available lexical resource for opinion mining.Google Scholar
- J. Friedman, T. Hastie, and R. Tibshirani. Regularization paths for generalized linear models via coordinate descent. Journal of Statistical Software, 33(1):1, 2010.Google ScholarCross Ref
- E. Gilbert and K. Karahalios. Widespread Worry and the Stock Market. In Proc. ICWSM, 2010.Google ScholarCross Ref
- E. Goffman. The Presentation of Self in Everyday Life. 1959.Google Scholar
- J. Hancock, C. Landrigan, and C. Silver. Expressing emotion in text-based communication. In Proc. CHI, pages 929--932, 2007. Google ScholarDigital Library
- J. Holmes and S. Schnurr. Politeness, humor and gender in the workplace: negotiating norms and identifying contestation. Journal of Politeness Research. Language, Behaviour, Culture, 1(1):121--149, 2005.Google Scholar
- Z. Jelveh and K. Russell. Interactive timeline: The rise and fall of enron. The New York Times, 2006.Google Scholar
- T. Joachims. Text categorization with support vector machines: Learning with many relevant features. Machine Learning: ECML-98, pages 137--142, 1998. Google ScholarDigital Library
- B. Klimt and Y. Yang. Introducing the Enron corpus. In First conference on email and anti-spam (CEAS), 2004.Google Scholar
- A. S. Lee Rainie, Kristen Purcell. The Social Side of the Internet. Technical report, Pew Internet & American Life Project, 2011.Google Scholar
- M. Madden and S. Jones. Networked Workers. Technical report, Pew Internet & American Life Project, 2008.Google Scholar
- V. Metsis, I. Androutsopoulos, and G. Paliouras. Spam filtering with naive bayes. In Proc. CEAS, pages 125--134, 2006.Google Scholar
- G. Namata, L. Getoor, and C. Diehl. Inferring formal titles in organizational email archives. In Proc. of the ICML Workshop on Statistical Network Analysis. Citeseer, 2006.Google Scholar
- Nielsen. What Americans Do Online: Social Media And Games Dominate Activity. Technical report, 2010.Google Scholar
- S. Palus, P. Bródka, and P. Kazienko. How to analyze company using social network? Knowledge Management, Information Systems, E-Learning, and Sustainability Research, pages 159--164, 2010.Google Scholar
- B. Pang and L. Lee. Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval, 2(1--2):1--135, 2008. Google ScholarDigital Library
- J. W. Pennebaker and M. E. Francis. Linguistic Inquiry and Word Count. Lawrence Erlbaum, August 1999.Google Scholar
- R. Rowe, G. Creamer, S. Hershkop, and S. Stolfo. Automated social hierarchy detection through email network analysis. In Proc. WebKDD, pages 109--117, 2007. Google ScholarDigital Library
- J. Shetty and J. Adibi. The Enron email dataset: database schema and brief statistical report. Technical report, University of Southern California, 2004.Google Scholar
- J. Shetty and J. Adibi. Discovering important nodes through graph entropy: the case of enron email database. In Proceedings of the 3rd international workshop on Link discovery, pages 74--81, 2005. Google ScholarDigital Library
- P. Stone, R. Bales, J. Namenwirth, and D. Ogilvie. The General Inquirer: a computer system for content analysis and retrieval based on the sentence as a unit of information. Behavioral Science, 7(4):484--498, 1962.Google ScholarCross Ref
- J. Thom-Santelli, D. Cosley, and G. Gay. What's mine is mine: territoriality in collaborative authoring. In Proc. CHI, pages 1481--1484, 2009. Google ScholarDigital Library
- F. Viegas, M. Wattenberg, F. Van,Ham, J. Kriss, and M. McKeon. Many Eyes: a site for visualization at internet scale. In InfoVis, pages 1121--1128. Published by the IEEE Computer Society, 2007. Google ScholarDigital Library
- S. Viller. The group facilitator: a cscw perspective. In Proceedings of the second conference on European Conference on Computer-Supported Cooperative Work, pages 81--95, 1991. Google ScholarDigital Library
- B. Vine. Getting things done at work: The discourse of power in workplace interaction. John Benjamins Publishing Company, 2004.Google ScholarCross Ref
- B. Vine. Directives at work: Exploring the contextual complexity of workplace directives. Journal of Pragmatics, 41(7):1395--1405, 2009.Google ScholarCross Ref
- B. Vine, J. Holmes, M. Marra, D. Pfeifer, and B. Jackson. Exploring co-leadership talk through interactional sociolinguistics. Leadership, 4(3):339, 2008.Google ScholarCross Ref
- M. Wattenberg and F. Viégas. The Word Tree, an interactive visual concordance. In InfoVis, pages 1221--1228, 2008. Google ScholarDigital Library
Index Terms
- Phrases that signal workplace hierarchy
Recommendations
Toward an Effective Igbo Part-of-Speech Tagger
Part-of-speech (POS) tagging is a well-established technology for most Western European languages and a few other world languages, but it has not been evaluated on Igbo, an agglutinative African language. This article presents POS tagging experiments ...
Preventing Spam Email by Delivery Limitation in RMX
IDEAS '15: Proceedings of the 19th International Database Engineering & Applications SymposiumOn the rule-based email exchange system called RMX, similar to general mailing lists, anyone can send emails by sending to an address unique to RMX. However, there is a security problem that we cannot prevent spam emails and accidentally sending email ...
A Lemmatizer for Low-resource Languages: WSD and Its Role in the Assamese Language
The morphological variations of highly inflected languages that appear in a text impede the progress of computer processing and root word determination tasks while extracting an abstract. As a remedy to this difficulty, a lemmatization algorithm is ...
Comments