Skip to main content
Top

2020 | OriginalPaper | Chapter

4. Using Non-negative Tensor Decomposition for Unsupervised Textual Influence Modeling

Authors : Robert E. Lowe, Michael W. Berry

Published in: Supervised and Unsupervised Learning for Data Science

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Documents are seldom created in a vacuum. In all literature, there exists some influencing factor either in the form of cited documents, collaboration, or documents which authors have read. This influence can be seen within their works, and is present as a latent variable. This chapter demonstrates a novel method for quantifying these influences and representing them in a semantically understandable fashion. The model is constructed by representing documents as tensors, decomposing them into a set of factors, and then searching the corpus factors for similarity.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
2.
go back to reference Antonia, A., Craig, H., & Elliott, J. (2014). Language chunking, data sparseness, and the value of a long marker list: Explorations with word n-grams and authorial attribution. Literary and Linguistic Computing, 29(2), 147–163.CrossRef Antonia, A., Craig, H., & Elliott, J. (2014). Language chunking, data sparseness, and the value of a long marker list: Explorations with word n-grams and authorial attribution. Literary and Linguistic Computing, 29(2), 147–163.CrossRef
3.
go back to reference Bader, B., Berry, M. W., & Browne, M. (2007). Discussion tracking in Enron email using PARAFAC. In M. W. Berry & M. Castellanos (Eds.), Survey of Text Mining, chapter 8 (pp. 147–163). Berlin: Springer. Bader, B., Berry, M. W., & Browne, M. (2007). Discussion tracking in Enron email using PARAFAC. In M. W. Berry & M. Castellanos (Eds.), Survey of Text Mining, chapter 8 (pp. 147–163). Berlin: Springer.
4.
go back to reference Bro, R. (1997). Parafac. Tutorial and applications. Chemometrics and Intelligent Laboratory Systems, 38(2), 149–171.CrossRef Bro, R. (1997). Parafac. Tutorial and applications. Chemometrics and Intelligent Laboratory Systems, 38(2), 149–171.CrossRef
5.
go back to reference Burrows, J. (2006). All the way through: Testing for authorship in different frequency strata. Literary and Linguistic Computing, 22(1), 27–47.CrossRef Burrows, J. (2006). All the way through: Testing for authorship in different frequency strata. Literary and Linguistic Computing, 22(1), 27–47.CrossRef
6.
go back to reference Burrows, J., & Craig, H. (2017). The joker in the pack?: Marlowe, Kyd, and the co-authorship of Henry VI, part 3. In G. Taylor & G. Egan (Eds.), The New Oxford Shakespeare Authorship Companion, chapter 11 (pp. 194–217). Oxford: Oxford University Press. Burrows, J., & Craig, H. (2017). The joker in the pack?: Marlowe, Kyd, and the co-authorship of Henry VI, part 3. In G. Taylor & G. Egan (Eds.), The New Oxford Shakespeare Authorship Companion, chapter 11 (pp. 194–217). Oxford: Oxford University Press.
7.
go back to reference Cantuariensis, A. (c. 1078). Proslogion. Ordo Sancti Benedicti. Cantuariensis, A. (c. 1078). Proslogion. Ordo Sancti Benedicti.
8.
go back to reference Carroll, J. D., & Chang, J.-J. (1970). Analysis of individual differences in multidimensional scaling via an n-way generalization of “Eckart-Young” decomposition. Psychometrika, 35(3), 283–319.CrossRef Carroll, J. D., & Chang, J.-J. (1970). Analysis of individual differences in multidimensional scaling via an n-way generalization of “Eckart-Young” decomposition. Psychometrika, 35(3), 283–319.CrossRef
9.
go back to reference Comon, P., Luciani, X., & De Almeida, A. L. (2009). Tensor decompositions, alternating least squares and other tales. Journal of Chemometrics, 23(7–8), 393–405.CrossRef Comon, P., Luciani, X., & De Almeida, A. L. (2009). Tensor decompositions, alternating least squares and other tales. Journal of Chemometrics, 23(7–8), 393–405.CrossRef
10.
go back to reference Craig, H., & Kinney, A. F. (2009). Sheakespeare, Computers, and the Mystery of Authorship. Cambridge: Cambridge University Press.CrossRef Craig, H., & Kinney, A. F. (2009). Sheakespeare, Computers, and the Mystery of Authorship. Cambridge: Cambridge University Press.CrossRef
11.
go back to reference Dietz, L., Bickel, S., & Scheffer, T. (2007). Unsupervised prediction of citation influences. In Proceedings of the 24th International Conference on Machine Learning (pp. 233–240). New York: ACM. Dietz, L., Bickel, S., & Scheffer, T. (2007). Unsupervised prediction of citation influences. In Proceedings of the 24th International Conference on Machine Learning (pp. 233–240). New York: ACM.
12.
go back to reference Harshman, R. A. (1970). Foundations of the parafac procedure: Models and conditions for an “explanatory” multi-modal factor analysis. UCLA Working Papers in Phonetics, 16. Harshman, R. A. (1970). Foundations of the parafac procedure: Models and conditions for an “explanatory” multi-modal factor analysis. UCLA Working Papers in Phonetics, 16.
13.
go back to reference Harshman, R. A., & Lundy, M. E. (1994). Parafac: Parallel factor analysis. Computational Statistics & Data Analysis, 18(1):39–72.CrossRef Harshman, R. A., & Lundy, M. E. (1994). Parafac: Parallel factor analysis. Computational Statistics & Data Analysis, 18(1):39–72.CrossRef
15.
go back to reference Hitchcock, F. L. (1927). The expression of a tensor or a polyadic as a sum of products. Journal of Mathematics and Physics, 6(1–4), 164–189.CrossRef Hitchcock, F. L. (1927). The expression of a tensor or a polyadic as a sum of products. Journal of Mathematics and Physics, 6(1–4), 164–189.CrossRef
16.
go back to reference Jiang, Z., Liu, X., & Gao, L. (2014). Dynamic topic/citation influence modeling for chronological citation recommendation. In Proceedings of the 5th International Workshop on Web-scale Knowledge Representation Retrieval & Reasoning (pp. 15–18). New York: ACM. Jiang, Z., Liu, X., & Gao, L. (2014). Dynamic topic/citation influence modeling for chronological citation recommendation. In Proceedings of the 5th International Workshop on Web-scale Knowledge Representation Retrieval & Reasoning (pp. 15–18). New York: ACM.
17.
go back to reference Kawamae, N. (2016). N-gram over context. In Proceedings of the 25th International Conference on World Wide Web (pp. 1045–1055). New York: International World Wide Web Conferences Steering Committee.CrossRef Kawamae, N. (2016). N-gram over context. In Proceedings of the 25th International Conference on World Wide Web (pp. 1045–1055). New York: International World Wide Web Conferences Steering Committee.CrossRef
19.
20.
go back to reference Lin, J., Keogh, E., Lonardi, S., & Chiu, B. (2003). A symbolic representation of time series, with implications for streaming algorithms. In Proceedings of the 8th ACM SIGMOD workshop on Research Issues in Data Mining and Knowledge Discovery (pp. 2–11). New York: ACM. Lin, J., Keogh, E., Lonardi, S., & Chiu, B. (2003). A symbolic representation of time series, with implications for streaming algorithms. In Proceedings of the 8th ACM SIGMOD workshop on Research Issues in Data Mining and Knowledge Discovery (pp. 2–11). New York: ACM.
21.
go back to reference Liu, J., Liu, J., Wonka, P., & Ye, J. (2012). Sparse non-negative tensor factorization using columnwise coordinate descent. Pattern Recognition, 45(1), 649–656.CrossRef Liu, J., Liu, J., Wonka, P., & Ye, J. (2012). Sparse non-negative tensor factorization using columnwise coordinate descent. Pattern Recognition, 45(1), 649–656.CrossRef
22.
go back to reference Lowe, R. E. (2018). Textual Influence Modeling Through Non-Negative Tensor Decomposition. PhD thesis, University of Tennessee, Knoxville. Lowe, R. E. (2018). Textual Influence Modeling Through Non-Negative Tensor Decomposition. PhD thesis, University of Tennessee, Knoxville.
23.
go back to reference Serfass, D. (2017). Dynamic biometric recognition of handwritten digits using symbolic aggregate approximation. In Proceedings of the SouthEast Conference (pp. 1–4). New York: ACM. Serfass, D. (2017). Dynamic biometric recognition of handwritten digits using symbolic aggregate approximation. In Proceedings of the SouthEast Conference (pp. 1–4). New York: ACM.
24.
go back to reference Stamatatos, E. (2011). Plagiarism detection based on structural information. In Proceedings of the 20th ACM International Conference on Information and Knowledge Management (pp. 1221–1230). New York: ACM. Stamatatos, E. (2011). Plagiarism detection based on structural information. In Proceedings of the 20th ACM International Conference on Information and Knowledge Management (pp. 1221–1230). New York: ACM.
Metadata
Title
Using Non-negative Tensor Decomposition for Unsupervised Textual Influence Modeling
Authors
Robert E. Lowe
Michael W. Berry
Copyright Year
2020
DOI
https://doi.org/10.1007/978-3-030-22475-2_4