Skip to main content
Erschienen in: Journal of Intelligent Information Systems 1/2019

22.10.2018

Deep recurrent convolutional networks for inferring user interests from social media

verfasst von: Jaeyong Kang, HongSeok Choi, Hyunju Lee

Erschienen in: Journal of Intelligent Information Systems | Ausgabe 1/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Online social media services, such as Facebook and Twitter, have recently increased in popularity. Although determining the subjects of individual posts is important for extracting users’ interests from social media, this task is nontrivial because posts are highly contextualized, informal, and limited in length. To address this problem, we propose a deep-neural-network-based approach for predicting user interests in social media. In our framework, a word-embedding technique is used to map the words in social media content into vectors. These vectors are used as input to a bidirectional gated recurrent unit (biGRU). Then, the output of the biGRU and the word-embedding vectors are used to construct a sentence matrix. The sentence matrix is then used as input to a convolutional neural network (CNN) model to predict a user’s interests. Experimental results show that our proposed method combining biGRU and CNN models outperforms existing methods for identifying users’ interests from social media. In addition, posts in social media are sensitive to trends and change with time. Here, we collected posts from two different social media platforms at different time intervals, and trained the proposed model with one set of social media data and tested it with another set of social media data. The experimental results showed that our proposed model can predict users’ interests from the independent data set with high accuracies.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Bergstra, J., Breuleux, O., Bastien, F.F., Lamblin, P., Pascanu, R., Desjardins, G., Turian, J., Warde-Farley, D., Bengio Y. (2010). Theano: a cpu and gpu math compiler in python. In Proceedings of the Python for Scientific Computing Conference (SciPy) (pp. 1–7). Bergstra, J., Breuleux, O., Bastien, F.F., Lamblin, P., Pascanu, R., Desjardins, G., Turian, J., Warde-Farley, D., Bengio Y. (2010). Theano: a cpu and gpu math compiler in python. In Proceedings of the Python for Scientific Computing Conference (SciPy) (pp. 1–7).
Zurück zum Zitat Bhattacharya, P., Zafar, M., Ganguly, N., Ghosh, S., Gummadi, K. (2014). Inferring user interests in the twitter social network. In Proceedings of the 8th ACM Conference on Recommender systems. Bhattacharya, P., Zafar, M., Ganguly, N., Ghosh, S., Gummadi, K. (2014). Inferring user interests in the twitter social network. In Proceedings of the 8th ACM Conference on Recommender systems.
Zurück zum Zitat Blei, D., Ng, A., Jordan, M. (2003). Latent dirichlet allocation. Journal of Machine Learning Research, 3, 993–1022.MATH Blei, D., Ng, A., Jordan, M. (2003). Latent dirichlet allocation. Journal of Machine Learning Research, 3, 993–1022.MATH
Zurück zum Zitat Budak, C., Kannan, A., Agrawal, R., Pedersen, J. (2014). Inferring user interests from microblogs. Technical report, Microsoft research. Budak, C., Kannan, A., Agrawal, R., Pedersen, J. (2014). Inferring user interests from microblogs. Technical report, Microsoft research.
Zurück zum Zitat Chen, J., Nairn, R., Nelson, L., Bernstein, M. (2010). Short and tweet: experiments on recommending content from information streams. In Proceedings of the SIGCHI conference on human factors in computing systems (pp. 1185–1194). Chen, J., Nairn, R., Nelson, L., Bernstein, M. (2010). Short and tweet: experiments on recommending content from information streams. In Proceedings of the SIGCHI conference on human factors in computing systems (pp. 1185–1194).
Zurück zum Zitat Chetlur, S., & Woolley, C. (2014). cudnn: Efficient primitives for deep learning. arXiv pp. 1–9. Chetlur, S., & Woolley, C. (2014). cudnn: Efficient primitives for deep learning. arXiv pp. 1–9.
Zurück zum Zitat Cho, K., Merrienboer, B.V., Bahdanau, D., Bengio, Y. (2014). On the properties of neural machine translation: Encoder-decoder approaches. arXiv. Cho, K., Merrienboer, B.V., Bahdanau, D., Bengio, Y. (2014). On the properties of neural machine translation: Encoder-decoder approaches. arXiv.
Zurück zum Zitat Chung, J., Gulcehre, C., Cho, K., Bengio, Y. (2014). Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv. Chung, J., Gulcehre, C., Cho, K., Bengio, Y. (2014). Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv.
Zurück zum Zitat Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., Kuksa, P. (2011). Natural language processing (almost) from scratch. The Journal of Machine Learning Research, 12, 2493–2537.MATH Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., Kuksa, P. (2011). Natural language processing (almost) from scratch. The Journal of Machine Learning Research, 12, 2493–2537.MATH
Zurück zum Zitat Firth, J. (1957). A synopsis of linguistic theory. Studies in linguistic analysis. Firth, J. (1957). A synopsis of linguistic theory. Studies in linguistic analysis.
Zurück zum Zitat Gao, J., He, X., Yih, W., Deng, L. (2014). Learning continuous phrase representations for translation modeling. In Proceedings of the 52nd annual meeting of the association for computational linguistics (pp. 699–709). Gao, J., He, X., Yih, W., Deng, L. (2014). Learning continuous phrase representations for translation modeling. In Proceedings of the 52nd annual meeting of the association for computational linguistics (pp. 699–709).
Zurück zum Zitat Godoy, D., & Amandi, A. (2006). Modeling user interests by conceptual clustering. Information Systems, 31, 247–265.CrossRef Godoy, D., & Amandi, A. (2006). Modeling user interests by conceptual clustering. Information Systems, 31, 247–265.CrossRef
Zurück zum Zitat Goldberg, Y., & Levy, O. (2014). word2vec explained: deriving mikolov et al.’s negative-sampling word-embedding method. CoRR arXiv:1402.3722. Goldberg, Y., & Levy, O. (2014). word2vec explained: deriving mikolov et al.’s negative-sampling word-embedding method. CoRR arXiv:1402.​3722.
Zurück zum Zitat Gorrell, G. (2006). Generalized hebbian algorithm for incremental singular value decomposition in natural language processing. In 11th conference of the European chapter of the association for computational linguistics. Gorrell, G. (2006). Generalized hebbian algorithm for incremental singular value decomposition in natural language processing. In 11th conference of the European chapter of the association for computational linguistics.
Zurück zum Zitat Han, J., & Lee, H. (2014). Characterizing user interest using heterogeneous media. In Proceedings of the 23nd international conference on World Wide Web (WWW Companion’14) (pp. 289–290). Han, J., & Lee, H. (2014). Characterizing user interest using heterogeneous media. In Proceedings of the 23nd international conference on World Wide Web (WWW Companion’14) (pp. 289–290).
Zurück zum Zitat Han, J., & Lee, H. (2016). Characterizing the interests of social media users: Refinement of a topic model for incorporating heterogeneous media. Information Sciences, 358, 112–128.CrossRef Han, J., & Lee, H. (2016). Characterizing the interests of social media users: Refinement of a topic model for incorporating heterogeneous media. Information Sciences, 358, 112–128.CrossRef
Zurück zum Zitat Han, J., Xie, X., Woo, W. (2013). Context-based microblog browsing for mobile users. Journal of Ambient Intelligence and Smart Environments, 5, 89–104. Han, J., Xie, X., Woo, W. (2013). Context-based microblog browsing for mobile users. Journal of Ambient Intelligence and Smart Environments, 5, 89–104.
Zurück zum Zitat Hinton, G., Srivastava, N., Krizhevsky, A., Sutskever, I., Salakhutdinov, R. (2012). Improving neural networks by preventing co-adaptation of feature detectors. arXiv:12070580. Hinton, G., Srivastava, N., Krizhevsky, A., Sutskever, I., Salakhutdinov, R. (2012). Improving neural networks by preventing co-adaptation of feature detectors. arXiv:12070580.
Zurück zum Zitat Hochreiter, S., & Schmidhuber, J. (1997). Long short-term memory. Neural Computation, pp. 1735–1780. Hochreiter, S., & Schmidhuber, J. (1997). Long short-term memory. Neural Computation, pp. 1735–1780.
Zurück zum Zitat Johnson, R., & Zhang, T. (2015). Semi-supervised convolutional neural networks for text categorization via region embedding. In Advances in Neural Information Processing Systems (NIPS) (pp. 919–927). Johnson, R., & Zhang, T. (2015). Semi-supervised convolutional neural networks for text categorization via region embedding. In Advances in Neural Information Processing Systems (NIPS) (pp. 919–927).
Zurück zum Zitat Kalchbrenner, N., Grefenstette, E., Blunsom, P. (2014). A convolutional neural network for modelling sentences. In Proceedings of the 52nd annual meeting of the association for computational linguistics (pp. 655–665). Kalchbrenner, N., Grefenstette, E., Blunsom, P. (2014). A convolutional neural network for modelling sentences. In Proceedings of the 52nd annual meeting of the association for computational linguistics (pp. 655–665).
Zurück zum Zitat Kang, J., & Lee, H. (2017). Modeling user interest in social media using news media and wikipedia. Information Systems, 65, 52–64.CrossRef Kang, J., & Lee, H. (2017). Modeling user interest in social media using news media and wikipedia. Information Systems, 65, 52–64.CrossRef
Zurück zum Zitat Kim, Y. (2014). Convolutional neural networks for sentence classification. In Proceedings of the conference on empirical methods in natural language processing (EMNLP ’14) (pp. 1746–1751). Kim, Y. (2014). Convolutional neural networks for sentence classification. In Proceedings of the conference on empirical methods in natural language processing (EMNLP ’14) (pp. 1746–1751).
Zurück zum Zitat Long, J., Shelhamer, E., Darrell, T. (2015). Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 3431–3440). Long, J., Shelhamer, E., Darrell, T. (2015). Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 3431–3440).
Zurück zum Zitat Mikolov, T., Chen, K., Corrado, G., Dean, J. (2013a). Efficient estimation of word representations in vector space. CoRR arXiv:1301.3781. Mikolov, T., Chen, K., Corrado, G., Dean, J. (2013a). Efficient estimation of word representations in vector space. CoRR arXiv:1301.​3781.
Zurück zum Zitat Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J. (2013b). Distributed representations of words and phrases and their compositionality. In Advances in neural information processing systems (pp. 3111–3119). Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J. (2013b). Distributed representations of words and phrases and their compositionality. In Advances in neural information processing systems (pp. 3111–3119).
Zurück zum Zitat Nair, V., & Hinton, G.E. (2010). Rectified linear units improve restricted boltzmann machines. In Proceedings of the 27th International Conference on Machine Learning (ICML ’10). Nair, V., & Hinton, G.E. (2010). Rectified linear units improve restricted boltzmann machines. In Proceedings of the 27th International Conference on Machine Learning (ICML ’10).
Zurück zum Zitat Ottoni, R., Casas, D.L., Pesce, J., Meira, W. Jr, Wilson, C., Mislove, A., Almeida, V. (2014). Of pins and tweets: Investigating how users behave across image-and text-based social networks. In Proceedings of the International Conference on Web and Social Media (ICWSM ’14). Ottoni, R., Casas, D.L., Pesce, J., Meira, W. Jr, Wilson, C., Mislove, A., Almeida, V. (2014). Of pins and tweets: Investigating how users behave across image-and text-based social networks. In Proceedings of the International Conference on Web and Social Media (ICWSM ’14).
Zurück zum Zitat Pennington, J., Socher, R., Manning, C.D. (2014). Glove: Global vectors for word representation. In Proceedings of the conference on empirical methods in natural language processing. Pennington, J., Socher, R., Manning, C.D. (2014). Glove: Global vectors for word representation. In Proceedings of the conference on empirical methods in natural language processing.
Zurück zum Zitat Ramanathan, K., & Kapoor, K. (2009). Creating user profiles using Wikipedia, vol. 5829, (pp. 415–427). Berlin: Springer. Ramanathan, K., & Kapoor, K. (2009). Creating user profiles using Wikipedia, vol. 5829, (pp. 415–427). Berlin: Springer.
Zurück zum Zitat Rothe, S., Ebert, S., Schutze, H. (2016). Ultradense word embeddings by orthogonal transformation. arXiv. Rothe, S., Ebert, S., Schutze, H. (2016). Ultradense word embeddings by orthogonal transformation. arXiv.
Zurück zum Zitat Rummelhart, D.E., McClelland, J.L., Group, P.R., et al. (1986). Parallel distributed processing, vol 1. Explorations in the microstructure of cognition. Rummelhart, D.E., McClelland, J.L., Group, P.R., et al. (1986). Parallel distributed processing, vol 1. Explorations in the microstructure of cognition.
Zurück zum Zitat Salton, G., & McGill, M. (1986). Introduction to modern information retrieval. New York: McGraw-Hill Inc.MATH Salton, G., & McGill, M. (1986). Introduction to modern information retrieval. New York: McGraw-Hill Inc.MATH
Zurück zum Zitat Schutze, H. (1992). Dimensions of meaning. In Proceedings of IEEE Supercomputing (pp. 787–796). Schutze, H. (1992). Dimensions of meaning. In Proceedings of IEEE Supercomputing (pp. 787–796).
Zurück zum Zitat Severyn, A., & Moschitti, A. (2015a). Twitter sentiment analysis with deep convolutional neural networks. In Proceedings of the 38th International ACM SIGIR Conference (pp. 959–962). Severyn, A., & Moschitti, A. (2015a). Twitter sentiment analysis with deep convolutional neural networks. In Proceedings of the 38th International ACM SIGIR Conference (pp. 959–962).
Zurück zum Zitat Severyn, A., & Moschitti, A. (2015b). Unitn: Training deep convolutional neural network for twitter sentiment classification. In Proceedings of the 9th international workshop on semantic evaluation. Severyn, A., & Moschitti, A. (2015b). Unitn: Training deep convolutional neural network for twitter sentiment classification. In Proceedings of the 9th international workshop on semantic evaluation.
Zurück zum Zitat Shen, Y., He, X., Gao, J., Deng, L., Mesnil, G. (2014a). A latent semantic model with convolutional-pooling structure for information retrieval. In Proceedings of the 23rd ACM international conference on conference on information and knowledge management (pp. 101–110). Shen, Y., He, X., Gao, J., Deng, L., Mesnil, G. (2014a). A latent semantic model with convolutional-pooling structure for information retrieval. In Proceedings of the 23rd ACM international conference on conference on information and knowledge management (pp. 101–110).
Zurück zum Zitat Shen, Y., He, X., Gao, J., Deng, L., Mesnil, G. (2014b). Learning semantic representations using convolutional neural networks for web search. In Proceedings of the companion publication of the 23rd international conference on World wide web companion (pp. 373–374). Shen, Y., He, X., Gao, J., Deng, L., Mesnil, G. (2014b). Learning semantic representations using convolutional neural networks for web search. In Proceedings of the companion publication of the 23rd international conference on World wide web companion (pp. 373–374).
Zurück zum Zitat Socher, R., Perelygin, A., Wu, J.Y., Chuang, J., Manning, C.D., Ng, A.Y., Potts, C. (2013). Recursive deep models for semantic compositionality over a sentiment treebank. In Proceedings of the conference on empirical methods in natural language processing (EMNLP ’13) (pp. 1631–1642). Socher, R., Perelygin, A., Wu, J.Y., Chuang, J., Manning, C.D., Ng, A.Y., Potts, C. (2013). Recursive deep models for semantic compositionality over a sentiment treebank. In Proceedings of the conference on empirical methods in natural language processing (EMNLP ’13) (pp. 1631–1642).
Zurück zum Zitat Sriram, B., Fuhry, D., Demir, E., Ferhatosmanoglu, H., Demirbas, M. (2010). Short text classification in twitter to improve information filtering. In Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval (pp. 841–842). Sriram, B., Fuhry, D., Demir, E., Ferhatosmanoglu, H., Demirbas, M. (2010). Short text classification in twitter to improve information filtering. In Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval (pp. 841–842).
Zurück zum Zitat Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A. (2015). Going deeper with convolutions. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1–9). Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A. (2015). Going deeper with convolutions. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1–9).
Zurück zum Zitat Weng, J., Lim, E., Jiang, J., He, Q. (2010). Twitterrank: finding topic-sensitive influential twitterers. In Proceedings of the third ACM international conference on Web search and data mining (WSDM ’10) (pp. 261–270). Weng, J., Lim, E., Jiang, J., He, Q. (2010). Twitterrank: finding topic-sensitive influential twitterers. In Proceedings of the third ACM international conference on Web search and data mining (WSDM ’10) (pp. 261–270).
Zurück zum Zitat Yang, S., Kolcz, A., Schlaikjer, A., Gupta, P. (2014). Large-scale high-precision topic modeling on twitter. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining. Yang, S., Kolcz, A., Schlaikjer, A., Gupta, P. (2014). Large-scale high-precision topic modeling on twitter. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining.
Zurück zum Zitat Yih, W., He, X., Meek, C. (2014). Semantic parsing for single-relation question answering. In Proceedings of ACL (pp. 643–648). Yih, W., He, X., Meek, C. (2014). Semantic parsing for single-relation question answering. In Proceedings of ACL (pp. 643–648).
Zurück zum Zitat Zhang, X., Zhao, J., LeCun, Y. (2015). Character-level convolutional networks for text classification. In Proceedings of the advances in neural information processing systems (pp. 649–657). Zhang, X., Zhao, J., LeCun, Y. (2015). Character-level convolutional networks for text classification. In Proceedings of the advances in neural information processing systems (pp. 649–657).
Metadaten
Titel
Deep recurrent convolutional networks for inferring user interests from social media
verfasst von
Jaeyong Kang
HongSeok Choi
Hyunju Lee
Publikationsdatum
22.10.2018
Verlag
Springer US
Erschienen in
Journal of Intelligent Information Systems / Ausgabe 1/2019
Print ISSN: 0925-9902
Elektronische ISSN: 1573-7675
DOI
https://doi.org/10.1007/s10844-018-0534-3

Weitere Artikel der Ausgabe 1/2019

Journal of Intelligent Information Systems 1/2019 Zur Ausgabe

Premium Partner