nach oben

Journal of Intelligent Information Systems

Erschienen in:

22.10.2018

Deep recurrent convolutional networks for inferring user interests from social media

verfasst von: Jaeyong Kang, HongSeok Choi, Hyunju Lee

Erschienen in: Journal of Intelligent Information Systems | Ausgabe 1/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Online social media services, such as Facebook and Twitter, have recently increased in popularity. Although determining the subjects of individual posts is important for extracting users’ interests from social media, this task is nontrivial because posts are highly contextualized, informal, and limited in length. To address this problem, we propose a deep-neural-network-based approach for predicting user interests in social media. In our framework, a word-embedding technique is used to map the words in social media content into vectors. These vectors are used as input to a bidirectional gated recurrent unit (biGRU). Then, the output of the biGRU and the word-embedding vectors are used to construct a sentence matrix. The sentence matrix is then used as input to a convolutional neural network (CNN) model to predict a user’s interests. Experimental results show that our proposed method combining biGRU and CNN models outperforms existing methods for identifying users’ interests from social media. In addition, posts in social media are sensitive to trends and change with time. Here, we collected posts from two different social media platforms at different time intervals, and trained the proposed model with one set of social media data and tested it with another set of social media data. The experimental results showed that our proposed model can predict users’ interests from the independent data set with high accuracies.

Vorheriger Artikel Lightweight domain modeling for adaptive web-based educational system

Nächster Artikel Cooperative treatment of failing queries over uncertain databases: a matrix-computation-based approach

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Bergstra, J., Breuleux, O., Bastien, F.F., Lamblin, P., Pascanu, R., Desjardins, G., Turian, J., Warde-Farley, D., Bengio Y. (2010). Theano: a cpu and gpu math compiler in python. In Proceedings of the Python for Scientific Computing Conference (SciPy) (pp. 1–7).

Bhattacharya, P., Zafar, M., Ganguly, N., Ghosh, S., Gummadi, K. (2014). Inferring user interests in the twitter social network. In Proceedings of the 8th ACM Conference on Recommender systems.

Blei, D., Ng, A., Jordan, M. (2003). Latent dirichlet allocation. Journal of Machine Learning Research, 3, 993–1022.MATH

Budak, C., Kannan, A., Agrawal, R., Pedersen, J. (2014). Inferring user interests from microblogs. Technical report, Microsoft research.

Chen, J., Nairn, R., Nelson, L., Bernstein, M. (2010). Short and tweet: experiments on recommending content from information streams. In Proceedings of the SIGCHI conference on human factors in computing systems (pp. 1185–1194).

Chetlur, S., & Woolley, C. (2014). cudnn: Efficient primitives for deep learning. arXiv pp. 1–9.

Cho, K., Merrienboer, B.V., Bahdanau, D., Bengio, Y. (2014). On the properties of neural machine translation: Encoder-decoder approaches. arXiv.

Chung, J., Gulcehre, C., Cho, K., Bengio, Y. (2014). Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv.

Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., Kuksa, P. (2011). Natural language processing (almost) from scratch. The Journal of Machine Learning Research, 12, 2493–2537.MATH

Firth, J. (1957). A synopsis of linguistic theory. Studies in linguistic analysis.

Gao, J., He, X., Yih, W., Deng, L. (2014). Learning continuous phrase representations for translation modeling. In Proceedings of the 52nd annual meeting of the association for computational linguistics (pp. 699–709).

Godoy, D., & Amandi, A. (2006). Modeling user interests by conceptual clustering. Information Systems, 31, 247–265.CrossRef

Goldberg, Y., & Levy, O. (2014). word2vec explained: deriving mikolov et al.’s negative-sampling word-embedding method. CoRR arXiv:1402.3722.

Gorrell, G. (2006). Generalized hebbian algorithm for incremental singular value decomposition in natural language processing. In 11th conference of the European chapter of the association for computational linguistics.

Han, J., & Lee, H. (2014). Characterizing user interest using heterogeneous media. In Proceedings of the 23nd international conference on World Wide Web (WWW Companion’14) (pp. 289–290).

Han, J., & Lee, H. (2016). Characterizing the interests of social media users: Refinement of a topic model for incorporating heterogeneous media. Information Sciences, 358, 112–128.CrossRef

Han, J., Xie, X., Woo, W. (2013). Context-based microblog browsing for mobile users. Journal of Ambient Intelligence and Smart Environments, 5, 89–104.

Harris, Z.S. (1954). Distributional structure. Word, 10, 146–162.CrossRef

Hinton, G., Srivastava, N., Krizhevsky, A., Sutskever, I., Salakhutdinov, R. (2012). Improving neural networks by preventing co-adaptation of feature detectors. arXiv:12070580.

Hochreiter, S., & Schmidhuber, J. (1997). Long short-term memory. Neural Computation, pp. 1735–1780.

Johnson, R., & Zhang, T. (2015). Semi-supervised convolutional neural networks for text categorization via region embedding. In Advances in Neural Information Processing Systems (NIPS) (pp. 919–927).

Kalchbrenner, N., Grefenstette, E., Blunsom, P. (2014). A convolutional neural network for modelling sentences. In Proceedings of the 52nd annual meeting of the association for computational linguistics (pp. 655–665).

Kang, J., & Lee, H. (2017). Modeling user interest in social media using news media and wikipedia. Information Systems, 65, 52–64.CrossRef

Kim, Y. (2014). Convolutional neural networks for sentence classification. In Proceedings of the conference on empirical methods in natural language processing (EMNLP ’14) (pp. 1746–1751).

Long, J., Shelhamer, E., Darrell, T. (2015). Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 3431–3440).

Mikolov, T., Chen, K., Corrado, G., Dean, J. (2013a). Efficient estimation of word representations in vector space. CoRR arXiv:1301.3781.

Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J. (2013b). Distributed representations of words and phrases and their compositionality. In Advances in neural information processing systems (pp. 3111–3119).

Nair, V., & Hinton, G.E. (2010). Rectified linear units improve restricted boltzmann machines. In Proceedings of the 27th International Conference on Machine Learning (ICML ’10).

Ottoni, R., Casas, D.L., Pesce, J., Meira, W. Jr, Wilson, C., Mislove, A., Almeida, V. (2014). Of pins and tweets: Investigating how users behave across image-and text-based social networks. In Proceedings of the International Conference on Web and Social Media (ICWSM ’14).

Pennington, J., Socher, R., Manning, C.D. (2014). Glove: Global vectors for word representation. In Proceedings of the conference on empirical methods in natural language processing.

Ramanathan, K., & Kapoor, K. (2009). Creating user profiles using Wikipedia, vol. 5829, (pp. 415–427). Berlin: Springer.

Rong, X. (2014). word2vec parameter learning explained. CoRR arXiv:1411.2738.

Rothe, S., Ebert, S., Schutze, H. (2016). Ultradense word embeddings by orthogonal transformation. arXiv.

Rummelhart, D.E., McClelland, J.L., Group, P.R., et al. (1986). Parallel distributed processing, vol 1. Explorations in the microstructure of cognition.

Salton, G., & McGill, M. (1986). Introduction to modern information retrieval. New York: McGraw-Hill Inc.MATH

Schutze, H. (1992). Dimensions of meaning. In Proceedings of IEEE Supercomputing (pp. 787–796).

Severyn, A., & Moschitti, A. (2015a). Twitter sentiment analysis with deep convolutional neural networks. In Proceedings of the 38th International ACM SIGIR Conference (pp. 959–962).

Severyn, A., & Moschitti, A. (2015b). Unitn: Training deep convolutional neural network for twitter sentiment classification. In Proceedings of the 9th international workshop on semantic evaluation.

Shen, Y., He, X., Gao, J., Deng, L., Mesnil, G. (2014a). A latent semantic model with convolutional-pooling structure for information retrieval. In Proceedings of the 23rd ACM international conference on conference on information and knowledge management (pp. 101–110).

Shen, Y., He, X., Gao, J., Deng, L., Mesnil, G. (2014b). Learning semantic representations using convolutional neural networks for web search. In Proceedings of the companion publication of the 23rd international conference on World wide web companion (pp. 373–374).

Socher, R., Perelygin, A., Wu, J.Y., Chuang, J., Manning, C.D., Ng, A.Y., Potts, C. (2013). Recursive deep models for semantic compositionality over a sentiment treebank. In Proceedings of the conference on empirical methods in natural language processing (EMNLP ’13) (pp. 1631–1642).

Sriram, B., Fuhry, D., Demir, E., Ferhatosmanoglu, H., Demirbas, M. (2010). Short text classification in twitter to improve information filtering. In Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval (pp. 841–842).

Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A. (2015). Going deeper with convolutions. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1–9).

Weng, J., Lim, E., Jiang, J., He, Q. (2010). Twitterrank: finding topic-sensitive influential twitterers. In Proceedings of the third ACM international conference on Web search and data mining (WSDM ’10) (pp. 261–270).

Yang, S., Kolcz, A., Schlaikjer, A., Gupta, P. (2014). Large-scale high-precision topic modeling on twitter. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining.

Yih, W., He, X., Meek, C. (2014). Semantic parsing for single-relation question answering. In Proceedings of ACL (pp. 643–648).

Zeiler, M.D. (2012). Adadelta: an adaptive learning rate method. CoRR arXiv:1212.5701.

Zhang, X., Zhao, J., LeCun, Y. (2015). Character-level convolutional networks for text classification. In Proceedings of the advances in neural information processing systems (pp. 649–657).

Titel: Deep recurrent convolutional networks for inferring user interests from social media
verfasst von: Jaeyong Kang
HongSeok Choi
Hyunju Lee
Publikationsdatum: 22.10.2018
Verlag: Springer US
Erschienen in: Journal of Intelligent Information Systems / Ausgabe 1/2019
Print ISSN: 0925-9902
Elektronische ISSN: 1573-7675
DOI: https://doi.org/10.1007/s10844-018-0534-3

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Weitere Artikel der Ausgabe 1/2019

Validating data acquired with experimental multimodal biometric system installed in bank branches

Word sense disambiguation application in sentiment analysis of news headlines: an applied approach to FOREX market prediction

Cooperative treatment of failing queries over uncertain databases: a matrix-computation-based approach

Discovering more precise process models from event logs by filtering out chaotic activities

Granular methods in automatic music genre classification: a case study

An enhancement on Clinical Data Analytics Language (CliniDAL) by integration of free text concept search

Premium Partner