Skip to main content

2017 | OriginalPaper | Buchkapitel

A Dynamic Individual Recommendation Method Based on Reinforcement Learning

verfasst von : Daojun Han, Xiajiong Shen, Tian Gan, Ruiqing Cai

Erschienen in: Parallel Architecture, Algorithm and Programming

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

As a widely used recommendation method, collaborative filtering can solve the problem of low level of resource utilization which caused by information overload. At present, in order to exhibiting and searching items, we need to use multipole attributes to describe items. Thus request to particularly distinguish every attribute and realize accurate recommendation. While the collaborative filtering method lose sight of the dynamic regulation of items attributes’ importance degree, and it cannot interpose the discrimination of attributes. Aiming at this problem, this paper come up with a dynamic individual recommendation method based on reinforcement learning. This method can dig user’s attribute tag preference from operant behavior. It can record user’s attributes operate path and recall path. Then we build the award-punishment model of attribute tag, and realize the tag weight dynamic regulation. According to the principle that reinforcement learning system always get max award, we make a tag recommend strategy and give user recommendations in accordance with the preferences. The experimental result show that this method can distinguish the validity of user’s click, and realize the tag weight dynamic regulation and give user recommendations in accordance with the preferences.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Goldberg, D., Nichols, D., Oki, B.M., Terry, D., et al.: Using collaborative filtering to weave an information tapestry. Commun. ACM, 61–70 (1992) Goldberg, D., Nichols, D., Oki, B.M., Terry, D., et al.: Using collaborative filtering to weave an information tapestry. Commun. ACM, 61–70 (1992)
2.
Zurück zum Zitat Cong, L.: Review of scalability problem in E-commerce collaborative filtering. New Technol. Libr. Inf. Serv. 11, 7–44 (2010) Cong, L.: Review of scalability problem in E-commerce collaborative filtering. New Technol. Libr. Inf. Serv. 11, 7–44 (2010)
3.
Zurück zum Zitat Li, H., He, K., Wang, J., Peng, Z., Tian, G.: A friends: recommendation algorithm based on formal concept analysis and random walk in social network. J. Sichuan Univ. (Eng. Sci. Edn.) 06, 131–138 (2015) Li, H., He, K., Wang, J., Peng, Z., Tian, G.: A friends: recommendation algorithm based on formal concept analysis and random walk in social network. J. Sichuan Univ. (Eng. Sci. Edn.) 06, 131–138 (2015)
4.
Zurück zum Zitat Chen, T.S., Chou, Y.S., Chen, T.C.: Mining user movement behavior patterns in a mobile service environment. IEEE Trans. Syst. Man Cybern. Part A Syst. Hum. 42(1), 87–101 (2012) Chen, T.S., Chou, Y.S., Chen, T.C.: Mining user movement behavior patterns in a mobile service environment. IEEE Trans. Syst. Man Cybern. Part A Syst. Hum. 42(1), 87–101 (2012)
5.
Zurück zum Zitat Yang, F., Yan, B.: Research on web usage mining. Microelectron. Comput. 11, 146–149 (2008) Yang, F., Yan, B.: Research on web usage mining. Microelectron. Comput. 11, 146–149 (2008)
6.
Zurück zum Zitat Li, Q., Shi, J., Qin, Z., Liu, T.: Mining user behavior patterns for event detection in email networks. Chin. J. Comput. 05, 1135–1146 (2014) Li, Q., Shi, J., Qin, Z., Liu, T.: Mining user behavior patterns for event detection in email networks. Chin. J. Comput. 05, 1135–1146 (2014)
7.
Zurück zum Zitat Zhang, R., Gu, G., Liu Z.: Reinforcement learning theory. Algorithm and its application. Control Theory Appl. 17(5), 637–642 (2005) Zhang, R., Gu, G., Liu Z.: Reinforcement learning theory. Algorithm and its application. Control Theory Appl. 17(5), 637–642 (2005)
8.
Zurück zum Zitat Gao, Y., Chen, S., Lu, X.: Reinforcement learning: survey of recent work. Acta Autom. Sin. 30(1), 86–100 (2004) Gao, Y., Chen, S., Lu, X.: Reinforcement learning: survey of recent work. Acta Autom. Sin. 30(1), 86–100 (2004)
9.
Zurück zum Zitat Bazzan, A.L.C.: Opportunities for multiagent systems and multiagent reinforcement learning in traffic control. Auton. Agents Multi-Agent Syst. 18(3), 342–375 (2009)CrossRef Bazzan, A.L.C.: Opportunities for multiagent systems and multiagent reinforcement learning in traffic control. Auton. Agents Multi-Agent Syst. 18(3), 342–375 (2009)CrossRef
10.
Zurück zum Zitat Kober, J., Peters, J.: Reinforcement learning in robotics: a survey. Int. J. Robot. Res. 32(11), 1238–1274 (2014)CrossRef Kober, J., Peters, J.: Reinforcement learning in robotics: a survey. Int. J. Robot. Res. 32(11), 1238–1274 (2014)CrossRef
11.
Zurück zum Zitat Liu, Z., Zeng, X., Liu, H., Chu, R.: A Heuristic two-layer reinforcement learning algorithm networks. Comput. Res. Dev. 03, 579–587 (2015) Liu, Z., Zeng, X., Liu, H., Chu, R.: A Heuristic two-layer reinforcement learning algorithm networks. Comput. Res. Dev. 03, 579–587 (2015)
12.
Zurück zum Zitat Jurczyk, P., Xiong, L., Sunderam, V.: Reinforcement learning strategies for A-team solving the resource-constrained project scheduling problem. Neurocomputing 146(1), 301–307 (2014) Jurczyk, P., Xiong, L., Sunderam, V.: Reinforcement learning strategies for A-team solving the resource-constrained project scheduling problem. Neurocomputing 146(1), 301–307 (2014)
Metadaten
Titel
A Dynamic Individual Recommendation Method Based on Reinforcement Learning
verfasst von
Daojun Han
Xiajiong Shen
Tian Gan
Ruiqing Cai
Copyright-Jahr
2017
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-6442-5_17

Neuer Inhalt