Skip to main content
Top
Published in: Mobile Networks and Applications 3/2015

01-06-2015

A Skip-gram-based Framework to Extract Knowledge from Chinese Reviews in Cloud Environment

Authors: Feng Zhao, Hang Zhu, Hai Jin, Weizhong Qiang

Published in: Mobile Networks and Applications | Issue 3/2015

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

With the development of cloud computing technologies, eBusiness systems and applications pay more attention on customer reviews, such as commodity, customer’s emotion. These review data contain a vast amount of valuable information. It is challenging to extract knowledge from these reviews in cloud environment, because they are massive, usually distributed, and keep constantly changing. In this paper, a novel framework to extract knowledge from Chinese review data is proposed, which mainly includes building knowledge space, retrieving knowledge and optimizing results. For Chinese reviews, a skip-gram-based model is used to train review data and generate the knowledge space. To quickly build knowledge space, an algorithm based on hierarchical softmax is proposed, which does not need any feature extraction and modelization. This algorithm is applicable for massive data and conveniently extended in cloud environment. When retrieving knowledge and optimizing results, our framework uses euclidean distance to find the knowledge, closely linked to the query, and uses 2-gram algorithm to optimize the results. Experimental results show that our framework is practical and efficient.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Show more products
Literature
1.
go back to reference Swapna Gottipati, Jing Jiang (2012) Finding thoughtful comments from social media. In: Proceedings of 20th International Conference on Computational Linguistics, pages 995–1010 Citeseer Swapna Gottipati, Jing Jiang (2012) Finding thoughtful comments from social media. In: Proceedings of 20th International Conference on Computational Linguistics, pages 995–1010 Citeseer
2.
go back to reference Marios Kokkodis (2012) Learning from positive and unlabeled amazon reviews towards identifying trustworthy reviewers. In: Proceedings of the 21st International Conference on World Wide Web, pages 545–546. ACM Marios Kokkodis (2012) Learning from positive and unlabeled amazon reviews towards identifying trustworthy reviewers. In: Proceedings of the 21st International Conference on World Wide Web, pages 545–546. ACM
3.
go back to reference Michele Banko, Oren Etzioni, Turing Center (2008) The tradeoffs between open and traditional relation extraction. In: Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics, pages 28–36. ACL Michele Banko, Oren Etzioni, Turing Center (2008) The tradeoffs between open and traditional relation extraction. In: Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics, pages 28–36. ACL
4.
go back to reference Jun Zhu, Zaiqing Nie, Xiaojiang Liu, Bo Zhang, Ji-Rong Wen (2009) Statsnowball: a statistical approach to extracting entity relationships. In: Proceedings of the 18th International Conference on World Wide Web, pages 101–110. ACM Jun Zhu, Zaiqing Nie, Xiaojiang Liu, Bo Zhang, Ji-Rong Wen (2009) Statsnowball: a statistical approach to extracting entity relationships. In: Proceedings of the 18th International Conference on World Wide Web, pages 101–110. ACM
5.
go back to reference Ruiji Fu, Jiang Guo, Bing Qin, Wanxiang Che, Haifeng Wang, Ting Liu (2014) Learning semantic hierarchies via word embeddings. In: Proceedings of the 52th Annual Meeting of the Association for Computational Linguistics, pages 1199–1209 ACL Ruiji Fu, Jiang Guo, Bing Qin, Wanxiang Che, Haifeng Wang, Ting Liu (2014) Learning semantic hierarchies via word embeddings. In: Proceedings of the 52th Annual Meeting of the Association for Computational Linguistics, pages 1199–1209 ACL
6.
go back to reference Chen Min, Mao Shiwen, Yunhao Liu (2014) Big data: A survey. Mob Networks Appl 19(2):171–209CrossRef Chen Min, Mao Shiwen, Yunhao Liu (2014) Big data: A survey. Mob Networks Appl 19(2):171–209CrossRef
7.
go back to reference Niu Feng, Ce Zhang, Christopher Ré, Jude W Shavlik (2012) Deepdive Web-scale knowledge-base construction using statistical learning and inference. VLDS J 12(1):25–28 Niu Feng, Ce Zhang, Christopher Ré, Jude W Shavlik (2012) Deepdive Web-scale knowledge-base construction using statistical learning and inference. VLDS J 12(1):25–28
8.
go back to reference Xu Yu, Li Peng, Zhixing Huang, Hai Zhuge (2014) A framework for automated construction of resource space based on background knowledge. Futur Gener Comput Syst 32(8):222–231CrossRef Xu Yu, Li Peng, Zhixing Huang, Hai Zhuge (2014) A framework for automated construction of resource space based on background knowledge. Futur Gener Comput Syst 32(8):222–231CrossRef
9.
go back to reference Johannes Hoffart, Fabian M Suchanek, Klaus Berberich, Edwin Lewis-Kelham, Gerard De Melo, Gerhard Weikum (2011) Yago2:exploring and querying world knowledge in time, space, context, and many languages. In: Proceedings of the 20th International Conference on World Wide Web, pages 229–232 ACM Johannes Hoffart, Fabian M Suchanek, Klaus Berberich, Edwin Lewis-Kelham, Gerard De Melo, Gerhard Weikum (2011) Yago2:exploring and querying world knowledge in time, space, context, and many languages. In: Proceedings of the 20th International Conference on World Wide Web, pages 229–232 ACM
10.
go back to reference Brambilla Marco, Ceri Stefano, Halevy Alon (2013) Special issue on structured and crowd-sourced data on the web. The VLDB J 22(5):587–588CrossRef Brambilla Marco, Ceri Stefano, Halevy Alon (2013) Special issue on structured and crowd-sourced data on the web. The VLDB J 22(5):587–588CrossRef
11.
go back to reference Sarkas Nikos, Paparizos Stelios, Panayiotis Tsaparas (2010) Structured annotations of web queries Sarkas Nikos, Paparizos Stelios, Panayiotis Tsaparas (2010) Structured annotations of web queries
12.
go back to reference Gao Yunjun, Liu Qing, Zheng Baihua, Chen Gang (2014) On efficient reverse skyline query processing. Expert Syst Appl 41(7):3237–3249CrossRef Gao Yunjun, Liu Qing, Zheng Baihua, Chen Gang (2014) On efficient reverse skyline query processing. Expert Syst Appl 41(7):3237–3249CrossRef
13.
go back to reference Raghunathan Rohit, De Sushovan, Kambhampati Subbarao (2014) Bayesian networks for supporting query processing over incomplete autonomous databases. J Intell Inf Syst 42(3):595–618CrossRef Raghunathan Rohit, De Sushovan, Kambhampati Subbarao (2014) Bayesian networks for supporting query processing over incomplete autonomous databases. J Intell Inf Syst 42(3):595–618CrossRef
14.
go back to reference Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, Jeff Dean (2013) Distributed representations of words and phrases and their compositionality. In: Proceedings of 26th Annual Conference on Neural Information Processing Systems, pages 3111- 3119. IEEE Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, Jeff Dean (2013) Distributed representations of words and phrases and their compositionality. In: Proceedings of 26th Annual Conference on Neural Information Processing Systems, pages 3111- 3119. IEEE
16.
go back to reference Pascal Denis, Benoît Sagot (2012) Coupling an annotated corpus and a lexicon for state-of-the-art pos tagging. Lang Resour Eval 46(4):721–736 Pascal Denis, Benoît Sagot (2012) Coupling an annotated corpus and a lexicon for state-of-the-art pos tagging. Lang Resour Eval 46(4):721–736
17.
go back to reference Jie Zhang, Xiaoyin Wang, Dan Hao, Bing Xie, Lu Zhang, Hong Mei (2015) A survey on bug-report analysis. Sci China Inf Sci 58(2):1–24CrossRef Jie Zhang, Xiaoyin Wang, Dan Hao, Bing Xie, Lu Zhang, Hong Mei (2015) A survey on bug-report analysis. Sci China Inf Sci 58(2):1–24CrossRef
18.
go back to reference Gary B Huang, Honglak Lee, Erik Learned-Miller (2012) Learning hierarchical representations for face verification with convolutional deep belief networks. In: Proceedings of 2012 IEEE Conference on Computer Vision and Pattern Recognition, pages 2518–2525, IEEE Gary B Huang, Honglak Lee, Erik Learned-Miller (2012) Learning hierarchical representations for face verification with convolutional deep belief networks. In: Proceedings of 2012 IEEE Conference on Computer Vision and Pattern Recognition, pages 2518–2525, IEEE
Metadata
Title
A Skip-gram-based Framework to Extract Knowledge from Chinese Reviews in Cloud Environment
Authors
Feng Zhao
Hang Zhu
Hai Jin
Weizhong Qiang
Publication date
01-06-2015
Publisher
Springer US
Published in
Mobile Networks and Applications / Issue 3/2015
Print ISSN: 1383-469X
Electronic ISSN: 1572-8153
DOI
https://doi.org/10.1007/s11036-015-0612-5

Other articles of this Issue 3/2015

Mobile Networks and Applications 3/2015 Go to the issue