Skip to main content

2015 | OriginalPaper | Buchkapitel

OpeNER: Open Tools to Perform Natural Language Processing on Accommodation Reviews

verfasst von : Aitor García-Pablos, Montse Cuadros, Maria Teresa Linaza

Erschienen in: Information and Communication Technologies in Tourism 2015

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Opinion mining is crucial for hoteliers and other tourism industries in order to improve their service from the analysis of services failures and recovery. The extensive use of the Internet and social networks has shifted the way tourism information is shared and spread. Travel agencies, hotels, restaurants, tourist destinations and other actors require the aid of new technologies to get an insight of the vast amount of customer generated reviews. Develop and integrate text analysis technologies is usually difficult and expensive, because it involves the use of Natural Language Processing techniques. This paper introduces the OpeNER European project, a set of free Open Source and ready-to-use text analysis tools to perform text processing tasks like Named Entity Recognition and Opinion detection. The paper also provides an example of a possible application of the OpeNER results in the geolocation of hotel reviews.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
2
KAF documents are XML files too verbose to be represented in this paper. More information about KAF format and examples can be found at the OpeNER website.
 
Literatur
Zurück zum Zitat Bagga, A., & Baldwin, B. (1999). Cross-document event coreference: Annotations, experiments, and observations. In Proceedings of the workshop on coreference and its applications. Bagga, A., & Baldwin, B. (1999). Cross-document event coreference: Annotations, experiments, and observations. In Proceedings of the workshop on coreference and its applications.
Zurück zum Zitat Bosma, W., Vossen, P., & Soroa, A. (2009). KAF: A generic semantic annotation format. In Proceedings of the GL2009 workshop on semantic annotation. Bosma, W., Vossen, P., & Soroa, A. (2009). KAF: A generic semantic annotation format. In Proceedings of the GL2009 workshop on semantic annotation.
Zurück zum Zitat Brants, T. (2000). TnT: A statistical part-of-speech tagger. In Proceedings of the sixth conference on applied natural language processing. Seattle, WA. Brants, T. (2000). TnT: A statistical part-of-speech tagger. In Proceedings of the sixth conference on applied natural language processing. Seattle, WA.
Zurück zum Zitat Brereton, R. G., & Lloyd, G. R. (2010). Support vector machines for classification and regression. The Analyst, 135, 230–267.CrossRef Brereton, R. G., & Lloyd, G. R. (2010). Support vector machines for classification and regression. The Analyst, 135, 230–267.CrossRef
Zurück zum Zitat Collins, M. (2002). Discriminative training methods for hidden Markov models: Theory and experiments with perceptron algorithms. In Proceedings of the ACL-02 conference on empirical methods in natural language processing (pp. 1–8). Philadelphia, PA: Association for Computational Linguistics. Collins, M. (2002). Discriminative training methods for hidden Markov models: Theory and experiments with perceptron algorithms. In Proceedings of the ACL-02 conference on empirical methods in natural language processing (pp. 1–8). Philadelphia, PA: Association for Computational Linguistics.
Zurück zum Zitat Ghose, A., Ipeirotis, P., & Li, B. (2009). The economic impact of user-generated content on the Internet: Combining text mining with demand estimation in the hotel industry. In Proceedings of the 20th workshop on information systems and economics (WISE). Ghose, A., Ipeirotis, P., & Li, B. (2009). The economic impact of user-generated content on the Internet: Combining text mining with demand estimation in the hotel industry. In Proceedings of the 20th workshop on information systems and economics (WISE).
Zurück zum Zitat Giesbrecht, E., & Evert, S. (2009). Is part-of-speech tagging a solved task? An evaluation of POS taggers for the German web as corpus. In Web as corpus workshop WAC5 (p. 27). Giesbrecht, E., & Evert, S. (2009). Is part-of-speech tagging a solved task? An evaluation of POS taggers for the German web as corpus. In Web as corpus workshop WAC5 (p. 27).
Zurück zum Zitat Gräbner, D., Zanker, M., Fliedl, G., & Fuchs, M. (2012). Classification of customer reviews based on sentiment analysis. In Proceedings of the 19th conference on information and communication technologies in tourism (ENTER) (pp. 460–470). Helsingborg, Sweden: Springer. Gräbner, D., Zanker, M., Fliedl, G., & Fuchs, M. (2012). Classification of customer reviews based on sentiment analysis. In Proceedings of the 19th conference on information and communication technologies in tourism (ENTER) (pp. 460–470). Helsingborg, Sweden: Springer.
Zurück zum Zitat Hu, M., & Liu, B. (2004). Mining opinion features in customer reviews. Association for the Advancement of Artificial Intelligence, 4(4), 755–760. Hu, M., & Liu, B. (2004). Mining opinion features in customer reviews. Association for the Advancement of Artificial Intelligence, 4(4), 755–760.
Zurück zum Zitat Kasper, W., & Vela, M. (2011). Sentiment analysis for hotel reviews. Computational Linguistics-Applications Conference, 231527, 45–52. Kasper, W., & Vela, M. (2011). Sentiment analysis for hotel reviews. Computational Linguistics-Applications Conference, 231527, 45–52.
Zurück zum Zitat Lau, K., Lee, K., & Ho, Y. (2005). Text mining for the hotel industry. Cornell Hotel and Restaurant Administration Quarterly, 46(3), 344–362.CrossRef Lau, K., Lee, K., & Ho, Y. (2005). Text mining for the hotel industry. Cornell Hotel and Restaurant Administration Quarterly, 46(3), 344–362.CrossRef
Zurück zum Zitat Lee, H., Peirsman, Y., Chang, A., Chambers, N., Surdeanu, M., & Jurafsky, D. (2011a). Stanford’ s multi-pass sieve coreference resolution system at the CoNLL-2011 shared task. In Proceedings of the fifteenth conference on computational natural language learning: Shared task (pp. 28–34). Portland, OR: Association for Computational Linguistics. Lee, H., Peirsman, Y., Chang, A., Chambers, N., Surdeanu, M., & Jurafsky, D. (2011a). Stanford’ s multi-pass sieve coreference resolution system at the CoNLL-2011 shared task. In Proceedings of the fifteenth conference on computational natural language learning: Shared task (pp. 28–34). Portland, OR: Association for Computational Linguistics.
Zurück zum Zitat Lee, M. J., Singh, N., & Chan, E. S. W. (2011b). Service failures and recovery actions in the hotel industry: A text-mining approach. Journal of Vacation Marketing, 17(3), 197–207.CrossRef Lee, M. J., Singh, N., & Chan, E. S. W. (2011b). Service failures and recovery actions in the hotel industry: A text-mining approach. Journal of Vacation Marketing, 17(3), 197–207.CrossRef
Zurück zum Zitat Liu, B. (2010). Sentiment analysis and subjectivity. In N. Indurkhya & F. J. Damerau (Eds.), Handbook of natural language processing (pp. 1–38). New York, NY: ACM press. Liu, B. (2010). Sentiment analysis and subjectivity. In N. Indurkhya & F. J. Damerau (Eds.), Handbook of natural language processing (pp. 1–38). New York, NY: ACM press.
Zurück zum Zitat Liu, S., Law, R., Rong, J., Li, G., & Hall, J. (2013). Analyzing changes in hotel customers’ expectations by trip mode. International Journal of Hospitality Management, 34, 359–371.CrossRef Liu, S., Law, R., Rong, J., Li, G., & Hall, J. (2013). Analyzing changes in hotel customers’ expectations by trip mode. International Journal of Hospitality Management, 34, 359–371.CrossRef
Zurück zum Zitat Marchetti, A., Tesconi, M., Abbate, S., Lo Duca, A., D’Errico, A., Frontini F., & Monachini, M. (2013). Tour-pedia: A web application for the analysis and visualization of opinions for tourism domain. In Z. Vetulani & H. Uszkoreit (Eds.), The 6th Language & Technology Conference on Human Language Technology (pp. 594–595). Poznan, Poland. Marchetti, A., Tesconi, M., Abbate, S., Lo Duca, A., D’Errico, A., Frontini F., & Monachini, M. (2013). Tour-pedia: A web application for the analysis and visualization of opinions for tourism domain. In Z. Vetulani & H. Uszkoreit (Eds.), The 6th Language & Technology Conference on Human Language Technology (pp. 594–595). Poznan, Poland.
Zurück zum Zitat Nadeau, D., & Sekine, S. (2007). A survey of named entity recognition and classification. Lingvisticae Investigationes, 30(1), 3–26.CrossRef Nadeau, D., & Sekine, S. (2007). A survey of named entity recognition and classification. Lingvisticae Investigationes, 30(1), 3–26.CrossRef
Zurück zum Zitat Pang, B., & Lee, L. (2008). Opinion mining and sentiment analysis. Foundations and Trends® in Information Retrieval, 2(1–2), 1–135.CrossRef Pang, B., & Lee, L. (2008). Opinion mining and sentiment analysis. Foundations and Trends® in Information Retrieval, 2(1–2), 1–135.CrossRef
Zurück zum Zitat Popescu, A.-M., & Etzioni, O. (2007). Extracting product features and opinions from reviews. In A. Kao & S. R. Poteet (Eds.), Natural language processing and text mining (pp. 9–28). London: Springer.CrossRef Popescu, A.-M., & Etzioni, O. (2007). Extracting product features and opinions from reviews. In A. Kao & S. R. Poteet (Eds.), Natural language processing and text mining (pp. 9–28). London: Springer.CrossRef
Zurück zum Zitat Rao, D., McNamee, P., & Dredze, M. (2013). Entity linking: Finding extracted entities in a knowledge base. In Multi-source, multilingual information extraction and summarization (pp. 93–115). Berlin: Springer. Rao, D., McNamee, P., & Dredze, M. (2013). Entity linking: Finding extracted entities in a knowledge base. In Multi-source, multilingual information extraction and summarization (pp. 93–115). Berlin: Springer.
Zurück zum Zitat Řehůřek, R., & Kolkus, M. (2009). Language identification on the web: Extending the dictionary method. Lecture Notes in Computer Science, 5449, 357–368.CrossRef Řehůřek, R., & Kolkus, M. (2009). Language identification on the web: Extending the dictionary method. Lecture Notes in Computer Science, 5449, 357–368.CrossRef
Zurück zum Zitat Sil, A., Cronin, E., Nie, P., Yang, Y., Popescu, A.-M., & Yates, A. (2012). Linking named entities to any database. In EMNLP-CoNLL 2012 (pp. 116–127). Stroudsburg, PA: Association for Computational Linguistics. Sil, A., Cronin, E., Nie, P., Yang, Y., Popescu, A.-M., & Yates, A. (2012). Linking named entities to any database. In EMNLP-CoNLL 2012 (pp. 116–127). Stroudsburg, PA: Association for Computational Linguistics.
Zurück zum Zitat Sutton, C., & McCallum, A. (2012). An introduction to conditional random fields. Foundations and Trends in Machine Learning, 4, 267–373.CrossRef Sutton, C., & McCallum, A. (2012). An introduction to conditional random fields. Foundations and Trends in Machine Learning, 4, 267–373.CrossRef
Zurück zum Zitat Webster, J. J., & Kit, C. (1992). Tokenization as the initial phase in NLP. In Proceedings of COLING-92 (pp. 1106–1110). Nantes, France: International Committee on Computational Linguistics. Webster, J. J., & Kit, C. (1992). Tokenization as the initial phase in NLP. In Proceedings of COLING-92 (pp. 1106–1110). Nantes, France: International Committee on Computational Linguistics.
Zurück zum Zitat Xiang, Z., & Gretzel, U. (2010). Role of social media in online travel information search. Tourism Management, 31(2), 179–188. Elsevier. Xiang, Z., & Gretzel, U. (2010). Role of social media in online travel information search. Tourism Management, 31(2), 179–188. Elsevier.
Zurück zum Zitat Ye, Q., Zhang, Z., & Law, R. (2009). Sentiment classification of online reviews to travel destinations by supervised machine learning approaches. Expert Systems with Applications, 36(3), 6527–6535. Elsevier. Ye, Q., Zhang, Z., & Law, R. (2009). Sentiment classification of online reviews to travel destinations by supervised machine learning approaches. Expert Systems with Applications, 36(3), 6527–6535. Elsevier.
Metadaten
Titel
OpeNER: Open Tools to Perform Natural Language Processing on Accommodation Reviews
verfasst von
Aitor García-Pablos
Montse Cuadros
Maria Teresa Linaza
Copyright-Jahr
2015
DOI
https://doi.org/10.1007/978-3-319-14343-9_10