Skip to main content
Erschienen in: Soft Computing 1/2012

01.01.2012 | Original Paper

A new model for linguistic summarization of heterogeneous data: an application to tourism web data sources

verfasst von: Ramón A. Carrasco, Pedro Villar

Erschienen in: Soft Computing | Ausgabe 1/2012

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper we present the problem of aggregating heterogeneous data from various websites with opinions about high end hotels into a database. We present the fuzzy model based on the semantic translation as a tool to obtain a linguistic summarization. The characteristics of this model (necessary to solve the problem) are not together on any of the existing linguistic models: the management of the input heterogeneous data (natural language included); the procurement of linguistic results with high precision and good interpretability; and the use of unbalanced linguistic term sets described by trapezoidal membership functions for defining the initial linguistic terms. We applied it to aggregate data from certain high end hotels websites and we show a case study using the high end hotels located in Granada (Spain) from such websites during a year. With this aggregated information, a data analyst can make several analyses with the benefit of easy linguistic interpretability and a high precision. The solution proposed here can be used to similar aggregation problems.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Bonissone PP (1982) A fuzzy sets based linguistic approach: theory and applications. In: Gupta MM, Sanchez E (eds) Approximate reasoning in decision analysis. North-Holland, Amsterdam, pp 329–339 Bonissone PP (1982) A fuzzy sets based linguistic approach: theory and applications. In: Gupta MM, Sanchez E (eds) Approximate reasoning in decision analysis. North-Holland, Amsterdam, pp 329–339
Zurück zum Zitat Bonissone PP, Decker KS (1986) Selecting uncertainty calculi and granularity: an experiment in trading-off precision and complexity. In: Kanal LH, Lemmer JF (eds) Uncertainty in artificial intelligence. North-Holland, Amsterdam, pp 217–247 Bonissone PP, Decker KS (1986) Selecting uncertainty calculi and granularity: an experiment in trading-off precision and complexity. In: Kanal LH, Lemmer JF (eds) Uncertainty in artificial intelligence. North-Holland, Amsterdam, pp 217–247
Zurück zum Zitat Bordogna G, Passi G (1993) A fuzzy linguistic approach generalizing boolean information retrieval: a model and its evaluation. J Am Soc Inf Sci 44:70–82CrossRef Bordogna G, Passi G (1993) A fuzzy linguistic approach generalizing boolean information retrieval: a model and its evaluation. J Am Soc Inf Sci 44:70–82CrossRef
Zurück zum Zitat Bordogna G, Passi G (2001) An ordinal information retrieval model. Int J Uncertain Fuzziness Knowl Based Syst 9:63–76CrossRefMATH Bordogna G, Passi G (2001) An ordinal information retrieval model. Int J Uncertain Fuzziness Knowl Based Syst 9:63–76CrossRefMATH
Zurück zum Zitat Bouchon-Meunier B, Yao J (1992) Linguistic modifiers and imprecise categories. Int J Intell Syst 7:25–36CrossRefMATH Bouchon-Meunier B, Yao J (1992) Linguistic modifiers and imprecise categories. Int J Intell Syst 7:25–36CrossRefMATH
Zurück zum Zitat Carenini G, Ng RT, Zwart E (2005) Extracting knowledge from evaluative text. In: Proceedings of the 3rd international conference on knowledge. ACM Press, New York, pp 11–18 Carenini G, Ng RT, Zwart E (2005) Extracting knowledge from evaluative text. In: Proceedings of the 3rd international conference on knowledge. ACM Press, New York, pp 11–18
Zurück zum Zitat Carrasco RA, Galindo J, Vila MA (2001) Using artificial neural network to define fuzzy comparators in FSQL with the criterion of some decision-maker. Lect Notes Comput Sci 2085:587–594CrossRef Carrasco RA, Galindo J, Vila MA (2001) Using artificial neural network to define fuzzy comparators in FSQL with the criterion of some decision-maker. Lect Notes Comput Sci 2085:587–594CrossRef
Zurück zum Zitat Delgado M, Verdegay JL, Vila MA (1992) Linguistic decision making models. Int J Intell Syst 7:479–492CrossRefMATH Delgado M, Verdegay JL, Vila MA (1992) Linguistic decision making models. Int J Intell Syst 7:479–492CrossRefMATH
Zurück zum Zitat Delgado M, Verdegay JL, Vila MA (1993) On aggregation operations of linguistic labels. Int J Intell Syst 8:351–370CrossRefMATH Delgado M, Verdegay JL, Vila MA (1993) On aggregation operations of linguistic labels. Int J Intell Syst 8:351–370CrossRefMATH
Zurück zum Zitat Dixon P (2001) Basics of oracle text retrieval. IEEE Data Eng Bull 24(4):11–14 Dixon P (2001) Basics of oracle text retrieval. IEEE Data Eng Bull 24(4):11–14
Zurück zum Zitat Galindo J, Carrasco RA, Almagro AM (2008), Fuzzy quantifiers with and without arguments for databases: definition, implementation and application to fuzzy dependencies. In: Proceedings 12th international conference information processing and management of uncertainty for knowledge-based systems, Malaga, Spain, pp 227–234 Galindo J, Carrasco RA, Almagro AM (2008), Fuzzy quantifiers with and without arguments for databases: definition, implementation and application to fuzzy dependencies. In: Proceedings 12th international conference information processing and management of uncertainty for knowledge-based systems, Malaga, Spain, pp 227–234
Zurück zum Zitat George R, Srikanth R (1996) Data summarization using genetic algorithms and fuzzy logic. In: Herrera F, Verdegay JL (eds) Genetic algorithms and soft computing. Physical, Heidelberg, pp 599–611 George R, Srikanth R (1996) Data summarization using genetic algorithms and fuzzy logic. In: Herrera F, Verdegay JL (eds) Genetic algorithms and soft computing. Physical, Heidelberg, pp 599–611
Zurück zum Zitat Herrera F, Martínez L (2000) A 2-tuple fuzzy linguistic representation model for computing with words. IEEE Trans Fuzzy Syst 8(6):746–752CrossRef Herrera F, Martínez L (2000) A 2-tuple fuzzy linguistic representation model for computing with words. IEEE Trans Fuzzy Syst 8(6):746–752CrossRef
Zurück zum Zitat Herrera F, Herrera-Viedma E, Verdegay JL (1995) A sequential selection process in group decision making with linguistic assessment. Inf Sci 85:223–239CrossRefMATH Herrera F, Herrera-Viedma E, Verdegay JL (1995) A sequential selection process in group decision making with linguistic assessment. Inf Sci 85:223–239CrossRefMATH
Zurück zum Zitat Herrera F, Herrera-Viedma E, Verdegay JL (1996) Direct approach processes in group decision making using linguistic OWA operators. Fuzzy Sets Syst 79:175–190CrossRefMATHMathSciNet Herrera F, Herrera-Viedma E, Verdegay JL (1996) Direct approach processes in group decision making using linguistic OWA operators. Fuzzy Sets Syst 79:175–190CrossRefMATHMathSciNet
Zurück zum Zitat Herrera F, Martínez L, Sánchez PJ (2005) Managing non-homogeneous information in group decision making. Eur J Oper Res 166(1):115–132CrossRefMATH Herrera F, Martínez L, Sánchez PJ (2005) Managing non-homogeneous information in group decision making. Eur J Oper Res 166(1):115–132CrossRefMATH
Zurück zum Zitat Herrera-Viedma E (2001) An information retrieval system with ordinal linguistic weighted queries based on two weighting elements. Int J Uncertain Fuzziness Knowl Based Syst 9:77–88CrossRefMATHMathSciNet Herrera-Viedma E (2001) An information retrieval system with ordinal linguistic weighted queries based on two weighting elements. Int J Uncertain Fuzziness Knowl Based Syst 9:77–88CrossRefMATHMathSciNet
Zurück zum Zitat Herrera-Viedma E, López-Herrera AG, Luque M, Porcel C (2007) A fuzzy linguistic IRS model based on a 2-tuple fuzzy linguistic approach. Int J Uncertain Fuzziness Knowl Based Syst 15:225–250CrossRefMATH Herrera-Viedma E, López-Herrera AG, Luque M, Porcel C (2007) A fuzzy linguistic IRS model based on a 2-tuple fuzzy linguistic approach. Int J Uncertain Fuzziness Knowl Based Syst 15:225–250CrossRefMATH
Zurück zum Zitat Hu M, Liu B (2004) Mining opinion features in customer reviews. In: Proceedings of nineteenth national conference on artificial intelligence. San José, California, pp 755–760 Hu M, Liu B (2004) Mining opinion features in customer reviews. In: Proceedings of nineteenth national conference on artificial intelligence. San José, California, pp 755–760
Zurück zum Zitat Kacprzyk J (1999) An interactive fuzzy logic approach to linguistic data summaries. In: Proceedings 18th international conference of the North American fuzzy information processing society, New York, pp 595–599 Kacprzyk J (1999) An interactive fuzzy logic approach to linguistic data summaries. In: Proceedings 18th international conference of the North American fuzzy information processing society, New York, pp 595–599
Zurück zum Zitat Kacprzyk J, Zadrozny S (2000) Computing with words: towards a new generation of linguistic querying and summarization in databases. In: Sincak P, Vašcak J (eds) Quo Vadis computational intelligence? Physica, Heidelberg, pp 144–175 Kacprzyk J, Zadrozny S (2000) Computing with words: towards a new generation of linguistic querying and summarization in databases. In: Sincak P, Vašcak J (eds) Quo Vadis computational intelligence? Physica, Heidelberg, pp 144–175
Zurück zum Zitat Kacprzyk J, Yager RR, Zadrozny S (2000) A fuzzy logic based approach to linguistic summaries in databases. Int J Appl Math Comput Sci 10:813–834MATH Kacprzyk J, Yager RR, Zadrozny S (2000) A fuzzy logic based approach to linguistic summaries in databases. Int J Appl Math Comput Sci 10:813–834MATH
Zurück zum Zitat Klement EP, Mesiar R, Pap E (2000) Triangular Norms. In: Klement EP, Mesiar R (eds) Trends in logic vol 8, Studia Logica Library, Kluwer Academic Publishers, Dordrecht Klement EP, Mesiar R, Pap E (2000) Triangular Norms. In: Klement EP, Mesiar R (eds) Trends in logic vol 8, Studia Logica Library, Kluwer Academic Publishers, Dordrecht
Zurück zum Zitat Ku LW, Liang YT, Chen HH (2006) Opinion extraction, summarization and tracking in news and blog corpora. In: Proceedings of AAAI-2006 Spring symposium on computational approaches to analyzing weblogs. Menlo Park, California, pp 100–107 Ku LW, Liang YT, Chen HH (2006) Opinion extraction, summarization and tracking in news and blog corpora. In: Proceedings of AAAI-2006 Spring symposium on computational approaches to analyzing weblogs. Menlo Park, California, pp 100–107
Zurück zum Zitat Laurent A (2003) A new approach for the generation of fuzzy summaries based on fuzzy multidimensional databases. Intell Data Anal 7(2):155–177MATH Laurent A (2003) A new approach for the generation of fuzzy summaries based on fuzzy multidimensional databases. Intell Data Anal 7(2):155–177MATH
Zurück zum Zitat Lazzari LL, Mouliá PI, Eriz M (2009) An alternative operationalization of fuzzy consideration set. Application to tourism. In: Proceedings of IFSA/EUSFLAT Conference. Lisboa, Portugal, pp 173–177 Lazzari LL, Mouliá PI, Eriz M (2009) An alternative operationalization of fuzzy consideration set. Application to tourism. In: Proceedings of IFSA/EUSFLAT Conference. Lisboa, Portugal, pp 173–177
Zurück zum Zitat Long C, Zhang J, Huang M, Zhu X, Li M, Ma B (2009) Specialized review selection for feature rating estimation. In: Proceedings of the IEEE/WIC/ACM international conference on web intelligence. Milan, Italy, pp 214–221 Long C, Zhang J, Huang M, Zhu X, Li M, Ma B (2009) Specialized review selection for feature rating estimation. In: Proceedings of the IEEE/WIC/ACM international conference on web intelligence. Milan, Italy, pp 214–221
Zurück zum Zitat Miao Q, Li Q, Dai R (2009) Amazing: a sentiment mining and retrieval system. Expert Syst Appl 36(3):7192–7198CrossRef Miao Q, Li Q, Dai R (2009) Amazing: a sentiment mining and retrieval system. Expert Syst Appl 36(3):7192–7198CrossRef
Zurück zum Zitat Moreno JM, Morales del Castillo JM, Porcel C, Herrera-Viedma E (2010) A quality evaluation methodology for health-related websites based on a 2-tuple fuzzy linguistic approach. Soft Comp 14(8):887–897CrossRef Moreno JM, Morales del Castillo JM, Porcel C, Herrera-Viedma E (2010) A quality evaluation methodology for health-related websites based on a 2-tuple fuzzy linguistic approach. Soft Comp 14(8):887–897CrossRef
Zurück zum Zitat Morinaga S, Yamanishi K, Tateishi K, Fukushima T (2002) Mining product reputations on the web. In: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining. ACM. Press, New York, pp 341–349CrossRef Morinaga S, Yamanishi K, Tateishi K, Fukushima T (2002) Mining product reputations on the web. In: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining. ACM. Press, New York, pp 341–349CrossRef
Zurück zum Zitat Ribeiro A, Fresno V, García-Alegre M, Guinea D (2002) A fuzzy system for the web page representation. In: Szczepaniak PS, Segovia J, Kacprzyk J, Zadeh LA (eds) Intelligent exploration of the web. Physica, Heidelberg pp 19–38 Ribeiro A, Fresno V, García-Alegre M, Guinea D (2002) A fuzzy system for the web page representation. In: Szczepaniak PS, Segovia J, Kacprzyk J, Zadeh LA (eds) Intelligent exploration of the web. Physica, Heidelberg pp 19–38
Zurück zum Zitat Shea C (2008) Oracle text reference, 11g release 1 (11.1) Part Number B28304-03 Shea C (2008) Oracle text reference, 11g release 1 (11.1) Part Number B28304-03
Zurück zum Zitat Tang H, Tan S, Cheng X (2009) A survey on sentiment detection of reviews. Expert Syst Appl 36(7):760–773CrossRef Tang H, Tan S, Cheng X (2009) A survey on sentiment detection of reviews. Expert Syst Appl 36(7):760–773CrossRef
Zurück zum Zitat Tsytsarau M, Palpanas T (2010) Mining subjective data on the web. In: Technical report DISI-10-045, Ingegneria e Scienza dell’Informazione. University of Trento, Italy Tsytsarau M, Palpanas T (2010) Mining subjective data on the web. In: Technical report DISI-10-045, Ingegneria e Scienza dell’Informazione. University of Trento, Italy
Zurück zum Zitat Umano M, Fukami S (1994) Fuzzy relational algebra for possibility-distribution-fuzzy-relational model of fuzzy data. J Intell Inf Syst 3:7–28CrossRef Umano M, Fukami S (1994) Fuzzy relational algebra for possibility-distribution-fuzzy-relational model of fuzzy data. J Intell Inf Syst 3:7–28CrossRef
Zurück zum Zitat Yager RR (1991) On linguistic summaries of data. In: Frawley W, Pietsky-Shapiro G (eds) Knowledge discovery in databases. AAAI/MIT Press, Cambridge, pp 347–363 Yager RR (1991) On linguistic summaries of data. In: Frawley W, Pietsky-Shapiro G (eds) Knowledge discovery in databases. AAAI/MIT Press, Cambridge, pp 347–363
Zurück zum Zitat Yager RR (1999) Decision making under uncertainty with ordinal information. Int J Uncertain Fuzziness Knowl Based Syst 7:483–500CrossRefMATHMathSciNet Yager RR (1999) Decision making under uncertainty with ordinal information. Int J Uncertain Fuzziness Knowl Based Syst 7:483–500CrossRefMATHMathSciNet
Zurück zum Zitat Zadeh LA (1975) The concept of a linguistic variable and its applications to approximate reasoning, Pt I, Inf Sci 8:199–249. Pt II, Inf Sci 8:301–357. Pt III, Inf Sci 9:43–80 Zadeh LA (1975) The concept of a linguistic variable and its applications to approximate reasoning, Pt I, Inf Sci 8:199–249. Pt II, Inf Sci 8:301–357. Pt III, Inf Sci 9:43–80
Zurück zum Zitat Zhang J, Kawai Y, Kumamoto T, Tanaka K (2009) A novel visualization method for distinction of web news sentiment. In: Vossen G, Long DDE, Yu JX (eds) LCNS, vol 5802. Springer, Berlin, pp 181–194 Zhang J, Kawai Y, Kumamoto T, Tanaka K (2009) A novel visualization method for distinction of web news sentiment. In: Vossen G, Long DDE, Yu JX (eds) LCNS, vol 5802. Springer, Berlin, pp 181–194
Metadaten
Titel
A new model for linguistic summarization of heterogeneous data: an application to tourism web data sources
verfasst von
Ramón A. Carrasco
Pedro Villar
Publikationsdatum
01.01.2012
Verlag
Springer-Verlag
Erschienen in
Soft Computing / Ausgabe 1/2012
Print ISSN: 1432-7643
Elektronische ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-011-0740-1

Weitere Artikel der Ausgabe 1/2012

Soft Computing 1/2012 Zur Ausgabe