Skip to main content

2018 | OriginalPaper | Buchkapitel

Product Matching to Determine the Energy Efficiency of Used Cars Available at Internet Marketplaces

verfasst von : Mario Rivas-Sánchez, Maria P. Guerrero-Lebrero, Elisa Guerrero, Guillermo Bárcena-Gonzalez, Jaime Martel, Pedro L. Galindo

Erschienen in: Soft Computing for Sustainability Science

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The growth of the Internet has fuelled the availability of e-commerce marketplaces and search engines must face with a huge amount of ambiguity and inconsistencies in the data. Product matching aims at disambiguating descriptions of products belonging to different websites in order to be able to recognize identical products and to merge the content from those identical items. In this work first we evaluate some similarity measures for string matching and then, we apply a complete product matching methodology to the retail market of used cars. We use a reference or master list of items and information about a wide variety of used cars offers. The resulting linkage allows energy efficiency assignment of the model identified.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Agrawal, R., Ieong, S.: Aggregating web offers to determine product prices. In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 435–443, ACM (2012) Agrawal, R., Ieong, S.: Aggregating web offers to determine product prices. In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 435–443, ACM (2012)
2.
Zurück zum Zitat Aizawa, A.: An information-theoretic perspective of tf-idf measures. Inf. Process. Manag. 39(1), 45–65 (2003)CrossRefMATH Aizawa, A.: An information-theoretic perspective of tf-idf measures. Inf. Process. Manag. 39(1), 45–65 (2003)CrossRefMATH
3.
Zurück zum Zitat Bilenko, M., Basil, S., Sahami, M.: Adaptive product normalization: using online learning for record linkage in comparison shopping. In: Fifth IEEE International Conference on Data Mining, pp. 8-pp. IEEE (2005) Bilenko, M., Basil, S., Sahami, M.: Adaptive product normalization: using online learning for record linkage in comparison shopping. In: Fifth IEEE International Conference on Data Mining, pp. 8-pp. IEEE (2005)
4.
Zurück zum Zitat Christen, P.: Data Matching: Concepts and Techniques for Record Linkage, Entity Resolution, and Duplicate Detection. Springer Science & Business Media, Berlin (2012)CrossRef Christen, P.: Data Matching: Concepts and Techniques for Record Linkage, Entity Resolution, and Duplicate Detection. Springer Science & Business Media, Berlin (2012)CrossRef
5.
Zurück zum Zitat Cohen, W.W., Ravikumar, P.D., Fienberg, S.E.: A comparison of string distance metrics for name-matching tasks. In: IIWeb, Vol. 2003, pp. 73–78 (2003) Cohen, W.W., Ravikumar, P.D., Fienberg, S.E.: A comparison of string distance metrics for name-matching tasks. In: IIWeb, Vol. 2003, pp. 73–78 (2003)
6.
Zurück zum Zitat Eisenstein, J.: What to do about bad language on the internet. In: HLT-NAACL, pp. 359–369 (2013) Eisenstein, J.: What to do about bad language on the internet. In: HLT-NAACL, pp. 359–369 (2013)
7.
Zurück zum Zitat Han, J., Kamber, M., Pei, J.: Data Mining: Concepts and Techniques. Elsevier, Amsterdam (2011) Han, J., Kamber, M., Pei, J.: Data Mining: Concepts and Techniques. Elsevier, Amsterdam (2011)
8.
Zurück zum Zitat Hong, T.P., Lin, C.W., Yang, K.T., Wang, S.L.: Using TF-IDF to hide sensitive itemsets. Appl. Intell. 38(4), 502–510 (2013)CrossRef Hong, T.P., Lin, C.W., Yang, K.T., Wang, S.L.: Using TF-IDF to hide sensitive itemsets. Appl. Intell. 38(4), 502–510 (2013)CrossRef
10.
Zurück zum Zitat Jaro, M.A.: Probabilistic linkage of large public health data files. Stat. Med. 14(5:7), 491–498 (1995)CrossRef Jaro, M.A.: Probabilistic linkage of large public health data files. Stat. Med. 14(5:7), 491–498 (1995)CrossRef
11.
Zurück zum Zitat Jimenez, S., Becerra, C., Gelbukh, A., Gonzalez, F.: Generalized Mongue-Elkan method for approximate text string comparison. In: International Conference on Intelligent Text Processing and Computational Linguistics, pp. 559–570. Springer, Berlin, Heidelberg (2009) Jimenez, S., Becerra, C., Gelbukh, A., Gonzalez, F.: Generalized Mongue-Elkan method for approximate text string comparison. In: International Conference on Intelligent Text Processing and Computational Linguistics, pp. 559–570. Springer, Berlin, Heidelberg (2009)
12.
Zurück zum Zitat Kannan, A., Givoni, I.E., Agrawal, R., Fuxman, A.: Matching unstructured product offers to structured product specifications. In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 404–412. ACM (2011) Kannan, A., Givoni, I.E., Agrawal, R., Fuxman, A.: Matching unstructured product offers to structured product specifications. In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 404–412. ACM (2011)
13.
Zurück zum Zitat Keller, J.M., Gray, M.R., Givens, J.A.: A fuzzy k-nearest neighbor algorithm. IEEE Trans. Syst. Man Cybern. 4, 580–585 (1985)CrossRef Keller, J.M., Gray, M.R., Givens, J.A.: A fuzzy k-nearest neighbor algorithm. IEEE Trans. Syst. Man Cybern. 4, 580–585 (1985)CrossRef
14.
Zurück zum Zitat Köpcke, H., Thor, A., Thomas, S., Rahm, E.: Tailoring entity resolution for matching product offers. In: Proceedings of the 15th International Conference on Extending Database Technology, pp. 545–550. ACM (2012) Köpcke, H., Thor, A., Thomas, S., Rahm, E.: Tailoring entity resolution for matching product offers. In: Proceedings of the 15th International Conference on Extending Database Technology, pp. 545–550. ACM (2012)
15.
Zurück zum Zitat Leskovec, J., Rajaraman, A., Ullman, J.D.: Mining of Massive Datasets. Cambridge University Press, Cambridge (2014) Leskovec, J., Rajaraman, A., Ullman, J.D.: Mining of Massive Datasets. Cambridge University Press, Cambridge (2014)
16.
Zurück zum Zitat Monge, A., Elkan, C.: The field matching problem: Algorithms and applications. In: Proceedings of The Second International Conference on Knowledge Discovery and Data Mining, (KDD) (1996) Monge, A., Elkan, C.: The field matching problem: Algorithms and applications. In: Proceedings of The Second International Conference on Knowledge Discovery and Data Mining, (KDD) (1996)
17.
Zurück zum Zitat Paik, J.H.: A novel TF-IDF weighting scheme for effective ranking. In: Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 343–352. ACM (2013) Paik, J.H.: A novel TF-IDF weighting scheme for effective ranking. In: Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 343–352. ACM (2013)
18.
Zurück zum Zitat Ren, F., Sohrab, M.G.: Class-indexing-based term weighting for automatic text classification. Inf. Sci. 236, 109–125 (2013)CrossRef Ren, F., Sohrab, M.G.: Class-indexing-based term weighting for automatic text classification. Inf. Sci. 236, 109–125 (2013)CrossRef
19.
Zurück zum Zitat Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manag. 24(5), 513–523 (1988)CrossRef Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manag. 24(5), 513–523 (1988)CrossRef
20.
Zurück zum Zitat Singhal, A.: Modern information retrieval: a brief overview. IEEE Data Eng. Bull. 24(4), 35–43 (2001) Singhal, A.: Modern information retrieval: a brief overview. IEEE Data Eng. Bull. 24(4), 35–43 (2001)
21.
Zurück zum Zitat Thor, A. (2010). Toward an adaptive string similarity measure for matching product offers. In: GI Jahrestagung (1), pp. 702–710 Thor, A. (2010). Toward an adaptive string similarity measure for matching product offers. In: GI Jahrestagung (1), pp. 702–710
22.
Zurück zum Zitat Winkler, W.E.: String Comparator metrics and enhanced decision rules in the Fellegi–Sunter model of record linkage. In: Proceedings of the Section on Survey Research Methods, pp. 354–359. American Statistical Association (1990) Winkler, W.E.: String Comparator metrics and enhanced decision rules in the Fellegi–Sunter model of record linkage. In: Proceedings of the Section on Survey Research Methods, pp. 354–359. American Statistical Association (1990)
23.
Zurück zum Zitat Winkler, W.E.: Overview of Record Linkage and Current Research Directions. Research Report Series, RRS (2006) Winkler, W.E.: Overview of Record Linkage and Current Research Directions. Research Report Series, RRS (2006)
Metadaten
Titel
Product Matching to Determine the Energy Efficiency of Used Cars Available at Internet Marketplaces
verfasst von
Mario Rivas-Sánchez
Maria P. Guerrero-Lebrero
Elisa Guerrero
Guillermo Bárcena-Gonzalez
Jaime Martel
Pedro L. Galindo
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-62359-7_10

Premium Partner