Skip to main content

2015 | OriginalPaper | Buchkapitel

Pentaho + R: An Integral View for Multidimensional Prediction Models

verfasst von : Adolfo Martínez-Usó, José Hernández-Orallo, M. José Ramírez-Quintana, Fernando Martínez Plumed

Erschienen in: Advances in Artificial Intelligence

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The integration of multidimensional data and machine learning seems to be natural in the area of business intelligence. On-Line Analytical Processing (OLAP) tools are frequent in this area where the data are usually represented in multidimensional datamarts and data mining tools are integrated in some of these tools. However, the efforts for a full integration of data mining and OLAP tools have not been as common as originally expected. Nowadays, this integration is mostly carried out on source code, implementing solutions that perform (i) all the operations on multidimensional data as well as (ii) the data mining algorithms to extract knowledge from these data. Hence, there now exists an important distinction between implementation-based developments where the entire solution is implemented on source code and OLAP-tool-based developments where (at least) the operations on multidimensional data are performed using an OLAP tool. This work analyses these two alternatives in cost-effective terms, performing an experimental analysis on a multidimensional problem and discussing when each approach seems to excel the other.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
2
We use data cube independently from the number of dimensions, although we often find this term as hypercube when more than 3 dimensions are involved in the hierarchy.
 
5
See https://​github.​com/​overcoil/​X4R/​issues/​ for a complete list of issues with this package.
 
9
Concretely, we used biserver-ce-5.2.0.0-209 version of this software, which is not the last version but compatible. Visit http://​community.​pentaho.​com/​ for details.
 
10
This is only necessary if, as often happens, you have attribute names written down using underscores.
 
Literatur
2.
Zurück zum Zitat Chaudhuri, S., Dayal, U.: An overview of data warehousing and OLAP technology. ACM Sigmod Rec. 26(1), 65–74 (1997)CrossRef Chaudhuri, S., Dayal, U.: An overview of data warehousing and OLAP technology. ACM Sigmod Rec. 26(1), 65–74 (1997)CrossRef
4.
Zurück zum Zitat Golfarelli, M., Maio, D., Rizzi, S.: The dimensional fact model: a conceptual model for data warehouses. Intl. J. Coop. Inf. Syst. 7, 215–247 (1998)CrossRef Golfarelli, M., Maio, D., Rizzi, S.: The dimensional fact model: a conceptual model for data warehouses. Intl. J. Coop. Inf. Syst. 7, 215–247 (1998)CrossRef
5.
Zurück zum Zitat Han, J.: Data Mining: Concepts and Techniques. Morgan Kaufmann Publishers Inc., San Francisco (2005) Han, J.: Data Mining: Concepts and Techniques. Morgan Kaufmann Publishers Inc., San Francisco (2005)
8.
Zurück zum Zitat Hernández-Orallo, J., Lachiche, N., Martínez-Usó, A.: Predictive models for multidimensional data when the resolution context changes. In: Ferri, C., Flach, P., Lachiche, N. (eds.) Workshop on Learning over Multiple Contexts at ECML 2014 (LMCE) (2014) Hernández-Orallo, J., Lachiche, N., Martínez-Usó, A.: Predictive models for multidimensional data when the resolution context changes. In: Ferri, C., Flach, P., Lachiche, N. (eds.) Workshop on Learning over Multiple Contexts at ECML 2014 (LMCE) (2014)
10.
Zurück zum Zitat Jensen, M., Moller, T., Pedersen, T.: Specifying OLAP cubes on XML data. In: Proceedings of the 13th International Conference on Scientific and Statistical Database Management (SSDBM), pp. 101–112 (2001) Jensen, M., Moller, T., Pedersen, T.: Specifying OLAP cubes on XML data. In: Proceedings of the 13th International Conference on Scientific and Statistical Database Management (SSDBM), pp. 101–112 (2001)
11.
Zurück zum Zitat Ková\(\check{c}\), S.: Suitability analysis of data mining tools and methods. Ph.D. thesis (2012) Ková\(\check{c}\), S.: Suitability analysis of data mining tools and methods. Ph.D. thesis (2012)
12.
Zurück zum Zitat R Development Core Team.: R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria (2012) R Development Core Team.: R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria (2012)
13.
Zurück zum Zitat Wahbeh, A.H., Al-Radaideh, Q.A., Al-Kabi, M.N., Al-Shawakfa, E.M.: A comparison study between data mining tools over some classification methods. Int. J. Adv. Comput. Sci. Appl. 2, 18–26 (2011)CrossRef Wahbeh, A.H., Al-Radaideh, Q.A., Al-Kabi, M.N., Al-Shawakfa, E.M.: A comparison study between data mining tools over some classification methods. Int. J. Adv. Comput. Sci. Appl. 2, 18–26 (2011)CrossRef
15.
Zurück zum Zitat Witten, I.H., Frank, E., Hall, M.A.: Data Mining: Practical Machine Learning Tools and Techniques, third edn. Morgan Kaufmann, Burlington (2011) Witten, I.H., Frank, E., Hall, M.A.: Data Mining: Practical Machine Learning Tools and Techniques, third edn. Morgan Kaufmann, Burlington (2011)
Metadaten
Titel
Pentaho + R: An Integral View for Multidimensional Prediction Models
verfasst von
Adolfo Martínez-Usó
José Hernández-Orallo
M. José Ramírez-Quintana
Fernando Martínez Plumed
Copyright-Jahr
2015
DOI
https://doi.org/10.1007/978-3-319-24598-0_21