Skip to main content
Erschienen in: New Generation Computing 1/2017

12.12.2016 | Special Feature

Joint Analysis of Multiple Algorithms and Performance Measures

verfasst von: Cassio P. de Campos, Alessio Benavoli

Erschienen in: New Generation Computing | Ausgabe 1/2017

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

There has been an increasing interest in the development of new methods using Pareto optimality to deal with multi-objective criteria (for example, accuracy and time complexity). Once one has developed an approach to a problem of interest, the problem is then how to compare it with the state of art. In machine learning, algorithms are typically evaluated by comparing their performance on different data sets by means of statistical tests. Standard tests used for this purpose are able to consider jointly neither performance measures nor multiple competitors at once. The aim of this paper is to resolve these issues by developing statistical procedures that are able to account for multiple competing measures at the same time and to compare multiple algorithms altogether. In particular, we develop two tests: a frequentist procedure based on the generalized likelihood ratio test and a Bayesian procedure based on a multinomial-Dirichlet conjugate model. We further extend them by discovering conditional independences among measures to reduce the number of parameters of such models, as usually the number of studied cases is very reduced in such comparisons. Data from a comparison among general purpose classifiers are used to show a practical application of our tests.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Fußnoten
1
If there are ties we treat a tie in a measure by a standard approach: we replicate the case with it into two and divide the weight of such case by two (this process might need to be performed multiple times until no ties are present in the data). Such approach preserves the sample size and fairly allocates ties between the algorithms being compared.
 
2
An algorithm is better (\(\succ \)) than another when it has higher accuracy and lower computational time.
 
3
To control the family-wise type I error of many pairwise comparisons, the significance level should be adjusted, as previously described.
 
Literatur
1.
Zurück zum Zitat Dehuri, S., Cho, S.-B.: Multi-criterion Pareto based particle swarm optimized polynomial neural network for classification: A review and state-of-the-art. Comput. Sci. Rev. 3(1), 19–40 (2009)CrossRefMATH Dehuri, S., Cho, S.-B.: Multi-criterion Pareto based particle swarm optimized polynomial neural network for classification: A review and state-of-the-art. Comput. Sci. Rev. 3(1), 19–40 (2009)CrossRefMATH
2.
Zurück zum Zitat Cai, W., Chen, S., Zhang, D.: A multiobjective simultaneous learning framework for clustering and classification. Neural Netw. IEEE Trans. 21(2), 185–200 (2010)CrossRef Cai, W., Chen, S., Zhang, D.: A multiobjective simultaneous learning framework for clustering and classification. Neural Netw. IEEE Trans. 21(2), 185–200 (2010)CrossRef
3.
Zurück zum Zitat Shi, C., Kong, X., Yu, P.S., Wang, B.: Multi-Objective Multi-Label Classification. In: Proceedings of the 2012 SIAM International Conference on Data Mining, pp. 355–366. SIAM (2012) Shi, C., Kong, X., Yu, P.S., Wang, B.: Multi-Objective Multi-Label Classification. In: Proceedings of the 2012 SIAM International Conference on Data Mining, pp. 355–366. SIAM (2012)
4.
Zurück zum Zitat Hsiao, K.J., Xu, K., Calder, J., Hero, A.O.: Multi-criteria anomaly detection using pareto depth analysis. In: Pereira, F., Burges, C., Bottou, L., Weinberger, K. (eds.) Advances in Neural Information Processing Systems 25, pp. 845–853. Curran Associates Inc, USA (2012) Hsiao, K.J., Xu, K., Calder, J., Hero, A.O.: Multi-criteria anomaly detection using pareto depth analysis. In: Pereira, F., Burges, C., Bottou, L., Weinberger, K. (eds.) Advances in Neural Information Processing Systems 25, pp. 845–853. Curran Associates Inc, USA (2012)
5.
Zurück zum Zitat Demšar, J.: Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 7, 1–30 (2006)MathSciNetMATH Demšar, J.: Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 7, 1–30 (2006)MathSciNetMATH
6.
Zurück zum Zitat Benavoli, A., de Campos, C.P.: Statistical Tests for Joint Analysis of Performance Measures, pp. 76–92. Springer International Publishing, Cham (2015) Benavoli, A., de Campos, C.P.: Statistical Tests for Joint Analysis of Performance Measures, pp. 76–92. Springer International Publishing, Cham (2015)
7.
Zurück zum Zitat Wilks, S.S.: The large-sample distribution of the likelihood ratio for testing composite hypotheses. Ann. Math. Stat. 9, 60–62 (1938)CrossRefMATH Wilks, S.S.: The large-sample distribution of the likelihood ratio for testing composite hypotheses. Ann. Math. Stat. 9, 60–62 (1938)CrossRefMATH
8.
Zurück zum Zitat Rice, J.: Mathematical statistics and data analysis. Cengage Learning, USA (2006) Rice, J.: Mathematical statistics and data analysis. Cengage Learning, USA (2006)
9.
Zurück zum Zitat de Campos, C.P., Tong, Y., Ji, Q.: Constrainted maximum likelihood bayesian network for facial expression recognition. In: European Conference on Computer Vision (ECCV). Lecture Notes in Computer Science, vol. 5304, pp. 168–181 (2008) de Campos, C.P., Tong, Y., Ji, Q.: Constrainted maximum likelihood bayesian network for facial expression recognition. In: European Conference on Computer Vision (ECCV). Lecture Notes in Computer Science, vol. 5304, pp. 168–181 (2008)
10.
Zurück zum Zitat DasGupta, A.: Asymptotic theory of statistics and probability. Springer, New York (2008)MATH DasGupta, A.: Asymptotic theory of statistics and probability. Springer, New York (2008)MATH
11.
Zurück zum Zitat Walley, P.: Inferences from multinomial data: learning about a bag of marbles. J. R. Stat. Soc. B 58(1), 3–57 (1996)MathSciNetMATH Walley, P.: Inferences from multinomial data: learning about a bag of marbles. J. R. Stat. Soc. B 58(1), 3–57 (1996)MathSciNetMATH
12.
Zurück zum Zitat Buntine, W.: Theory refinement on Bayesian networks. In: D’Ambrosio, B.D., Smets, P., Bonissone, P.P. (eds.) UAI-92, pp. 52–60. Morgan Kaufmann, San Francisco, CA (1991) Buntine, W.: Theory refinement on Bayesian networks. In: D’Ambrosio, B.D., Smets, P., Bonissone, P.P. (eds.) UAI-92, pp. 52–60. Morgan Kaufmann, San Francisco, CA (1991)
13.
Zurück zum Zitat Cooper, G.F., Herskovits, E.: A Bayesian method for the induction of probabilistic networks from data. Mach. Learn. 9, 309–347 (1992)MATH Cooper, G.F., Herskovits, E.: A Bayesian method for the induction of probabilistic networks from data. Mach. Learn. 9, 309–347 (1992)MATH
14.
Zurück zum Zitat Heckerman, D., Geiger, D., Chickering, D.M.: Learning Bayesian networks: the combination of knowledge and statistical data. Mach. Learn. 20, 197–243 (1995)MATH Heckerman, D., Geiger, D., Chickering, D.M.: Learning Bayesian networks: the combination of knowledge and statistical data. Mach. Learn. 20, 197–243 (1995)MATH
15.
Zurück zum Zitat de Campos, C.P., Ji, Q.: Properties of Bayesian Dirichlet scores to learn Bayesian network structures. In: AAAI Conference on Artificial Intelligence, pp. 431–436. AAAI Press, USA (2010) de Campos, C.P., Ji, Q.: Properties of Bayesian Dirichlet scores to learn Bayesian network structures. In: AAAI Conference on Artificial Intelligence, pp. 431–436. AAAI Press, USA (2010)
16.
Zurück zum Zitat Silander, T., Myllymaki, P.: A simple approach for finding the globally optimal Bayesian network structure. 22nd Conference on Uncertainty in Artificial Intelligence. Arlington, Virginia, pp. 445–452. AUAI Press, USA (2006) Silander, T., Myllymaki, P.: A simple approach for finding the globally optimal Bayesian network structure. 22nd Conference on Uncertainty in Artificial Intelligence. Arlington, Virginia, pp. 445–452. AUAI Press, USA (2006)
17.
Zurück zum Zitat Barlett, M., Cussens, J.: Advances in Bayesian network learning using integer programming. In: Proceedings of the 29th Conference on Uncertainty in Artificial Intelligence, UAI’13, pp. 182–191 (2013) Barlett, M., Cussens, J.: Advances in Bayesian network learning using integer programming. In: Proceedings of the 29th Conference on Uncertainty in Artificial Intelligence, UAI’13, pp. 182–191 (2013)
18.
Zurück zum Zitat de Campos, C.P., Ji, Q.: Efficient structure learning of Bayesian networks using constraints. J. Mach. Learn. Res. 12, 663–689 (2011)MathSciNetMATH de Campos, C.P., Ji, Q.: Efficient structure learning of Bayesian networks using constraints. J. Mach. Learn. Res. 12, 663–689 (2011)MathSciNetMATH
19.
Zurück zum Zitat Yuan, C., Malone, B.: Learning optimal Bayesian networks: A shortest path perspective. J. Artif. Intell. Res. 48, 23–65 (2013)MathSciNetMATH Yuan, C., Malone, B.: Learning optimal Bayesian networks: A shortest path perspective. J. Artif. Intell. Res. 48, 23–65 (2013)MathSciNetMATH
20.
Zurück zum Zitat Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: an update. ACM SIGKDD Explor. Newsl. 11(1), 10–18 (2009)CrossRef Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: an update. ACM SIGKDD Explor. Newsl. 11(1), 10–18 (2009)CrossRef
Metadaten
Titel
Joint Analysis of Multiple Algorithms and Performance Measures
verfasst von
Cassio P. de Campos
Alessio Benavoli
Publikationsdatum
12.12.2016
Verlag
Ohmsha
Erschienen in
New Generation Computing / Ausgabe 1/2017
Print ISSN: 0288-3635
Elektronische ISSN: 1882-7055
DOI
https://doi.org/10.1007/s00354-016-0005-8

Weitere Artikel der Ausgabe 1/2017

New Generation Computing 1/2017 Zur Ausgabe