Skip to main content

2018 | OriginalPaper | Buchkapitel

An Efficient Prototype Selection Algorithm Based on Dense Spatial Partitions

verfasst von : Joel Luís Carbonera, Mara Abel

Erschienen in: Artificial Intelligence and Soft Computing

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In order to deal with big data, techniques for prototype selection have been applied for reducing the computational resources that are necessary to apply data mining approaches. However, most of the proposed approaches for prototype selection have a high time complexity and, due to this, they cannot be applied for dealing with big data. In this paper, we propose an efficient approach for prototype selection. It adopts the notion of spatial partition for efficiently dividing the dataset in sets of similar instances. In a second step, the algorithm extracts a prototype of each of the densest spatial partitions that were previously identified. The approach was evaluated on 15 well-known datasets used in a classification task, and its performance was compared to those of 6 state-of-the-art algorithms, considering two measures: accuracy and reduction. All the obtained results show that, in general, the proposed approach provides a good trade-off between accuracy and reduction, with a significantly lower running time, when compared with other approaches.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
2
Notice that we are assuming in this paper that the set of natural numbers \((\mathbb {N}\)) is the set of non-negative integers and, due to this, it includes zero. When we are referring to the set of natural numbers excluding zero, we use \(\mathbb {N}^{*}\).
 
3
All algorithms were implemented by the authors.
 
Literatur
1.
Zurück zum Zitat Anwar, I.M., Salama, K.M., Abdelbar, A.M.: Instance selection with ant colony optimization. Procedia Comput. Sci. 53, 248–256 (2015)CrossRef Anwar, I.M., Salama, K.M., Abdelbar, A.M.: Instance selection with ant colony optimization. Procedia Comput. Sci. 53, 248–256 (2015)CrossRef
2.
Zurück zum Zitat Brighton, H., Mellish, C.: Advances in instance selection for instance-based learning algorithms. Data Min. Knowl. Disc. 6(2), 153–172 (2002)MathSciNetCrossRef Brighton, H., Mellish, C.: Advances in instance selection for instance-based learning algorithms. Data Min. Knowl. Disc. 6(2), 153–172 (2002)MathSciNetCrossRef
4.
Zurück zum Zitat Carbonera, J.L., Abel, M.: A density-based approach for instance selection. In: 2015 IEEE 27th International Conference on Tools with Artificial Intelligence (ICTAI), pp. 768–774. IEEE (2015) Carbonera, J.L., Abel, M.: A density-based approach for instance selection. In: 2015 IEEE 27th International Conference on Tools with Artificial Intelligence (ICTAI), pp. 768–774. IEEE (2015)
5.
Zurück zum Zitat Carbonera, J.L., Abel, M.: A novel density-based approach for instance selection. In: 2016 IEEE 28th International Conference on Tools with Artificial Intelligence (ICTAI), pp. 549–556. IEEE (2016) Carbonera, J.L., Abel, M.: A novel density-based approach for instance selection. In: 2016 IEEE 28th International Conference on Tools with Artificial Intelligence (ICTAI), pp. 549–556. IEEE (2016)
6.
Zurück zum Zitat Carbonera, J.L., Abel, M.: Efficient prototype selection supported by subspace partitions. In: 2017 IEEE 29th International Conference on Tools with Artificial Intelligence (ICTAI), pp. 921–928. IEEE (2017) Carbonera, J.L., Abel, M.: Efficient prototype selection supported by subspace partitions. In: 2017 IEEE 29th International Conference on Tools with Artificial Intelligence (ICTAI), pp. 921–928. IEEE (2017)
7.
Zurück zum Zitat Chou, C.H., Kuo, B.H., Chang, F.: The generalized condensed nearest neighbor rule as a data reduction method. In: 2006 18th International Conference on Pattern Recognition, ICPR 2006, vol. 2, pp. 556–559. IEEE (2006) Chou, C.H., Kuo, B.H., Chang, F.: The generalized condensed nearest neighbor rule as a data reduction method. In: 2006 18th International Conference on Pattern Recognition, ICPR 2006, vol. 2, pp. 556–559. IEEE (2006)
9.
Zurück zum Zitat Gates, G.W.: Reduced nearest neighbor rule. IEEE Trans. Inf. Theory 18(3), 431–433 (1972)CrossRef Gates, G.W.: Reduced nearest neighbor rule. IEEE Trans. Inf. Theory 18(3), 431–433 (1972)CrossRef
10.
Zurück zum Zitat Hamidzadeh, J., Monsefi, R., Yazdi, H.S.: IRAHC: instance reduction algorithm using hyperrectangle clustering. Pattern Recogn. 48(5), 1878–1889 (2015)CrossRef Hamidzadeh, J., Monsefi, R., Yazdi, H.S.: IRAHC: instance reduction algorithm using hyperrectangle clustering. Pattern Recogn. 48(5), 1878–1889 (2015)CrossRef
11.
Zurück zum Zitat Hart, P.E.: The condensed nearest neighbor rule. IEEE Trans. Inf. Theory 14, 515–516 (1968)CrossRef Hart, P.E.: The condensed nearest neighbor rule. IEEE Trans. Inf. Theory 14, 515–516 (1968)CrossRef
12.
Zurück zum Zitat Leyva, E., González, A., Pérez, R.: Three new instance selection methods based on local sets: a comparative study with several approaches from a bi-objective perspective. Pattern Recogn. 48(4), 1523–1537 (2015)CrossRef Leyva, E., González, A., Pérez, R.: Three new instance selection methods based on local sets: a comparative study with several approaches from a bi-objective perspective. Pattern Recogn. 48(4), 1523–1537 (2015)CrossRef
13.
Zurück zum Zitat Lin, W.C., Tsai, C.F., Ke, S.W., Hung, C.W., Eberle, W.: Learning to detect representative data for large scale instance selection. J. Syst. Softw. 106, 1–8 (2015)CrossRef Lin, W.C., Tsai, C.F., Ke, S.W., Hung, C.W., Eberle, W.: Learning to detect representative data for large scale instance selection. J. Syst. Softw. 106, 1–8 (2015)CrossRef
14.
Zurück zum Zitat Nikolaidis, K., Goulermas, J.Y., Wu, Q.: A class boundary preserving algorithm for data condensation. Pattern Recogn. 44(3), 704–715 (2011)CrossRef Nikolaidis, K., Goulermas, J.Y., Wu, Q.: A class boundary preserving algorithm for data condensation. Pattern Recogn. 44(3), 704–715 (2011)CrossRef
15.
Zurück zum Zitat Wilson, D.R., Martinez, T.R.: Reduction techniques for instance-based learning algorithms. Mach. Learn. 38(3), 257–286 (2000)CrossRef Wilson, D.R., Martinez, T.R.: Reduction techniques for instance-based learning algorithms. Mach. Learn. 38(3), 257–286 (2000)CrossRef
16.
Zurück zum Zitat Wilson, D.L.: Asymptotic properties of nearest neighbor rules using edited data. IEEE Trans. Syst. Man Cybern. SMC 2(3), 408–421 (1972)MathSciNetCrossRef Wilson, D.L.: Asymptotic properties of nearest neighbor rules using edited data. IEEE Trans. Syst. Man Cybern. SMC 2(3), 408–421 (1972)MathSciNetCrossRef
Metadaten
Titel
An Efficient Prototype Selection Algorithm Based on Dense Spatial Partitions
verfasst von
Joel Luís Carbonera
Mara Abel
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-91262-2_26