Skip to main content
Top
Published in: Progress in Artificial Intelligence 3/2020

05-08-2020 | Regular Paper

Large-width machine learning algorithm

Authors: Martin Anthony, Joel Ratsaby

Published in: Progress in Artificial Intelligence | Issue 3/2020

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

We introduce an algorithm, called Large Width (LW), that produces a multi-category classifier (defined on a distance space) with the property that the classifier has a large ‘sample width.’ (Width is a notion similar to classification margin.) LW is an incremental instance-based (also known as ‘lazy’) learning algorithm. Given a sample of labeled and unlabeled examples, it iteratively picks the next unlabeled example and classifies it while maintaining a large distance between each labeled example and its nearest-unlike prototype. (A prototype is either a labeled example or an unlabeled example which has already been classified.) Thus, LW gives a higher priority to unlabeled points whose classification decision ‘interferes’ less with the labeled sample. On a collection UCI benchmark datasets, the LW algorithm ranks at the top when compared to 11 instance-based learning algorithms (or configurations). When compared to the best candidate from instance-based learners, MLP, SVM, decision tree learner (C4.5) and Naive Bayes, LW is ranked at second place after only MLP which comes at first place by a single extra win against LW. The LW algorithm can be implemented in parallel distributed processing to yield a high speedup factor and is suitable for any distance space, with a distance function which need not necessarily satisfy the conditions of a metric.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Appendix
Available only for authorised users
Literature
1.
go back to reference Aha, D.W., Kibler, D., Albert, M.K.: Instance-based learning algorithms. Mach. Learn. 6(1), 37–66 (1991) Aha, D.W., Kibler, D., Albert, M.K.: Instance-based learning algorithms. Mach. Learn. 6(1), 37–66 (1991)
2.
go back to reference Anthony, M., Bartlett, P.L.: Neural Network Learning: Theoretical Foundations. Cambridge University Press, Cambridge (1999)MATHCrossRef Anthony, M., Bartlett, P.L.: Neural Network Learning: Theoretical Foundations. Cambridge University Press, Cambridge (1999)MATHCrossRef
5.
6.
go back to reference Anthony, M., Ratsaby, J.: Learning bounds via sample width for classifiers on finite metric spaces. Theoret. Comput. Sci. 529, 2–10 (2014)MathSciNetMATHCrossRef Anthony, M., Ratsaby, J.: Learning bounds via sample width for classifiers on finite metric spaces. Theoret. Comput. Sci. 529, 2–10 (2014)MathSciNetMATHCrossRef
9.
10.
11.
go back to reference Anthony, M., Ratsaby, J.: Large width nearest prototype classification on general distance spaces. Theoret. Comput. Sci. 738, 65–79 (2018)MathSciNetMATHCrossRef Anthony, M., Ratsaby, J.: Large width nearest prototype classification on general distance spaces. Theoret. Comput. Sci. 738, 65–79 (2018)MathSciNetMATHCrossRef
12.
go back to reference Atkeson, C.G., Moore, A.W., Schaal, S.: Locally weighted learning. Artif. Intell. Rev. 11(1–5), 11–73 (1997)CrossRef Atkeson, C.G., Moore, A.W., Schaal, S.: Locally weighted learning. Artif. Intell. Rev. 11(1–5), 11–73 (1997)CrossRef
13.
go back to reference Chester, U., Ratsaby, J.: Universal distance measure for images. In: Proceedings of the \(27th\) IEEE Convention of Electrical Electronics Engineers in Israel (IEEEI’12), pages 1–4, Eilat, Israel, November 14–17 (2012) Chester, U., Ratsaby, J.: Universal distance measure for images. In: Proceedings of the \(27th\) IEEE Convention of Electrical Electronics Engineers in Israel (IEEEI’12), pages 1–4, Eilat, Israel, November 14–17 (2012)
14.
go back to reference Chester, U., Ratsaby, J.: Machine learning for image classification and clustering using a universal distance measure. In: N. Brisaboa, O. Pedreira, and P. Zezula, editors, Proceedings of the 6th International Conference on Similarity Search and Applications (SISAP’13), volume 8199 of Springer Lecture Notes in Computer Science, pages 59–72 (2013) Chester, U., Ratsaby, J.: Machine learning for image classification and clustering using a universal distance measure. In: N. Brisaboa, O. Pedreira, and P. Zezula, editors, Proceedings of the 6th International Conference on Similarity Search and Applications (SISAP’13), volume 8199 of Springer Lecture Notes in Computer Science, pages 59–72 (2013)
16.
go back to reference Cleary, J.G., Trigg, K.E.: K*: An instance-based learner using and entropic distance measure. In: Proceedings of the Twelfth International Conference on International Conference on Machine Learning, ICML’95, 108–114, Morgan Kaufmann Publishers Inc, San Francisco (1995) Cleary, J.G., Trigg, K.E.: K*: An instance-based learner using and entropic distance measure. In: Proceedings of the Twelfth International Conference on International Conference on Machine Learning, ICML’95, 108–114, Morgan Kaufmann Publishers Inc, San Francisco (1995)
17.
go back to reference Cover, T., Hart, P.: Nearest neighbor pattern classification. IEEE Trans. Inf. Theory 13(1), 21–27 (1967)MATHCrossRef Cover, T., Hart, P.: Nearest neighbor pattern classification. IEEE Trans. Inf. Theory 13(1), 21–27 (1967)MATHCrossRef
18.
go back to reference Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines and other Kernel-based learning methods. Cambridge University Press, Cambridge (2000)MATHCrossRef Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines and other Kernel-based learning methods. Cambridge University Press, Cambridge (2000)MATHCrossRef
19.
go back to reference Deza, M., Deza, E.: Encyclopedia of Distances, volume 15 of Series in Computer Science. Springer-Verlag, (2009) Deza, M., Deza, E.: Encyclopedia of Distances, volume 15 of Series in Computer Science. Springer-Verlag, (2009)
20.
go back to reference Dudani, S.A.: The distance-weighted k-nearest-neighbor rule. IEEE Trans. Syst. Man Cybernet. SMC–6(4), 325–327 (1976)CrossRef Dudani, S.A.: The distance-weighted k-nearest-neighbor rule. IEEE Trans. Syst. Man Cybernet. SMC–6(4), 325–327 (1976)CrossRef
21.
go back to reference Duin, R.P.W., Pekalska, E., Loog, M.: Non-euclidean dissimilarities: causes, embedding and informativeness. In: Pelillo, M. (ed.) Similarity-Based Pattern Analysis and Recognition Advances in Computer Vision and Pattern Recognition. Springer, Berlin (2013) Duin, R.P.W., Pekalska, E., Loog, M.: Non-euclidean dissimilarities: causes, embedding and informativeness. In: Pelillo, M. (ed.) Similarity-Based Pattern Analysis and Recognition Advances in Computer Vision and Pattern Recognition. Springer, Berlin (2013)
22.
go back to reference Frank, E., Hall, M.A., Witten, I.: The WEKA Workbench. Practical Machine Learning Tools and Techniques. Morgan Kaufmann, fourth edition, Online Appendix for Data Mining (2016) Frank, E., Hall, M.A., Witten, I.: The WEKA Workbench. Practical Machine Learning Tools and Techniques. Morgan Kaufmann, fourth edition, Online Appendix for Data Mining (2016)
23.
go back to reference Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: an update. SIGKDD Explorat. 11(1), 10–18 (2009)CrossRef Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: an update. SIGKDD Explorat. 11(1), 10–18 (2009)CrossRef
24.
25.
go back to reference Mitchell, T.: Machine Learning. McGraw Hill, New York (1997)MATH Mitchell, T.: Machine Learning. McGraw Hill, New York (1997)MATH
26.
go back to reference Nadeau, C., Bengio, Y.: Inference for the generalization error. Mach. Learn. 52(3), 239–281 (2003)MATHCrossRef Nadeau, C., Bengio, Y.: Inference for the generalization error. Mach. Learn. 52(3), 239–281 (2003)MATHCrossRef
27.
go back to reference Pekalska, E., Duin, R.P.W.: The Dissimilarity Representation for Pattern Recognition: Foundations And Applications (Machine Perception and Artificial Intelligence). World Scientific Publishing Co.Inc, River Edge, NJ (2005)MATHCrossRef Pekalska, E., Duin, R.P.W.: The Dissimilarity Representation for Pattern Recognition: Foundations And Applications (Machine Perception and Artificial Intelligence). World Scientific Publishing Co.Inc, River Edge, NJ (2005)MATHCrossRef
28.
go back to reference Ratsaby, J., Sabaty, A.: Parallelizing the large width learning algorithm. In: IEEE International Conference on the Science of Electrical Engineering (ICSEE’2018), 1–5, December 14–16 (2018) Ratsaby, J., Sabaty, A.: Parallelizing the large width learning algorithm. In: IEEE International Conference on the Science of Electrical Engineering (ICSEE’2018), 1–5, December 14–16 (2018)
29.
30.
go back to reference Smola, A.J., Bartlett, P.L., Scholkopf, B., Schuurmans, D.: Advances in Large-Margin Classifiers (Neural Information Processing). MIT Press, Cambridge (2000)MATHCrossRef Smola, A.J., Bartlett, P.L., Scholkopf, B., Schuurmans, D.: Advances in Large-Margin Classifiers (Neural Information Processing). MIT Press, Cambridge (2000)MATHCrossRef
Metadata
Title
Large-width machine learning algorithm
Authors
Martin Anthony
Joel Ratsaby
Publication date
05-08-2020
Publisher
Springer Berlin Heidelberg
Published in
Progress in Artificial Intelligence / Issue 3/2020
Print ISSN: 2192-6352
Electronic ISSN: 2192-6360
DOI
https://doi.org/10.1007/s13748-020-00212-4

Other articles of this Issue 3/2020

Progress in Artificial Intelligence 3/2020 Go to the issue

Premium Partner