Skip to main content
Top
Published in: International Journal of Machine Learning and Cybernetics 3/2011

01-09-2011 | Original Article

Problem of knowledge discovery in noisy databases

Authors: Vadim Vagin, Marina Fomina

Published in: International Journal of Machine Learning and Cybernetics | Issue 3/2011

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The problem of information generalization for real data that may contain noisy data is considered. Various models of information noise are presented, and the influence of noise to the algorithms of generalization is discussed. We used the methods of constructing decision trees and forming production rules. The results of the modeling are presented.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Show more products
Literature
1.
go back to reference Michalski RS (1983) A theory and methodology of inductive learning. In: Michalski RS, Carbonel JG, Mitchell TM (eds) Machine learning: an artificial intelligence approach. Morgan Kaufman, Palo Alto, pp 83–134 Michalski RS (1983) A theory and methodology of inductive learning. In: Michalski RS, Carbonel JG, Mitchell TM (eds) Machine learning: an artificial intelligence approach. Morgan Kaufman, Palo Alto, pp 83–134
2.
go back to reference Vagin VN, Golovina EJ, Zagoryanskaya AA, Fomina MV (2008) Exact and plausible inference in intelligent systems. In: Vagin VN, Pospelov DA (eds) 2nd edn, Fizmatlit, Moscow (in Russian) Vagin VN, Golovina EJ, Zagoryanskaya AA, Fomina MV (2008) Exact and plausible inference in intelligent systems. In: Vagin VN, Pospelov DA (eds) 2nd edn, Fizmatlit, Moscow (in Russian)
3.
go back to reference Parsons S (1995) Current approaches to handling imperfect information in data and knowledge bases. IEEE Trans Knowl Data Eng 8(3):353–372MathSciNetCrossRef Parsons S (1995) Current approaches to handling imperfect information in data and knowledge bases. IEEE Trans Knowl Data Eng 8(3):353–372MathSciNetCrossRef
4.
go back to reference Quinlan JR (1986) Induction of decision trees. Mach Learn 1:81–106 Quinlan JR (1986) Induction of decision trees. Mach Learn 1:81–106
5.
go back to reference Quinlan JR (1996) Improved use of continuous attributes in C 4.5. J Artif Intell Res 4:77–90MATH Quinlan JR (1996) Improved use of continuous attributes in C 4.5. J Artif Intell Res 4:77–90MATH
6.
go back to reference Clark P, Niblett T (1989) The CN2 induction algorithm. Mach Learn 3:261–283 Clark P, Niblett T (1989) The CN2 induction algorithm. Mach Learn 3:261–283
7.
go back to reference Clark P, Boswell R (1991) Rule induction with CN2: some recent improvements. In: Machine learning—Proceedings of the Fifth European Conference (ESWL-91), Berlin, 151–163 Clark P, Boswell R (1991) Rule induction with CN2: some recent improvements. In: Machine learning—Proceedings of the Fifth European Conference (ESWL-91), Berlin, 151–163
8.
go back to reference Fayyad U, Piatetsky-Shapiro G, Smith P (1996) From data mining to knowledge discovery: an overview. In: Advances in knowledge discovery and data mining. AAAI Press/The MIT Press, Cambridge, pp 1–36 Fayyad U, Piatetsky-Shapiro G, Smith P (1996) From data mining to knowledge discovery: an overview. In: Advances in knowledge discovery and data mining. AAAI Press/The MIT Press, Cambridge, pp 1–36
9.
go back to reference Wong E (1982) A statistical approach to incomplete information in database Systems. ACM Trans Database Syst 7:470–488MATHCrossRef Wong E (1982) A statistical approach to incomplete information in database Systems. ACM Trans Database Syst 7:470–488MATHCrossRef
10.
go back to reference Weiss S, Kulikowski C (1991) Computer system that learn: classification and prediction methods from statistics neural networks machine learning and expert systems. Morgan Kaufmann, San Francisco Weiss S, Kulikowski C (1991) Computer system that learn: classification and prediction methods from statistics neural networks machine learning and expert systems. Morgan Kaufmann, San Francisco
11.
go back to reference Kolodner J (1993) Case-based reasoning. Morgan Kaufmann, San-Francisco Kolodner J (1993) Case-based reasoning. Morgan Kaufmann, San-Francisco
13.
go back to reference Utgoff PE (1989) Incremental induction of decision trees. Mach Learn 4:161–186CrossRef Utgoff PE (1989) Incremental induction of decision trees. Mach Learn 4:161–186CrossRef
14.
go back to reference Nilsson N (1998) Artificial Intelligence: a new synthesis. Computers. Morgan Kaufmann, San Francisco Nilsson N (1998) Artificial Intelligence: a new synthesis. Computers. Morgan Kaufmann, San Francisco
15.
go back to reference Heckerman D (1996) Bayesian networks for knowledge discovery. In: Advances in knowledge discovery and data mining. AAAI Press/The MIT Press, Cambridge, pp 273–306 Heckerman D (1996) Bayesian networks for knowledge discovery. In: Advances in knowledge discovery and data mining. AAAI Press/The MIT Press, Cambridge, pp 273–306
16.
go back to reference Goldberg David E (1989) Genetic algorithms in search, optimization and machine learning. Addison–Wesley Longman Publishing Co, Inc., Boston, MA, USA Goldberg David E (1989) Genetic algorithms in search, optimization and machine learning. Addison–Wesley Longman Publishing Co, Inc., Boston, MA, USA
18.
go back to reference Berisha AM, Vagin VN, Kulikov AV, Fomina MV (2005) Methods of knowledge discovery in “noisy” databases. J Comput Syst Sci Intern 44(6):973–987MathSciNet Berisha AM, Vagin VN, Kulikov AV, Fomina MV (2005) Methods of knowledge discovery in “noisy” databases. J Comput Syst Sci Intern 44(6):973–987MathSciNet
19.
go back to reference Mookerjee V, Mannino M, Gilson R (1995) Improving the performance stability of inductive expert systems under input noise. Info Syst Res 6(4):328–356CrossRef Mookerjee V, Mannino M, Gilson R (1995) Improving the performance stability of inductive expert systems under input noise. Info Syst Res 6(4):328–356CrossRef
20.
go back to reference Dasarathy BV (1991) Nearest neighbor (NN) norms: NN pattern classification techniques. IEEE Computer Society Press, Los Alamos Dasarathy BV (1991) Nearest neighbor (NN) norms: NN pattern classification techniques. IEEE Computer Society Press, Los Alamos
21.
go back to reference Quinlan JR (1989) Unknown attribute values in induction. In: Proceedings of the sixth international workshop on machine learning, pp 164–168 Quinlan JR (1989) Unknown attribute values in induction. In: Proceedings of the sixth international workshop on machine learning, pp 164–168
22.
go back to reference Domingos P (1996) Unifying instance-based and rule-based induction. Mach Learn 24:141–168 Domingos P (1996) Unifying instance-based and rule-based induction. Mach Learn 24:141–168
23.
go back to reference Vagin VN, Fomina MV, Kulikov AV (2008) The problem of object recognition in the presence of noise in original data. In: 10th Scandinavian Conference on Artificial Intelligence SCAI 2008, pp 60–67 Vagin VN, Fomina MV, Kulikov AV (2008) The problem of object recognition in the presence of noise in original data. In: 10th Scandinavian Conference on Artificial Intelligence SCAI 2008, pp 60–67
25.
go back to reference Vagin V, Fomina M (2010) Methods and algorithms of information generalization in noisy databases. In: Advances in Soft Computing. 9th Mexican International Conference on Artificial Intelligence, MICAI 2010, Proceedings, Part II/Springer Verlag, Berlin, pp 44–55 Vagin V, Fomina M (2010) Methods and algorithms of information generalization in noisy databases. In: Advances in Soft Computing. 9th Mexican International Conference on Artificial Intelligence, MICAI 2010, Proceedings, Part II/Springer Verlag, Berlin, pp 44–55
Metadata
Title
Problem of knowledge discovery in noisy databases
Authors
Vadim Vagin
Marina Fomina
Publication date
01-09-2011
Publisher
Springer-Verlag
Published in
International Journal of Machine Learning and Cybernetics / Issue 3/2011
Print ISSN: 1868-8071
Electronic ISSN: 1868-808X
DOI
https://doi.org/10.1007/s13042-011-0028-x

Other articles of this Issue 3/2011

International Journal of Machine Learning and Cybernetics 3/2011 Go to the issue

Editorial

Editorial