Skip to main content

2019 | OriginalPaper | Buchkapitel

Keys in Relational Databases with Nulls and Bounded Domains

verfasst von : Munqath Alattar, Attila Sali

Erschienen in: Advances in Databases and Information Systems

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Missing data value is an extensive problem in both research and industrial developers. Two general approaches are there to deal with the problem of missing values in databases, they either could be ignored (removed) or imputed (filled in) with new values [10]. For some SQL tables it is possible that some candidate key of the table is not null-free and this needs to be handled. Possible keys and certain keys to deal with this situation were introduced in [17]. In the present paper we introduce an intermediate concept called strongly possible keys that is based on a data mining approach using only information already contained in the SQL table. A strongly possible key is a key that holds for some possible world which is obtained by replacing any occurrences of nulls with some values already appearing in the corresponding attributes. Implication among strongly possible keys is characterized and Armstrong tables are constructed. An algorithm to verify a strongly possible key is given applying bipartite matching. Connection between matroid intersection problem and system of strongly possible keys is established.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
1.
Zurück zum Zitat Acuña, E., Rodriguez, C.: The treatment of missing values and its effect on classifier accuracy. In: Banks, D., McMorris, F.R., Arabie, P., Gaul, W. (eds.) Classification, Clustering, and Data Mining Applications. Studies in Classification, Data Analysis, and Knowledge Organisation, pp. 639–647. Springer, Heidelberg (2004). https://doi.org/10.1007/978-3-642-17103-1_60CrossRef Acuña, E., Rodriguez, C.: The treatment of missing values and its effect on classifier accuracy. In: Banks, D., McMorris, F.R., Arabie, P., Gaul, W. (eds.) Classification, Clustering, and Data Mining Applications. Studies in Classification, Data Analysis, and Knowledge Organisation, pp. 639–647. Springer, Heidelberg (2004). https://​doi.​org/​10.​1007/​978-3-642-17103-1_​60CrossRef
2.
Zurück zum Zitat Beeri, C., Dowd, M., Fagin, R., Statman, R.: On the structure of Armstrong relations for functional dependencies. J. ACM 31(1), 30–46 (1984)MathSciNetCrossRef Beeri, C., Dowd, M., Fagin, R., Statman, R.: On the structure of Armstrong relations for functional dependencies. J. ACM 31(1), 30–46 (1984)MathSciNetCrossRef
5.
Zurück zum Zitat Cheng, C., Wei, L., Lin, T.: Improving relational database quality based on adaptive learning method for estimating null value. In: Second International Conference on Innovative Computing, Informatio and Control (ICICIC 2007), Kumamoto, p. 81 (2007). https://doi.org/10.1109/ICICIC.2007.350 Cheng, C., Wei, L., Lin, T.: Improving relational database quality based on adaptive learning method for estimating null value. In: Second International Conference on Innovative Computing, Informatio and Control (ICICIC 2007), Kumamoto, p. 81 (2007). https://​doi.​org/​10.​1109/​ICICIC.​2007.​350
6.
Zurück zum Zitat Codd, E.F.: The Relational Model for Database Management, Version 2. Addison-Wesley Publishing Company, Boston (1990) MATH Codd, E.F.: The Relational Model for Database Management, Version 2. Addison-Wesley Publishing Company, Boston (1990) MATH
7.
Zurück zum Zitat Date, C.J.: NOT Is Not “Not”! (Notes on Three-Valued Logic and Related Matters) in Relational Database Writings 1985–1989. Addison-Wesley Reading, Boston (1990) Date, C.J.: NOT Is Not “Not”! (Notes on Three-Valued Logic and Related Matters) in Relational Database Writings 1985–1989. Addison-Wesley Reading, Boston (1990)
9.
Zurück zum Zitat Farhangfar, A., Kurgan, L.A., Pedrycz, W.: Experimental analysis of methods for imputation of missing values in databases. In: Proceedings of SPIE 5421, Intelligent Computing: Theory and Applications II, 12 April 2004. https://doi.org/10.1117/12.542509 Farhangfar, A., Kurgan, L.A., Pedrycz, W.: Experimental analysis of methods for imputation of missing values in databases. In: Proceedings of SPIE 5421, Intelligent Computing: Theory and Applications II, 12 April 2004. https://​doi.​org/​10.​1117/​12.​542509
10.
Zurück zum Zitat Farhangfar, A., Kurgan, L.A., Pedrycz, W.: A novel framework for imputation of missing values in databases. IEEE Trans. Syst. Man Cybern. Part A Syst. Hum. 37(5), 692–709 (2007)CrossRef Farhangfar, A., Kurgan, L.A., Pedrycz, W.: A novel framework for imputation of missing values in databases. IEEE Trans. Syst. Man Cybern. Part A Syst. Hum. 37(5), 692–709 (2007)CrossRef
11.
Zurück zum Zitat Farhangfar, A., Kurgan, L.A., Dy, J.: Impact of imputation of missing values on classification error for discrete data. Pattern Recogn. 41(12), 3692–3705 (2008)CrossRef Farhangfar, A., Kurgan, L.A., Dy, J.: Impact of imputation of missing values on classification error for discrete data. Pattern Recogn. 41(12), 3692–3705 (2008)CrossRef
13.
Zurück zum Zitat Garey, M.R., Johnson, D.S.: Computers and Intractability. A guide to the Theory of NP-Completeness. Freeman, New York (1979)MATH Garey, M.R., Johnson, D.S.: Computers and Intractability. A guide to the Theory of NP-Completeness. Freeman, New York (1979)MATH
14.
Zurück zum Zitat Hartmann, S., Kirchberg, M., Link, S.: Design by example for SQL table definitions with functional dependencies. VLDB J. 21(1), 121–144 (2012)CrossRef Hartmann, S., Kirchberg, M., Link, S.: Design by example for SQL table definitions with functional dependencies. VLDB J. 21(1), 121–144 (2012)CrossRef
15.
Zurück zum Zitat Hartmann, S., Leck, U., Link, S.: On Codd families of keys over incomplete relations. Comput. J. 54(7), 1166–1180 (2010)CrossRef Hartmann, S., Leck, U., Link, S.: On Codd families of keys over incomplete relations. Comput. J. 54(7), 1166–1180 (2010)CrossRef
17.
Zurück zum Zitat Köhler, H., Leck, U., Link, S., Zhou, X.: Possible and certain keys for SQL. VLDB J. 25(4), 571–596 (2016)CrossRef Köhler, H., Leck, U., Link, S., Zhou, X.: Possible and certain keys for SQL. VLDB J. 25(4), 571–596 (2016)CrossRef
18.
Zurück zum Zitat Köhler, H., Link, S.: SQL schema design: foundations, normal forms, and normalization. Inf. Syst. 76, 88–113 (2018)CrossRef Köhler, H., Link, S.: SQL schema design: foundations, normal forms, and normalization. Inf. Syst. 76, 88–113 (2018)CrossRef
20.
Zurück zum Zitat Levene, M., Loizou, G.: Axiomatisation of functional dependencies in incomplete relations. J. Theor. Comput. Sci. 206(1–2), 283–300 (1998)MathSciNetCrossRef Levene, M., Loizou, G.: Axiomatisation of functional dependencies in incomplete relations. J. Theor. Comput. Sci. 206(1–2), 283–300 (1998)MathSciNetCrossRef
21.
Zurück zum Zitat Mannila, H., Rähä, K.-J.: Design of Relational Databases. Addison-Wesley, Boston (1992)MATH Mannila, H., Rähä, K.-J.: Design of Relational Databases. Addison-Wesley, Boston (1992)MATH
22.
Zurück zum Zitat Sali, A., Schewe, K.-D.: Keys and Armstrong databases in trees with restructuring. Acta Cybernetica 18(3), 529–556 (2008)MathSciNetMATH Sali, A., Schewe, K.-D.: Keys and Armstrong databases in trees with restructuring. Acta Cybernetica 18(3), 529–556 (2008)MathSciNetMATH
23.
Zurück zum Zitat Welsh, D.J.A.: Matroid Theory. Academic Press, New York (1976)MATH Welsh, D.J.A.: Matroid Theory. Academic Press, New York (1976)MATH
24.
Zurück zum Zitat Zhang, S., Qin, Z., Ling, C.X., Sheng, S.: “Missing is Useful”: missing values in cost-sensitive decision trees. IEEE Trans. Knowl. Data Eng. 17(12), 1689–1693 (2005)CrossRef Zhang, S., Qin, Z., Ling, C.X., Sheng, S.: “Missing is Useful”: missing values in cost-sensitive decision trees. IEEE Trans. Knowl. Data Eng. 17(12), 1689–1693 (2005)CrossRef
Metadaten
Titel
Keys in Relational Databases with Nulls and Bounded Domains
verfasst von
Munqath Alattar
Attila Sali
Copyright-Jahr
2019
DOI
https://doi.org/10.1007/978-3-030-28730-6_3