Skip to main content

2020 | OriginalPaper | Buchkapitel

Similarity-based Rough Sets and Its Applications in Data Mining

verfasst von : Dávid Nagy

Erschienen in: Transactions on Rough Sets XXII

Verlag: Springer Berlin Heidelberg

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Pawlakian spaces rely on an equivalence relation which represent indiscernibility. As a generalization of these spaces, some approximation spaces have appeared that are not based on an equivalence relation but on a tolerance relation that represents similarity. These spaces preserve the property of the Pawlakian space that the union of the base sets gives out the universe. However, they give up the requirement that the base sets are pairwise disjoint. The base sets are generated in a way where for each object, the objects that are similar to the given object, are taken. This means that the similarity to a given object is considered. In the worst case, it can happen that the number of base sets equals those of objects in the universe. This significantly increases the computational resource need of the set approximation process and limits the efficient use of them in large databases. To overcome this problem, a possible solution is presented in this dissertation. The space is called similarity-based rough sets where the system of base sets is generated by the correlation clustering. Therefore, the real similarity is taken into consideration not the similarity to a distinguished object. The space generated this way, on the one hand, represents the interpreted similarity properly and on the other hand, reduces the number of base sets to a manageable size. This work deals with the properties and applicability of this space, presenting all the advantages that can be gained from the correlation clustering.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Fußnoten
1
The work described in this chapter was based on [57, 21, 3538].
 
2
The formal definition of these answers can also be seen in the first chapter but for better readability, they are also listed here again focusing on covering approximation spaces based on a tolerance relation.
 
3
The work described in this section was based on [38].
 
4
The work described in this section was based on [37].
 
5
The work described in this section was based on [57, 35].
 
6
The work described in this section was based on [5].
 
7
The work described in this section was based on [6, 7, 21, 35].
 
8
The work described in this section was based on [21].
 
9
The work described in this section was based on [6, 7].
 
10
The work described in this section was based on [35].
 
11
The work described in this chapter was based on [6, 35].
 
12
The work described in this section was based on [35].
 
13
The work described in this section was based on [6].
 
Literatur
2.
Zurück zum Zitat Altman, A., Tennenholtz, M.: Ranking systems: the PageRank axioms. In: Proceedings of the 6th ACM Conference on Electronic Commerce, pp. 1–8. ACM (2005) Altman, A., Tennenholtz, M.: Ranking systems: the PageRank axioms. In: Proceedings of the 6th ACM Conference on Electronic Commerce, pp. 1–8. ACM (2005)
4.
Zurück zum Zitat Aszalós, L., Hajdu, L., Pethő, A.: On a correlational clustering of integers. Indagationes Mathematicae 27(1), 173–191 (2016)MathSciNetCrossRef Aszalós, L., Hajdu, L., Pethő, A.: On a correlational clustering of integers. Indagationes Mathematicae 27(1), 173–191 (2016)MathSciNetCrossRef
5.
Zurück zum Zitat Aszalós, L., Nagy, D.: Visualization of tolerance relations. In: Gábor, Kusper; Roland, K. (ed.) Proceedings of the 10th International Conference on Applied Informatics, pp. 15–22 (2018) Aszalós, L., Nagy, D.: Visualization of tolerance relations. In: Gábor, Kusper; Roland, K. (ed.) Proceedings of the 10th International Conference on Applied Informatics, pp. 15–22 (2018)
7.
Zurück zum Zitat Aszalós, L., Nagy, D.: Selecting representatives. In: Ganzha, M., Maciaszek, L., Paprzycki, M. (eds.) Communication Papers of the 2019 Federated Conference on Computer Science and Information Systems. Annals of Computer Science and Information Systems, vol. 20, pp. 13–19. PTI (2019). https://doi.org/10.15439/2019F95 Aszalós, L., Nagy, D.: Selecting representatives. In: Ganzha, M., Maciaszek, L., Paprzycki, M. (eds.) Communication Papers of the 2019 Federated Conference on Computer Science and Information Systems. Annals of Computer Science and Information Systems, vol. 20, pp. 13–19. PTI (2019). https://​doi.​org/​10.​15439/​2019F95
9.
Zurück zum Zitat Becker, H.: A survey of correlation clustering. Advanced Topics in Computational Learning Theory, pp. 1–10 (2005) Becker, H.: A survey of correlation clustering. Advanced Topics in Computational Learning Theory, pp. 1–10 (2005)
11.
Zurück zum Zitat Brownlee, J.: Clever Algorithms: Nature-Inspired Programming Recipes. 1st edn. Lulu.com (2011) Brownlee, J.: Clever Algorithms: Nature-Inspired Programming Recipes. 1st edn. Lulu.com (2011)
12.
Zurück zum Zitat Caballero, Y., Bello, R., Alvarez, D., Gareia, M.M., Pizano, Y.: Improving the K-NN method: rough set in edit training set. In: Debenham, J. (ed.) Professional Practice in Artificial Intelligence, pp. 21–30. Springer, Boston (2006)CrossRef Caballero, Y., Bello, R., Alvarez, D., Gareia, M.M., Pizano, Y.: Improving the K-NN method: rough set in edit training set. In: Debenham, J. (ed.) Professional Practice in Artificial Intelligence, pp. 21–30. Springer, Boston (2006)CrossRef
14.
Zurück zum Zitat Csajbók, Z., Mihálydeák, T.: From vagueness to rough sets in partial approximation spaces. In: JRS2014 (submitted, 2014) Csajbók, Z., Mihálydeák, T.: From vagueness to rough sets in partial approximation spaces. In: JRS2014 (submitted, 2014)
15.
Zurück zum Zitat Csajbók, Z., Mihálydeák, T.: A general set theoretic approximation framework. In: Greco, S., Bouchon-Meunier, B., Coletti, G., Fedrizzi, M., Matarazzo, B., Yager, R. (eds.) Advances on Computational Intelligence, Communications in Computer and Information Science, vol. 297, pp. 604–612. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-31709-5_61 Csajbók, Z., Mihálydeák, T.: A general set theoretic approximation framework. In: Greco, S., Bouchon-Meunier, B., Coletti, G., Fedrizzi, M., Matarazzo, B., Yager, R. (eds.) Advances on Computational Intelligence, Communications in Computer and Information Science, vol. 297, pp. 604–612. Springer, Heidelberg (2012). https://​doi.​org/​10.​1007/​978-3-642-31709-5_​61
16.
Zurück zum Zitat Csajbók, Z., Mihálydeák, T.: Partial approximative set theory: a generalization of the rough set theory. Int. J. Comput. Inf. Syst. Ind. Manag. Appl. 4, 437–444 (2012) Csajbók, Z., Mihálydeák, T.: Partial approximative set theory: a generalization of the rough set theory. Int. J. Comput. Inf. Syst. Ind. Manag. Appl. 4, 437–444 (2012)
17.
Zurück zum Zitat Delic, D., Lenz, H.J., Neiling, M.: Improving the quality of association rule mining by means of rough sets. In: Grzegorzewski, P., Hryniewicz, O., Gil, M.Á. (eds.) Soft Methods in Probability, Statistics and Data Analysis, pp. 281–288. Physica-Verlag HD, Heidelberg (2002)CrossRef Delic, D., Lenz, H.J., Neiling, M.: Improving the quality of association rule mining by means of rough sets. In: Grzegorzewski, P., Hryniewicz, O., Gil, M.Á. (eds.) Soft Methods in Probability, Statistics and Data Analysis, pp. 281–288. Physica-Verlag HD, Heidelberg (2002)CrossRef
23.
Zurück zum Zitat Erdős, P., Rényi, A.: On the evolution of random graphs. In: Publication of the Mathematical Institute of the Hungarian Academy of Sciences, pp. 17–61 (1960) Erdős, P., Rényi, A.: On the evolution of random graphs. In: Publication of the Mathematical Institute of the Hungarian Academy of Sciences, pp. 17–61 (1960)
24.
Zurück zum Zitat Filiberto, Y., Caballero, Y., Larrua, R., Bello, R.: A method to build similarity relations into extended rough set theory. In: 2010 10th International Conference on Intelligent Systems Design and Applications, pp. 1314–1319, November 2010. https://doi.org/10.1109/ISDA.2010.5687091 Filiberto, Y., Caballero, Y., Larrua, R., Bello, R.: A method to build similarity relations into extended rough set theory. In: 2010 10th International Conference on Intelligent Systems Design and Applications, pp. 1314–1319, November 2010. https://​doi.​org/​10.​1109/​ISDA.​2010.​5687091
30.
Zurück zum Zitat Janusz, A.: Algorithms for similarity relation learning from high dimensional data. Trans. Rough Sets 17, 174–292 (2014)CrossRef Janusz, A.: Algorithms for similarity relation learning from high dimensional data. Trans. Rough Sets 17, 174–292 (2014)CrossRef
40.
Zurück zum Zitat Néda, Z., Sumi, R., Ercsey-Ravasz, M., Varga, M., Molnár, B., Cseh, G.: Correlation clustering on networks. J. Phys. A: Math. Theor. 42(34), 345003 (2009)MathSciNetCrossRef Néda, Z., Sumi, R., Ercsey-Ravasz, M., Varga, M., Molnár, B., Cseh, G.: Correlation clustering on networks. J. Phys. A: Math. Theor. 42(34), 345003 (2009)MathSciNetCrossRef
42.
Zurück zum Zitat Pawlak, Z.: Rough sets. Int. J. Parallel Prog. 11(5), 341–356 (1982)MATH Pawlak, Z.: Rough sets. Int. J. Parallel Prog. 11(5), 341–356 (1982)MATH
45.
Zurück zum Zitat Pawlak, Z., et al.: Rough sets: theoretical aspects of reasoning about data. In: System Theory, Knowledge Engineering and Problem Solving, Kluwer Academic Publishers, Dordrecht, 1991, September 1991 Pawlak, Z., et al.: Rough sets: theoretical aspects of reasoning about data. In: System Theory, Knowledge Engineering and Problem Solving, Kluwer Academic Publishers, Dordrecht, 1991, September 1991
47.
Zurück zum Zitat Polkowski, L., Skowron, A., Zytkow, J.: Rough foundations for rough sets soft computing: rough sets, fuzzy logic, neural networks, uncertainty management, knowledge discovery, pp. 55–58, May 2020 Polkowski, L., Skowron, A., Zytkow, J.: Rough foundations for rough sets soft computing: rough sets, fuzzy logic, neural networks, uncertainty management, knowledge discovery, pp. 55–58, May 2020
48.
Zurück zum Zitat do Prado, H.A., Engel, P.M., Filho, H.C.: Rough clustering: an alternative to find meaningful clusters by using the reducts from a dataset. In: Alpigini, J.J., Peters, J.F., Skowron, A., Zhong, N. (eds.) RSCTC 2002. LNCS (LNAI), vol. 2475, pp. 234–238. Springer, Heidelberg (2002). https://doi.org/10.1007/3-540-45813-1_30CrossRefMATH do Prado, H.A., Engel, P.M., Filho, H.C.: Rough clustering: an alternative to find meaningful clusters by using the reducts from a dataset. In: Alpigini, J.J., Peters, J.F., Skowron, A., Zhong, N. (eds.) RSCTC 2002. LNCS (LNAI), vol. 2475, pp. 234–238. Springer, Heidelberg (2002). https://​doi.​org/​10.​1007/​3-540-45813-1_​30CrossRefMATH
49.
51.
Zurück zum Zitat Słowiński, R., Vanderpooten, D.: A generalized definition of rough approximations based on similarity. IEEE Trans. Knowl. Data Eng. 12(2), 331–336 (2000)CrossRef Słowiński, R., Vanderpooten, D.: A generalized definition of rough approximations based on similarity. IEEE Trans. Knowl. Data Eng. 12(2), 331–336 (2000)CrossRef
52.
Zurück zum Zitat Słowiński, R., Vanderpooten, D.: Similarity relation as a basis for rough approximations. Adv. Mach. Intell. Soft-Comput. 4, 17–33 (1997) Słowiński, R., Vanderpooten, D.: Similarity relation as a basis for rough approximations. Adv. Mach. Intell. Soft-Comput. 4, 17–33 (1997)
60.
Zurück zum Zitat Zimek, A.: Correlation clustering. ACM SIGKDD Explor. Newslett. 11(1), 53–54 (2009)CrossRef Zimek, A.: Correlation clustering. ACM SIGKDD Explor. Newslett. 11(1), 53–54 (2009)CrossRef
Metadaten
Titel
Similarity-based Rough Sets and Its Applications in Data Mining
verfasst von
Dávid Nagy
Copyright-Jahr
2020
Verlag
Springer Berlin Heidelberg
DOI
https://doi.org/10.1007/978-3-662-62798-3_5

Premium Partner