Skip to main content

2017 | OriginalPaper | Buchkapitel

Clustering Complex Data Represented as Propositional Formulas

verfasst von : Abdelhamid Boudane, Said Jabbour, Lakhdar Sais, Yakoub Salhi

Erschienen in: Advances in Knowledge Discovery and Data Mining

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Clustering has been extensively studied to deal with different kinds of data. Usually, datasets are represented as a n-dimensional vector of attributes described by numerical or nominal categorical values. Symbolic data is another concept where the objects are more complex such as intervals, multi-categorical or modal. However, new applications might give rise to even more complex data describing for example customer desires, constraints, and preferences. Such data can be expressed more compactly using logic-based representations. In this paper, we introduce a new clustering framework, where complex objects are described by propositional formulas. First, we extend the two well-known k-means and hierarchical agglomerative clustering techniques. Second, we introduce a new divisive algorithm for clustering objects represented explicitly by sets of models. Finally, we propose a propositional satisfiability based encoding of the problem of clustering propositional formulas without the need for an explicit representation of their models. Preliminary experimental results validating our proposed framework are provided.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Aggarwal, C.C., Reddy, C.K.: Data clustering: algorithms and applications. CRC Press, Boca Raton (2013)MATH Aggarwal, C.C., Reddy, C.K.: Data clustering: algorithms and applications. CRC Press, Boca Raton (2013)MATH
2.
Zurück zum Zitat Berkhin, P.: A survey of clustering data mining techniques. In: Kogan, J., Nicholas, C.K., Teboulle, M. (eds.) Grouping Multidimensional Data - Recent Advances in Clustering, pp. 25–71. Springer, Heidelberg (2006)CrossRef Berkhin, P.: A survey of clustering data mining techniques. In: Kogan, J., Nicholas, C.K., Teboulle, M. (eds.) Grouping Multidimensional Data - Recent Advances in Clustering, pp. 25–71. Springer, Heidelberg (2006)CrossRef
3.
Zurück zum Zitat Billard, L., Diday, E., Analysis, S.D.: Conceptual Statistics and Data Mining. Wiley, Hoboken (2012) Billard, L., Diday, E., Analysis, S.D.: Conceptual Statistics and Data Mining. Wiley, Hoboken (2012)
4.
Zurück zum Zitat Bock, H.H.: Analysis of Symbolic Data: Exploratory Methods for Extracting Statistical Information from Complex Data. Springer, New York (2000)CrossRefMATH Bock, H.H.: Analysis of Symbolic Data: Exploratory Methods for Extracting Statistical Information from Complex Data. Springer, New York (2000)CrossRefMATH
6.
Zurück zum Zitat de Carvalho, F.D.A., Csernel, M., Lechevallier, Y.: Clustering constrained symbolic data. Pattern Recogn. Lett. 30(11), 1037–1045 (2009)CrossRef de Carvalho, F.D.A., Csernel, M., Lechevallier, Y.: Clustering constrained symbolic data. Pattern Recogn. Lett. 30(11), 1037–1045 (2009)CrossRef
7.
Zurück zum Zitat de Souza, R.M., de Carvalho, F.D.A.: Clustering of interval data based on city-block distances. Pattern Recogn. Lett. 25(3), 353–365 (2004)CrossRef de Souza, R.M., de Carvalho, F.D.A.: Clustering of interval data based on city-block distances. Pattern Recogn. Lett. 25(3), 353–365 (2004)CrossRef
8.
Zurück zum Zitat Diday, E., Esposito, F.: An introduction to symbolic data analysis and the SODAS software. Intell. Data Anal. 7(6), 583–601 (2003) Diday, E., Esposito, F.: An introduction to symbolic data analysis and the SODAS software. Intell. Data Anal. 7(6), 583–601 (2003)
9.
Zurück zum Zitat Gomes, C.P., Hoffmann, J., Sabharwal, A., Selman, B.: From sampling to model counting. In: IJCAI 1997, pp. 2293–2299 (2007) Gomes, C.P., Hoffmann, J., Sabharwal, A., Selman, B.: From sampling to model counting. In: IJCAI 1997, pp. 2293–2299 (2007)
10.
Zurück zum Zitat Gowda, K.C., Diday, E.: Symbolic clustering algorithms using similarity and dissimilarity measures. In: Diday, E., Lechevallier, Y., Schader, M., Bertrand, P., Burtschy, B. (eds.) New Approaches in Classification and Data Analysis, pp. 414–422. Springer, Heidelberg (1994)CrossRef Gowda, K.C., Diday, E.: Symbolic clustering algorithms using similarity and dissimilarity measures. In: Diday, E., Lechevallier, Y., Schader, M., Bertrand, P., Burtschy, B. (eds.) New Approaches in Classification and Data Analysis, pp. 414–422. Springer, Heidelberg (1994)CrossRef
11.
Zurück zum Zitat Hotz, L., Felfernig, A., Stumptner, M., Ryabokon, A., Bagley, C., Wolter, K.: Configuration knowledge representation and reasoning. In: Knowledge-Based Configuration, chap. 6, pp. 41–72. Morgan Kaufmann (2014) Hotz, L., Felfernig, A., Stumptner, M., Ryabokon, A., Bagley, C., Wolter, K.: Configuration knowledge representation and reasoning. In: Knowledge-Based Configuration, chap. 6, pp. 41–72. Morgan Kaufmann (2014)
12.
Zurück zum Zitat Jabbour, S., Lonlac, J., Sais, L., Salhi. Y.: Extending modern SAT solvers for models enumeration. In: IEEE-IRI 2014, pp. 803–810 (2014) Jabbour, S., Lonlac, J., Sais, L., Salhi. Y.: Extending modern SAT solvers for models enumeration. In: IEEE-IRI 2014, pp. 803–810 (2014)
13.
Zurück zum Zitat Jaccard, P.: The distribution of the flora of the alpine zone. New Phytol. 11, 37–50 (1912)CrossRef Jaccard, P.: The distribution of the flora of the alpine zone. New Phytol. 11, 37–50 (1912)CrossRef
14.
Zurück zum Zitat Michalski, R.S.: Knowledge acquisition through conceptual clustering: a theoretical framework and an algorithm for partitioning data into conjunctive concepts. J. Policy Anal. Inf. Syst. 4(3), 219–244 (1980)MathSciNet Michalski, R.S.: Knowledge acquisition through conceptual clustering: a theoretical framework and an algorithm for partitioning data into conjunctive concepts. J. Policy Anal. Inf. Syst. 4(3), 219–244 (1980)MathSciNet
15.
Zurück zum Zitat Tversky, A.: Features of similarity. Psychol. Rev. 84(4), 327–352 (1977)CrossRef Tversky, A.: Features of similarity. Psychol. Rev. 84(4), 327–352 (1977)CrossRef
16.
Zurück zum Zitat Tversky, A.: Preference, Belief, and Similarity. The MIT Press, Cambridge (2003) Tversky, A.: Preference, Belief, and Similarity. The MIT Press, Cambridge (2003)
Metadaten
Titel
Clustering Complex Data Represented as Propositional Formulas
verfasst von
Abdelhamid Boudane
Said Jabbour
Lakhdar Sais
Yakoub Salhi
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-57529-2_35