Skip to main content
Erschienen in: Pattern Analysis and Applications 3/2014

01.08.2014 | Theoretical Advances

Dimensionality reduction and topographic mapping of binary tensors

verfasst von: Jakub Mažgut, Peter Tiňo, Mikael Bodén, Hong Yan

Erschienen in: Pattern Analysis and Applications | Ausgabe 3/2014

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper, a decomposition method for binary tensors, generalized multi-linear model for principal component analysis (GMLPCA) is proposed. To the best of our knowledge at present there is no other principled systematic framework for decomposition or topographic mapping of binary tensors. In the model formulation, we constrain the natural parameters of the Bernoulli distributions for each tensor element to lie in a sub-space spanned by a reduced set of basis (principal) tensors. We evaluate and compare the proposed GMLPCA technique with existing real-valued tensor decomposition methods in two scenarios: (1) in a series of controlled experiments involving synthetic data; (2) on a real-world biological dataset of DNA sub-sequences from different functional regions, with sequences represented by binary tensors. The experiments suggest that the GMLPCA model is better suited for modelling binary tensors than its real-valued counterparts. Furthermore, we extended our GMLPCA model to the semi-supervised setting by forcing the model to search for a natural parameter subspace that represents a user-specified compromise between the modelling quality and the degree of class separation.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Fußnoten
1
The updating formulas of CSA and MPCA are similar, the only difference being that MPCA subtracts the mean from the data tensors.
 
2
As a term, we denote a short and widespread sequence of nucleotides that has or may have a biological significance.
 
3
R = [3 × 3] represents a natural parameter subspace spanned by 3-row and 3-column vectors.
 
4
\(\theta\)’s are fixed current values of the parameters and should be treated as constants.
 
Literatur
1.
Zurück zum Zitat Lu H, Plataniotis KN, Venetsanopoulos AN (2008) MPCA: multilinear principal component analysis of tensor objects. IEEE Trans Neural Netw 19:18–39CrossRef Lu H, Plataniotis KN, Venetsanopoulos AN (2008) MPCA: multilinear principal component analysis of tensor objects. IEEE Trans Neural Netw 19:18–39CrossRef
2.
Zurück zum Zitat Nolker C, Ritter H (2002) Visual recognition of continuous hand postures. IEEE Trans Neural Netw 13:983–994CrossRef Nolker C, Ritter H (2002) Visual recognition of continuous hand postures. IEEE Trans Neural Netw 13:983–994CrossRef
3.
Zurück zum Zitat Jia K, Gong S (2005) Multi-modal tensor face for simultaneous super-resolution and recognition. In: 10th IEEE international conference computer vision 2:1683–1690 (Beijing) Jia K, Gong S (2005) Multi-modal tensor face for simultaneous super-resolution and recognition. In: 10th IEEE international conference computer vision 2:1683–1690 (Beijing)
4.
Zurück zum Zitat Renard N, Bourennane S (2008) An ICA-based multilinear algebra tools for dimensionality reduction in hyperspectral imagery. In: IEEE international conference acoustics, speech and signal processing, vol 4. Las Vegas, NV, pp 1345–1348 Renard N, Bourennane S (2008) An ICA-based multilinear algebra tools for dimensionality reduction in hyperspectral imagery. In: IEEE international conference acoustics, speech and signal processing, vol 4. Las Vegas, NV, pp 1345–1348
5.
Zurück zum Zitat Cai D, He X, Han J (2006) Tensor space model for document analysis. In: Proceedings 29th annual ACM SIGIR international conference research and development in information retrieval. Seatlle, WA, pp 625–626 Cai D, He X, Han J (2006) Tensor space model for document analysis. In: Proceedings 29th annual ACM SIGIR international conference research and development in information retrieval. Seatlle, WA, pp 625–626
6.
Zurück zum Zitat Pearson K (1901) On lines and planes of closest fit to systems of points in space. Philos Mag 2(6):559–572CrossRef Pearson K (1901) On lines and planes of closest fit to systems of points in space. Philos Mag 2(6):559–572CrossRef
7.
Zurück zum Zitat Lu H, Plataniotis KN, Venetsanopoulos AN (2009) Uncorrelated multilinear discriminant analysis with regularization and aggregation for tensor object recognition. IEEE Trans Neural Netw 20:103–123CrossRef Lu H, Plataniotis KN, Venetsanopoulos AN (2009) Uncorrelated multilinear discriminant analysis with regularization and aggregation for tensor object recognition. IEEE Trans Neural Netw 20:103–123CrossRef
8.
Zurück zum Zitat Zafeiriou S (2009) Discriminant nonnegative tensor factorization algorithms. IEEE Trans Neural Netw 20:217–235CrossRef Zafeiriou S (2009) Discriminant nonnegative tensor factorization algorithms. IEEE Trans Neural Netw 20:217–235CrossRef
9.
Zurück zum Zitat Panagakis Y, Kotropoulos C, Arce G (2010) Non-negative multilinear principal component analysis of auditory temporal modulations for music genre classification. IEEE Trans Audio Speech Lang Process 18:576–588CrossRef Panagakis Y, Kotropoulos C, Arce G (2010) Non-negative multilinear principal component analysis of auditory temporal modulations for music genre classification. IEEE Trans Audio Speech Lang Process 18:576–588CrossRef
10.
Zurück zum Zitat Brachat J, Comon P, Mourrain B, Tsigaridas E (2010) Symmetric tensor decomposition. Linear Algebra Appl (In Press, Corrected Proof) Brachat J, Comon P, Mourrain B, Tsigaridas E (2010) Symmetric tensor decomposition. Linear Algebra Appl (In Press, Corrected Proof)
11.
Zurück zum Zitat Schein A, Saul L, Ungar L (2003) A generalized linear model for principal component analysis of binary data. In: 9th international workshop artificial intelligence and statistics. Key West, FL Schein A, Saul L, Ungar L (2003) A generalized linear model for principal component analysis of binary data. In: 9th international workshop artificial intelligence and statistics. Key West, FL
12.
Zurück zum Zitat Acar E, Yener B (2009) “Unsupervised multiway data analysis: a literature survey. IEEE Trans Knowl Data En 21:6–20CrossRef Acar E, Yener B (2009) “Unsupervised multiway data analysis: a literature survey. IEEE Trans Knowl Data En 21:6–20CrossRef
13.
Zurück zum Zitat Wang H, Ahuja N (2005) Rank-r approximation of tensors: using image-as-matrix representation. Comput Vis Pattern Recognit 346–353 Wang H, Ahuja N (2005) Rank-r approximation of tensors: using image-as-matrix representation. Comput Vis Pattern Recognit 346–353
14.
Zurück zum Zitat De Lathauwer L, De Moor B, Vandewalle J (2000) A multilinear singular value decomposition. SIAM J Matrix Anal Appl 21 (4):1253–1278CrossRefMATHMathSciNet De Lathauwer L, De Moor B, Vandewalle J (2000) A multilinear singular value decomposition. SIAM J Matrix Anal Appl 21 (4):1253–1278CrossRefMATHMathSciNet
15.
Zurück zum Zitat Kofidis E, Regalia PA (2001) On the best rank-1 approximation of higher-order supersymmetric tensors. SIAM J Matrix Anal Appl 23(3):863–884CrossRefMathSciNet Kofidis E, Regalia PA (2001) On the best rank-1 approximation of higher-order supersymmetric tensors. SIAM J Matrix Anal Appl 23(3):863–884CrossRefMathSciNet
16.
Zurück zum Zitat Wang H, Ahuja N (2004) Compact representation of multidimensional data using tensor rank-one decomposition. In: Proceedings 17th international conference pattern recognition. Cambridge, UK, pp 44–47 Wang H, Ahuja N (2004) Compact representation of multidimensional data using tensor rank-one decomposition. In: Proceedings 17th international conference pattern recognition. Cambridge, UK, pp 44–47
17.
Zurück zum Zitat Ye J, Janardan R, Li Q (2004) Gpca: an efficient dimension reduction scheme for image compression and retrieval .In: Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, KDD ’04. New York, NY, USA, pp 354–363, ACM, 2004. Ye J, Janardan R, Li Q (2004) Gpca: an efficient dimension reduction scheme for image compression and retrieval .In: Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, KDD ’04. New York, NY, USA, pp 354–363, ACM, 2004.
18.
Zurück zum Zitat Xu D, Yan S, Zhang L, Lin S, Zhang H-J, Huang T (2008) Reconstruction and recognition of tensor-based objects with concurrent subspaces analysis. IEEE Trans Circuits Syst Video Technol 18:36–47CrossRef Xu D, Yan S, Zhang L, Lin S, Zhang H-J, Huang T (2008) Reconstruction and recognition of tensor-based objects with concurrent subspaces analysis. IEEE Trans Circuits Syst Video Technol 18:36–47CrossRef
19.
Zurück zum Zitat Lu H, Plataniotis KN, Venetsanopoulos AN (2009) Uncorrelated multilinear principal component analysis for unsupervised multilinear subspace learning. IEEE Trans Neural Netw 20(11):1820–1836CrossRef Lu H, Plataniotis KN, Venetsanopoulos AN (2009) Uncorrelated multilinear principal component analysis for unsupervised multilinear subspace learning. IEEE Trans Neural Netw 20(11):1820–1836CrossRef
20.
Zurück zum Zitat Inoue K, Hara K, Urahama K (2009) Robust multilinear principal component analysis. In: 12th international conference on computer vision 2009, pp 591–597, 2–29 oct 2009 Inoue K, Hara K, Urahama K (2009) Robust multilinear principal component analysis. In: 12th international conference on computer vision 2009, pp 591–597, 2–29 oct 2009
21.
Zurück zum Zitat Lu H, Plataniotis KN, Venetsanopoulos AN (2011) A survey of multilinear subspace learning for tensor data. Pattern Recognit 44:1540–1551CrossRefMATH Lu H, Plataniotis KN, Venetsanopoulos AN (2011) A survey of multilinear subspace learning for tensor data. Pattern Recognit 44:1540–1551CrossRefMATH
22.
Zurück zum Zitat Cortes C, Mohri M (2003) AUC optimization vs. error rate minimization. In: Advances in neural information processing systems, vol 16. Banff, AL, Canada, pp 313–320 Cortes C, Mohri M (2003) AUC optimization vs. error rate minimization. In: Advances in neural information processing systems, vol 16. Banff, AL, Canada, pp 313–320
23.
Zurück zum Zitat Li X, Zeng J, Yan H (2008) PCA-HPR: a principle component analysis model for human promoter recognition. Bioinformation 2(9):373–378CrossRef Li X, Zeng J, Yan H (2008) PCA-HPR: a principle component analysis model for human promoter recognition. Bioinformation 2(9):373–378CrossRef
24.
Zurück zum Zitat Sonnenburg S, Zien A, Philips P, Ratsch G (2008) POIMs: positional oligomer importance matrices–understanding support vector machine-based signal detectors. Bioinformatics 24(13): i6–i14CrossRef Sonnenburg S, Zien A, Philips P, Ratsch G (2008) POIMs: positional oligomer importance matrices–understanding support vector machine-based signal detectors. Bioinformatics 24(13): i6–i14CrossRef
25.
Zurück zum Zitat Baldi P, Brunak S (2001) Bioinformatics: the machine learning approach. The MIT Press, 2nd ed. Baldi P, Brunak S (2001) Bioinformatics: the machine learning approach. The MIT Press, 2nd ed.
26.
Zurück zum Zitat Isaev A (2007) Introduction to mathematical methods in Bioinformatics. Springer, Secaucus Isaev A (2007) Introduction to mathematical methods in Bioinformatics. Springer, Secaucus
27.
Zurück zum Zitat Wakaguri H, Yamashita R, Sugano SYS, Nakai K (2008) DBTSS: database of transcription start sites. Nucleic Acids Res 36:97–101 (no. Database-Issue) Wakaguri H, Yamashita R, Sugano SYS, Nakai K (2008) DBTSS: database of transcription start sites. Nucleic Acids Res 36:97–101 (no. Database-Issue)
28.
Zurück zum Zitat Saxonov S, Daizadeh I, Fedorov A, Gilbert W (2000) EID: the exon-intron database—an exhaustive database of protein-coding intron-containing genes. Nucleic Acids Res 28(1):185–190CrossRef Saxonov S, Daizadeh I, Fedorov A, Gilbert W (2000) EID: the exon-intron database—an exhaustive database of protein-coding intron-containing genes. Nucleic Acids Res 28(1):185–190CrossRef
29.
Zurück zum Zitat Ron D, Singer Y, Tishby N (1996) “The power of amnesia: learning probabilistic automata with variable memory length. Mach Learning 25(2–3):117–149CrossRefMATH Ron D, Singer Y, Tishby N (1996) “The power of amnesia: learning probabilistic automata with variable memory length. Mach Learning 25(2–3):117–149CrossRefMATH
30.
Zurück zum Zitat Tino P, Dorffner G (2001) Predicting the future of discrete sequences from fractal representations of the past. Mach Learning 45(2):187–217CrossRefMATH Tino P, Dorffner G (2001) Predicting the future of discrete sequences from fractal representations of the past. Mach Learning 45(2):187–217CrossRefMATH
31.
Zurück zum Zitat Cross S, Clark V, Bird A (1999) Isolation of CpG islands from large genomic clones. Nucleic Acids Res 27(10):2099–2107CrossRef Cross S, Clark V, Bird A (1999) Isolation of CpG islands from large genomic clones. Nucleic Acids Res 27(10):2099–2107CrossRef
32.
Zurück zum Zitat Bodén M, Bailey TL (2007) Associating transcription factor-binding site motifs with target GO terms and target genes. Nucleic Acids Res 36(12):4108–4117CrossRef Bodén M, Bailey TL (2007) Associating transcription factor-binding site motifs with target GO terms and target genes. Nucleic Acids Res 36(12):4108–4117CrossRef
33.
Zurück zum Zitat Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G (2000) Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 25(1):25–29CrossRef Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G (2000) Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 25(1):25–29CrossRef
34.
Zurück zum Zitat Straussman R, Nejman D, Roberts D, Steinfeld I, Blum B, Benvenisty N, Simon I, Yakhini Z, Cedar H (2009) Developmental programming of CpG island methylation profiles in the human genome. Nat Struct Mol Biol 16(5): 564–571CrossRef Straussman R, Nejman D, Roberts D, Steinfeld I, Blum B, Benvenisty N, Simon I, Yakhini Z, Cedar H (2009) Developmental programming of CpG island methylation profiles in the human genome. Nat Struct Mol Biol 16(5): 564–571CrossRef
35.
Zurück zum Zitat Bailey TL, Boden M, Buske FA, Frith M, Grant CE, Clementi L, Ren J, Li WW, Noble WS (2009) Meme suite: tools for motif discovery and searching. Nucleic Acids Res 37:W202–W208 (Web Server issue) Bailey TL, Boden M, Buske FA, Frith M, Grant CE, Clementi L, Ren J, Li WW, Noble WS (2009) Meme suite: tools for motif discovery and searching. Nucleic Acids Res 37:W202–W208 (Web Server issue)
36.
Zurück zum Zitat Gupta S, Stamatoyannopoulos J, Bailey T, Noble W (2007) Quantifying similarity between motifs. Genome Biol8(2): R24 Gupta S, Stamatoyannopoulos J, Bailey T, Noble W (2007) Quantifying similarity between motifs. Genome Biol8(2): R24
37.
Zurück zum Zitat Globerson A, Roweis S (2006) Metric learning by collapsing classes. Adva Neural Info Process Syst 18:451–458 Globerson A, Roweis S (2006) Metric learning by collapsing classes. Adva Neural Info Process Syst 18:451–458
Metadaten
Titel
Dimensionality reduction and topographic mapping of binary tensors
verfasst von
Jakub Mažgut
Peter Tiňo
Mikael Bodén
Hong Yan
Publikationsdatum
01.08.2014
Verlag
Springer London
Erschienen in
Pattern Analysis and Applications / Ausgabe 3/2014
Print ISSN: 1433-7541
Elektronische ISSN: 1433-755X
DOI
https://doi.org/10.1007/s10044-013-0317-y

Weitere Artikel der Ausgabe 3/2014

Pattern Analysis and Applications 3/2014 Zur Ausgabe