Skip to main content

2015 | OriginalPaper | Buchkapitel

Sampling from Determinantal Point Processes for Scalable Manifold Learning

verfasst von : Christian Wachinger, Polina Golland

Erschienen in: Information Processing in Medical Imaging

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

High computational costs of manifold learning prohibit its application for large datasets. A common strategy to overcome this problem is to perform dimensionality reduction on selected landmarks and to successively embed the entire dataset with the Nyström method. The two main challenges that arise are: (i) the landmarks selected in non-Euclidean geometries must result in a low reconstruction error, (ii) the graph constructed from sparsely sampled landmarks must approximate the manifold well. We propose to sample the landmarks from determinantal distributions on non-Euclidean spaces. Since current determinantal sampling algorithms have the same complexity as those for manifold learning, we present an efficient approximation with linear complexity. Further, we recover the local geometry after the sparsification by assigning each landmark a local covariance matrix, estimated from the original point set. The resulting neighborhood selection based on the Bhattacharyya distance improves the embedding of sparsely sampled manifolds. Our experiments show a significant performance improvement compared to state-of-the-art landmark selection techniques on synthetic and medical data.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Arthur, D., Vassilvitskii, S.: k-means++: The advantages of careful seeding. In: SODA, pp. 1027–1035 (2007) Arthur, D., Vassilvitskii, S.: k-means++: The advantages of careful seeding. In: SODA, pp. 1027–1035 (2007)
2.
Zurück zum Zitat Balasubramanian, M., Schwartz, E.L.: The Isomap algorithm and topological stability. Science 295(5552), 7 (2002)CrossRef Balasubramanian, M., Schwartz, E.L.: The Isomap algorithm and topological stability. Science 295(5552), 7 (2002)CrossRef
3.
Zurück zum Zitat Belabbas, M., Wolfe, P.: On landmark selection and sampling in high-dimensional data analysis. In: PNAS (2009) Belabbas, M., Wolfe, P.: On landmark selection and sampling in high-dimensional data analysis. In: PNAS (2009)
4.
Zurück zum Zitat Belabbas, M., Wolfe, P.: Spectral methods in machine learning and new strategies for very large datasets. PNAS 106(2), 369–374 (2009)CrossRef Belabbas, M., Wolfe, P.: Spectral methods in machine learning and new strategies for very large datasets. PNAS 106(2), 369–374 (2009)CrossRef
5.
Zurück zum Zitat Belkin, M., Niyogi, P.: Laplacian eigenmaps for dimensionality reduction and data representation. Neural Comput. 15(6), 1373–1396 (2003)MATHCrossRef Belkin, M., Niyogi, P.: Laplacian eigenmaps for dimensionality reduction and data representation. Neural Comput. 15(6), 1373–1396 (2003)MATHCrossRef
6.
Zurück zum Zitat Bengio, Y., Paiement, J., Vincent, P., Delalleau, O., Le Roux, N., Ouimet, M.: Out-of-sample extensions for lle, isomap, mds, eigenmaps, and spectral clustering. In: NIPS (2004) Bengio, Y., Paiement, J., Vincent, P., Delalleau, O., Le Roux, N., Ouimet, M.: Out-of-sample extensions for lle, isomap, mds, eigenmaps, and spectral clustering. In: NIPS (2004)
7.
Zurück zum Zitat Borodin, A.: Determinantal point processes. In: Akemann, G., Baik, J., Francesco, P.D. (eds.) The Oxford Handbook of Random Matrix Theory, pp. 231–249. Oxford University Press, Oxford (2011) Borodin, A.: Determinantal point processes. In: Akemann, G., Baik, J., Francesco, P.D. (eds.) The Oxford Handbook of Random Matrix Theory, pp. 231–249. Oxford University Press, Oxford (2011)
8.
Zurück zum Zitat Chen, W., Song, Y., Bai, H., Lin, C., Chang, E.: Parallel spectral clustering in distributed systems. TPAMI 33(3), 568–586 (2011)CrossRef Chen, W., Song, Y., Bai, H., Lin, C., Chang, E.: Parallel spectral clustering in distributed systems. TPAMI 33(3), 568–586 (2011)CrossRef
9.
Zurück zum Zitat Deshpande, A., Rademacher, L., Vempala, S., Wang, G.: Matrix approximation and projective clustering via volume sampling. In: SODA (2006) Deshpande, A., Rademacher, L., Vempala, S., Wang, G.: Matrix approximation and projective clustering via volume sampling. In: SODA (2006)
10.
Zurück zum Zitat Deshpande, A., Vempala, S.S.: Adaptive sampling and fast low-rank matrix approximation. In: Díaz, J., Jansen, K., Rolim, J.D.P., Zwick, U. (eds.) APPROX 2006 and RANDOM 2006. LNCS, vol. 4110, pp. 292–303. Springer, Heidelberg (2006) CrossRef Deshpande, A., Vempala, S.S.: Adaptive sampling and fast low-rank matrix approximation. In: Díaz, J., Jansen, K., Rolim, J.D.P., Zwick, U. (eds.) APPROX 2006 and RANDOM 2006. LNCS, vol. 4110, pp. 292–303. Springer, Heidelberg (2006) CrossRef
11.
Zurück zum Zitat Drineas, P., Mahoney, M.W.: On the nyström method for approximating a gram matrix for improved kernel-based learning. JMLR 6, 2153–2175 (2005)MATHMathSciNet Drineas, P., Mahoney, M.W.: On the nyström method for approximating a gram matrix for improved kernel-based learning. JMLR 6, 2153–2175 (2005)MATHMathSciNet
12.
Zurück zum Zitat Farahat, A., Ghodsi, A., Kamel, M.: A novel greedy algorithm for nyström approximation. In: IASTATS (2011) Farahat, A., Ghodsi, A., Kamel, M.: A novel greedy algorithm for nyström approximation. In: IASTATS (2011)
13.
Zurück zum Zitat Fowlkes, C., Belongie, S., Chung, F., Malik, J.: Spectral grouping using the nyström method. TPAMI 26(2), 214–225 (2004)CrossRef Fowlkes, C., Belongie, S., Chung, F., Malik, J.: Spectral grouping using the nyström method. TPAMI 26(2), 214–225 (2004)CrossRef
14.
Zurück zum Zitat Hough, J., Krishnapur, M., Peres, Y., Virág, B.: Determinantal processes and independence. Probab. Surv. 3, 206–229 (2006)MATHMathSciNetCrossRef Hough, J., Krishnapur, M., Peres, Y., Virág, B.: Determinantal processes and independence. Probab. Surv. 3, 206–229 (2006)MATHMathSciNetCrossRef
15.
Zurück zum Zitat Kulesza, A., Taskar, B.: Structured determinantal point processes. In: NIPS (2010) Kulesza, A., Taskar, B.: Structured determinantal point processes. In: NIPS (2010)
16.
Zurück zum Zitat Kumar, S., Mohri, M., Talwalkar, A.: Sampling techniques for the nystrom method. J. Mach. Learn. Res. 13, 981–1006 (2012)MATHMathSciNet Kumar, S., Mohri, M., Talwalkar, A.: Sampling techniques for the nystrom method. J. Mach. Learn. Res. 13, 981–1006 (2012)MATHMathSciNet
17.
Zurück zum Zitat LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)CrossRef LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)CrossRef
18.
Zurück zum Zitat Talwalkar, A., Kumar, S., Rowley, H.: Large-scale manifold learning. In: CVPR (2008) Talwalkar, A., Kumar, S., Rowley, H.: Large-scale manifold learning. In: CVPR (2008)
19.
Zurück zum Zitat Tenenbaum, J., Silva, V., Langford, J.: A global geometric framework for nonlinear dimensionality reduction. Science 290(5500), 2319–2323 (2000)CrossRef Tenenbaum, J., Silva, V., Langford, J.: A global geometric framework for nonlinear dimensionality reduction. Science 290(5500), 2319–2323 (2000)CrossRef
20.
Zurück zum Zitat Williams, C.: On a connection between kernel PCA and metric multidimensional scaling. In: NIPS, pp. 675–681 (2001) Williams, C.: On a connection between kernel PCA and metric multidimensional scaling. In: NIPS, pp. 675–681 (2001)
21.
Zurück zum Zitat Williams, C., Seeger, M.: Using the nyström method to speed up kernel machines. In: NIPS, pp. 682–688 (2001) Williams, C., Seeger, M.: Using the nyström method to speed up kernel machines. In: NIPS, pp. 682–688 (2001)
22.
Zurück zum Zitat Zhang, K., Tsang, I., Kwok, J.: Improved nyström low-rank approximation and error analysis. In: ICML, pp. 1232–1239. ACM (2008) Zhang, K., Tsang, I., Kwok, J.: Improved nyström low-rank approximation and error analysis. In: ICML, pp. 1232–1239. ACM (2008)
Metadaten
Titel
Sampling from Determinantal Point Processes for Scalable Manifold Learning
verfasst von
Christian Wachinger
Polina Golland
Copyright-Jahr
2015
DOI
https://doi.org/10.1007/978-3-319-19992-4_54

Premium Partner