Skip to main content
Erschienen in: GeoInformatica 4/2022

12.04.2022

Multi-type clustering using regularized tensor decomposition

verfasst von: Charlotte L. Ellison, William R. Fields

Erschienen in: GeoInformatica | Ausgabe 4/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Geospatial analytics increasingly rely on data fusion methods to extract patterns from data; however robust results are difficult to achieve because of the need for spatial and temporal regularization and latent structures within data. Tensor decomposition is a promising approach because it can accommodate multidimensional structure of data (e.g., trajectory information about users, locations, and time periods). To address these challenges, we introduce Multi-Type Clustering using Regularized tensor Decomposition (MCRD), an innovative method for data analysis that provides insight not just about groupings within data types (e.g., clusters of users), but also about the interactions between data types (e.g., clusters of users and locations) in the latent features of complex multi-type datasets. This is done by combining two innovations. First, a tensor representing spatiotemporal data is decomposed using a novel regularization method to account for structure within the data. Next, within- and cross-type groups are found through the application of novel hypergraph community detection methods to the decomposed results. Experimentation on both synthetic and real trajectory data demonstrates MCRD’s capacity to reveal the within- and cross-type grouping in data, and MCRD outperforms related methods including tensor decomposition without regularization, unfolding of tensors, Laplacian regularization, and tensor block models. The robust and versatile analysis provided by combining new regularization and clustering techniques outlined in this paper likely have utility in geospatial analytics beyond the movement applications explicitly studied.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
Because it is an internal clustering index, the CH criterion is not as meaningful for comparing the results of clustering elements based on factor matrices with different number of factors.
 
2
Several MATLAB packages for basic tensor operations are used [2, 3, 33].
 
4
The code used for this can be found at https://​github.​com/​ike002jp/​npartite.
 
Literatur
1.
Zurück zum Zitat Acar E, Kolda TG, Dunlavy DM (2011) All-at-once optimization for coupled matrix and tensor factorizations. arXiv:1105.3422 Acar E, Kolda TG, Dunlavy DM (2011) All-at-once optimization for coupled matrix and tensor factorizations. arXiv:1105.​3422
4.
Zurück zum Zitat Battaglino C, Ballard G, Kolda TG (2018) A practical randomized cp tensor decomposition. SIAM J Matrix Anal Appl 39(2):876–901MathSciNetCrossRefMATH Battaglino C, Ballard G, Kolda TG (2018) A practical randomized cp tensor decomposition. SIAM J Matrix Anal Appl 39(2):876–901MathSciNetCrossRefMATH
5.
Zurück zum Zitat Caliński T, Harabasz J (1974) A dendrite method for cluster analysis. Communications in Statistics-theory and Methods 3(1):1–27MathSciNetCrossRefMATH Caliński T, Harabasz J (1974) A dendrite method for cluster analysis. Communications in Statistics-theory and Methods 3(1):1–27MathSciNetCrossRefMATH
6.
Zurück zum Zitat Castro PS, Zhang D, Chen C, Li S, Pan G (2013) From taxi gps traces to social and community dynamics: a survey. ACM Computing Surveys (CSUR) 46(2):1–34CrossRef Castro PS, Zhang D, Chen C, Li S, Pan G (2013) From taxi gps traces to social and community dynamics: a survey. ACM Computing Surveys (CSUR) 46(2):1–34CrossRef
7.
Zurück zum Zitat Chi EC, Gaines BR, Sun WW, Zhou H, Yang J (2018) Provable convex co-clustering of tensors. arXiv:1803.06518 Chi EC, Gaines BR, Sun WW, Zhou H, Yang J (2018) Provable convex co-clustering of tensors. arXiv:1803.​06518
8.
Zurück zum Zitat Comon P, Luciani X, De Almeida AL (2009) Tensor decompositions, alternating least squares and other tales. J Chemom: A J Chemom Soc 23(7-8):393–405CrossRef Comon P, Luciani X, De Almeida AL (2009) Tensor decompositions, alternating least squares and other tales. J Chemom: A J Chemom Soc 23(7-8):393–405CrossRef
9.
Zurück zum Zitat Danon L, Diaz-Guilera A, Duch J, Arenas A (2005) Comparing community structure identification. J Stat Mech: Theory and Exp 2005(09):P09008CrossRef Danon L, Diaz-Guilera A, Duch J, Arenas A (2005) Comparing community structure identification. J Stat Mech: Theory and Exp 2005(09):P09008CrossRef
10.
Zurück zum Zitat Gauvin L, Panisson A, Cattuto C (2014) Detecting the community structure and activity patterns of temporal networks: a non-negative tensor factorization approach. PloS One 9(1):e86028CrossRef Gauvin L, Panisson A, Cattuto C (2014) Detecting the community structure and activity patterns of temporal networks: a non-negative tensor factorization approach. PloS One 9(1):e86028CrossRef
11.
Zurück zum Zitat Grauwin S, Sobolevsky S, Moritz S, Gódor I, Ratti C (2015) Towards a comparative science of cities: Using mobile traffic records in new york, london, and hong kong. In: Computational approaches for urban environments, Springer, pp 363–387 Grauwin S, Sobolevsky S, Moritz S, Gódor I, Ratti C (2015) Towards a comparative science of cities: Using mobile traffic records in new york, london, and hong kong. In: Computational approaches for urban environments, Springer, pp 363–387
12.
Zurück zum Zitat Haass MJ, Van Benthem MH, Ochoa EM (2014) Tensor analysis methods for activity characterization in spatiotemporal data. Sandia Tech Report SAND2014–1825 Haass MJ, Van Benthem MH, Ochoa EM (2014) Tensor analysis methods for activity characterization in spatiotemporal data. Sandia Tech Report SAND2014–1825
13.
14.
Zurück zum Zitat Ikematsu K, Murata T (2013) A fast method for detecting communities from tripartite networks. In: Int conferen on soc inform, Springer, pp 192–205 Ikematsu K, Murata T (2013) A fast method for detecting communities from tripartite networks. In: Int conferen on soc inform, Springer, pp 192–205
15.
Zurück zum Zitat Ioannidis VN, Zamzam AS, Giannakis GB, Sidiropoulos ND (2018) Coupled graphs and tensor factorization for recommender systems and community detection. arXiv:1809.08353 Ioannidis VN, Zamzam AS, Giannakis GB, Sidiropoulos ND (2018) Coupled graphs and tensor factorization for recommender systems and community detection. arXiv:1809.​08353
17.
Zurück zum Zitat Lee DD, Seung HS (2001) Algorithms for non-negative matrix factorization. pp 556–562 Lee DD, Seung HS (2001) Algorithms for non-negative matrix factorization. pp 556–562
18.
Zurück zum Zitat Li X, Li M, Gong YJ, Zhang XL, Yin J (2016) T-desp: Destination prediction based on big trajectory data. IEEE Transactions on Intell Transp Syst 17(8):2344–2354CrossRef Li X, Li M, Gong YJ, Zhang XL, Yin J (2016) T-desp: Destination prediction based on big trajectory data. IEEE Transactions on Intell Transp Syst 17(8):2344–2354CrossRef
19.
Zurück zum Zitat Lin YR, Sun J, Castro P, Konuru R, Sundaram H, Kelliher A (2009) Metafac: community discovery via relational hypergraph factorization. In: Proc of the 15th ACM SIGKDD int conferen on knowl discov and data min, ACM, pp 527–536 Lin YR, Sun J, Castro P, Konuru R, Sundaram H, Kelliher A (2009) Metafac: community discovery via relational hypergraph factorization. In: Proc of the 15th ACM SIGKDD int conferen on knowl discov and data min, ACM, pp 527–536
20.
Zurück zum Zitat Liu JX, Wang D, Gao YL, Zheng CH, Xu Y, Yu J (2017) Regularized non-negative matrix factorization for identifying differentially expressed genes and clustering samples: a survey. IEEE/ACM Trans on Computl Biolog and Bioinform 15(3):974–987CrossRef Liu JX, Wang D, Gao YL, Zheng CH, Xu Y, Yu J (2017) Regularized non-negative matrix factorization for identifying differentially expressed genes and clustering samples: a survey. IEEE/ACM Trans on Computl Biolog and Bioinform 15(3):974–987CrossRef
21.
Zurück zum Zitat Liu L, Andris C, Ratti C (2010) Uncovering cabdrivers’ behavior patterns from their digital traces. Comput Environ Urban Syst 34(6):541–548CrossRef Liu L, Andris C, Ratti C (2010) Uncovering cabdrivers’ behavior patterns from their digital traces. Comput Environ Urban Syst 34(6):541–548CrossRef
22.
Zurück zum Zitat Liu Y, Li Z, Xiong H, Gao X, Wu J (2010) Understanding of internal clustering validation measures. In: 2010 IEEE International conference on data mining, IEEE, pp 911–916 Liu Y, Li Z, Xiong H, Gao X, Wu J (2010) Understanding of internal clustering validation measures. In: 2010 IEEE International conference on data mining, IEEE, pp 911–916
23.
Zurück zum Zitat Moreira-Matias L, Gama J, Ferreira M, Mendes-Moreira J, Damas L (2016) Time-evolving od matrix estimation using high-speed gps data streams. Expert Systems with Applications 44:275–288CrossRef Moreira-Matias L, Gama J, Ferreira M, Mendes-Moreira J, Damas L (2016) Time-evolving od matrix estimation using high-speed gps data streams. Expert Systems with Applications 44:275–288CrossRef
25.
Zurück zum Zitat Murtagh F (1983) A survey of recent advances in hierarchical clustering algorithms. The Comput J 26(4):354–359CrossRefMATH Murtagh F (1983) A survey of recent advances in hierarchical clustering algorithms. The Comput J 26(4):354–359CrossRefMATH
26.
Zurück zum Zitat Narita A, Hayashi K, Tomioka R, Kashima H (2012) Tensor factorization using auxiliary information. Data Min and Knowl Discov 25(2):298–324MathSciNetCrossRefMATH Narita A, Hayashi K, Tomioka R, Kashima H (2012) Tensor factorization using auxiliary information. Data Min and Knowl Discov 25(2):298–324MathSciNetCrossRefMATH
27.
Zurück zum Zitat Neubauer N, Obermayer K (2010) Community detection in tagging-induced hypergraphs. In: Workshop on inform in netw. New York University NY, USA, pp 24–25 Neubauer N, Obermayer K (2010) Community detection in tagging-induced hypergraphs. In: Workshop on inform in netw. New York University NY, USA, pp 24–25
28.
Zurück zum Zitat Ouvrard X, Goff JL, Marchand-Maillet S (2017) Adjacency and tensor representation in general hypergraphs part 1: e-adjacency tensor uniformisation using homogeneous polynomials. arXiv:1712.08189 Ouvrard X, Goff JL, Marchand-Maillet S (2017) Adjacency and tensor representation in general hypergraphs part 1: e-adjacency tensor uniformisation using homogeneous polynomials. arXiv:1712.​08189
29.
Zurück zum Zitat Phithakkitnukoon S, Veloso M, Bento C, Biderman A, Ratti C (2010) Taxi-aware map: Identifying and predicting vacant taxis in the city. In: International joint conference on ambient intelligence, Springer, pp 86–95 Phithakkitnukoon S, Veloso M, Bento C, Biderman A, Ratti C (2010) Taxi-aware map: Identifying and predicting vacant taxis in the city. In: International joint conference on ambient intelligence, Springer, pp 86–95
30.
Zurück zum Zitat Shashua A, Hazan T (2005) Non-negative tensor factorization with applications to statistics and computer vision. In: Proc of the 22nd int conferen on mach learn, ACM, pp 792–799 Shashua A, Hazan T (2005) Non-negative tensor factorization with applications to statistics and computer vision. In: Proc of the 22nd int conferen on mach learn, ACM, pp 792–799
31.
Zurück zum Zitat Sun L, Axhausen KW (2016) Understanding urban mobility patterns with a probabilistic tensor factorization framework. Transp Res Part B: Methodol 91:511–524CrossRef Sun L, Axhausen KW (2016) Understanding urban mobility patterns with a probabilistic tensor factorization framework. Transp Res Part B: Methodol 91:511–524CrossRef
32.
Zurück zum Zitat Takeuchi K, Tomioka R, Ishiguro K, Kimura A, Sawada H (2013) Non-negative multiple tensor factorization. In: 2013 IEEE 13Th int conferen on data min, IEEE, pp 1199–1204 Takeuchi K, Tomioka R, Ishiguro K, Kimura A, Sawada H (2013) Non-negative multiple tensor factorization. In: 2013 IEEE 13Th int conferen on data min, IEEE, pp 1199–1204
34.
Zurück zum Zitat Wang M, Zeng Y (2019) Multiway clustering via tensor block models. In: Adv in neural inf process sys, pp 713–723 Wang M, Zeng Y (2019) Multiway clustering via tensor block models. In: Adv in neural inf process sys, pp 713–723
35.
Zurück zum Zitat Wang Y, Zheng Y, Xue Y (2014) Travel time estimation of a path using sparse trajectories. In: Proc of the 20th ACM SIGKDD int conferen on knowl discov and data min, ACM, pp 25–34 Wang Y, Zheng Y, Xue Y (2014) Travel time estimation of a path using sparse trajectories. In: Proc of the 20th ACM SIGKDD int conferen on knowl discov and data min, ACM, pp 25–34
36.
Zurück zum Zitat Wu R, Luo G, Jin Q, Shao J, Lu CT (2020) Learning evolving user’s behaviors on location-based social networks. GeoInformatica, pp 1–31 Wu R, Luo G, Jin Q, Shao J, Lu CT (2020) Learning evolving user’s behaviors on location-based social networks. GeoInformatica, pp 1–31
37.
Zurück zum Zitat Wu T, Benson AR, Gleich DF (2016) General tensor spectral co-clustering for higher-order data. In: Adv in neural inf process syst, pp 2559–2567 Wu T, Benson AR, Gleich DF (2016) General tensor spectral co-clustering for higher-order data. In: Adv in neural inf process syst, pp 2559–2567
38.
Zurück zum Zitat Xu Y, Yin W (2013) A block coordinate descent method for regularized multiconvex optimization with applications to nonnegative tensor factorization and completion. SIAM J on Imaging Sci 6(3):1758–1789MathSciNetCrossRefMATH Xu Y, Yin W (2013) A block coordinate descent method for regularized multiconvex optimization with applications to nonnegative tensor factorization and completion. SIAM J on Imaging Sci 6(3):1758–1789MathSciNetCrossRefMATH
39.
Zurück zum Zitat Yao L, Sheng QZ, Qin Y, Wang X, Shemshadi A, He Q (2015) Context-aware point-of-interest recommendation using tensor factorization with social regularization. In: Proc of the 38th int ACM SIGIR conferen on res and dev in inf retr, ACM, pp 1007–1010 Yao L, Sheng QZ, Qin Y, Wang X, Shemshadi A, He Q (2015) Context-aware point-of-interest recommendation using tensor factorization with social regularization. In: Proc of the 38th int ACM SIGIR conferen on res and dev in inf retr, ACM, pp 1007–1010
40.
Zurück zum Zitat Yılmaz KY, Cemgil AT, Simsekli U (2011) Generalised coupled tensor factorisation. In: Adv in neural inf process syst, pp 2151–2159 Yılmaz KY, Cemgil AT, Simsekli U (2011) Generalised coupled tensor factorisation. In: Adv in neural inf process syst, pp 2151–2159
41.
Zurück zum Zitat Zheng Y (2015) Trajectory data mining: an overview. ACM Trans on Intell Syst Technol (TIST) 6(3):29 Zheng Y (2015) Trajectory data mining: an overview. ACM Trans on Intell Syst Technol (TIST) 6(3):29
42.
Zurück zum Zitat Zheng Y, Liu T, Wang Y, Zhu Y, Liu Y, Chang E (2014) Diagnosing new york city’s noises with ubiquitous data. In: Proc of the 2014 ACM int jt conferen on pervasive and ubiquitous comput, ACM, pp 715–725 Zheng Y, Liu T, Wang Y, Zhu Y, Liu Y, Chang E (2014) Diagnosing new york city’s noises with ubiquitous data. In: Proc of the 2014 ACM int jt conferen on pervasive and ubiquitous comput, ACM, pp 715–725
43.
Zurück zum Zitat Zheng Y, Liu Y, Yuan J, Xie X (2011) Urban computing with taxicabs. In: Proceedings of the 13th international conference on Ubiquitous computing, pp 89–98 Zheng Y, Liu Y, Yuan J, Xie X (2011) Urban computing with taxicabs. In: Proceedings of the 13th international conference on Ubiquitous computing, pp 89–98
44.
Zurück zum Zitat Zheng Y, Zhou X (2011) Computing with spatial trajectories. Springer Science & Business Media Zheng Y, Zhou X (2011) Computing with spatial trajectories. Springer Science & Business Media
Metadaten
Titel
Multi-type clustering using regularized tensor decomposition
verfasst von
Charlotte L. Ellison
William R. Fields
Publikationsdatum
12.04.2022
Verlag
Springer US
Erschienen in
GeoInformatica / Ausgabe 4/2022
Print ISSN: 1384-6175
Elektronische ISSN: 1573-7624
DOI
https://doi.org/10.1007/s10707-021-00457-8

Weitere Artikel der Ausgabe 4/2022

GeoInformatica 4/2022 Zur Ausgabe