Skip to main content

2022 | OriginalPaper | Buchkapitel

A Simple Extension of the Bag-of-Paths Model Weighting Path Lengths by a Poisson Distribution

verfasst von : Sylvain Courtain, Marco Saerens

Erschienen in: Complex Networks & Their Applications X

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This work extends the bag-of-paths model [14] by introducing a weighting of the length of the paths in the network, provided by a Poisson probability distribution. The main advantage of this approach is that it allows to tune the mean path length parameter which is most relevant for the application at hand. Various quantities of interest, such as the probability of drawing a path from the bag of paths, or the join probability of sampling any path connecting two nodes of interest, can easily be computed in closed form from this model. In this context, a new distance measure between nodes of a network, considering a weighting factor on the length of the paths, is defined. Experiments on semi-supervised classification tasks show that the introduced distance measure provides competitive results compared to other state-of-the-art methods. Moreover, a new interpretation of the logarithmic communicability similarity measure [17] is proposed in terms of the new model.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
If the graph G is undirected, each of its undirected edge \(i \leftrightarrow j\) is considered as a set of two different directed edges going in opposite directions, \(i \rightarrow j\) and \(j \rightarrow i\).
 
2
As, for instance, a simple bag-of-words model in information retrieval or a naive Bayes classification model in data science.
 
3
In general, this matrix is non-symmetric.
 
4
The communicability kernel is also related to the exponential diffusion kernel [22].
 
5
The communicability kernel has also been investigated, but we have not included its results as these are worse compared to its logarithmic version.
 
6
We use the LIBSVM library [3] with the options ‘-s 0’ and ‘-t 4’.
 
7
Note that the kernels obtained through this transformation are already centered.
 
8
We apply the centering transformation \(\mathbf {K}=\mathbf {H}\mathbf {K}\mathbf {H}\) to the Gaussian kernel where \(\mathbf {H} = \mathbf {I} - \mathbf {e} \mathbf {e}^{\mathrm {T}}/n\) is the centering matrix, \(\mathbf {e}\) is a column vector full of 1’s, \(\mathbf {I}\) is the identity matrix and n is the number of nodes.
 
Literatur
1.
Zurück zum Zitat Akamatsu, T.: Cyclic flows, Markov process and stochastic traffic assignment. Transp. Res. B 30(5), 369–386 (1996)CrossRef Akamatsu, T.: Cyclic flows, Markov process and stochastic traffic assignment. Transp. Res. B 30(5), 369–386 (1996)CrossRef
3.
Zurück zum Zitat Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. 2, 27:1–27:27 (2011) Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. 2, 27:1–27:27 (2011)
4.
Zurück zum Zitat Chebotarev, P., Shamis, E.: The matrix-forest theorem and measuring relations in small social groups. Autom. Remote Control 58(9), 1505–1514 (1997)MATH Chebotarev, P., Shamis, E.: The matrix-forest theorem and measuring relations in small social groups. Autom. Remote Control 58(9), 1505–1514 (1997)MATH
5.
Zurück zum Zitat Chebotarev, P.: A class of graph-geodetic distances generalizing the shortest-path and the resistance distances. Discret. Appl. Math. 159(5), 295–302 (2011)MathSciNetCrossRefMATH Chebotarev, P.: A class of graph-geodetic distances generalizing the shortest-path and the resistance distances. Discret. Appl. Math. 159(5), 295–302 (2011)MathSciNetCrossRefMATH
6.
Zurück zum Zitat Consul, P.C.: Generalized Poisson Distributions: Properties and Applications. Marcel Dekker (1989) Consul, P.C.: Generalized Poisson Distributions: Properties and Applications. Marcel Dekker (1989)
7.
Zurück zum Zitat Courtain, S., Guex, G., Kivimaki, I., Saerens, M.: Relative entropy-regularized optimal transport on a graph: a new algorithm and an experimental comparison. ArXiv preprint arXiv:0912.0238v9 (2021) Courtain, S., Guex, G., Kivimaki, I., Saerens, M.: Relative entropy-regularized optimal transport on a graph: a new algorithm and an experimental comparison. ArXiv preprint arXiv:​0912.​0238v9 (2021)
8.
Zurück zum Zitat Courtain, S., Leleux, P., Kivimaki, I., Guex, G., Saerens, M.: Randomized shortest paths with net flows and capacity constraints. Inf. Sci. 556, 341–360 (2020)MathSciNetCrossRefMATH Courtain, S., Leleux, P., Kivimaki, I., Guex, G., Saerens, M.: Randomized shortest paths with net flows and capacity constraints. Inf. Sci. 556, 341–360 (2020)MathSciNetCrossRefMATH
10.
Zurück zum Zitat Demšar, J.: Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 7, 1–30 (2006)MathSciNetMATH Demšar, J.: Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 7, 1–30 (2006)MathSciNetMATH
11.
Zurück zum Zitat Devooght, R., Mantrach, A., Kivimäki, I., Bersini, H., Jaimes, A., Saerens, M.: Random walks based modularity: application to semi-supervised learning. In: Proceedings of the 23rd International World Wide Web Conference (WWW 2014), pp. 213–224 (2014) Devooght, R., Mantrach, A., Kivimäki, I., Bersini, H., Jaimes, A., Saerens, M.: Random walks based modularity: application to semi-supervised learning. In: Proceedings of the 23rd International World Wide Web Conference (WWW 2014), pp. 213–224 (2014)
13.
Zurück zum Zitat Fouss, F., Pirotte, A., Renders, J.M., Saerens, M.: Random-walk computation of similarities between nodes of a graph, with application to collaborative recommendation. IEEE Trans. Knowl. Data Eng. 19(3), 355–369 (2007)CrossRef Fouss, F., Pirotte, A., Renders, J.M., Saerens, M.: Random-walk computation of similarities between nodes of a graph, with application to collaborative recommendation. IEEE Trans. Knowl. Data Eng. 19(3), 355–369 (2007)CrossRef
14.
Zurück zum Zitat Francoisse, K., Kivimki, I., Mantrach, A., Rossi, F., Saerens, M.: A bag-of-paths framework for network data analysis. Neural Netw. 90, 90–111 (2017)CrossRefMATH Francoisse, K., Kivimki, I., Mantrach, A., Rossi, F., Saerens, M.: A bag-of-paths framework for network data analysis. Neural Netw. 90, 90–111 (2017)CrossRefMATH
15.
Zurück zum Zitat Guex, G., Courtain, S., Saerens, M.: Covariance and correlation Kernels on a graph in the generalized bag-of-paths formalism. J. Complex Netw. 8(6), 1–46 (2021) Guex, G., Courtain, S., Saerens, M.: Covariance and correlation Kernels on a graph in the generalized bag-of-paths formalism. J. Complex Netw. 8(6), 1–46 (2021)
16.
Zurück zum Zitat Guex, G., Kivimäki, I., Saerens, M.: Randomized optimal transport on a graph: framework and new distance measures. Netw. Sci. 7(1), 88–122 (2019)CrossRef Guex, G., Kivimäki, I., Saerens, M.: Randomized optimal transport on a graph: framework and new distance measures. Netw. Sci. 7(1), 88–122 (2019)CrossRef
19.
Zurück zum Zitat Kivimäki, I., Lebichot, B., Saramäki, J., Saerens, M.: Two betweenness centrality measures based on randomized shortest paths. Sci. Rep. 6(1), 1–15 (2016)CrossRef Kivimäki, I., Lebichot, B., Saramäki, J., Saerens, M.: Two betweenness centrality measures based on randomized shortest paths. Sci. Rep. 6(1), 1–15 (2016)CrossRef
20.
Zurück zum Zitat Kivimäki, I., Shimbo, M., Saerens, M.: Developments in the theory of randomized shortest paths with a comparison of graph node distances. Physica A 393, 600–616 (2014)CrossRefMATH Kivimäki, I., Shimbo, M., Saerens, M.: Developments in the theory of randomized shortest paths with a comparison of graph node distances. Physica A 393, 600–616 (2014)CrossRefMATH
22.
Zurück zum Zitat Kondor, R.I., Lafferty, J.: Diffusion kernels on graphs and other discrete structures. In: Proceedings of the 19th International Conference on Machine Learning (ICML 2002), pp. 315–322 (2002) Kondor, R.I., Lafferty, J.: Diffusion kernels on graphs and other discrete structures. In: Proceedings of the 19th International Conference on Machine Learning (ICML 2002), pp. 315–322 (2002)
24.
Zurück zum Zitat Leleux, P., Courtain, S., Françoisse, K., Saerens, M.: Design of biased random walks on a graph with application to collaborative recommendation. Submitted for publication (2020) Leleux, P., Courtain, S., Françoisse, K., Saerens, M.: Design of biased random walks on a graph with application to collaborative recommendation. Submitted for publication (2020)
25.
Zurück zum Zitat Lichman, M.: UCI machine learning repository (2013) Lichman, M.: UCI machine learning repository (2013)
26.
Zurück zum Zitat Macskassy, S.A., Provost, F.: Classification in networked data: a toolkit and a univariate case study. J. Mach. Learn. Res. 8, 935–983 (2007) Macskassy, S.A., Provost, F.: Classification in networked data: a toolkit and a univariate case study. J. Mach. Learn. Res. 8, 935–983 (2007)
27.
Zurück zum Zitat Saerens, M., Achbany, Y., Fouss, F., Yen, L.: Randomized shortest-path problems: two related models. Neural Comput. 21(8), 2363–2404 (2009)MathSciNetCrossRefMATH Saerens, M., Achbany, Y., Fouss, F., Yen, L.: Randomized shortest-path problems: two related models. Neural Comput. 21(8), 2363–2404 (2009)MathSciNetCrossRefMATH
28.
Zurück zum Zitat Schölkopf, B., Smola, A.: Learning with Kernels. MIT Press, Cambridge (2002) Schölkopf, B., Smola, A.: Learning with Kernels. MIT Press, Cambridge (2002)
31.
Zurück zum Zitat von Luxburg, U., Radl, A., Hein, M.: Getting lost in space: Large sample analysis of the commute distance. In: Advances in Neural Information Processing Systems: Proceedings of the NIPS 2010 Conference, vol. 23, pp. 2622–2630 (2010) von Luxburg, U., Radl, A., Hein, M.: Getting lost in space: Large sample analysis of the commute distance. In: Advances in Neural Information Processing Systems: Proceedings of the NIPS 2010 Conference, vol. 23, pp. 2622–2630 (2010)
32.
Zurück zum Zitat Yen, L., Fouss, F., Decaestecker, C., Francq, P., Saerens, M.: Graph nodes clustering with the sigmoid commute-time kernel: a comparative study. Data Knowl. Eng. 68(3), 338–361 (2009)CrossRef Yen, L., Fouss, F., Decaestecker, C., Francq, P., Saerens, M.: Graph nodes clustering with the sigmoid commute-time kernel: a comparative study. Data Knowl. Eng. 68(3), 338–361 (2009)CrossRef
33.
Zurück zum Zitat Yen, L., Mantrach, A., Shimbo, M., Saerens, M.: A family of dissimilarity measures between nodes generalizing both the shortest-path and the commute-time distances. In: Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2008), pp. 785–793 (2008) Yen, L., Mantrach, A., Shimbo, M., Saerens, M.: A family of dissimilarity measures between nodes generalizing both the shortest-path and the commute-time distances. In: Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2008), pp. 785–793 (2008)
Metadaten
Titel
A Simple Extension of the Bag-of-Paths Model Weighting Path Lengths by a Poisson Distribution
verfasst von
Sylvain Courtain
Marco Saerens
Copyright-Jahr
2022
DOI
https://doi.org/10.1007/978-3-030-93409-5_19

Premium Partner