Skip to main content
Top
Published in: Knowledge and Information Systems 1/2018

25-10-2017 | Regular Paper

Automatic feature selection for supervised learning in link prediction applications: a comparative study

Authors: Antonio Pecli, Maria Claudia Cavalcanti, Ronaldo Goldschmidt

Published in: Knowledge and Information Systems | Issue 1/2018

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

For the last years, a considerable amount of attention has been devoted to the research about the link prediction (LP) problem in complex networks. This problem tries to predict the likelihood of an association between two not interconnected nodes in a network to appear in the future. One of the most important approaches to the LP problem is based on supervised machine learning (ML) techniques for classification. Although many works have presented promising results with this approach, choosing the set of features (variables) to train the classifiers is still a major challenge. In this article, we report on the effects of three different automatic variable selection strategies (Forward, Backward and Evolutionary) applied to the feature-based supervised learning approach in LP applications. The results of the experiments show that the use of these strategies does lead to better classification models than classifiers built with the complete set of variables. Such experiments were performed over three datasets (Microsoft Academic Network, Amazon and Flickr) that contained more than twenty different features each, including topological and domain-specific ones. We also describe the specification and implementation of the process used to support the experiments. It combines the use of the feature selection strategies, six different classification algorithms (SVM, K-NN, naïve Bayes, CART, random forest and multilayer perceptron) and three evaluation metrics (Precision, F-Measure and Area Under the Curve). Moreover, this process includes a novel ML voting committee inspired approach that suggests sets of features to represent data in LP applications. It mines the log of the experiments in order to identify sets of features frequently selected to produce classification models with high performance. The experiments showed interesting correlations between frequently selected features and datasets.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Footnotes
1
Also known as feature extraction or feature engineering.
 
6
PredLig’s code is available for download at https://​github.​com/​alpecli/​predlig.
 
8
Bias is the set of characteristics that collectively influence the way an algorithm searches for hypotheses that separate the classes of a problem.
 
9
It is important to notice that we applied the Wilcoxon signed-ranks test 108 times independently. In each time, the test verified whether there was a statistical difference between two algorithms: a classification algorithm and a modified version of itself (the combination of the algorithm with a feature selection configuration).
 
10
Table 9 highlights in bold font the experiment executions associated with the 26 experiment configurations that revealed significant difference in the hypothesis test.
 
11
In fact, ES2 was the only FS configuration that significantly improved SVM’s performance.
 
Literature
1.
go back to reference Adafre SF, de Rijke M (2005) Discovering missing links in Wikipedia. In: Proceedings of the 3rd international workshop on Link discovery. ACM, pp 90–97 Adafre SF, de Rijke M (2005) Discovering missing links in Wikipedia. In: Proceedings of the 3rd international workshop on Link discovery. ACM, pp 90–97
2.
go back to reference Adamic LA, Adar E (2003) Friends and neighbors on the web. Soc Netw 25(3):211–230CrossRef Adamic LA, Adar E (2003) Friends and neighbors on the web. Soc Netw 25(3):211–230CrossRef
3.
go back to reference Agrawal R, Srikant R (1994) Fast algorithms for mining association rules in large databases. In: VLDB’94, Proceedings of 20th international conference on very large data bases, Santiago de Chile, Chile, 12–15 September 1994, pp 487–499 Agrawal R, Srikant R (1994) Fast algorithms for mining association rules in large databases. In: VLDB’94, Proceedings of 20th international conference on very large data bases, Santiago de Chile, Chile, 12–15 September 1994, pp 487–499
4.
go back to reference Aha D, Bankert R (1996) A comparative evaluation of sequential feature selection algorithms. In: Fisher D, Lenz H-J (eds) Learning from data, volume 112 of Lecture Notes in Statistics. Springer, New York, pp 199–206. doi:10.1007/978-1-4612-2404-4-19 Aha D, Bankert R (1996) A comparative evaluation of sequential feature selection algorithms. In: Fisher D, Lenz H-J (eds) Learning from data, volume 112 of Lecture Notes in Statistics. Springer, New York, pp 199–206. doi:10.​1007/​978-1-4612-2404-4-19
6.
go back to reference Airoldi EM, Blei DM, Fienberg SE, Xing EP (2008) Mixed membership stochastic blockmodels. J Mach Learn Res 9:1981–2014MATH Airoldi EM, Blei DM, Fienberg SE, Xing EP (2008) Mixed membership stochastic blockmodels. J Mach Learn Res 9:1981–2014MATH
7.
go back to reference Airoldi EM, Blei DM, Fienberg SE, Xing EP, Jaakkola T (2006) Mixed membership stochastic block models for relational data with application to protein–protein interactions. In: Proceedings of the international biometrics society annual meeting, pp 1–34 Airoldi EM, Blei DM, Fienberg SE, Xing EP, Jaakkola T (2006) Mixed membership stochastic block models for relational data with application to protein–protein interactions. In: Proceedings of the international biometrics society annual meeting, pp 1–34
8.
go back to reference Backstrom L, Leskovec J (2011) Supervised random walks: predicting and recommending links in social networks. In: Proceedings of the fourth ACM international conference on Web search and data mining WSDM ’11. ACM, New York, NY, USA, pp 635–644. doi:10.1145/1935826.1935914 Backstrom L, Leskovec J (2011) Supervised random walks: predicting and recommending links in social networks. In: Proceedings of the fourth ACM international conference on Web search and data mining WSDM ’11. ACM, New York, NY, USA, pp 635–644. doi:10.​1145/​1935826.​1935914
9.
go back to reference Barabasi AL, Jeong H, Neda Z, Ravasz E (2001) Evolution of the social network of scientific collaboration. Soc Netw 25:211–230 Barabasi AL, Jeong H, Neda Z, Ravasz E (2001) Evolution of the social network of scientific collaboration. Soc Netw 25:211–230
10.
go back to reference Batagelj V, Zaversnik M (2003) An O(m) algorithm for cores decomposition of networks. CoRR, cs.DS/0310049 Batagelj V, Zaversnik M (2003) An O(m) algorithm for cores decomposition of networks. CoRR, cs.DS/0310049
11.
go back to reference Benzi M, Estrada E, Klymko C (2012) Ranking hubs and authorities using matrix functions. CoRR Benzi M, Estrada E, Klymko C (2012) Ranking hubs and authorities using matrix functions. CoRR
13.
go back to reference Caruana R, Karampatziakis N, Yessenalina A (2008) An empirical evaluation of supervised learning in high dimensions. In: Proceedings of the twenty-fifth international conference machine learning (ICML 2008), Helsinki, Finland, 5–9 June 2008, pp. 96–103. doi:10.1145/1390156.1390169 Caruana R, Karampatziakis N, Yessenalina A (2008) An empirical evaluation of supervised learning in high dimensions. In: Proceedings of the twenty-fifth international conference machine learning (ICML 2008), Helsinki, Finland, 5–9 June 2008, pp. 96–103. doi:10.​1145/​1390156.​1390169
15.
go back to reference Demšar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7:1–30MathSciNetMATH Demšar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7:1–30MathSciNetMATH
16.
go back to reference Dong Y, Tang J, Wu S, Tian J, Chawla NV, Rao J, Cao H (2012) Link prediction and recommendation across heterogeneous social networks. In: 2012 IEEE 12th international conference on data mining (ICDM). IEEE, pp 181–190 Dong Y, Tang J, Wu S, Tian J, Chawla NV, Rao J, Cao H (2012) Link prediction and recommendation across heterogeneous social networks. In: 2012 IEEE 12th international conference on data mining (ICDM). IEEE, pp 181–190
18.
go back to reference Estrada E (2011) The structure of complex networks: theory and applications. Oxford University Press, Inc., New YorkCrossRef Estrada E (2011) The structure of complex networks: theory and applications. Oxford University Press, Inc., New YorkCrossRef
19.
go back to reference Freeman LC (1978) Centrality in social networks conceptual clarification In: Social Networks, vol 1, Issue 3. Elsevier, Lausanne, pp 215–239 Freeman LC (1978) Centrality in social networks conceptual clarification In: Social Networks, vol 1, Issue 3. Elsevier, Lausanne, pp 215–239
20.
go back to reference Freitas AA (2002) Data mining and knowledge discovery with evolutionary algorithms. Springer-Verlag New York, Inc., SecaucusCrossRefMATH Freitas AA (2002) Data mining and knowledge discovery with evolutionary algorithms. Springer-Verlag New York, Inc., SecaucusCrossRefMATH
21.
go back to reference Freschi V (2009) A graph-based semi-supervised algorithm for protein function prediction from interaction maps. In: Third international conference learning and intelligent optimization, LION 3, Trento, Italy, 14–18 January 2009, Selected Papers, pp 249–258. doi:10.1007/978-3-642-11169-3-18 Freschi V (2009) A graph-based semi-supervised algorithm for protein function prediction from interaction maps. In: Third international conference learning and intelligent optimization, LION 3, Trento, Italy, 14–18 January 2009, Selected Papers, pp 249–258. doi:10.​1007/​978-3-642-11169-3-18
22.
go back to reference Friedman M (1937) The use of ranks to avoid the assumption of normality implicit in the analysis of variance. J Am Stat Assoc 32:675–701CrossRefMATH Friedman M (1937) The use of ranks to avoid the assumption of normality implicit in the analysis of variance. J Am Stat Assoc 32:675–701CrossRefMATH
23.
go back to reference Hagberg AA, Schult DA, Swart PJ (2008) Exploring network structure, dynamics, and function using networkX. In: Varoquaux G, Vaught T, Millman J (eds) Proceedings of the 7th Python in Science Conference. Pasadena, pp 11–15 Hagberg AA, Schult DA, Swart PJ (2008) Exploring network structure, dynamics, and function using networkX. In: Varoquaux G, Vaught T, Millman J (eds) Proceedings of the 7th Python in Science Conference. Pasadena, pp 11–15
24.
25.
go back to reference Hasan MA, Chaoji V, Salem S, Zaki M (2006) Link prediction using supervised learning. In: Proceedings of SDM 06 workshop on Link Analysis, Counterterrorism and Security Hasan MA, Chaoji V, Salem S, Zaki M (2006) Link prediction using supervised learning. In: Proceedings of SDM 06 workshop on Link Analysis, Counterterrorism and Security
26.
go back to reference Hsieh C-J, Chiang K-Y, Dhillon IS (2012) Low rank modeling of signed networks. In: Proceedings of the 18th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, pp 507–515 Hsieh C-J, Chiang K-Y, Dhillon IS (2012) Low rank modeling of signed networks. In: Proceedings of the 18th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, pp 507–515
27.
go back to reference Huang D, Meyn SP (2012) Feature selection for composite hypothesis testing with small samples: fundamental limits and algorithms. In: ICASSP. IEEE, pp 1917–1920 Huang D, Meyn SP (2012) Feature selection for composite hypothesis testing with small samples: fundamental limits and algorithms. In: ICASSP. IEEE, pp 1917–1920
28.
go back to reference Huang Z, Li X, Chen H (2005) Link prediction approach to collaborative filtering. In: ACM/IEEE Joint Conference on Digital Libraries, JCDL 2005, Denver, CO, USA, 7–11 June 2005, Proceedings, pp 141–142. doi:10.1145/1065385.1065415 Huang Z, Li X, Chen H (2005) Link prediction approach to collaborative filtering. In: ACM/IEEE Joint Conference on Digital Libraries, JCDL 2005, Denver, CO, USA, 7–11 June 2005, Proceedings, pp 141–142. doi:10.​1145/​1065385.​1065415
29.
go back to reference Jannach D, Zanker M, Felfernig A, Friedrich G (2010) Recommender systems: an introduction, 1st edn. Cambridge University Press, New YorkCrossRef Jannach D, Zanker M, Felfernig A, Friedrich G (2010) Recommender systems: an introduction, 1st edn. Cambridge University Press, New YorkCrossRef
30.
go back to reference Katz L (1953) A new status index derived from sociometric analysis. Psychometrika 18:39–43CrossRefMATH Katz L (1953) A new status index derived from sociometric analysis. Psychometrika 18:39–43CrossRefMATH
32.
go back to reference Krebs VE (2002) Mapping networks of terrorist cells. Connections 24:43–52 Krebs VE (2002) Mapping networks of terrorist cells. Connections 24:43–52
34.
go back to reference Lee J, Bengio S, Kim S, Lebanon G, Singer Y (2014) Local collaborative ranking. In: Proceedings of the 23rd international conference on World Wide Web WWW ’14. ACM, New York, NY, USA, pp 85–96. doi:10.1145/2566486.2567970 Lee J, Bengio S, Kim S, Lebanon G, Singer Y (2014) Local collaborative ranking. In: Proceedings of the 23rd international conference on World Wide Web WWW ’14. ACM, New York, NY, USA, pp 85–96. doi:10.​1145/​2566486.​2567970
38.
go back to reference Li X, Chen H (2009) Recommendation as link prediction: a graph kernel-based machine learning approach. In: Proceedings of the 2009 Joint International Conference on Digital Libraries, JCDL 2009, Austin, TX, USA, 15–19 June 2009, pp 213–216. doi:10.1145/1555400.1555433 Li X, Chen H (2009) Recommendation as link prediction: a graph kernel-based machine learning approach. In: Proceedings of the 2009 Joint International Conference on Digital Libraries, JCDL 2009, Austin, TX, USA, 15–19 June 2009, pp 213–216. doi:10.​1145/​1555400.​1555433
40.
go back to reference Lind PG, Gonzalez MC, Herrmann HJ (2005) Cycles and clustering in bipartite networks. Phys Rev E Stat Nonlin Soft Matter Phys 72(5 Pt 2):056127 Lind PG, Gonzalez MC, Herrmann HJ (2005) Cycles and clustering in bipartite networks. Phys Rev E Stat Nonlin Soft Matter Phys 72(5 Pt 2):056127
42.
go back to reference Lü L, Zhou T (2010) Link prediction in complex networks: a survey. Physica A 390(6):1150–1170CrossRef Lü L, Zhou T (2010) Link prediction in complex networks: a survey. Physica A 390(6):1150–1170CrossRef
43.
go back to reference Lü L, Zhou T (2010) Link prediction in weighted networks: the role of weak ties. EPL (Europhys Lett) 89:18001CrossRef Lü L, Zhou T (2010) Link prediction in weighted networks: the role of weak ties. EPL (Europhys Lett) 89:18001CrossRef
45.
go back to reference Menon A, Elkan C (2011) Link prediction via matrix factorization. In: Gunopulos D, Hofmann T, Malerba D, Vazirgiannis M (eds) Machine learning and knowledge discovery in databases, volume 6912 of Lecture Notes in Computer Science. Springer, Berlin, pp 437–452. doi:10.1007/978-3-642-23783-6-28 Menon A, Elkan C (2011) Link prediction via matrix factorization. In: Gunopulos D, Hofmann T, Malerba D, Vazirgiannis M (eds) Machine learning and knowledge discovery in databases, volume 6912 of Lecture Notes in Computer Science. Springer, Berlin, pp 437–452. doi:10.​1007/​978-3-642-23783-6-28
47.
48.
go back to reference Oyama S, Hayashi K, Kashima H (2011) Cross-temporal link prediction. In: IEEE 11th International Conference on Data Mining (ICDM). IEEE, Vancouver, pp 1188–1193 Oyama S, Hayashi K, Kashima H (2011) Cross-temporal link prediction. In: IEEE 11th International Conference on Data Mining (ICDM). IEEE, Vancouver, pp 1188–1193
49.
go back to reference Page L, Brin S, Motwani R, Winograd T (1999) The PageRank citation ranking: bringing order to the Web. Technical Report 1999-66 Stanford InfoLab. Previous number = SIDL-WP-1999-0120 Page L, Brin S, Motwani R, Winograd T (1999) The PageRank citation ranking: bringing order to the Web. Technical Report 1999-66 Stanford InfoLab. Previous number = SIDL-WP-1999-0120
50.
go back to reference Pecli A, Giovanini B, Pacheco CC, Moreira C, Ferreira F, Tosta F, Tesolin J, Dias MV, Filho S, Cavalcanti MC, Goldschmidt RR (2015) Dimensionality reduction for supervised learning in link prediction problems. In: ICEIS 2015—Proceedings of the 17th international conference on enterprise information systems, vol 1, Barcelona, Spain, 27–30 April 2015, pp 295–302 Pecli A, Giovanini B, Pacheco CC, Moreira C, Ferreira F, Tosta F, Tesolin J, Dias MV, Filho S, Cavalcanti MC, Goldschmidt RR (2015) Dimensionality reduction for supervised learning in link prediction problems. In: ICEIS 2015—Proceedings of the 17th international conference on enterprise information systems, vol 1, Barcelona, Spain, 27–30 April 2015, pp 295–302
51.
go back to reference Pedregosa F, Varoquaux G, Gramfort A, Thirion B, Grisel VM, Blondel O, Prettenhofer M, Weiss P, Dubourg R, Vanderplas V, Passos J, Cournapeau A, Brucher D, Perrot M, Duchesnay E (2011) Scikit-learn: machine learning in Python. J Mach Learn Res 12:2825–2830MathSciNetMATH Pedregosa F, Varoquaux G, Gramfort A, Thirion B, Grisel VM, Blondel O, Prettenhofer M, Weiss P, Dubourg R, Vanderplas V, Passos J, Cournapeau A, Brucher D, Perrot M, Duchesnay E (2011) Scikit-learn: machine learning in Python. J Mach Learn Res 12:2825–2830MathSciNetMATH
52.
go back to reference Pourebrahimi A, Shirazi B, Chamani T (2014) Improving link prediction in social network with population based metaheuristics algorithm. Int J Mechatron Electr Comput Technol 12: 1202–1213 Pourebrahimi A, Shirazi B, Chamani T (2014) Improving link prediction in social network with population based metaheuristics algorithm. Int J Mechatron Electr Comput Technol 12: 1202–1213
53.
go back to reference Raymond R, Kashima H (2010) Fast and scalable algorithms for semi-supervised link prediction on static and dynamic graphs. In: Proceedings of the 2010 European Conference on Machine Learning and Knowledge Discovery in Databases: Part III ECML PKDD’10. Springer, Berlin, pp 131–147 Raymond R, Kashima H (2010) Fast and scalable algorithms for semi-supervised link prediction on static and dynamic graphs. In: Proceedings of the 2010 European Conference on Machine Learning and Knowledge Discovery in Databases: Part III ECML PKDD’10. Springer, Berlin, pp 131–147
54.
go back to reference Rendle S, Freudenthaler C, Gantner Z, Schmidt-Thieme L (2009) Bpr: Bayesian personalized ranking from implicit feedback. In: Proceedings of the twenty-fifth conference on uncertainty in artificial intelligence. AUAI Press, pp 452–461 Rendle S, Freudenthaler C, Gantner Z, Schmidt-Thieme L (2009) Bpr: Bayesian personalized ranking from implicit feedback. In: Proceedings of the twenty-fifth conference on uncertainty in artificial intelligence. AUAI Press, pp 452–461
55.
go back to reference Rickmers AD, Todd HN (1967) Statistics: an introduction. McGraw-Hill, New YorkMATH Rickmers AD, Todd HN (1967) Statistics: an introduction. McGraw-Hill, New YorkMATH
56.
go back to reference Saramäki J, Kivelä M, Onnela J, Kaski K, Kertesz (2007) Generalizations of the clustering coefficient to weighted complex networks. Phys Rev E 75:027105 Saramäki J, Kivelä M, Onnela J, Kaski K, Kertesz (2007) Generalizations of the clustering coefficient to weighted complex networks. Phys Rev E 75:027105
57.
go back to reference Shi Y, Larson M, Hanjalic A (2010) List-wise learning to rank with matrix factorization for collaborative filtering. In: Proceedings of the fourth ACM Conference on Recommender Systems RecSys ’10. ACM, New York, NY, USA, pp 269–272. doi:10.1145/1864708.1864764 Shi Y, Larson M, Hanjalic A (2010) List-wise learning to rank with matrix factorization for collaborative filtering. In: Proceedings of the fourth ACM Conference on Recommender Systems RecSys ’10. ACM, New York, NY, USA, pp 269–272. doi:10.​1145/​1864708.​1864764
58.
go back to reference Song D, Meyer DA (2015) Recommending positive links in signed social networks by optimizing a generalized AUC. In: Twenty-ninth AAAI conference on artificial intelligence Song D, Meyer DA (2015) Recommending positive links in signed social networks by optimizing a generalized AUC. In: Twenty-ninth AAAI conference on artificial intelligence
59.
go back to reference Song D, Meyer DA, Tao D (2015) Efficient latent link recommendation in signed networks. In: Proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining KDD ’15. ACM, New York, NY, USA, pp 1105–1114. doi:10.1145/2783258.2783358 Song D, Meyer DA, Tao D (2015) Efficient latent link recommendation in signed networks. In: Proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining KDD ’15. ACM, New York, NY, USA, pp 1105–1114. doi:10.​1145/​2783258.​2783358
60.
go back to reference Souza G (2015) Recomendacao em Redes Sociais Baseada em Grafos. Technical Report S2729r Military Institute of Engineering Souza G (2015) Recomendacao em Redes Sociais Baseada em Grafos. Technical Report S2729r Military Institute of Engineering
64.
go back to reference Wolpert DH, Macready WG (1997) No free lunch theorems for optimization. IEEE Trans Evolut Comput 1:67–82CrossRef Wolpert DH, Macready WG (1997) No free lunch theorems for optimization. IEEE Trans Evolut Comput 1:67–82CrossRef
65.
go back to reference Wu S, Sun J, Tang J (2013) Patent partner recommendation in enterprise social networks. In: Sixth ACM international conference on Web Search and Data Mining, WSDM 2013, Rome, Italy, 4–8 February 2013, pp 43–52. doi:10.1145/2433396.2433404 Wu S, Sun J, Tang J (2013) Patent partner recommendation in enterprise social networks. In: Sixth ACM international conference on Web Search and Data Mining, WSDM 2013, Rome, Italy, 4–8 February 2013, pp 43–52. doi:10.​1145/​2433396.​2433404
66.
go back to reference Xu Y, Rockmore D (2012) Feature selection for link prediction. In: Proceedings of the 5th Ph.D. Workshop on Information and Knowledge. ACM, pp 25–32 Xu Y, Rockmore D (2012) Feature selection for link prediction. In: Proceedings of the 5th Ph.D. Workshop on Information and Knowledge. ACM, pp 25–32
67.
go back to reference Yang Y, Lichtenwalter RN, Chawla NV (2015) Evaluating link prediction methods. CoRR, abs/1505.04094 Yang Y, Lichtenwalter RN, Chawla NV (2015) Evaluating link prediction methods. CoRR, abs/1505.04094
68.
go back to reference Yu L, Liu H (2003) Feature selection for high-dimensional data: A fast correlation-based filter solution. In: Proceedings of the twentieth international conference machine learning (ICML 2003), 21–24 August 2003, Washington, DC, USA, pp 856–863 Yu L, Liu H (2003) Feature selection for high-dimensional data: A fast correlation-based filter solution. In: Proceedings of the twentieth international conference machine learning (ICML 2003), 21–24 August 2003, Washington, DC, USA, pp 856–863
69.
go back to reference Zhu J, Hong J, Hughes JG (2002) Using Markov models for web site link prediction. In: HYPERTEXT 2002, Proceedings of the 13th ACM conference on hypertext and hypermedia, 11–15 June 2002, University of Maryland, College Park, MD, USA, pp 169–170. doi:10.1145/513338.513381 Zhu J, Hong J, Hughes JG (2002) Using Markov models for web site link prediction. In: HYPERTEXT 2002, Proceedings of the 13th ACM conference on hypertext and hypermedia, 11–15 June 2002, University of Maryland, College Park, MD, USA, pp 169–170. doi:10.​1145/​513338.​513381
Metadata
Title
Automatic feature selection for supervised learning in link prediction applications: a comparative study
Authors
Antonio Pecli
Maria Claudia Cavalcanti
Ronaldo Goldschmidt
Publication date
25-10-2017
Publisher
Springer London
Published in
Knowledge and Information Systems / Issue 1/2018
Print ISSN: 0219-1377
Electronic ISSN: 0219-3116
DOI
https://doi.org/10.1007/s10115-017-1121-6

Other articles of this Issue 1/2018

Knowledge and Information Systems 1/2018 Go to the issue

Premium Partner