Abstract
We introduce a novel ranking of search results based on a variant of the h-index for directed information networks such as the Web. The h-index was originally introduced to measure an individual researcher’s scientific output and influence, but here a variant of it is applied to assess the “importance” of web pages. Like PageRank, the “importance” of a page is defined by the “importance” of the pages linking to it. However, unlike the computation of PageRank which involves the whole web graph, computing the h-index for web pages (the hw-rank) is based on a local computation and only the neighbors of the neighbors of the given node are considered. Preliminary results show a strong correlation between ranking with the hw-rank and PageRank, and moreover its computation is simpler and less complex than computation of the PageRank. Further, larger scale experiments are needed in order to assess the applicability of the method.
Similar content being viewed by others
References
Bar-Ilan, J. (2010a). Rankings of information and library science journals by JIF and by h-type indices. Journal of Informetrics, 4, 141–147.
Bar-Ilan, J. (2010b). The WIF of Peter Ingwersen’s website. In B. Larsen, J. W. Schneider and F. Åström (Eds.) The Janus Faced Scholar: A Festschrift in honour of Peter Ingwersen (pp. 119–125). Retrieved from http://lup.lub.lu.se/luur/download?func=downloadFile&recordOId=1632623&fileOId=1632624.
Bornmann, L., & Daniel, H.-D. (2009). The state of h-index research. EMBO Reports, 10(1), 2–6.
Braun, T., Glänzel, W., & Schubert, A. (2006). A Hirsch-type index for journals. Scientometrics, 69(1), 169–173.
Brin, S., & Page, L. (1998). The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems, 30, 107–117.
Carrière, S. J., & Kazman, R. (1997). WebQuery: Searching and visualizing the Web through connectivity. Computer Networks and ISDN Systems, 29(8–13), 1257–1267.
Costas, R., & Bordons, M. (2007). The h-index: Advantages, limitations and its relation with other bibliometric indicators at the micro level. Journal of Informetrics, 1(3), 193–203.
Egghe, L. (2006). Theory and practise of the g-index. Scientometrics, 69(1), 131–152.
eigenfactor.org (2008). Eigenfactor™ score and article influence™ score: Detailed methods. Retrieved from http://www.eigenfactor.org/methods.pdf.
Fortunato, S., Boguňá, M., Flammini, A., & Menczer, F. (2008). Approximating PageRank from in-degree. Algorithms and models for the web graph. WAW 2006. Lecture Notes in Computer Science, 4936, 59–71.
Garfield, E. (1973). Citation frequency as a measure of research activity and performance. Essays of an Information Scientist, 1, 406.
Glänzel, W. (2006). On the opportunities and limitations of the h-index. Science Focus, 1(1), 10–11. English version retrieved from http://eprints.rclis.org/9378/1/H_Index_opprtunities.pdf.
Google. (2010). Corporate information: Technology overview. Retrieved from https://web.archive.org/web/20100419191933/http://www.google.com/intl/en_uk/corporate/tech.html.
Guerrero-Bote, V. P., & Moya-Anegón, F. (2012). A further step forward in measuring journals’ scientific prestige: The SJR2 indicator. Journal of Informetrics, 6, 674–688.
Guns, R., & Rousseau, R. (2009). Real and rational variants of the h-index and the g-index. Journal of Informetrics, 3, 64–71.
Gyöngyi, Z, & Garcia-Molina, H. (2005). Link spam alliances. In Proceedings of the 31st international conference of very large databases (VLDB), pp. 517–528.
Hauke, J., & Kossowski, T. (2011). Comparison of values of Pearson’s and Spearman’s correlations on the same sets of data. Quaestiones Geographicae, 30(2), 87–93.
Hirsch, J. E. (2005). An index to quantify an individual’s scientific research output. Proceedings of the National Academy of Sciences (PNAS), 102(46), 16569–16572.
Ingwersen, P. (1998). The calculation of web impact factors. Journal of Documentation, 54(2), 236–243.
Katz, L. (1953). A new status index derived from sociometric analysis. Psychometrika, 18(1), 39–43.
Kleinberg, J. M. (1999). Authoritative sources in a hyperlinked environment. Journal of the ACM, 46(5), 604–632.
Korn, A., Schubert, A., & Telcs, A. (2009). Lobby index in networks. Physica A, 388, 2221–2226.
Moed, H. F. (2005). Citation analysis in research evaluation. Dortrecht: Springer.
Page, L., Brin, S., Motwani, R., & Winograd, T. (1999). The PageRank citation ranking: Bringing order to the Web. Technical report, Stanford InfoLab. Retrieved from http://ilpubs.stanford.edu:8090/422/.
Pinski, G., & Narin, F. (1976). Citation influence for journal aggregates of scientific publications: Theory, with application to the literature of physics. Information Processing and Management, 12, 297–312.
Ruane, F., & Tol, R. S. J. (2008). Rational (successive) h-indices: An application to economics in the Republic of Ireland. Scientometrics, 75(2), 395–405.
Salton, G., & McGill, M. J. (1986). Introduction to modern information retrieval. New York: McGraw Hill.
Schubert, A. (2009). Using the h-index for assessing single publications. Scientometrics, 78(3), 559–565.
SCImago (2007). SJR—SCImago journal & country rank. Retrieved from http://www.scimagojr.com.
Thelwall, M. (2006). Interpreting social science link analysis research: A theoretical framework. Journal of the American Society for Information Science and Technology, 57(1), 60–68.
Upstill, T., Craswell, N., & Hawking, D. (2003). Predicting fame and fortune: PageRank or indegree? In Proceedings of the 8th Australasian document computing symposium. Retrieved from http://131.107.65.14/pubs/65254/upstill_adcs03.pdf.
Van Raan, A. F. J. (2006). Comparison of the Hirsch-index with standard bibliometric indicators and with peer judgment for 147 chemistry research groups. Scientometrics, 67(3), 491–502.
West, J. D., Bergstrom, T. C., & Bergstrom, C. T. (2010). The Eigenfactor™ metrics: A network approach to assessing scholarly journals. College and Research Libraries, 71(3), 236–244.
Google (n. d.). Facts about Google and competition. Retrieved from http://www.google.com/competition/howgooglesearchworks.html.
Zhao, S. X., Rousseau, R., & Ye, F. Y. (2011). h-degree as a basic measure in weighted networks. Journal of Informetrics, 5, 668–677.
Acknowledgments
The authors thank Chatzimichalis Konstantinos for carrying out the comparisons between the hw-index and PageRank while studying at Birkbeck University of London.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Bar-Ilan, J., Levene, M. The hw-rank: an h-index variant for ranking web pages. Scientometrics 102, 2247–2253 (2015). https://doi.org/10.1007/s11192-014-1477-2
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11192-014-1477-2