Google PageRanking problem: The model and the analysis

https://doi.org/10.1016/j.cam.2010.02.005Get rights and content
Under an Elsevier user license
open archive

Abstract

The spectral and Jordan structures of the Web hyperlink matrix G(c)=cG+(1c)evT have been analyzed when G is the basic (stochastic) Google matrix, c is a real parameter such that 0<c<1, v is a nonnegative probability vector, and e is the all-ones vector. Typical studies have relied heavily on special properties of nonnegative, positive, and stochastic matrices. There is a unique nonnegative vector y(c) such that y(c)TG(c)=y(c)T and y(c)Te=1. This PageRank vector y(c) can be computed effectively by the power method.

We consider a square complex matrix A and nonzero complex vectors x and v such that Ax=λx and vx=1. We use standard matrix analytic tools to determine the eigenvalues, the Jordan blocks, and a distinguished left λ-eigenvector of A(c)=cA+(1c)λxv as a function of a complex variable c. If λ is a semisimple eigenvalue of A, there is a uniquely determined projection N such that limc1y(c)=Nv for all v; this limit may fail to exist for some v if λ is not semisimple. As a special case of our results, we obtain a complex analog of PageRank for the Web hyperlink matrix G(c) with a complex parameter c. We study regularity, limits, expansions, and conditioning of y(c) and we propose algorithms (e.g., complex extrapolation, power method on a modified matrix etc.) that may provide an efficient way to compute PageRank also with c close or equal to 1. An interpretation of the limit vector Nv and a related critical discussion on the model, on its adherence to reality, and possible ways for its improvement, represent the contribution of the paper on modeling issues.

MSC

65F10
65F15
65Y20
15A18
15A21
15A51

Keywords

Google matrix
PageRanking
Surfing model
Rank-one perturbation
Brauer’s Theorem
Jordan canonical form
Principle of biorthogonality
Extrapolation formulae

Cited by (0)

This material was partly presented at the Seminar “Web Information Retrieval and Linear Algebra Algorithms”, Schloss Dagstuhl–Wadern (Germany), February 2007, see Serra-Capizzano (2007) [19], and at the Workshop “Numerical Linear Algebra in Internet Algorithms”, Monopoli - Bari (Italy), September 2007. The resulting paper could be viewed as two parts: The part regarding the model is new and that regarding the analysis is based for a large part on a joint work with Roger Horn (2008) [17] and on Serra-Capizzano (2005) [7].