Google PR最近更新比较频繁,很多人看到自身的PR并没有更新,也不知道是什么原因,今天看到Matt Cutts写的关于PR计算模型的东东,转来大家分享学习下,英语水平有限,翻译出来就误导了,还是看原文吧!
People think about PageRank in lots of different ways. People have compared PageRank to a "random surfer" model in which PageRank is the probability that a random surfer clicking on links lands on a page. Other people think of the web as an link matrix in which the value at position (i,j) indicates the presence of links from page i to page j. In that case, PageRank corresponds to the principal eigenvector of that normalized link matrix.
Disclaimer: Even when I joined the company in 2000, Google was doing more sophisticated link computation than you would observe from the classic PageRank papers. If you believe that Google stopped innovating in link analysis, that's a flawed assumption. Although we still refer to it as PageRank, Google's ability to compute reputation based on links has advanced considerably over the years. I'll do the rest of my blog post in the framework of "classic PageRank" but bear in mind that it's not a perfect analogy.
Probably the most popular way to envision PageRank is as a flow that happens between documents across outlinks. In a recent talk at WordCamp I showed an image from one of the original PageRank papers:
