PHP Classes

File: README

Recommend this page to a friend!
  Classes of Cesar D. Rodas   guaranix Rank   README   Download  
File: README
Role: Documentation
Content type: text/plain
Description: Read this!
Class: guaranix Rank
Implementation of the Google page rank algorithm
Author: By
Last change: New readme
Date: 17 years ago
Size: 4,490 bytes
 

Contents

Class file image Download
IMPORTANT!! FOR RUN THIS SCRIPT YOU MUST NEED TO CREATE A TABLE WITH: CREATE TABLE `pagerank` ( `master` int(11) NOT NULL default '0', `slave` int(11) NOT NULL default '0', KEY `m` (`master`), KEY `s` (`slave`) ); Master is a page that apoints to slave... If you wanna to modify your column or table name, you must change the mysql.php file. The Guaranix Rank is an implementation of the PageRank (The Larry Page and Sergei Brin Algorithm). Google the greater Web Search Engine uses the PageRank for search the better pages first. This algorithm could be used to calculate witch page is more important that anothers. On my test, on a Sempromp 1.8 GHZ 512 RAM, it took about one hour to calculate the PageRank of 1.000.000 Web Pages ( Just 60000 are Crawled and the others are just links undowloaded jet) From Wikipedia in English. Starting with an only page that is http://en.wikipedia.org/wiki/Linus_Torvalds. Here are the top of PageRank: * +----------+-----------------------------------------------------------+---------+ | posicion | url | pr | +----------+-----------------------------------------------------------+---------+ | 1 | http://en.wikipedia.org/wiki/Special:Upload | 66.1543 | | 2 | http://en.wikipedia.org/wiki/Wikipedia:General_disclaimer | 65.72 | | 3 | http://en.wikipedia.org/wiki/Wikipedia:Featured_articles | 64.5116 | | 4 | http://en.wikipedia.org/wiki/Wikipedia:Contact_us | 64.2818 | | 5 | http://en.wikipedia.org/wiki/Special:Recentchanges | 64.025 | | 6 | http://en.wikipedia.org/wiki/Help:Contents | 63.5814 | | 7 | http://en.wikipedia.org/wiki/Wikipedia:About | 63.4931 | | 8 | http://en.wikipedia.org/wiki/Wikipedia:Community_Portal | 63.4072 | | 9 | http://en.wikipedia.org/wiki/Special:Specialpages | 63.3888 | | 10 | http://en.wikipedia.org/wiki/Portal:Current_events | 62.964 | +----------+-----------------------------------------------------------+---------+ A comparation Between OS: * +----------+---------------------------------------------+----------+ | posicion | url | pr | +----------+---------------------------------------------+----------+ | 233 | http://en.wikipedia.org/wiki/Linux | 1.26486 | | 410 | http://en.wikipedia.org/wiki/Microsoft | 0.950259 | | 428 | http://en.wikipedia.org/wiki/Unix | 0.919783 | | 1853 | http://en.wikipedia.org/wiki/FreeBSD | 0.405697 | | 2235 | http://en.wikipedia.org/wiki/Category:Unix | 0.35416 | | 2665 | http://en.wikipedia.org/wiki/Mac_OS | 0.312736 | | 5680 | http://en.wikipedia.org/wiki/Category:Linux | 0.243818 | +----------+---------------------------------------------+----------+ A Comparation Between Computer Peoples: * +----------+-----------------------------------------------+----------+ | posicion | url | pr | +----------+-----------------------------------------------+----------+ | 717 | http://en.wikipedia.org/wiki/Linus_Torvalds | 0.626179 | | 1595 | http://en.wikipedia.org/wiki/Richard_Stallman | 0.451587 | | 4636 | http://en.wikipedia.org/wiki/Bill_Gates | 0.267769 | +----------+-----------------------------------------------+----------+ A Comparation between WebSearch Engines: * +----------+-------------------------------------------------------------+----------+ | posicion | url | pr | +----------+-------------------------------------------------------------+----------+ | 194 | http://en.wikipedia.org/wiki/Google | 1.30925 | | 10956 | http://en.wikipedia.org/wiki/MSN | 0.193957 | | 64674 | http://en.wikipedia.org/wiki/List_of_acquisitions_by_Google | 0.15752 | +----------+-------------------------------------------------------------+----------+ This Results are automatic calculate by this Class, the result is not changed by our preference The PageRank will be more usefull is there is download the whole wikipedia. For better performance in the test file there is set to 68M, if you have a good machine you give more of RAM.