<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>SEO and Web Marketing Research &#187; Link Popularity Algorithms</title>
	<atom:link href="http://www.seoresearcher.com/category/link-popularity-algorithms/feed" rel="self" type="application/rss+xml" />
	<link>http://www.seoresearcher.com</link>
	<description>A comprehensive SEO and Web Marketing study</description>
	<lastBuildDate>Wed, 03 Jun 2009 01:01:12 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Guidelines to a Perfect Link Exchange Scam</title>
		<link>http://www.seoresearcher.com/guidelines-to-a-perfect-link-exchange-scam.htm</link>
		<comments>http://www.seoresearcher.com/guidelines-to-a-perfect-link-exchange-scam.htm#comments</comments>
		<pubDate>Sun, 24 Dec 2006 23:14:46 +0000</pubDate>
		<dc:creator>oleg.ishenko</dc:creator>
				<category><![CDATA[Link Popularity Algorithms]]></category>
		<category><![CDATA[Search Engine Optimization]]></category>

		<guid isPermaLink="false">http://www.seoresearcher.com/48.htm</guid>
		<description><![CDATA[Reciprocal  link exchange still is an important strategy of link popularity building  despite all the measures taken by the search engines to diminish its effect. Back  in 1999-2001 obtaining a quality link exchange was not difficult, and webmasters  used to respond more willingly to an e-mail request. But as more people [...]]]></description>
			<content:encoded><![CDATA[<p><img width="190" height="160" align="left" alt="Reciprocal link exchange" src="http://www.seoresearcher.com/images/articles/link-exchange.jpg" /><strong>Reciprocal  link exchange</strong> still is an important strategy of link popularity building  despite all the measures taken by the search engines to diminish its effect. Back  in 1999-2001 obtaining a quality link exchange was not difficult, and webmasters  used to respond more willingly to an e-mail request. But as more people became  aware of this strategy so the <strong>reciprocal linking scam</strong> started  to be a common practice.</p>
<p>Sometimes I check my old â€˜link exchangeâ€™ e-mail account I used to build    link popularity for my very first website. There are lots of people contacting    me daily with exchange proposals. Well, not actually people â€“ they are    mostly <strong>bots</strong>.</p>
<p>Probably one of the reasons I still maintain that e-mail is that those requests    are a source of a <strong>persistent amusement</strong> for me. One example:    a request in pink letters with images of dancing puppies and bouncing hearts    written by a â€˜blond chickâ€™ (picture attached) asking me to link    to her pharmacy site! Or maybe I just enjoy reading the admiring comments on    the outlook and content of my site that precede every exchange proposal?</p>
<p>Link exchange scam is an interesting theme for a study <em>per se</em> and    still awaits its researchers. But in the meanwhile the SEO community is being successful    in summarizing the <strong>guidelines for the most perfect link exchange scam</strong>.</p>
<p><span id="more-48"></span></p>
<h2>Filing an Exchange Request</h2>
<div id="advertical"><script type="text/javascript"><!--
google_ad_client = "pub-4068762382585748";
/* 160x600, created 7/15/09 */
google_ad_slot = "4312880109";
google_ad_width = 160;
google_ad_height = 600;
//-->
</script>
<script type="text/javascript"
src="http://pagead2.googlesyndication.com/pagead/show_ads.js">
</script></div>
<ul>
<li>Send an automated e-mail request or use a bot to submit it via an online      contact form. Combining the both methods is preferred whenever possible.</li>
<li>Use a free e-mail account such as Gmail, or better yet, some foreign free      e-mail service to send your message. Sending a duplicate request from your      company account is also beneficial.</li>
<li>Do send follow up e-mails. Sooner or later your victim will give up and      read one of them.</li>
<li>Send minimum 100-300 automated requests every day. Push your mail serverâ€™s      spam detection to the limits.</li>
<li>Make sure that the website you are trying to contact is absolutely unrelated      to your field.</li>
<li>Send your request to every e-mail address you can find on the target site.      Let the sales or customer support guys forward them to the webmaster.</li>
</ul>
<h2>Writing Your Request</h2>
<ul>
<li><em>Address properly</em>. No names required. Best thing is to use the websiteâ€™s      title or at least the URL: â€œ<em>Dear Blue Cheap Online Widgets</em>â€,      or â€œ<em>Hello www.bluewidgets.com</em>â€</li>
<li><em>Kiss ass</em>. Tell your victims how much you adore their websites.      Do use superlatives.</li>
<li><em>Inform</em>. Let your recipients know how important PageRank and incoming      links are. Go in depth with the mysteries and magnificence of the PageRank      and how the high PageRank will ensure them the first positions in Google.</li>
<li><em>Scare</em>. Notify them that their link popularity is low, and their      positions in search engines are threatened.</li>
<li><em>Share a secret</em>. Tell them that the three-way linking is more effective,      since search engines detect and ignore two-way links.</li>
<li><em>Threaten</em>. Notify them that their link will be removed from your      high quality directory if they do not provide a link back in the specified      number of days.</li>
<li><em> Show your scale</em>. Make your message easily detectable as a bulk      sending by setting a different font size and color for the recipientâ€™s      address and site name.</li>
<li><em>Be unofficial</em>. Use the Internet argot in your e-mail. Like â€˜u      râ€™ instead of â€˜you areâ€™. This is the Internet â€“ formalism      is unacceptable.</li>
<li><em>Threaten them again</em>. With hundreds of reminder e-mails.</li>
<li><em>Use a girlâ€™s name</em>. Most webmasters are male and should not      resist a lady asking for a favor.</li>
</ul>
<h2>Prepare a Sound Links Page</h2>
<ul>
<li>Your link page must have at least 100 outgoing links, preferably uncategorized.      Make sure that minimum 50% point to pharmacy and gambling websites.</li>
<li>Your proposed links page has to be deeply buried in a keyword-rich URL      like: <em>http://www.yoursite.com/widgets/cheap-widgets/amazingly-cheap-widgets/widgets-links/</em></li>
<li>Make sure the links page URL contains at least one poison keyword like      â€˜<em>links</em>â€™, â€˜<em>partners</em>â€™, â€˜<em>directory</em>â€™,      or â€˜<em>exchanges</em>â€™.</li>
<li>Alternatively provide a dynamic URL with a minimum of 100 characters of      meaningless parameter values.</li>
<li>Choose links pages that are in Googleâ€™s supplementary index.</li>
<li>The PageRank for your page has to be between 0 and 3 with 0 being the best.</li>
<li>Make your page look more credible by putting AdSense ads on it. â€œ<em>Well,      if Google approves this page, then it is worth having a link from it</em>â€.</li>
<li>Disguise your low PR links pages by opening them in a high PR frame.</li>
<li>Orphan pages are the best.</li>
</ul>
<h2>Use SEO Tricks</h2>
<ul>
<li>Link to your partners using one of the following options:
<ul>
<li>â€˜nofollowâ€™ attribute</li>
<li>javascript links</li>
<li>302 â€˜Foundâ€™ redirects</li>
</ul>
</li>
<li>Edit <em>robots.txt</em> to restrict spiders from indexing your links pages.</li>
<li>Double protect your links pages from indexing by adding meta â€˜<em>noindex,nofollow</em>â€™      tags.</li>
</ul>
<p>The above guidelines are compiled from my own experience and the hilarious    thread â€˜<a target="_blank" href="http://www.webmasterworld.com/forum12/3154.htm">SEO    Link Exchange</a>â€™ from the <strong>WebMasterWorld</strong> forum.</p>
<p>The list can be continued. Any suggestions?</p>
]]></content:encoded>
			<wfw:commentRss>http://www.seoresearcher.com/guidelines-to-a-perfect-link-exchange-scam.htm/feed</wfw:commentRss>
		<slash:comments>22</slash:comments>
		</item>
		<item>
		<title>Link Popularity Building Strategies and Tips</title>
		<link>http://www.seoresearcher.com/link-popularity-building-strategies-and-tips.htm</link>
		<comments>http://www.seoresearcher.com/link-popularity-building-strategies-and-tips.htm#comments</comments>
		<pubDate>Sat, 14 Oct 2006 19:54:43 +0000</pubDate>
		<dc:creator>oleg.ishenko</dc:creator>
				<category><![CDATA[Link Popularity Algorithms]]></category>
		<category><![CDATA[Search Engine Optimization]]></category>

		<guid isPermaLink="false">http://www.seoresearcher.com/link-popularity-building-strategies-and-tips.htm</guid>
		<description><![CDATA[Link    Popularity
Link building has always been a hot topic. In the beginning of the web hyperlinks    were virtually the only way to get visitors to a site, because search engines    were in their infancy. When search engines grew to be the major source of the  [...]]]></description>
			<content:encoded><![CDATA[<h3><img width="249" height="192" align="left" src="http://www.seoresearcher.com/images/articles/link-building.jpg" />Link    Popularity</h3>
<p>Link building has always been a hot topic. In the beginning of the web hyperlinks    were virtually the only way to get visitors to a site, because search engines    were in their infancy. When search engines grew to be the major source of the    web traffic, links didnâ€™t lose their weight, as search algorithms started    to rank sites according to the quantity and quality of their incoming links.    And today links become increasingly important with the growing significance    of the new Web 2.0 social networks.<span id="more-33"></span></p>
<h3>Link Popularuty Building Strategies</h3>
<div id="advertical"><script type="text/javascript"><!--
google_ad_client = "pub-4068762382585748";
/* 160x600, created 7/15/09 */
google_ad_slot = "4312880109";
google_ad_width = 160;
google_ad_height = 600;
//-->
</script>
<script type="text/javascript"
src="http://pagead2.googlesyndication.com/pagead/show_ads.js">
</script></div>
<p>Thus, links rule the Internet. Once a routine task of a webmaster, <strong>link    building</strong> has emerged itself into a full scale industry with millions    of dollars in turnover. Ranking algorithms perceive links as a proxy for a human    judgment, or a userâ€™s positive endorsement of a page. The idea is as follows:    a user discovers a page, likes its content, links to the page, and the page    gets higher ranking. This is the so-called <strong>â€˜natural way</strong>â€™    of acquiring links. The natural way of acquiring link works too slow and    can be pretty unfair. New pages on big and established websites are far more    likely to be discovered by web users, and these pages will get the biggest part    of the new links (like 90%); while new pages on fresh sites will get trinkets.    This is a serious defect of the link ranking system which is discussed more    in details in my article <a href="http://www.seoresearcher.com/popularity-ranking-faults.htm">Popularity    Ranking Faults</a>.</p>
<p>Since the natural way of getting links for a new website can take forever,    some additional boost is required. There are many strategies of link building    able to ensure you some initial ranking and exposure, which are necessary to    make the â€˜natural wayâ€™ work. Some of these strategies can be very    tricky and do more harm than use. So it is critically important to keep in mind    the following tips of link building.</p>
<h3>Link Building Tips</h3>
<p>Be a <strong>user</strong> when building links. The point is to make your link    exchanges look like they are acquired the natural way. Make sure that your links    appear in places where search engine expect them to be. This should be pages    relevant to your content. Link must be in the page copy or in a sidebar possibly    among the other links pointing to pages also relevant to your topic. The anchor    text must look naturally â€“ so no keyword stuffing.</p>
<p>Analyze your own <strong>motives</strong> of linking to the sites you like.    What motivates you to cite a web resource? Is it a collection of online tools    or handy tutorials? Or may be it is a provoking title? Apply this â€˜reverse    engineeringâ€™ to your pages, and use unique interesting content to attract    links.</p>
<p>Avoid things that can damage your <strong>reputation</strong> in the eyes of    search engines. No link farms, suspicious looking websites or poor quality link    exchanges. Forget the reciprocal links â€“ they no longer have any significant    weight. Do not participate in three-way or similar linking schemes â€“ these    attempts to disguise reciprocal linking are easy to detect. NASA managed to    get a man on the Moon with computers less powerful than a GameBoy, so why do    you think Google canâ€™t discover link triangles with all the computing    resources at its disposal?</p>
<p><strong>Buying links</strong>. This practice is pretty much discouraged by    Google, because it undermines the idea of the proxy for human judgment. This means you    have to be especially savvy when buying links. Avoid link trading sites or any    site publicly announcing that is sells links. Donâ€™t mix buying links with    paid advertising. You pay for an advertisement on a high traffic page expecting    visitors referred by your ad. Buying links has a different purpose â€“ increasing    your link popularity.</p>
<p>Do not be obsessed with <strong>backlinks</strong>. There is an intense focus    on link building but not enough focus on content creation. Links must reflect    the quality of content. If you think your site has not enough incoming links,    you should concentrate on how to improve the quality of content and make it more    appealing, not on more link exchanges.</p>
<p><strong>Link penalties</strong>. Many people are afraid to get penalized for    linking or being linked by fishy websites. If there is a need to put a link    to a site which you do not want to be related with, use rel â€˜nofollowâ€™.    Google confirms that this attribute is critical in link analysis, so you should    be fine. Links from dubious sources to your site are out of your control and    all the major search engines assure that they donâ€™t punish people for    that. However too many links from such sites (like tens of thousands) can bring    an unwanted attention of search engine quality teams. They can ban your site    if they find you responsible for boosting your rankings, but you can always    submit reinclusion.</p>
<h3>To sum up:</h3>
<p>Make your linking strategy look natural. Avoid the known patterns of artificial    link building and do not obsess with links at the expense of content creation.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.seoresearcher.com/link-popularity-building-strategies-and-tips.htm/feed</wfw:commentRss>
		<slash:comments>23</slash:comments>
		</item>
		<item>
		<title>Authority Threshold Algorithm</title>
		<link>http://www.seoresearcher.com/authority-threshold-algorithm.htm</link>
		<comments>http://www.seoresearcher.com/authority-threshold-algorithm.htm#comments</comments>
		<pubDate>Wed, 19 Jul 2006 11:20:04 +0000</pubDate>
		<dc:creator>oleg.ishenko</dc:creator>
				<category><![CDATA[Link Popularity Algorithms]]></category>
		<category><![CDATA[Search Engines Technology]]></category>

		<guid isPermaLink="false">http://www.seoresearcher.com/blog/2006/07/19/authority-threshold-algorithm/</guid>
		<description><![CDATA[Authority Threshold Algorithm (AT(k))
The idea behind AT(k) Algorithm is using only k highest authority weights instead of calculating average weight from every authority pointed by a hub. The parameter k  is called authority threshold. A variant of an AT algorithm is MAX algorithm, where k=1, i.e. a hub is as good as the best [...]]]></description>
			<content:encoded><![CDATA[<h2>Authority Threshold Algorithm (AT(k))</h2>
<p>The idea behind <strong>AT(k) Algorithm</strong> is using only <em>k</em> highest authority weights instead of calculating average weight from every authority pointed by a hub. The parameter <em>k </em> is called <em>authority threshold</em>. A variant of an AT algorithm is <strong>MAX </strong>algorithm, where <em>k</em>=1, i.e. a hub is as good as the best authority it links to.</p>
<p>In general AT(k) algorithm uses the same formula as <a title="HITS Algorithm" href="http://www.seoresearcher.com/link-analysis-algorithms-hits.htm"><strong>HITS</strong></a>. The difference is that when calculating the weight of a hub we consider top <em>k</em> authorities only, i.e. <em>Fk(i) </em>is a subset of outgoing links <em>F(i)</em>. If the number of outgoing links <em>|F(i)| </em>is less or equal <em>k</em> than the AT(k) algorithm works exactly the same as HITS.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.seoresearcher.com/authority-threshold-algorithm.htm/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Link Analysis Algorithms: HUBAVG</title>
		<link>http://www.seoresearcher.com/link-analysis-algorithms-hubavg.htm</link>
		<comments>http://www.seoresearcher.com/link-analysis-algorithms-hubavg.htm#comments</comments>
		<pubDate>Wed, 19 Jul 2006 11:17:05 +0000</pubDate>
		<dc:creator>oleg.ishenko</dc:creator>
				<category><![CDATA[Link Popularity Algorithms]]></category>
		<category><![CDATA[Search Engines Technology]]></category>

		<guid isPermaLink="false">http://www.seoresearcher.com/blog/2006/07/19/link-analysis-algorithms-hubavg/</guid>
		<description><![CDATA[HUBAVG Algortihm
To overcome the shortcoming of the HITS algorithm of a hub getting a high weight when it points to numerous low-quality authorities, the following refinement was suggested. While using the same formula to calculate authority weights, the hub score h is now averaged by a number of outgoing links &#124;F(i)&#124;:

So in order to achieve [...]]]></description>
			<content:encoded><![CDATA[<h2>HUBAVG Algortihm</h2>
<p>To overcome the shortcoming of the HITS algorithm of a hub getting a high weight when it points to numerous low-quality authorities, the following refinement was suggested. While using the same formula to calculate authority weights, the hub score <em>h</em> is now averaged by a number of outgoing links <em>|F(i)|</em>:</p>
<p><img alt="HUBAVG weights calculation" title="HUBAVG weights calculation" src="http://www.seoresearcher.com/images/link-algorithms/hubavg.gif" /></p>
<p>So in order to achieve a high weight a hub should link good authorities. Unfortunately this approach has its own flaw. Consider two hubs pointing to an equal number of equally good authorities. The two hubs are identical until one puts one more link to a low quality authority. The average sum of the authorities it points to sinks, and it gets penalized in weight. This is quite illogical but can be fixed by using so-called Authority Threshold Algorithm.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.seoresearcher.com/link-analysis-algorithms-hubavg.htm/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Link Analysis Algorithms: HITS</title>
		<link>http://www.seoresearcher.com/link-analysis-algorithms-hits.htm</link>
		<comments>http://www.seoresearcher.com/link-analysis-algorithms-hits.htm#comments</comments>
		<pubDate>Tue, 18 Jul 2006 17:42:41 +0000</pubDate>
		<dc:creator>oleg.ishenko</dc:creator>
				<category><![CDATA[Link Popularity Algorithms]]></category>
		<category><![CDATA[Search Engines Technology]]></category>

		<guid isPermaLink="false">http://www.seoresearcher.com/blog/2006/07/18/link-analysis-algorithms-hits/</guid>
		<description><![CDATA[HITS Algorithm
This algorithm was first described by Jon Kleinberg in his work â€œAuthoritative Sources in a Hyperlinked Environmentâ€ (1998). The idea behind the HITS (Hyperlink Induced Topic Distillation) algorithm is that the authorities and hubs mutually reinforce each other. Authority weight of a page is calculated as a sum of hub weights pointing to it, [...]]]></description>
			<content:encoded><![CDATA[<h2>HITS Algorithm</h2>
<p>This algorithm was first described by <strong>Jon Kleinberg</strong> in his work â€œ<em>Authoritative Sources in a Hyperlinked Environment</em>â€ (1998). The idea behind the <strong>HITS </strong>(Hyperlink Induced Topic Distillation) algorithm is that the authorities and hubs mutually reinforce each other. Authority weight of a page is calculated as a sum of hub weights pointing to it, and weight of a hub â€“ as a sum of weights of authorities pointed to by it. In other words a hub is as good as the authorities linked by it, and vice versa.<span id="more-11"></span></p>
<p>The notation of the algorithm is as follows. Let S be a set of pages for which hub and authority weights are being calculated, n â€“ the number of pages in the set. Then H is a subset of S containing pages acting as hubs, and A is a subset of S containing authorities. Since each page can be an authority and a hub, A and H overlap. For every page i in its hub role F(i) is the number of outgoing links. For every page i in its authority role B(i) is the number of incoming links. The n-dimensional vector of authority weights is denoted as a, and vector of hub weight â€“ as h. Then hub and authority weights are calculated by the following formula:</p>
<p><img title="HITS Algorithm calculation of weights" alt="HITS Algorithm calculation of weights" src="http://www.seoresearcher.com/images/link-algorithms/hits-weights.gif" /></p>
<p>The process is iterative. First all the weights receive value of 1. Then hubs and authority weights are calculated and the vectors are normalized. This stage is repeated until vectors a and h converge.</p>
<p>The algorithm however has a number of flaws. For example the nature of mutual reinforcement creates the following situation. Consider a hub that points to many authorities (hub B on the picture below), and a number of hubs pointing to a single authority (authority A on the picture). If the number of authorities pointed to by B is larger then the number of hubs pointing to A, then the HITS algorithm will allocate all the weight to the authorities in the right part of the picture and the authority A will get a weight near zero.</p>
<p><img title="HITS Algorithm faults" alt="HITS Algorithm faults" src="http://www.seoresearcher.com/images/link-algorithms/hits-faults.jpg" /></p>
<p>The reason is that hub B will initially get a very high score and propagate it to the authorities it links to. In the same time hubs on the left side will get very low score, and consequently A will get low weight too, although obviously it deserves more.</p>
<h2>Cited Resources</h2>
<ul>
<li>Kleinberg, J. May 1997, â€˜<a title="Authoritative sources in a hyperlinked environment" target="_blank" href="http://citeseer.ist.psu.edu/article/kleinberg98authoritative.html">Authoritative sources in a hyperlinked environment</a>â€™. Technical Report RJ 10076, IBM,. Available at http://citeseer.ist.psu.edu/article/kleinberg98authoritative.html</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.seoresearcher.com/link-analysis-algorithms-hits.htm/feed</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Topic-Sensitive PageRank</title>
		<link>http://www.seoresearcher.com/topic-sensitive-pagerank.htm</link>
		<comments>http://www.seoresearcher.com/topic-sensitive-pagerank.htm#comments</comments>
		<pubDate>Mon, 17 Jul 2006 18:15:51 +0000</pubDate>
		<dc:creator>oleg.ishenko</dc:creator>
				<category><![CDATA[Link Popularity Algorithms]]></category>
		<category><![CDATA[Search Engines Technology]]></category>

		<guid isPermaLink="false">http://www.seoresearcher.com/blog/2006/07/17/topic-sensitive-pagerank/</guid>
		<description><![CDATA[The link structure of the Web is highly sensitive to page topic. Pages tend to contain links pointing to other pages on the same broad topic, e.g. pages on investment banking often link to other business-related resources but rarely to sports portals. While using offline PageRank scores has an advantage of faster processing, it also [...]]]></description>
			<content:encoded><![CDATA[<p>The link structure of the Web is highly sensitive to page <strong>topic</strong>. Pages tend to contain links pointing to other pages on the same broad topic, e.g. pages on investment banking often link to other business-related resources but rarely to sports portals. While using offline PageRank scores has an advantage of faster processing, it also creates a situation where some highly linked page receive higher ranking on topics for which they have no authority. A query-time adjustment of the scoring function is necessary to refine the search results. Some algorithms like <strong>HITS</strong> and <strong>Hilltop </strong>allow such an adjustment. However these algorithms have their own shortcomings that restrict their efficient use by search engines.</p>
<p><strong>HITS </strong>algorithm calculates <em>hubs </em>and <em>authorities </em>in query-time but relies on a relatively small subset of the Web â€“ the immediate neighborhood of a page, since otherwise computation time would be unacceptably long. <strong>Hilltop </strong>algorithm analyses a query and calculates score values by finding pages that seem to be experts in the query-specific topic. This algorithm restricts itself to popular queries, since it canâ€™t produce score values when no experts for an uncommon search term are found.</p>
<p><strong> Topic-Sensitive PageRank</strong> extends the original PageRank idea by adding a <em>query-time topic-sensitive adjustment</em>.<span id="more-10"></span> Instead of a single vector of PageRank values, multiple topic-specific PageRank vectors are calculated. Creation of a PageRank vectors for every possible topic would require extensive resources, so in practice the algorithm uses only 16 topic-specific ranking vectors representing the top categories of the <a title="Open Directory Project" href="http://dmoz.org/" target="_blank">ODP</a> project. Other sources of topics can be used for this purpose as well, but since ODP project is created and edited by a large number of independent volunteers, it is the least likely to be influenced by any one party. For each page in the Web a set of importance scores with respect to various topics is precomputed and stored offline. In query time the topic-specific score is combined with other scores (e.g. content analysis) to form the final ranking for a page.</p>
<p>Let <em>c<sub>j</sub> </em>be one of the <a title="ODP Top-Level Categories" href="http://dmoz.org/" target="_blank">16 top-level</a> ODP categories. For each topic <em>c<sub>j</sub></em> it is necessary to calculate a biased PageRank vector. Let <em>M </em>be a <a title="Wiki: Modofied Adjancecny Matrix" href="http://en.wikipedia.org/wiki/Modified_adjacency_matrix" target="_blank">modified adjacency matrix</a>. Each element <em>m<sub>ji</sub> </em>has value <em>1/N<sub>j</sub></em>, if there is a link from <em>j</em> to <em>i</em>, and where <em>N<sub>j</sub></em> is the number of outgoing links on page <em>j</em>. Otherwise element value is 0. Then the original PageRank formula in matrix notation looks as following:</p>
<p><img title="PageRank formula in a matrix notation" src="http://www.seoresearcher.com/images/link-algorithms/pagerank-matrix-notation.gif" alt="PageRank formula in a matrix notation" /></p>
<p>Parameter <em>Î±</em> here is the dumping factor that equals <em>1-d</em> in the original PageRank formula. The resulting vector of PageRank values is denoted as <em>PR(Î±, p)</em>.</p>
<p>With minor modifications the same formula is used to calculate the topic-sensitive ranking vectors. Let <em>T<sub>j</sub> </em>be the set of pages under a topic <em>c<sub>j</sub></em>. Then instead of the uniform distribution damping factor <em>p</em>, a non-uniform vector <em>v<sub>j</sub></em> is used, where:</p>
<p><img title="Non-uniform damping factor" src="http://www.seoresearcher.com/images/link-algorithms/dampingfactor.gif" alt="Non-uniform damping factor" /></p>
<p>Resulting PageRank vector is denoted as <em>PR(a, v<sub>ij</sub>)</em>. Additionally using all the documents under each topic <em>c<sub>j</sub></em> a term vector <em>D<sub>j</sub> </em>is constructed. Term vector elements <em>D<sub>jt</sub></em> are the numbers of occurrences of every term under topic <em>c<sub>j</sub>.</em> In order to detect a topic, to which a search query term relates to, two scenarios are considered. In the first scenario a user highlights a keyword in a page and initiates a search. In this case the search topic is defined by the page content. For example if word â€˜architectureâ€™ is highlighted in a page about famous buildings, the pages on CPU architecture should not appear among search results. So if a term <em>q</em> is highlighted in some page <em>u</em>, its context <em>qâ€™</em> would be the words in <em>u</em>.  In the second scenario a user enters a keyword into a search form in the conventional way. In this case the context of the query <em>q</em> is the search term itself:<em> qâ€™ = q</em>.  When a history of search terms is kept, it is also possible to use it as the context <em>qâ€™</em>.  In query time the proximity of query context <em>qâ€™ </em>to one of the topics <em>c<sub>j</sub></em> is calculated:</p>
<p><img title="Topic proximity value" src="http://www.seoresearcher.com/images/link-algorithms/proximity.gif" alt="Topic proximity value" /></p>
<p>Proximity value <em>P(qâ€™|c<sub>j</sub>)</em> is calculated using the term vectors <em>D<sub>j</sub></em>. Then a query-sensitive score values are computed for every page d from the index that contains the original search term <em>q</em>. For each document <em>d</em> we sum up the products of topic proximity values <em>P(c<sub>j</sub>|qâ€™)</em> and the page rank <em>r<sub>d</sub></em>. The rank<em> r<sub>d</sub></em> is an element of the page rank vector <em>PR(a, v<sub>ji</sub>)</em> of a topic to which the document d belongs to:</p>
<p><img title="Sorting value" src="http://www.seoresearcher.com/images/link-algorithms/sortingvalue.gif" alt="Sorting value" /></p>
<p>The final search results are sorted by the values of <em>s<sub>d</sub></em>. Since the PageRank calculations are performed in advance, the algorithm is able to quickly perform topic adjustments in the query time.</p>
<h2>Cited resources</h2>
<ul>
<li>Haveliwala, T.H. â€˜<a title="Topic-sensitive PageRank" href="http://citeseer.ist.psu.edu/haveliwala02topicsensitive.html" target="_blank">Topic-sensitive PageRank</a>â€™. In Proceedings of the Eleventh International World Wide Web Conference, Honolulu, Hawaii, May 2002. Available at http://citeseer.ist.psu.edu/haveliwala02topicsensitive.html</li>
<li>Haveliwala, T.H. â€˜<a title="Topic-sensitive pagerank: A context-sensitive ranking algorithm for web search" href="http://citeseer.ist.psu.edu/rd/83310218%2C578979%2C1%2C0.25%2CDownload/http://citeseer.ist.psu.edu/cache/papers/cs/27801/http:zSzzSzwww.stanford.eduzSz%7EtaherhzSzpaperszSztopic-sensitive-pagerank-tkde.pdf/haveliwala03topicsensitive.pdf" target="_blank">Topic-sensitive pagerank: A context-sensitive ranking algorithm for web search</a>â€™. IEEE Trans. Knowl. Data Eng., 15(4):784&#8211;796, 2003. Available at http://citeseer.ist.psu.edu/article/haveliwala03topicsensitive.html</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.seoresearcher.com/topic-sensitive-pagerank.htm/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>PageRank</title>
		<link>http://www.seoresearcher.com/pagerank.htm</link>
		<comments>http://www.seoresearcher.com/pagerank.htm#comments</comments>
		<pubDate>Mon, 17 Jul 2006 17:49:07 +0000</pubDate>
		<dc:creator>oleg.ishenko</dc:creator>
				<category><![CDATA[Link Popularity Algorithms]]></category>
		<category><![CDATA[Search Engines Technology]]></category>

		<guid isPermaLink="false">http://www.seoresearcher.com/blog/2006/07/17/pagerank/</guid>
		<description><![CDATA[The PageRank Algorithm
PageRank extends the idea behind the InDegree algorithm by assigning different weights to the links. Links from high quality pages should make a stronger impact on the rank of a page. Therefore it is not only important how many incoming links a page has, but also how important the pointing pages are.
To determine [...]]]></description>
			<content:encoded><![CDATA[<h2>The PageRank Algorithm</h2>
<p><strong>PageRank </strong>extends the idea behind the <a title="InDegree Algorithm" href="http://www.seoresearcher.com/link-analysis-algorithms-indegree.htm">InDegree algorithm</a> by assigning different weights to the links. Links from high quality pages should make a stronger impact on the rank of a page. Therefore it is not only important how many incoming links a page has, but also how important the pointing pages are.</p>
<p>To determine the authority of internet pages PageRank simulates the behavior of a random web-surfer,<span id="more-9"></span> by starting its walk from some random page and then following the outgoing links. The starting point is usually drawn from a uniform distribution, but other distributions can be used as well. The process of the random walk is as follows: at the starting point an outgoing link is randomly chosen and the surfer follows it with a probability <em>1-d</em>. With probability <em>d</em> the surfer jumps to another random page without following any link (â€œgets bored clicking linksâ€). The parameter <em>d</em> is called â€œ<strong>the dumping factor</strong>â€, its value can vary but generally it is assumed that equals 0.85.</p>
<p>The PageRank of a page is calculated by a formula given below:</p>
<p><img alt="PageRank Formula" title="PageRank Formula" src="http://www.seoresearcher.com/images/link-algorithms/pagerankformula.png" /></p>
<p>The algorithm considers a set of interlinked pages <em>M(p<sub>i</sub>) </em>with the number of elements <em>N</em>. <em>PR(p<sub>j</sub>)</em> is the PageRank of a page <em>p<sub>j</sub></em> in  the set, and <em>L(p<sub>j</sub>)</em> is the number of outgoing links on that page. Each page that passes to <em>p<sub>i</sub> </em>a fraction of its own PageRank depending on how many other pages it links to.</p>
<p>The incoming links in the set<em> M(p<sub>i</sub>) </em>can be presented in a form of <a target="_blank" title="Wiki: adjacency matrix" href="http://en.wikipedia.org/wiki/Modified_adjacency_matrix">adjacency matrix</a>:</p>
<p><img alt="Modified adjacency matrix" title="Modified adjacency matrix" src="http://www.seoresearcher.com/images/link-algorithms/adjacencymatrix.gif" /></p>
<p>Where <em>l(p<sub>1</sub>,p<sub>2</sub>)</em> equals 1 if there is a link between pages <em>p<sub>1</sub></em> and <em>p<sub>2</sub></em> and zero otherwise. PageRanks of all the pages in the set <em>M(p<sub>i</sub>)</em> form a vector <em>R</em>, which is the dominant <a target="_blank" title="Wiki: Eigenvectors" href="http://en.wikipedia.org/wiki/EigenVectors">eigenvector</a> to the adjacency matrix.</p>
<p><img alt="PageRank as an eigenvector" title="PageRank as an eigenvector" src="http://www.seoresearcher.com/images/link-algorithms/pagerankeigenvector.png" /></p>
<p>Then the PageRank formula can be written as:</p>
<p><img alt="PageRank equation" title="PageRank equation" src="http://www.seoresearcher.com/images/link-algorithms/pagerankequation.png" /></p>
<p>PageRank values are precomputed and do not depend on search queries. When it is necessary to rank the results for some search term PageRank is used in conjunction with query specific scores. Having the PageRank scores precomputed allows faster results sorting.</p>
<h2>References</h2>
<ul>
<li>Borodin, A, Roberts, G.O., Rosenthal, J.S. and Tsaparas, P. â€˜<a title="Finding authorities and hubs from link structures on the World Wide Web" href="http://citeseer.ist.psu.edu/rd/0%2C608558%2C1%2C0.25%2CDownload/http://citeseer.ist.psu.edu/cache/papers/cs/30201/http:zSzzSzwww.cs.toronto.eduzSz%7EtsapzSzpublicationszSzhubs-journal.pdf/borodin01finding.pdf">Finding authorities and hubs from link structures on the World Wide Web</a>â€™. In Proceedings of the 10 th International World Wide Web Conference, Hong Kong, May 2001. Available at http://citeseer.ist.psu.edu/borodin01finding.html</li>
<li>Wikipedia: <a target="_blank" title="PageRank" href="http://en.wikipedia.org/wiki/Pagerank">PageRank</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.seoresearcher.com/pagerank.htm/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Link Analysis Algorithms: InDegree</title>
		<link>http://www.seoresearcher.com/link-analysis-algorithms-indegree.htm</link>
		<comments>http://www.seoresearcher.com/link-analysis-algorithms-indegree.htm#comments</comments>
		<pubDate>Mon, 17 Jul 2006 17:31:47 +0000</pubDate>
		<dc:creator>oleg.ishenko</dc:creator>
				<category><![CDATA[Link Popularity Algorithms]]></category>
		<category><![CDATA[Search Engines Technology]]></category>

		<guid isPermaLink="false">http://www.seoresearcher.com/blog/2006/07/17/link-analysis-algorithms-indegree/</guid>
		<description><![CDATA[The InDegree Algorithm
This simplest algorithm uses page popularity as a ranking factor. Page popularity is measured as a number of incoming links â€“ similar to document citation in the academic world. In the early days of the Web this algorithm was widely used by search engines. The InDegree algorithm is not very effective â€“ we [...]]]></description>
			<content:encoded><![CDATA[<h2>The InDegree Algorithm</h2>
<p>This simplest algorithm uses <strong>page popularity</strong> as a ranking factor. Page popularity is measured as a number of incoming links â€“ similar to document citation in the academic world. In the early days of the Web this algorithm was widely used by search engines. The InDegree algorithm is not very effective â€“ we would need to consider links not just from any page but from those which are relevant to the query. Otherwise the algorithm can be easily manipulated by obtaining thousands of links from anywhere in the Web, thus artificially inflating the link popularity (<em>link farms</em>). The popularity <em>a<sub>i</sub></em> of a page <em>i</em> is calculated by a simple formula:</p>
<p align="center"><em>a<sub>i</sub> = |B(i)|,</em></p>
<p>where <em>B(i)</em>  is a set of pages pointing to page <em>i</em>, and <em>|B(i)|</em>  is a number of elements in the set.</p>
<h2>Reference</h2>
<ul>
<li>Borodin, A, Roberts, G.O., Rosenthal, J.S. and Tsaparas, P. â€˜<a target="_blank" title="Finding authorities and hubs from link structures on the World Wide Web" href="http://citeseer.ist.psu.edu/rd/0%2C608558%2C1%2C0.25%2CDownload/http://citeseer.ist.psu.edu/cache/papers/cs/30201/http:zSzzSzwww.cs.toronto.eduzSz%7EtsapzSzpublicationszSzhubs-journal.pdf/borodin01finding.pdf">Finding authorities and hubs from link structures on the World Wide Web</a>â€™. In Proceedings of the 10 th International World Wide Web Conference, Hong Kong, May 2001. Available at http://citeseer.ist.psu.edu/borodin01finding.html</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.seoresearcher.com/link-analysis-algorithms-indegree.htm/feed</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Link Analysis Algorithms</title>
		<link>http://www.seoresearcher.com/link-analysis-algorithms.htm</link>
		<comments>http://www.seoresearcher.com/link-analysis-algorithms.htm#comments</comments>
		<pubDate>Mon, 17 Jul 2006 17:25:10 +0000</pubDate>
		<dc:creator>oleg.ishenko</dc:creator>
				<category><![CDATA[Link Popularity Algorithms]]></category>
		<category><![CDATA[Search Engines Technology]]></category>

		<guid isPermaLink="false">http://www.seoresearcher.com/blog/2006/07/17/link-analysis-algorithms/</guid>
		<description><![CDATA[Relevance and Authority
When a user queries a search engine with a keyword, he expects more than just relevant results. For example if someone searches for â€˜Bali vacationsâ€™ he would be disappointed to get a page of a personal blog with a story about John Doeâ€™s awesome Bali vacation last summer. Obviously, what the user was [...]]]></description>
			<content:encoded><![CDATA[<h2>Relevance and Authority</h2>
<p>When a user queries a search engine with a keyword, he expects more than just relevant results. For example if someone searches for â€˜Bali vacationsâ€™ he would be disappointed to get a page of a personal blog with a story about John Doeâ€™s awesome Bali vacation last summer. Obviously, what the user was looking for was a travel agent like Expedia. Thus it is critical that users get not just <strong>relevant </strong>but also <strong>authoritative </strong>results. And the more pages appear daily in the Internet the bigger is a shift from the relevance to the authoritativeness in search algorithms.</p>
<p>Nowadays the relevance of a page is defined differently. It is not just about the keyword saturation, or copy structure. Currently the <strong>context </strong>where the page exists defines its relevance. The <em>context </em>is a set of pages linking to or linked by the given page. If these pages are about Bali vacations, then it is naturally to expect a page linked by them to be about Bali vacations as well. The page content would be used to adjust the algorithmâ€™s results in cases when links point to an irrelevant page, for example a widely used free web statistics system.</p>
<h2>Link Analysis Ranking Algorithms</h2>
<p>So how come the page content analysis is no longer enough to get relevant search results? There is a problem of <strong>abundance</strong>: the number of pages considered to be relevant basing on the page content analysis is too big for a human to digest. And this is where searching for authoritative pages helps to narrow down the results. But authority is even a vaguer notion than relevance. Authority has to express the importance and the weight of a web document. The nature of the Web as an interlinked hypertext environment suggests that links can be used to measure the degree of â€œpublic recognitionâ€ of web pages.</p>
<p>This idea has been existing since the creation of the Internet, and <strong>Jon Kleinberg</strong> was one of the first to create a workable approach, which he described in his seminal work â€œ<a title="Authoritative Sources in a Hyperlinked Environment" target="_blank" href="http://citeseer.ist.psu.edu/87928.html">Authoritative Sources in a Hyperlinked Environment</a>â€ (1998). He suggested that web pages can be either <strong>â€œhubsâ€ </strong>or <strong>â€œauthoritiesâ€</strong>. <em>Authority </em>is a page that has many incoming links or high <em>in-degree</em>. Authorities returned as relevant to some query should demonstrate an <em>overlap </em>in pages pointing to them. Those pages containing links to the relevant resources are called <em>hubs</em>. Hubs determine the relevance of authorities on a given topic and allow discarding other non-relevant pages with high in-degree.</p>
<p><img alt="Hubs and Authorities in an interlinked environment" title="Hubs and Authorities in an interlinked environment" src="http://www.seoresearcher.com/images/link-algorithms/hubs-and-authorities.gif" /></p>
<p>Link analysis ranking algorithms use hyperlink graphs similar to the shown above. The nodes of the hyperlink graph are web pages, and links are the directed edges. The graph is simple: if there is more than one link between two nodes, only one is considered and neither are the links from a page to itself. A different weight can be assigned to edges (links) according to the web page analysis or other factors, which search engines consider important, e.g. link or domain age.</p>
<p>In my further posts I will describe the most widely used link analysis algorithms.</p>
<h2>References</h2>
<ul>
<li>Kleinberg, J. May 1997, â€˜<a target="_blank" title="Authoritative sources in a hyperlinked environment" href="http://citeseer.ist.psu.edu/article/kleinberg98authoritative.html">Authoritative sources in a hyperlinked environment</a>â€™. Technical Report RJ 10076, IBM,. Available at http://citeseer.ist.psu.edu/article/kleinberg98authoritative.html</li>
<li>Borodin, A, Roberts, G.O., Rosenthal, J.S.  and Tsaparas, P. â€˜<a target="_blank" title="Finding authorities and hubs from link structures on the World Wide Web" href="http://citeseer.ist.psu.edu/borodin01finding.html">Finding authorities and hubs from link structures on the World Wide Web</a>â€™. In Proceedings of the 10 th International World Wide Web Conference, Hong Kong, May 2001. Available at http://citeseer.ist.psu.edu/borodin01finding.html</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.seoresearcher.com/link-analysis-algorithms.htm/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Trust Problem in the Link-Based Popularity Ranking</title>
		<link>http://www.seoresearcher.com/trust-problem-in-link-based-popularity-ranking.htm</link>
		<comments>http://www.seoresearcher.com/trust-problem-in-link-based-popularity-ranking.htm#comments</comments>
		<pubDate>Mon, 03 Jul 2006 22:52:44 +0000</pubDate>
		<dc:creator>oleg.ishenko</dc:creator>
				<category><![CDATA[Link Popularity Algorithms]]></category>
		<category><![CDATA[Search Engines Technology]]></category>

		<guid isPermaLink="false">http://www.seoresearcher.com/blog/?p=5</guid>
		<description><![CDATA[The idea behind the popularity ranking algorithms is that, by linking to a page, you imply that that page deserves attention. Search engines use links to determine the authority of pages in topics described by the link anchor text. The problem is that every link is considered as a positive endorsement with no regard to [...]]]></description>
			<content:encoded><![CDATA[<p>The idea behind the popularity ranking algorithms is that, by linking to a page, you imply that that page deserves attention. Search engines use links to determine the authority of pages in topics described by the link anchor text. The problem is that every link is considered as a positive endorsement with no regard to the real intention of the linking person. There is no effective way for a search engine to distinguish between positive and negative endorsements in links yet.<span id="more-5"></span></p>
<p>The earliest web-like development by Tim Berners-Lee &#8211; the Enquire program â€“ allowed the usage of various types of relationships between documents, such as similar-to, part-of, or made-by. However in his later work on the World Wide Web and HTML Tim Berners-Lee has abandoned multiple-types of relationships for simplicity. Flat links with no relationship information have been used ever since.</p>
<h2>Link Abuse</h2>
<p>The absence of this feature has been widely exploited by people trying to manipulate search engines results since the introduction of the link-based popularity ranking. The most notable examples are link farms, which used to flourish just a couple years ago. Search engines had realized the problem of abuse and started penalizing link farms and sites that link to them. But still with the anchor text and the host pageâ€™s own authority being the only parameters used to evaluate the link impact, there exist many other ways to manipulate rankings. By using anchor text large web-users communities can perform the infamous Google-bombing:  see â€˜<a rel="nofollow" href="http://www.google.com/search?hl=en&#038;lr=&#038;safe=off&#038;q=miserable+failure&#038;btnG=Search">miserable failure</a>â€™ search results in Google. And this can be employed against other search engines as well, e.g. <a target="_blank" rel="nofollow" href="http://www.yandex.ru/yandsearch?stype=www&#038;nl=0&#038;text=%E4%F0%F3%E3+%ED%E0%F0%EE%E4%E0">â€˜Ð´Ñ€ÑƒÐ³ Ð½Ð°Ñ€Ð¾Ð´Ð°â€™</a> (â€˜peopleâ€™s friendâ€™) query in the Russian major search engine <a title="Yandex" target="_blank" href="http://www.yandex.ru">Yandex.ru</a> returns V.Putinâ€™s bio on the top of the results</p>
<h2>Adding Semantics Info into Link Markup</h2>
<p>The problem lies in the very nature of HTML. Designed as a document structure markup language it is ineffective for describing the semantics of a document. Some extensions had to be introduced into the link markup to make it able to reflect the nature of the relationship to the linked document.  One example is the <em>rel </em>attribute. This attribute is used in the <strong>VoteLinks</strong> microformat proposed by the <a title="Technocrati" href="http://www.technorati.com/">Technorati.com</a> to allow users to express their opinion about linked blogs. The <em>rev </em>attribute in the <strong>VoteLinks</strong> format can have the following values: â€˜<em>vote-for</em>â€™, â€˜<em>vote-abstain</em>â€™ and â€˜<em>vote-against</em>â€™, thus reflecting positive, neutral and negative opinions. This makes it possible to augment or reduce the authority of the linked document respectively.</p>
<p>The <em>rel </em>attributes values such as <em>â€˜friendâ€™</em>, <em>â€˜colleagueâ€™</em>, <em>â€˜siblingâ€™</em>, <em>â€˜neighborâ€™ </em>or <em>â€˜spouseâ€™ </em>allow a different approach, reflecting the nature of the relationships between two documents, rather than the opinion. This approach is also used by <a title="Technocrati" target="_blank" href="http://www.technorati.com/">Technorati.com</a> and might have certain value within a community of bloggers.</p>
<p>None of these approaches is currently used by any major search engine. The <em>rel </em>attribute value â€˜<em>nofollow</em>â€™ is the only one to be taken into account by Google, MSN and Yahoo. This value is intended to show that the linking person is not certain about his approval of the linked document. The search engine crawlers are supposed not to follow such links, but many webmasters report that actually they still do. However it is possible that such links do not propagate authority.</p>
<h2>Possible Impact of Trust Links</h2>
<p>What if search engines were able to determine the trust or distrust nature of a link? Would the existing ranking be significantly different? Currently inbound links show the measure of attention to a certain page. Therefore even those pages which are linked to express distrust or even disgust might be ranked higher than positively endorsed pages for the same query.There is another approach that would have to be reconsidered. Now if page <em>A</em> links to page <em>B</em>, it propagates a fraction of its authority value to <em>B</em>. If there is a reciprocal link, some authority can be returned to <em>A</em>, so that both pages benefit from the mutual trust. But consider now that a highly ranked page<em> A</em> distrusts page <em>B</em>. Page <em>B</em> also adds a distrust link to <em>A</em> in revenge. Should we decrease the ranking of <em>A</em> because a distrusted page <em>B</em> also negatively links to it? Or should we stop considering any opinion from <em>B</em>?</p>
<p>To determine the effect of the search enginesâ€™ ability to distinguish between positive and negative opinions in web links <a target="_blank" href="http://moloko.itc.it/paoloblog/"><strong>Paolo Massa</strong></a> and <strong>Conor Hayes</strong> have conducted an <a title="Page-reRank: using trusted links to re-rank authority" target="_blank" href="http://sra.itc.it/people/massa/publications/wi05_page_rerank_massa_hayes.pdf">experiment </a>using the linkage data from the <a title="Epinions" target="_blank" href="http://www.epinions.com/">Epinions.com</a> website. <a title="Epinions" target="_balnk" rel="nofollow" href="http://www.epinions.com/">Epinions.com</a> is a web community where users can write their opinions on movies, books, goods and services. It is also possible to rate to use other usersâ€™ reviews, thus expressing trust or distrust. In the experimentâ€™s model, users are considered as web pages, while their ratings are seen as incoming links. The 20 top ranked users were sorted by their total number of ratings (positive and negative) and by the number of the positive rankings only. The total number of rankings shows the attention to the user, which is similar to the link popularity of a web page. Using statistical methods P. Massa and C. Hayes have determined the alignment factor between the attention ranking and the trust ranking. The value is <strong>0.8</strong> and it should be noted that since people tend to assign positive ranking more frequently than negative (often a disappointed user would simply not rate at all) â€“ the actual alignment value can be even less.</p>
<p>However trust linking would be ineffective in the web. The introduction of trust/distrust linking would create wide opportunities for abuse, and not just from gray spammers. Any highly-ranked website would be able to destroy authority of any less ranked competitor with a power of a single link! While in relatively small communities of bloggers (Technorati.com) or reviewers (Epinions.com) such wars can be prevented by human moderation, the existing technology still doesnâ€™t allow search engines to perform a similar task in the Web.</p>
<h2>References:</h2>
<ul>
<li>Massa, P. and Hayes, C. 2005, â€˜<a href="http://sra.itc.it/people/massa/publications/wi05_page_rerank_massa_hayes.pdf">Page-reRank: using trusted links to re-rank authority</a>â€™. Istituto Trentino di Cultura. Available at http://sra.itc.it/people/massa/publications/wi05_page_rerank_massa_hayes.pdf. Retrieved on 29.06.06</li>
<li>Ntoulas, A., Cho, J., Cho, H.K., et al. 2004, â€˜<a target="_blank" href="http://oak.cs.ucla.edu/~cho/papers/ntoulas-evolution.pdf">A Study On The Evolution Of The Web</a>â€™. University of California. Available at http://oak.cs.ucla.edu/~cho/papers/ntoulas-evolution.pdf . Retrieved on 15.06.06</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.seoresearcher.com/trust-problem-in-link-based-popularity-ranking.htm/feed</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
	</channel>
</rss>
