<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Advertising Digital Media &#187; Search engines</title>
	<atom:link href="http://www.addigitalmedia.com/category/search-engines/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.addigitalmedia.com</link>
	<description>Internet marketing and online advertising campaigns with experienced advertising agency for Internet promotion.</description>
	<lastBuildDate>Thu, 09 Feb 2012 20:27:31 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Spam targeting search engines (Spamdexing)</title>
		<link>http://www.addigitalmedia.com/2009/04/spam-targeting-search-engines-spamdexing/</link>
		<comments>http://www.addigitalmedia.com/2009/04/spam-targeting-search-engines-spamdexing/#comments</comments>
		<pubDate>Tue, 07 Apr 2009 19:20:35 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Search engines]]></category>
		<category><![CDATA[Spam]]></category>
		<category><![CDATA[black-hat]]></category>
		<category><![CDATA[blog]]></category>
		<category><![CDATA[comment forms]]></category>
		<category><![CDATA[guestbook]]></category>
		<category><![CDATA[guestbooks]]></category>
		<category><![CDATA[indexing]]></category>
		<category><![CDATA[PageRank]]></category>
		<category><![CDATA[referrer]]></category>
		<category><![CDATA[Search engine optimization]]></category>
		<category><![CDATA[Spamdexing]]></category>
		<category><![CDATA[spamming]]></category>
		<category><![CDATA[weblogs]]></category>
		<category><![CDATA[white hat]]></category>
		<category><![CDATA[wiki]]></category>

		<guid isPermaLink="false">http://www.addigitalmedia.com/?p=254</guid>
		<description><![CDATA[Spamdexing (a portmanteau of spamming and indexing) refers to the practice on the World Wide Web of deliberately modifying HTML pages to increase the chance of them being placed high on search engine relevancy lists. People who do this are called search engine spammers. In layman&#8217;s terms, spamdexing is using unethical means known as &#8220;black [...]]]></description>
			<content:encoded><![CDATA[<fb:like href='http://www.addigitalmedia.com/2009/04/spam-targeting-search-engines-spamdexing/' send='true' layout='standard' show_faces='true' width='450' height='65' action='like' colorscheme='light' font='lucida+grande'></fb:like><p><strong>Spamdexing</strong> (a portmanteau of <em>spamming</em> and <em>indexing</em>)  refers to the practice on the World Wide Web of deliberately modifying HTML  pages to increase the chance of them being placed high on search engine  relevancy lists. People who do this are called search engine spammers. In  layman&#8217;s terms, spamdexing is using unethical means known as &#8220;black hat seo  techniques&#8221; to unfairly increase the rank of sites in search engines. When a  website is optimized to be indexable by a search engine, without trying to  deceive its web crawler, this is called search engine optimization. To be sure,  there is much gray area between <em>white-hat</em> search engine optimization and <em>black-hat</em> spamdexing.</p>
<h4>Blog, wiki, guestbook, and referrer spam</h4>
<p>Google&#8217;s PageRank system uses the number of links to a page as an index of  its &#8220;importance&#8221;. Ordinarily, very few pages will link to a spammer&#8217;s commercial  site, because it is of no interest to anyone else, and hence it will have a very  low PageRank score. To counter this effect, spammers attempt to create links to  their sites on other people&#8217;s pages.</p>
<p>The most common targets for this kind of spam are weblogs, the spamming then  being known as blog spam, or &#8220;blam&#8221; for short. In 2003, this type of spam took  advantage of the open nature of comments in the blogging software Movable Type  by repeatedly placing comments to various blog posts that provided nothing more  than a link to the spammer&#8217;s commercial web site. <a class="external autonumber" title="http://www.wired.com/wired/archive/12.03/google.html?pg=7" href="http://www.wired.com/wired/archive/12.03/google.html?pg=7"> [3]</a></p>
<p>Similar attacks are often performed against wikis and guestbooks, both of  which accept user contributions; something that consistantly impresses and  confounds critics of Wikipedia is its remarkable lack of spam, in spite of  having nearly one million articles and over two million pages.</p>
<p>On January 18, 2005, Google proposed a <code>rel="nofollow"</code> attribute  that could be placed on a link; doing so instructs most major search engines to  ignore the link, rendering it useless to spammers. Software is then rewritten to  add this attribute to any link embedded in a comment. As of April 2005, nofollow  has seen expanding usage, but is not yet universal. <a class="external autonumber" title="http://googleblog.blogspot.com/2005/01/preventing-comment-spam.html" href="http://googleblog.blogspot.com/2005/01/preventing-comment-spam.html"> [4]</a></p>
<p>As well as comment forms, editable pages and guestbooks, some sites publish a  list of the most common referrers to their site in order to show how readers  have found it. These lists have also been exploited by spammers with so-called  referer spam, in which the spammer makes repeated web site requests using a fake  referer URL pointing to a spam-advertised site. That URL will later appear as a  link on the site, boosting the PageRank of its target.</p>
<p>This guide is licensed under the <a href="http://www.gnu.org/copyleft/fdl.html">GNU Free Documentation License</a>.  It uses material from the <a href="http://www.wikipedia.org/">Wikipedia</a>.</p>
<p>Need an webmaster? Click <a href="mailto:nicolae@sfetcu.com">HERE</a></p>
<span class="fb_share"><fb:like href="http://www.addigitalmedia.com/2009/04/spam-targeting-search-engines-spamdexing/" layout="button_count"></fb:like></span><p><a class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fwww.addigitalmedia.com%2F2009%2F04%2Fspam-targeting-search-engines-spamdexing%2F&amp;title=Spam%20targeting%20search%20engines%20%28Spamdexing%29" id="wpa2a_2"><img src="http://www.addigitalmedia.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://www.addigitalmedia.com/2009/04/spam-targeting-search-engines-spamdexing/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Organic search engines</title>
		<link>http://www.addigitalmedia.com/2009/03/organic-search-engines/</link>
		<comments>http://www.addigitalmedia.com/2009/03/organic-search-engines/#comments</comments>
		<pubDate>Thu, 12 Mar 2009 15:41:42 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[History]]></category>
		<category><![CDATA[Search engine optimization]]></category>
		<category><![CDATA[Search engines]]></category>
		<category><![CDATA[affiliations]]></category>
		<category><![CDATA[Age]]></category>
		<category><![CDATA[anchor text]]></category>
		<category><![CDATA[broken outgoing links]]></category>
		<category><![CDATA[Citations]]></category>
		<category><![CDATA[click through rates]]></category>
		<category><![CDATA[content]]></category>
		<category><![CDATA[documents]]></category>
		<category><![CDATA[External links]]></category>
		<category><![CDATA[f acquisition]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Google Toolbar]]></category>
		<category><![CDATA[hosting uptime]]></category>
		<category><![CDATA[HTML coding]]></category>
		<category><![CDATA[illegal content]]></category>
		<category><![CDATA[Incoming backlinks]]></category>
		<category><![CDATA[keywords]]></category>
		<category><![CDATA[link]]></category>
		<category><![CDATA[link manipulation]]></category>
		<category><![CDATA[of sub-domains]]></category>
		<category><![CDATA[on-site factors]]></category>
		<category><![CDATA[organic]]></category>
		<category><![CDATA[outward links]]></category>
		<category><![CDATA[PageRank]]></category>
		<category><![CDATA[rel=nofollow]]></category>
		<category><![CDATA[reputation]]></category>
		<category><![CDATA[research]]></category>
		<category><![CDATA[schemes]]></category>
		<category><![CDATA[scoring]]></category>
		<category><![CDATA[Semantic connections]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[SERP]]></category>
		<category><![CDATA[Stem-related]]></category>
		<category><![CDATA[terms]]></category>
		<category><![CDATA[time]]></category>
		<category><![CDATA[tools]]></category>
		<category><![CDATA[Uniqueness]]></category>
		<category><![CDATA[webmasters]]></category>

		<guid isPermaLink="false">http://www.addigitalmedia.com/?p=123</guid>
		<description><![CDATA[Google was started by two PhD students at Stanford University, Sergey Brin and Larry Page, and brought a new concept to evaluating web pages. This concept, called PageRank, has been from the start important to the Google algorithm [1]. PageRank relies heavily on incoming links and uses the logic that each link to a page [...]]]></description>
			<content:encoded><![CDATA[<fb:like href='http://www.addigitalmedia.com/2009/03/organic-search-engines/' send='true' layout='standard' show_faces='true' width='450' height='65' action='like' colorscheme='light' font='lucida+grande'></fb:like><p>Google was started by two PhD students at Stanford University, Sergey Brin  and Larry Page, and brought a new concept to evaluating web pages. This concept,  called PageRank, has been from the start important to the Google algorithm <a class="external autonumber" title="http://www-db.stanford.edu/~backrub/google.html" href="http://www-db.stanford.edu/~backrub/google.html"> [1]</a>. PageRank relies heavily on incoming links and uses the logic that each  link to a page is a vote for that page&#8217;s value. The more incoming links a page  had the more &#8220;worthy&#8221; it is. The value of each incoming link itself varies  directly based on the PageRank of the page it comes from and inversely on the  number of outgoing links on that page.</p>
<p>With help from PageRank, Google proved to be very good at serving relevant  results. Google became the most popular and successful search engine. Because  PageRank measured an off-site factor, Google felt it would be more difficult to  manipulate than on-page factors.</p>
<p>But manipulated it was. Webmasters had already developed link manipulation  tools and schemes to influence the Inktomi search engine. These methods proved  to be equally applicable to Google&#8217;s algorithm. Many sites focused on  exchanging, buying, and selling links on a massive scale. PageRank&#8217;s reliance on  the link as a vote of confidence in a page&#8217;s value was undermined as many  webmasters sought to garner links purely to influence Google into sending them  more traffic, irrespective of whether the link was useful to human site  visitors.</p>
<p>It was time for Google—and other search engines—to look at a wider range of  off-site factors. There were other reasons to develop more intelligent  algorithms. The Internet was reaching a vast population of non-technical users  who were often unable to use advanced querying techniques to reach the  information they were seeking and the sheer volume and complexity of the indexed  data was vastly different from that of the early days. Search engines had to  develop predictive, semantic, linguistic and heuristic algorithms.</p>
<p>A proxy for the PageRank metric is still displayed in the Google Toolbar, but  PageRank is only one of more than 100 factors that Google considers in ranking  pages.</p>
<p>Today, most search engines keep their methods and ranking algorithms secret.  A search engine may use hundreds of factors in ranking the listings on its SERPs;  the factors themselves and the weight each carries may change continually.</p>
<p>Much current SEO thinking on what works and what doesn&#8217;t is largely  speculation and informed guesses. Some SEOs have carried out controlled  experiments to gauge the effects of different approaches to search optimization.</p>
<p>The following, though, are some of the considerations search engines could be  building into their algorithms, and the list of Google patents <a class="external autonumber" title="http://appft1.uspto.gov/netacgi/nph-Parser?Sect1=PTO2&amp;Sect2=HITOFF&amp;p=1&amp;u=/netahtml/PTO/search-bool.html&amp;r=1&amp;f=G&amp;l=50&amp;co1=AND&amp;d=PG01&amp;s1=20050071741&amp;OS=20050071741&amp;RS=20050071741" href="http://appft1.uspto.gov/netacgi/nph-Parser?Sect1=PTO2&amp;Sect2=HITOFF&amp;p=1&amp;u=/netahtml/PTO/search-bool.html&amp;r=1&amp;f=G&amp;l=50&amp;co1=AND&amp;d=PG01&amp;s1=20050071741&amp;OS=20050071741&amp;RS=20050071741"> [2]</a> may give some indication as to what is in the pipeline:</p>
<ul>
<li>Age of site</li>
<li>Length of time domain has been registered</li>
<li>Age of content</li>
<li>Regularity with which new content is added</li>
<li>Age of link and reputation of linking site</li>
<li>Standard on-site factors</li>
<li>Negative scoring for on-site factors (for example, a dampening for sites  	with extensive keyword meta tags indicative of having being SEO-ed)</li>
<li>Uniqueness of content</li>
<li>Related terms used in content (the terms the search engine associates as  	being related to the main content of the page)</li>
<li>Google Pagerank (Only used in Google&#8217;s algorithm)</li>
<li>External links, the anchor text in those external links and in the  	sites/pages containing those links</li>
<li>Citations and research sources (indicating the content is of research  	quality)</li>
<li>Stem-related terms in the search engine&#8217;s database (finance/financing)</li>
<li>Incoming backlinks and anchor text of incoming backlinks</li>
<li>Negative scoring for some incoming backlinks (perhaps those coming from  	low value pages, reciprocated backlinks, etc.)</li>
<li>Rate of acquisition of backlinks: too many too fast could indicate  	&#8220;unnatural&#8221; link buying activity</li>
<li>Text surrounding outward links and incoming backlinks. A link following  	the words &#8220;Sponsored Links&#8221; could be ignored</li>
<li>Use of &#8220;rel=nofollow&#8221; to suggest that the search engine should ignore  	the link</li>
<li>Depth of document in site</li>
<li>Metrics collected from other sources, such as monitoring how frequently  	users hit the back button when SERPs send them to a particular page</li>
<li>Metrics collected from sources like the Google Toolbar, Google AdWords/Adsense  	programs, etc.</li>
<li>Metrics collected in data-sharing arrangements with third parties (like  	providers of statistical programs used to monitor site traffic)</li>
<li>Rate of removal of incoming links to the site</li>
<li>Use of sub-domains, use of keywords in sub-domains and volume of content  	on sub-domains… and negative scoring for such activity</li>
<li>Semantic connections of hosted documents</li>
<li>Rate of document addition or change</li>
<li>IP of hosting service and the number/quality of other sites hosted on  	that IP</li>
<li>Other affiliations of linking site with the linked site (do they share  	an IP? have a common postal address on the &#8220;contact us&#8221; page?)</li>
<li>Technical matters like use of 301 to redirect moved pages, showing a 404  	server header rather than a 200 server header for pages that don&#8217;t exist,  	proper use of robots.txt</li>
<li>Hosting uptime</li>
<li>Whether the site serves different content to different categories of  	users (cloaking)</li>
<li>Broken outgoing links not rectified promptly</li>
<li>Unsafe or illegal content</li>
<li>Quality of HTML coding, presence of coding errors</li>
<li>Actual click through rates observed by the search engines for listings  	displayed on their SERPs</li>
<li>Hand ranking by humans of the most frequently accessed SERPs</li>
</ul>
<p>This guide is licensed under the <a href="http://www.gnu.org/copyleft/fdl.html">GNU  Free Documentation License</a>. It uses material from the <a href="http://www.wikipedia.org/">Wikipedia</a>.</p>
<p>Need an webmaster? Click <a href="mailto:nicolae@sfetcu.com">HERE</a></p>
<span class="fb_share"><fb:like href="http://www.addigitalmedia.com/2009/03/organic-search-engines/" layout="button_count"></fb:like></span><p><a class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fwww.addigitalmedia.com%2F2009%2F03%2Forganic-search-engines%2F&amp;title=Organic%20search%20engines" id="wpa2a_4"><img src="http://www.addigitalmedia.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://www.addigitalmedia.com/2009/03/organic-search-engines/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Early search engines</title>
		<link>http://www.addigitalmedia.com/2009/03/early-search-engines/</link>
		<comments>http://www.addigitalmedia.com/2009/03/early-search-engines/#comments</comments>
		<pubDate>Mon, 02 Mar 2009 14:05:16 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[History]]></category>
		<category><![CDATA[Search engine optimization]]></category>
		<category><![CDATA[Search engines]]></category>
		<category><![CDATA[algorithms]]></category>
		<category><![CDATA[early]]></category>
		<category><![CDATA[Meta tags]]></category>
		<category><![CDATA[programs]]></category>
		<category><![CDATA[ranking]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[SERP]]></category>
		<category><![CDATA[spiders]]></category>
		<category><![CDATA[webmasters]]></category>

		<guid isPermaLink="false">http://www.addigitalmedia.com/?p=60</guid>
		<description><![CDATA[SEO began in the mid-1990s, as the first search engines were cataloging the early Web. Initially, all a webmaster needed to do was submit a site to the various engines which would run spiders, programs to &#8220;crawl&#8221; the site, and store the collected data. The search engines then sorted the information by topic, and served [...]]]></description>
			<content:encoded><![CDATA[<fb:like href='http://www.addigitalmedia.com/2009/03/early-search-engines/' send='true' layout='standard' show_faces='true' width='450' height='65' action='like' colorscheme='light' font='lucida+grande'></fb:like><p>SEO began in the mid-1990s, as the first search engines were cataloging the  early Web. Initially, all a webmaster needed to do was submit a site to the  various engines which would run spiders, programs to &#8220;crawl&#8221; the site, and store  the collected data. The search engines then sorted the information by topic, and  served results based on pages they had spidered. As the number of documents  online kept growing, and more webmasters realized the value of organic search  listings, so popular search engines began to sort their listings so they could  display the most relevant pages first. This was the start of a search engine  versus webmaster game that continues to this day.</p>
<p>At first search engines were guided by the webmasters themselves. Early  versions of search algorithms relied on webmaster-provided information such as  category and keyword meta tags. Meta tags provided a guide to each page&#8217;s  content. When some webmasters began to abuse meta tags, causing their pages to  rank for irrelevant searches, search engines abandoned their consideration of  Meta tags and instead developed more complex ranking algorithms, taking into  account factors that were more diverse, including:</p>
<ul>
<li>Text within the title tag</li>
<li>Domain name</li>
<li>URL directories and file names</li>
<li>HTML tags: headings, bold and emphasized text</li>
<li>Keyword density</li>
<li>Keyword proximity</li>
<li>Alt attributes for images</li>
<li>Text within NOFRAMES tags</li>
</ul>
<p>By relying so extensively on factors that were still within the webmasters&#8217;  exclusive control, search engines continued to suffer from abuse and ranking  manipulation. In order to provide better results to their users, search engines  had to adapt to ensure their SERPs showed the most relevant search results,  rather than useless pages stuffed with keywords by unscrupulous webmasters. This  led to the rise of a new kind of search engine.</p>
<p>This guide is licensed under the <a href="http://www.gnu.org/copyleft/fdl.html">GNU Free Documentation License</a>.  It uses material from the <a href="http://www.wikipedia.org/">Wikipedia</a>.</p>
<span class="fb_share"><fb:like href="http://www.addigitalmedia.com/2009/03/early-search-engines/" layout="button_count"></fb:like></span><p><a class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fwww.addigitalmedia.com%2F2009%2F03%2Fearly-search-engines%2F&amp;title=Early%20search%20engines" id="wpa2a_6"><img src="http://www.addigitalmedia.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://www.addigitalmedia.com/2009/03/early-search-engines/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Minified using disk: basic
Page Caching using disk: enhanced

Served from: www.addigitalmedia.com @ 2012-02-10 14:33:36 -->
