<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: 4 unusual tricks to not give SE-robots a chance to waste all your traffic</title>
	<atom:link href="http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/feed/" rel="self" type="application/rss+xml" />
	<link>http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/</link>
	<description>web, technologies, seo, web2.0, web3.0 (why not?), etc.</description>
	<lastBuildDate>Sun, 31 Jul 2011 20:16:07 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
	<item>
		<title>By: L. Stetz</title>
		<link>http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/comment-page-1/#comment-28460</link>
		<dc:creator>L. Stetz</dc:creator>
		<pubDate>Sun, 02 Nov 2008 14:12:46 +0000</pubDate>
		<guid isPermaLink="false">http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/#comment-28460</guid>
		<description>I&#039;ve been hit by a bot attack from Yahoo. Over 30 hours this week and last week too. they&#039;ve been getting more aggressive for weeks.
My question: Don&#039;t I limit Yahoo crawler?(crawl.yahoo.net) I mean, what&#039;s with this &quot;Slurp&quot;?
 Also MSN is doing it too with their inktomisearch.com search bot. 
They all just send a different bot every time.

I&#039;ve got lots of images and want them on the searches. of course, as you say, Google doesn&#039;t update and  alot of the images they &#039;ve plucked from my pages don&#039;t match search queries.

I just wonder about the NAMES to limit the bots. Can you give any suggestions?</description>
		<content:encoded><![CDATA[<p>I&#8217;ve been hit by a bot attack from Yahoo. Over 30 hours this week and last week too. they&#8217;ve been getting more aggressive for weeks.<br />
My question: Don&#8217;t I limit Yahoo crawler?(crawl.yahoo.net) I mean, what&#8217;s with this &#8220;Slurp&#8221;?<br />
 Also MSN is doing it too with their inktomisearch.com search bot.<br />
They all just send a different bot every time.</p>
<p>I&#8217;ve got lots of images and want them on the searches. of course, as you say, Google doesn&#8217;t update and  alot of the images they &#8216;ve plucked from my pages don&#8217;t match search queries.</p>
<p>I just wonder about the NAMES to limit the bots. Can you give any suggestions?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John H. Gohde</title>
		<link>http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/comment-page-1/#comment-25308</link>
		<dc:creator>John H. Gohde</dc:creator>
		<pubDate>Mon, 09 Jun 2008 03:12:13 +0000</pubDate>
		<guid isPermaLink="false">http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/#comment-25308</guid>
		<description>I tend to agree that excessive hits from search engines is a waste of bandwidth. If half of your hits are coming from search engines then your site definitely has a problem.

I think think that most sites really don&#039;t want to have their images indexed.  It is an issue of copyright violation.  But, really have made no effort to prevent it.

In the past, I have experienced excessive indexing from Yahoo.  With Google, it will index the same web pages over and over again while totally ignoring new content.</description>
		<content:encoded><![CDATA[<p>I tend to agree that excessive hits from search engines is a waste of bandwidth. If half of your hits are coming from search engines then your site definitely has a problem.</p>
<p>I think think that most sites really don&#8217;t want to have their images indexed.  It is an issue of copyright violation.  But, really have made no effort to prevent it.</p>
<p>In the past, I have experienced excessive indexing from Yahoo.  With Google, it will index the same web pages over and over again while totally ignoring new content.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: I made update of tricks with saving traffic and SE-robots</title>
		<link>http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/comment-page-1/#comment-9887</link>
		<dc:creator>I made update of tricks with saving traffic and SE-robots</dc:creator>
		<pubDate>Fri, 20 Apr 2007 11:59:40 +0000</pubDate>
		<guid isPermaLink="false">http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/#comment-9887</guid>
		<description>[...] April 20, 2007 at 7:59 am &#183; Tags: update&#160;&#160; tutorial   For long time I was going to update my tutorial &#8216;3 unusual tricks to not give SE-robots a chance to waste all your traffic&#8217;, finally done with this. Now it&#8217;s &#8216;4 unusual tricks to not give SE-robots a chance to waste all your traffic&#8216;. [...]</description>
		<content:encoded><![CDATA[<p>[...] April 20, 2007 at 7:59 am &#183; Tags: update&nbsp;&nbsp; tutorial   For long time I was going to update my tutorial &#8216;3 unusual tricks to not give SE-robots a chance to waste all your traffic&#8217;, finally done with this. Now it&#8217;s &#8216;4 unusual tricks to not give SE-robots a chance to waste all your traffic&#8216;. [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: samlowry</title>
		<link>http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/comment-page-1/#comment-200</link>
		<dc:creator>samlowry</dc:creator>
		<pubDate>Tue, 05 Sep 2006 19:17:16 +0000</pubDate>
		<guid isPermaLink="false">http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/#comment-200</guid>
		<description>&gt;if they come back often is beacuse they think your site is worth spidering.deeper.
You are absolutely wrong. I mean huge number of requests recieved in short time. Yahoo or MSN crazy robots can make several K of hits perd day to a small well indexed site. I experienced the real DDOS attack to one of my server from Yahoo Slurp before I made Crawl-delay:10 for all sites on that server.

&gt;get a real host if your serious about the web
I have several ;) And I am very serious.</description>
		<content:encoded><![CDATA[<p>>if they come back often is beacuse they think your site is worth spidering.deeper.<br />
You are absolutely wrong. I mean huge number of requests recieved in short time. Yahoo or MSN crazy robots can make several K of hits perd day to a small well indexed site. I experienced the real DDOS attack to one of my server from Yahoo Slurp before I made Crawl-delay:10 for all sites on that server.</p>
<p>>get a real host if your serious about the web<br />
I have several <img src='http://web3.0log.org/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' />  And I am very serious.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Maurice</title>
		<link>http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/comment-page-1/#comment-197</link>
		<dc:creator>Maurice</dc:creator>
		<pubDate>Tue, 05 Sep 2006 16:04:50 +0000</pubDate>
		<guid isPermaLink="false">http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/#comment-197</guid>
		<description>Yeh 

Right I want to stop search engines spiding my site - if they come back often is beacuse they think your site is worth spidering.deeper.

get a real host if your serious about the web or leave it to the pros</description>
		<content:encoded><![CDATA[<p>Yeh </p>
<p>Right I want to stop search engines spiding my site &#8211; if they come back often is beacuse they think your site is worth spidering.deeper.</p>
<p>get a real host if your serious about the web or leave it to the pros</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Easy Webbers &#187; Blog Archive &#187; Speedlinking</title>
		<link>http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/comment-page-1/#comment-158</link>
		<dc:creator>Easy Webbers &#187; Blog Archive &#187; Speedlinking</dc:creator>
		<pubDate>Mon, 04 Sep 2006 19:03:43 +0000</pubDate>
		<guid isPermaLink="false">http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/#comment-158</guid>
		<description>[...] - An ajax enhancement for wordpress blogs. - 3 unusual tricks to not give se-robots a chance to waste all your traffic. - Generate a screenshot of your website in about 5 seconds, for free. - 10 things businesses should know before building a website. - Pageviews are obsolete? - If you must have a shoutbox, this is the one to use. - And finally some usability guidelines. [...]</description>
		<content:encoded><![CDATA[<p>[...] &#8211; An ajax enhancement for wordpress blogs. &#8211; 3 unusual tricks to not give se-robots a chance to waste all your traffic. &#8211; Generate a screenshot of your website in about 5 seconds, for free. &#8211; 10 things businesses should know before building a website. &#8211; Pageviews are obsolete? &#8211; If you must have a shoutbox, this is the one to use. &#8211; And finally some usability guidelines. [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jonathan</title>
		<link>http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/comment-page-1/#comment-110</link>
		<dc:creator>Jonathan</dc:creator>
		<pubDate>Sun, 03 Sep 2006 12:16:35 +0000</pubDate>
		<guid isPermaLink="false">http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/#comment-110</guid>
		<description>Some good advice. However, I find people often visit my site through an image search, so I wouldn&#039;t want to disable that. I suggest people check their logs first to see where visitors come from. The image crawlers don&#039;t visit too often: Google image search is always quite out of date with lots of broken links (which isn&#039;t good from the user&#039;s point of view!)</description>
		<content:encoded><![CDATA[<p>Some good advice. However, I find people often visit my site through an image search, so I wouldn&#8217;t want to disable that. I suggest people check their logs first to see where visitors come from. The image crawlers don&#8217;t visit too often: Google image search is always quite out of date with lots of broken links (which isn&#8217;t good from the user&#8217;s point of view!)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: The Web Design Blog</title>
		<link>http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/comment-page-1/#comment-99</link>
		<dc:creator>The Web Design Blog</dc:creator>
		<pubDate>Sun, 03 Sep 2006 01:28:53 +0000</pubDate>
		<guid isPermaLink="false">http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/#comment-99</guid>
		<description>[...] If bandwidth is an issue for you or your clients, this technique will stop search engine spiders from over-crawling your website, and more importantly, stop the spiders from crawling and indexing your images, because images tend to use more bandwidth, and lets face it, no-one really views your site when using image search. Big waste of bandwidth.       &#171; Turning visitors into users &#160; [...]</description>
		<content:encoded><![CDATA[<p>[...] If bandwidth is an issue for you or your clients, this technique will stop search engine spiders from over-crawling your website, and more importantly, stop the spiders from crawling and indexing your images, because images tend to use more bandwidth, and lets face it, no-one really views your site when using image search. Big waste of bandwidth.       &laquo; Turning visitors into users &nbsp; [...]</p>
]]></content:encoded>
	</item>
</channel>
</rss>

