<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	>
<channel>
	<title>Comments on: 4 unusual tricks to not give SE-robots a chance to waste all your traffic</title>
	<atom:link href="http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/feed/" rel="self" type="application/rss+xml" />
	<link>http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/</link>
	<description>web, technologies, seo, web2.0, web3.0 (why not?), etc.</description>
	<pubDate>Fri, 12 Mar 2010 05:02:17 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.7.1</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: L. Stetz</title>
		<link>http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/comment-page-1/#comment-28460</link>
		<dc:creator>L. Stetz</dc:creator>
		<pubDate>Sun, 02 Nov 2008 14:12:46 +0000</pubDate>
		<guid isPermaLink="false">http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/#comment-28460</guid>
		<description>I've been hit by a bot attack from Yahoo. Over 30 hours this week and last week too. they've been getting more aggressive for weeks.
My question: Don't I limit Yahoo crawler?(crawl.yahoo.net) I mean, what's with this "Slurp"?
 Also MSN is doing it too with their inktomisearch.com search bot. 
They all just send a different bot every time.

I've got lots of images and want them on the searches. of course, as you say, Google doesn't update and  alot of the images they 've plucked from my pages don't match search queries.

I just wonder about the NAMES to limit the bots. Can you give any suggestions?</description>
		<content:encoded><![CDATA[<p>I&#8217;ve been hit by a bot attack from Yahoo. Over 30 hours this week and last week too. they&#8217;ve been getting more aggressive for weeks.<br />
My question: Don&#8217;t I limit Yahoo crawler?(crawl.yahoo.net) I mean, what&#8217;s with this &#8220;Slurp&#8221;?<br />
 Also MSN is doing it too with their inktomisearch.com search bot.<br />
They all just send a different bot every time.</p>
<p>I&#8217;ve got lots of images and want them on the searches. of course, as you say, Google doesn&#8217;t update and  alot of the images they &#8216;ve plucked from my pages don&#8217;t match search queries.</p>
<p>I just wonder about the NAMES to limit the bots. Can you give any suggestions?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John H. Gohde</title>
		<link>http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/comment-page-1/#comment-25308</link>
		<dc:creator>John H. Gohde</dc:creator>
		<pubDate>Mon, 09 Jun 2008 03:12:13 +0000</pubDate>
		<guid isPermaLink="false">http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/#comment-25308</guid>
		<description>I tend to agree that excessive hits from search engines is a waste of bandwidth. If half of your hits are coming from search engines then your site definitely has a problem.

I think think that most sites really don't want to have their images indexed.  It is an issue of copyright violation.  But, really have made no effort to prevent it.

In the past, I have experienced excessive indexing from Yahoo.  With Google, it will index the same web pages over and over again while totally ignoring new content.</description>
		<content:encoded><![CDATA[<p>I tend to agree that excessive hits from search engines is a waste of bandwidth. If half of your hits are coming from search engines then your site definitely has a problem.</p>
<p>I think think that most sites really don&#8217;t want to have their images indexed.  It is an issue of copyright violation.  But, really have made no effort to prevent it.</p>
<p>In the past, I have experienced excessive indexing from Yahoo.  With Google, it will index the same web pages over and over again while totally ignoring new content.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: I made update of tricks with saving traffic and SE-robots</title>
		<link>http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/comment-page-1/#comment-9887</link>
		<dc:creator>I made update of tricks with saving traffic and SE-robots</dc:creator>
		<pubDate>Fri, 20 Apr 2007 11:59:40 +0000</pubDate>
		<guid isPermaLink="false">http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/#comment-9887</guid>
		<description>[...] April 20, 2007 at 7:59 am &#183; Tags: update&#160;&#160; tutorial   For long time I was going to update my tutorial &#8216;3 unusual tricks to not give SE-robots a chance to waste all your traffic&#8217;, finally done with this. Now it&#8217;s &#8216;4 unusual tricks to not give SE-robots a chance to waste all your traffic&#8216;. [...]</description>
		<content:encoded><![CDATA[<p>[...] April 20, 2007 at 7:59 am &#183; Tags: update&nbsp;&nbsp; tutorial   For long time I was going to update my tutorial &#8216;3 unusual tricks to not give SE-robots a chance to waste all your traffic&#8217;, finally done with this. Now it&#8217;s &#8216;4 unusual tricks to not give SE-robots a chance to waste all your traffic&#8216;. [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: samlowry</title>
		<link>http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/comment-page-1/#comment-200</link>
		<dc:creator>samlowry</dc:creator>
		<pubDate>Tue, 05 Sep 2006 19:17:16 +0000</pubDate>
		<guid isPermaLink="false">http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/#comment-200</guid>
		<description>&gt;if they come back often is beacuse they think your site is worth spidering.deeper.
You are absolutely wrong. I mean huge number of requests recieved in short time. Yahoo or MSN crazy robots can make several K of hits perd day to a small well indexed site. I experienced the real DDOS attack to one of my server from Yahoo Slurp before I made Crawl-delay:10 for all sites on that server.

&gt;get a real host if your serious about the web
I have several ;) And I am very serious.</description>
		<content:encoded><![CDATA[<p>>if they come back often is beacuse they think your site is worth spidering.deeper.<br />
You are absolutely wrong. I mean huge number of requests recieved in short time. Yahoo or MSN crazy robots can make several K of hits perd day to a small well indexed site. I experienced the real DDOS attack to one of my server from Yahoo Slurp before I made Crawl-delay:10 for all sites on that server.</p>
<p>>get a real host if your serious about the web<br />
I have several <img src='http://web3.0log.org/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /> And I am very serious.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Maurice</title>
		<link>http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/comment-page-1/#comment-197</link>
		<dc:creator>Maurice</dc:creator>
		<pubDate>Tue, 05 Sep 2006 16:04:50 +0000</pubDate>
		<guid isPermaLink="false">http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/#comment-197</guid>
		<description>Yeh 

Right I want to stop search engines spiding my site - if they come back often is beacuse they think your site is worth spidering.deeper.

get a real host if your serious about the web or leave it to the pros</description>
		<content:encoded><![CDATA[<p>Yeh </p>
<p>Right I want to stop search engines spiding my site - if they come back often is beacuse they think your site is worth spidering.deeper.</p>
<p>get a real host if your serious about the web or leave it to the pros</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Easy Webbers &#187; Blog Archive &#187; Speedlinking</title>
		<link>http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/comment-page-1/#comment-158</link>
		<dc:creator>Easy Webbers &#187; Blog Archive &#187; Speedlinking</dc:creator>
		<pubDate>Mon, 04 Sep 2006 19:03:43 +0000</pubDate>
		<guid isPermaLink="false">http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/#comment-158</guid>
		<description>[...] - An ajax enhancement for wordpress blogs. - 3 unusual tricks to not give se-robots a chance to waste all your traffic. - Generate a screenshot of your website in about 5 seconds, for free. - 10 things businesses should know before building a website. - Pageviews are obsolete? - If you must have a shoutbox, this is the one to use. - And finally some usability guidelines. [...]</description>
		<content:encoded><![CDATA[<p>[...] - An ajax enhancement for wordpress blogs. - 3 unusual tricks to not give se-robots a chance to waste all your traffic. - Generate a screenshot of your website in about 5 seconds, for free. - 10 things businesses should know before building a website. - Pageviews are obsolete? - If you must have a shoutbox, this is the one to use. - And finally some usability guidelines. [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jonathan</title>
		<link>http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/comment-page-1/#comment-110</link>
		<dc:creator>Jonathan</dc:creator>
		<pubDate>Sun, 03 Sep 2006 12:16:35 +0000</pubDate>
		<guid isPermaLink="false">http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/#comment-110</guid>
		<description>Some good advice. However, I find people often visit my site through an image search, so I wouldn't want to disable that. I suggest people check their logs first to see where visitors come from. The image crawlers don't visit too often: Google image search is always quite out of date with lots of broken links (which isn't good from the user's point of view!)</description>
		<content:encoded><![CDATA[<p>Some good advice. However, I find people often visit my site through an image search, so I wouldn&#8217;t want to disable that. I suggest people check their logs first to see where visitors come from. The image crawlers don&#8217;t visit too often: Google image search is always quite out of date with lots of broken links (which isn&#8217;t good from the user&#8217;s point of view!)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: The Web Design Blog</title>
		<link>http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/comment-page-1/#comment-99</link>
		<dc:creator>The Web Design Blog</dc:creator>
		<pubDate>Sun, 03 Sep 2006 01:28:53 +0000</pubDate>
		<guid isPermaLink="false">http://web3.0log.org/2006/09/02/3-unusual-tricks-to-not-give-se-robots-a-chance-to-waste-all-your-traffic/#comment-99</guid>
		<description>[...] If bandwidth is an issue for you or your clients, this technique will stop search engine spiders from over-crawling your website, and more importantly, stop the spiders from crawling and indexing your images, because images tend to use more bandwidth, and lets face it, no-one really views your site when using image search. Big waste of bandwidth.       &#171; Turning visitors into users &#160; [...]</description>
		<content:encoded><![CDATA[<p>[...] If bandwidth is an issue for you or your clients, this technique will stop search engine spiders from over-crawling your website, and more importantly, stop the spiders from crawling and indexing your images, because images tend to use more bandwidth, and lets face it, no-one really views your site when using image search. Big waste of bandwidth.       &laquo; Turning visitors into users &nbsp; [...]</p>
]]></content:encoded>
	</item>
</channel>
</rss>
