PHP Web Host - Quality Web Hosting For All PHP Applications Just Great Software
  Login or Register
 • Home • Downloads • Your Account • Forums • 

View next topic
View previous topic


Google
 
Web RavenPHPScripts (This Site)
Post new topic   Reply to topic
Author Message
nb1
Regular
Regular


Joined: Mar 03, 2005
Posts: 92
Location: OZ

PostPosted: Sat Jul 14, 2007 1:50 am Reply with quote Back to top

I am posting in hopes that it may help someone else.

Yahoo has changed its crawling method and also added more crawlers and the way I read it for a short time there will be excessive amount of crawling of web sites for a period
of time
NetRange: 74.6.0.0 - 74.6.255.255
In effect may result in a higher cpu usage , and a higher number of hits also causing the pages to load slower ,

So as a Webmaster this is something you may want to be informd about

The links in this post explains what can be done to help slow The crawling down by using a delayed method

Only registered users can see links on this board!
Get registered or login to the forums!

Only registered users can see links on this board!
Get registered or login to the forums!

Only registered users can see links on this board!
Get registered or login to the forums!



thank you for your time
::NB::
View user's profile Send private message Visit poster's website AIM Address Yahoo Messenger MSN Messenger
kguske
Site Admin


Joined: Jun 04, 2004
Posts: 4873

PostPosted: Sat Jul 14, 2007 7:45 am Reply with quote Back to top

Thanks! In short, here's the relevant stuff from the links above:

There is a Yahoo! Slurp-specific extension to robots.txt which allows you to set a lower limit on our crawler request rate.

You can add a "Crawl-delay: xx" instruction, where "xx" is a delay value between successive crawler accesses. If the crawler rate is a problem for your server, you can set the delay up to 5 or 10 or a comfortable value for your server.

Setting a crawl-delay of 10 for Yahoo! Slurp would look something like:

User-agent: Slurp
Crawl-delay: 10
View user's profile Send private message
montego
Site Admin


Joined: Aug 29, 2004
Posts: 7481
Location: Arizona

PostPosted: Sat Jul 14, 2007 8:44 am Reply with quote Back to top

This also helps keep the behaving search engine bots from getting banned with the Flood Blocker if you use "User-agent: *".

At one point Google said that they do not support that directive, but they have a way from their webmaster tools to set up a crawl delay. (Sorry if this particular info is outdated - have had both the robots.txt and Google set this way for quite some time - the old adage: "if it ain't broke, don't fix it!". LOL).
View user's profile Send private message Visit poster's website
nb1
Regular
Regular


Joined: Mar 03, 2005
Posts: 92
Location: OZ

PostPosted: Sat Jul 14, 2007 11:23 am Reply with quote Back to top

humm outdated It may be the only thing I know is several other sites I visit that show you visitors ip addresses are usually flooded with that range and yes it also sets off the Flood Blocker quite often
But if this helps any one it's a good thing,
I was wondering do you have a direct link to your Google
site map in your robots text file ?
Sitemap:http://YOUR SITE/sitemap.xml

The reason I ask Beginning in April or may befor all major search engines are able to read this method
and I have had some trouble getting Google to validate my site map

Any follow up on this ?
View user's profile Send private message Visit poster's website AIM Address Yahoo Messenger MSN Messenger
montego
Site Admin


Joined: Aug 29, 2004
Posts: 7481
Location: Arizona

PostPosted: Sun Jul 15, 2007 9:51 am Reply with quote Back to top

nb1, I use nukeSEO to provide my XML sitemap. Wink
View user's profile Send private message Visit poster's website
nb1
Regular
Regular


Joined: Mar 03, 2005
Posts: 92
Location: OZ

PostPosted: Sun Jul 15, 2007 11:28 am Reply with quote Back to top

so do i
View user's profile Send private message Visit poster's website AIM Address Yahoo Messenger MSN Messenger
montego
Site Admin


Joined: Aug 29, 2004
Posts: 7481
Location: Arizona

PostPosted: Mon Jul 16, 2007 5:36 am Reply with quote Back to top

Then have you asked kguske about it over on nukeSEO.com? Mine has always validated...
View user's profile Send private message Visit poster's website
Display posts from previous:       
Post new topic   Reply to topic

View next topic
View previous topic
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum
Forums ©
 

All logos and trademarks in this site are property of their respective owner.
The comments are property of their posters, all the rest © 2002-2008 by Raven
Proud to be listed at Lobo Links Web Directory

You can syndicate our news using the file xml

CSE HTML Validator Helped Clean up This Page! [Valid RSS] valid RSS 2.0 Valid robots.txt Stop Spam Harvesters, Join Project Honey Pot

Website engines core code is © copyright by PHP-Nuke but has been heavily patched and modified by myself and others.
PHP-Nuke is a free software released under the GNU/GPL.


:: fisubice phpbb2 style by Daz :: PHP-Nuke theme by www.nukemods.com ::

:: fisubice Theme Recoded To 100% W3C CSS & HTML 4.01 Transitional Compliance by Raven and 64bitguy ::

:: W3C CSS Compliance Validation :: W3C HTML 4.01 Transitional Compliance Validation ::

zerosum