Ravens PHP Scripts: Forums
Forum Index • Forum FAQ • Search • Memberlist • Usergroups • Profile • Log in to check your private messages • Log in |
Search found 10 matches |
Ravens PHP Scripts And Web Hosting Forum Index |
Author | Message |
---|---|
Topic: Majestic12.co.uk | |
majestic-12 Replies: 25 Views: 15172 ![]() |
![]() |
Its okay - I am not happy with this solution but in short term its probably best -- I _DID_ change url loading software to fix your urls, but some from previous load may have slipped in.
If you do ... |
|
Topic: Majestic12.co.uk | |
majestic-12 Replies: 25 Views: 15172 ![]() |
![]() |
Would it be a good idea to ban all your IP´s for the next 1-3 month or how long ?
No, it would not be a good idea because we use distributed model and number of IPs is very high with new ones cons ... |
|
Topic: Majestic12.co.uk | |
majestic-12 Replies: 25 Views: 15172 ![]() |
![]() |
Susann -- these are likely to be old urls - the change you make on site do not have immediate effect on old crawled data -- note that these urls you referenced exibit same error we discussed above - n ... | |
Topic: Majestic12.co.uk | |
majestic-12 Replies: 25 Views: 15172 ![]() |
![]() |
Yes, I understand, but how do I do this exactly ?
Well, you will need to edit the code that appends those SIDs, from what I can see you must be using some internal re-writing to have nice ".html"' ... |
|
Topic: Majestic12.co.uk | |
majestic-12 Replies: 25 Views: 15172 ![]() |
![]() |
just the SID disable it completely That isn t so easy.
You probably right about this -- but you definately need to fix your URLs by changing & to ?, because without it a URL parsing routine wi ... |
|
Topic: Majestic12.co.uk | |
majestic-12 Replies: 25 Views: 15172 ![]() |
![]() |
Your rewrite is fine, its just the SID bit that's the problem, if I were you I'd disable it completely because even though my bot understands it (provided URL is properly formatted), but others won't. | |
Topic: Majestic12.co.uk | |
majestic-12 Replies: 25 Views: 15172 ![]() |
![]() |
I am back!
As promised I tested my code to see if there was a bug. Now my code was NOT removing your session ID, but there is a good reason for it -- your URL is actually not correct because you us ... |
|
Topic: Majestic12.co.uk | |
majestic-12 Replies: 25 Views: 15172 ![]() |
![]() |
Thanks - best wishes to whatever you do in cyber life too! ![]() I did have friendly discussion with Yacy people but they did not agree with me, which is fine -- perhaps I am wrong and P2P is possible ... |
|
Topic: Majestic12.co.uk | |
majestic-12 Replies: 25 Views: 15172 ![]() |
![]() |
The bot does NOT ignore robots.txt and it support Crawl-Delay parameter to have bigger than normal (1 sec) delay between requests.
I do have SID filtering implemented, however I am going to recheck ... |
|
Topic: Majestic12.co.uk | |
majestic-12 Replies: 25 Views: 15172 ![]() |
![]() |
Hi there,
I am the creator of the bot -- found this forum just like you found my bot - from the log file ![]() I am suprised session ID was present in the URL because a few months ago I implemented ... |
|
Ravens PHP Scripts And Web Hosting Forum Index |
Powered by phpBB © 2001-2007 phpBB Group
All times are GMT - 6 Hours