[pmwiki-users] Being pounded

Patrick R. Michaud pmichaud at pobox.com
Tue Dec 19 13:47:51 CST 2006


On Tue, Dec 19, 2006 at 11:32:37AM -0800, pmwiki at 911networks.com wrote:
> My site is http://www.sqlhacks.com
> 
> I use the fixflow skin [thanks Hans, you are listed in the credits]
> with the (:noaction:). So you can't see the Edit, Print, Recent
> Changes, Source...
> 
> When I create the sitemap through http://www.xml-sitemaps.com it
> finds properly 30 pages, but I check my logs it get:
> 
> sqlhacks.com 72.52.140.189 - - [15/Dec/2006:09:36:27 -0800]
> "GET /index.php/Retrieve/Retrieve HTTP/1.1" 404 9187 "-" "Mozil
> la/6.0 (MSIE 6.0; Windows NT 5.1;)" sqlhacks.com 72.52.140.189 - -
> [15/Dec/2006:09:36:28 -0800]
> "GET /index.php/Retrieve/Retrieve?action=source HTTP/1.1" 200 9 538
> "-" "Mozilla/6.0 (MSIE 6.0; Windows NT 5.1;)" sqlhacks.com
> ...

Somewhere on the site there is probably a (broken) link to a page 
called Retrieve.Retrieve.  Since that page doesn't have (:noaction:) 
in it, the links appear and the xml-sitemaps.com spider follows them.

(And, as I've previously commented, I think the xml-sitemaps.com
spider is fairly broken and can't be relied upon to give good sitemaps
for any modern or database-backed site.  Basically, any spider that
ignores 'nofollow,noindex' in meta robots tags (as xml-sitemaps.com's
spider apparently does) just isn't going to do what you want here.)

Pm




More information about the pmwiki-users mailing list