View Full Version : Google Sitemap indexing only about 10% of site?
Kevuk2k
04-12-2006, 01:45 PM
I have been looking at the Google sitemap in admin and wondered why Google were only crawling a certain amount of pages. I have a few thousand categories for certain in the one in question so was slightly concerned when I cut and pasted the entire sitemap to a text editor, took all the crap away leaving just the bare links. I was worried to say the leasr when it only shows 277 links when I know for a fact there are nearer 2000 guaranteed.
Can someone advise me on this anomoly and if it is the reason why my site doesn't get crawled correctly by Google.
Thanks in advance,
Kev
Kevuk2k
04-12-2006, 10:38 PM
*BUMP*
clubracer
04-13-2006, 01:16 PM
get incoming links (with good anchors) to your unlisted pages!
if some pages from your sitemap are listed thats a good sign, but you need a litle help from outside to get the lower rankend pages on the surface.
good SEM is always thinking outside the box :)
just my 2c
gearoid
04-15-2006, 12:57 AM
Hi Kevuk2k,
Would you consider using your xml/rss, to help?
For example only; submit some of your deep category feeds to the various feed directories. And preferably, the ones with content if that applies.
This should help if you never had a sitemap.
Consider http://www.rorweb.com/rormap.htm perhaps or as well, for the other s.e.'s. It might grab their attention for 1000 pages?
gearoid
04-15-2006, 01:07 AM
Sorry,
This one is supposed to be good for 5000 urls http://www.rorweb.com/rormap.htm#download . Have a quick scan of the urls yourself though before you save the file at the server.
Msn search reckon I have over three thousand urls, which I shouldn't have, lol. (it may have being this, at an early point, but is fine now)
Kevuk2k
04-15-2006, 01:42 AM
Hi there mate, this one is ok, I used it before, the only problem with it is that it reads without discrination, in other words it will read both .html and php in the rss feeds. This causes duplicate sites being indexed and my be a problem if Google caught on. There is a script out there that allows you to eliminate the problem of php and htm, html being read at the same time and even allows you to ignore PHPSESSID's if you want. The url to this is http://www.softswot.com/sitemapxml.php
Hope this helps.
I want the sitemap to work however it goes but would prefer the one on this script to work though.
Thanks for the help though.
Kev
vBulletin® v3.8.0, Copyright ©2000-2012, Jelsoft Enterprises Ltd.