Google Site Search (commercial) not indexing files in sitemap

Posted by melat0nin on Pro Webmasters See other posts from Pro Webmasters or by melat0nin
Published on 2013-02-07T15:09:18Z Indexed on 2013/11/04 22:15 UTC
Read the original article Hit count: 209

I have a client for whom we have purchased Google Site Search. It works well for HTML pages served by the CMS, but files aren't being reliably indexed.

I wrote a script to generate an XML feed (sitemap) of all the files in the CMS which I've plugged in to Google Webmaster Tools for the site. It says that for that sitemap 923 URLs have been submitted, but only 26 have been indexed.

The client relies heavily on searching within files, which is why we decided to use Google search, so this is a bit of a problem.

Many of the files aren't linked to from any page on the site, as they are old and therefore don't merit having a page of their own. But they still need to be accessible through search for archiving purposes.

The file archive xml can be found at www.sniffer.org.uk/file-archive and the standard xml sitemap (of pages) can be found at www.sniffer.org.uk/sitemap.xml.

Any thought would be much appreciated!

© Pro Webmasters or respective owner

Related posts about google-index

Related posts about xml-sitemap