PassMark Logo
Home » Forum

Announcement

Collapse
No announcement yet.

Making the url check case insensitive

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Making the url check case insensitive

    Hi,

    Is it possible to specify in the indexer that urls that are the same but with a different case are actually the same page and should not be indexed more than once? Basically the reason it has come about is that I tried to index a link which had querystring data in it. The link was specified on two different pages and the capitalisation was different between the two. The spider indexed both pages as it thought they were different when in fact they were the same. The CRC check failed to pick it up because the querystring data was displayed on the page itself.

    The same goes for the skip options. Is it possible to make this check case insensitive as well, so links with e.g. &Date= and &date= will both be ignored without having to have them as two separate entries?

  • #2
    URLs are case sensitive on most servers (because Linux and Unix are case sensitive). So Zoom must also be case sensitive to index these sites correctly. There is no option to change this behaviour.

    But as you say, you can add entries to the skip list to avoid one or both case variants being indexed.

    Comment

    Working...
    X