PassMark Logo
Home » Forum

Announcement

Collapse
No announcement yet.

Absolute links when doing a local crawl

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Absolute links when doing a local crawl

    I was recently doing a crawl / indexing of my web-site on my local machine. I was surprised, though this particular site uses all absolute links rather than relative links, that Zoom was able to follow the absolute links on the local machine to crawl / index all of the files locally. And, when the Zoom files were uploaded to the server, the search engine worked accurately. I was pleasantly surprised but wondered how it was possible. ...How is it possible?

  • #2
    If you were using offline mode (rather than spider mode), links are not followed at all. It might just be a happy coincidence that it worked.

    Comment


    • #3
      Yes, I was in offline mode

      Originally posted by wrensoft View Post
      If you were using offline mode (rather than spider mode), links are not followed at all. It might just be a happy coincidence that it worked.
      Yes, I was in offline mode. You might check into it for yourself. It definitely worked.

      Comment


      • #4
        As David said, offline mode does not rely on links to find the pages of your site. It will simply index all files within a given folder (and its subfolders) which satisfy the scan and skip options in the Configuration window. This means that, yes, it will find all the files for your website (assuming all the files are within the start folder specified), regardless of the links.

        It is one of the benefits of using offline mode, over spider mode. That is, the files do not need to be well-linked for the indexer to find them. It is also much faster, and uses up no internet traffic. However, the main disadvantage of offline mode is that it is unable to index dynamically generated pages (such as PHP or ASP pages) which must be executed by the server before a meaningful page is rendered. For sites with such pages, you would need to use spider mode.
        --Ray
        Wrensoft Web Software
        Sydney, Australia
        Zoom Search Engine

        Comment


        • #5
          Originally posted by Ray View Post
          As David said, offline mode does not rely on links to find the pages of your site. It will simply index all files within a given folder (and its subfolders) which satisfy the scan and skip options in the Configuration window. This means that, yes, it will find all the files for your website (assuming all the files are within the start folder specified), regardless of the links.

          It is one of the benefits of using offline mode, over spider mode. That is, the files do not need to be well-linked for the indexer to find them. It is also much faster, and uses up no internet traffic. However, the main disadvantage of offline mode is that it is unable to index dynamically generated pages (such as PHP or ASP pages) which must be executed by the server before a meaningful page is rendered. For sites with such pages, you would need to use spider mode.
          Ray, thank you for your explanation. I found that satisfying.

          Comment

          Working...
          X