PassMark Logo
Home » Forum

Announcement

Collapse
No announcement yet.

Problem indexing https site

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Problem indexing https site

    Hello,

    We are using Zoom Search Engine 3.1. All was well until recently.

    We changed our site to https. The site worked fine. But the search engine couldnt index the site. After moving to https we did the following for search to work.

    We have changed the spidering URL to https://www.mysite.com/home.asp.
    We checked if .asp in the extensions list

    So please help us on fixing the problem.

    Thanks

  • #2
    Indexing https pages should be no problem. I did a quick test on our secure order page and it worked as expected, see the trace below,

    Code:
    06:57:42 - Zoom Search Engine Indexer (Professional Edition)
    06:57:42 - Version 4.0 (Build: 1014) on Windows XP
    06:57:42 - Copyright Wrensoft 2000-2004 (http://www.wrensoft.com/)
    06:57:42 - Plugin for DOC files found. DOC file support enabled.
    06:57:42 - Plugin for PDF files found. PDF file support enabled.
    06:57:42 - Plugin for PPT files found. PPT file support enabled.
    06:57:42 - Plugin for XLS files found. XLS file support enabled.
    06:57:42 - Config file loaded: D:\Program Files\Zoom Search Engine 4.0\zoom.zcfg
    06:58:13 - Start indexing (spider mode) at Sat Mar 19 06:58:13 2005
    06:58:13 - Maximum number of words: 50000
    06:58:13 - Maximum number of pages: 1000
    06:58:13 - Will scan files with extensions
    06:58:13 -      .htm
    06:58:13 -      .html
    06:58:13 -      .php
    06:58:13 -      .asp
    06:58:13 -      .cfm
    06:58:13 -      .aspx
    06:58:13 -      .php3
    06:58:13 -      .php4
    06:58:13 -      .txt
    06:58:13 - Spider from: https://www.regsoft.net/regsoft/vieworderpage.php3?productid=60810
    06:58:13 - Web site URL: https://www.regsoft.net/regsoft/
    06:58:13 - Estimated RAM required during index process: 35654 KB
    06:58:13 - Initiating HTTP session (thread #1) ...
    06:58:13 - [DOWNLOAD] Downloading file https://www.regsoft.net/regsoft/vieworderpage.php3?productid=60810  (1024 bytes)
    06:58:15 - [SCANNED] Scanning https://www.regsoft.net/regsoft/vieworderpage.php3?productid=60810
    06:58:15 - Initiating HTTP session (thread #2) ...
    06:58:16 - [DOWNLOAD] Downloading file https://www.regsoft.net/regsoft/vieworderpage.php3?productid=60810&ordertypeid=3&pc=&rc=&template=001&ww_sid=&affiliateid=
    Can you post up your trace so that we can see any error messages.

    Also can you download V4 of Zoom to see if there is a difference in behaviour between V3.1 and V4.

    I also assume that you have access to your spider start page from Internet Explorer and you have checked that it is not a firewall problem?

    ------
    David

    Comment


    • #3
      Thanks, We resolved the problem. We were using version 3.1 build 1003. We just downloaded update build 1022. That fixed the problem.

      Just to let you know, the software is awesome. Feature rich and reliable.

      A different question, I was looking at setting up categories. Now I can setup categories according to URL path. Is there a way to categorise by meta data.

      Thanks

      Comment


      • #4
        In the current version of the software, V4.0, categories can only be defined by file location (matching part of the URL in spider mode, or the file name in offline mode).

        There is no option for creating categories from the page meta data.

        Comment

        Working...
        X