PassMark Logo
Home » Forum

Announcement

Collapse
No announcement yet.

Help with ZoomStop and Start Tags.

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Help with ZoomStop and Start Tags.

    Do you seen any issues in these two pages setup using the Stop and Start tags?

    http://www.ncsl.org/default.aspx?tabid=19506
    or
    http://www.ncsl.org/default.aspx?tabid=14515

    When we do a search on our site for International this page is showing up in results. I do a view source and only see International in the menu where I have my start and stop tags. We have change the search results to put them in a grid from the XML which I can send you a link to where you can just post to the cgi and see the same results. I made sure before I submitted to you that it was not something on inserting into the grid that was causing this. Let me know if you see something.
    EDIT: Version 6.0.1019 is the version that I am currently using.

    Thanks
    Last edited by chriscsanyi; Mar-03-2010, 05:15 PM. Reason: Update Version using.

  • #2
    Your ZOOMSTOP and ZOOMRESTART tags appear to be fine.

    If you have just added these tags in (or very soon after an indexing attempt), you may be indexing from a cached copy of the page from before you added the tags. Click "Configure"->"Spider options" and check "Reload all files (do not use cache)" to avoid this.

    I tried to index these two pages from here but I'm getting this issue where the website is redirecting my requests:

    Downloading file http://www.ncsl.org/default.aspx?tabid=19506
    URL redirected to: http://www.ncsl.org/Default.aspx?tabid=19506&error=Value+cannot+be+nul l.%0d%0aParameter+name%3a+String&content=0 [thread #1]
    It ends up indexing the error page instead. There's no occurrence of the word "International" there so I assume this isn't your problem. Nonetheless, it is documented here:
    http://www.wrensoft.com/forum/showthread.php?t=3537

    You may be using an older version of DNN which had a bug in that it mis-identified certain spiders. Note that you may have worked around this if you have changed your User-Agent string in Zoom.
    --Ray
    Wrensoft Web Software
    Sydney, Australia
    Zoom Search Engine

    Comment


    • #3
      Ray thanks for looking at this so quickly as your team always does. I double-check my logs from last night and here is the file getting index with no errors that I can see. I have had the skins in place for a long time with the stop and start tags. Also we don't load from cache each night.

      Anything else to look for in the log file that might be a clue? I can send you a pm with a location that I have the search.cgi and a copy of the files to search against so you can see that the pages are being return.

      Code:
      14|03/04/10 03:05:30|DL Thread #3, got URL (http://www.ncsl.org/default.aspx?tabid=19506) off queue
      14|03/04/10 03:05:30|Index Thread got ready buffer for http://www.ncsl.org/default.aspx?tabid=19432 (Content-type: HTML text)
      04|03/04/10 03:05:30|Downloading file http://www.ncsl.org/default.aspx?tabid=19506
      00|03/04/10 03:05:30|Indexing http://www.ncsl.org/default.aspx?tabid=19432
      14|03/04/10 03:05:31|DL Thread #1, got URL (http://www.ncsl.org/default.aspx?tabid=19604) off queue
      14|03/04/10 03:05:31|Index Thread got ready buffer for http://www.ncsl.org/default.aspx?tabid=18886 (Content-type: HTML text)
      04|03/04/10 03:05:31|Downloading file http://www.ncsl.org/default.aspx?tabid=19604
      00|03/04/10 03:05:31|Indexing http://www.ncsl.org/default.aspx?tabid=18886
      14|03/04/10 03:05:31|DL Thread #9, got URL (http://www.ncsl.org/default.aspx?tabid=19605) off queue
      04|03/04/10 03:05:31|Downloading file http://www.ncsl.org/default.aspx?tabid=19605
      14|03/04/10 03:05:31|Index Thread got ready buffer for http://www.ncsl.org/default.aspx?tabid=18017 (Content-type: HTML text)
      00|03/04/10 03:05:31|Indexing http://www.ncsl.org/default.aspx?tabid=18017
      14|03/04/10 03:05:31|DL Thread #3, got URL (http://www.ncsl.org/default.aspx?tabid=19682) off queue
      04|03/04/10 03:05:31|Downloading file http://www.ncsl.org/default.aspx?tabid=19682
      14|03/04/10 03:05:31|Index Thread got ready buffer for http://www.ncsl.org/default.aspx?tabid=19506 (Content-type: HTML text)
      00|03/04/10 03:05:31|Indexing http://www.ncsl.org/default.aspx?tabid=19506
      14|03/04/10 03:05:31|DL Thread #1, got URL (http://www.ncsl.org/default.aspx?tabid=19703) off queue
      04|03/04/10 03:05:31|Downloading file http://www.ncsl.org/default.aspx?tabid=19703
      14|03/04/10 03:05:31|Index Thread got ready buffer for http://www.ncsl.org/default.aspx?tabid=19604 (Content-type: HTML text)
      00|03/04/10 03:05:31|Indexing http://www.ncsl.org/default.aspx?tabid=19604
      14|03/04/10 03:05:32|DL Thread #9, got URL (http://www.ncsl.org/default.aspx?tabid=19704) off queue
      04|03/04/10 03:05:32|Downloading file http://www.ncsl.org/default.aspx?tabid=19704
      14|03/04/10 03:05:32|Index Thread got ready buffer for http://www.ncsl.org/default.aspx?tabid=19605 (Content-type: HTML text)
      00|03/04/10 03:05:32|Indexing http://www.ncsl.org/default.aspx?tabid=19605
      Code:
      12|03/04/10 03:05:41|Start index time: 02:30:02 (2010/03/04)
      12|03/04/10 03:05:41|Elapsed index time: 00:35:39
      12|03/04/10 03:05:41|Peak physical memory used: 275 MB
      12|03/04/10 03:05:41|Peak virtual memory used: 528 MB
      12|03/04/10 03:05:41|Errors: 0
      Last edited by chriscsanyi; Mar-04-2010, 03:58 PM. Reason: UPdate info on errors

      Comment


      • #4
        We're still getting the "Value cannot be null" error when we try to index that page from here. So we can't really tell much from that.

        Did you look at the other thread I linked to above? I still believe this problem exist on your website. It may however not be doing this for you because the behaviour changes depending on the IP address? Are you indexing on the web server itself or within its LAN?

        If you can address that and send us your .zcfg file (containing your saved indexer configuration), we can take a closer look.

        Other things to check:
        - Make sure you have NOT modified any of the search files (including settings.zdat)
        - Make sure you are using a consistent set of search files from the same indexing session and you are not mixing files from different sessions.
        - Make sure the set of files you are hosting are actually the index files generated from the latest index (people sometimes upload the files to the wrong folder and not realize they have two sets of files on their web server and are looking at the wrong one).
        --Ray
        Wrensoft Web Software
        Sydney, Australia
        Zoom Search Engine

        Comment


        • #5
          We are indexing from one location from with in our site that contains all the links. You will see that in our setup file. It is like a site map so that we can determine what is getting picked up in the indexer. I just tried from my machine with the free version and it index the page without any issues by pointing directly to the page. We are running x64 version on the server but pointing to the web browsers on page with all the listings. I stated before we were using 6.19 but guess it was 6.17.

          This is a strange one on why you can't index it. I looked at the other post but since I was able to index did not think it was an issue since we don't have any errors in indexlog. I will be sending you an email shortly with our last two night of indexlog files and the setup file. I will also send you a link to test page with a todays version of the search files that will pull up in xml format which runs well in IE to see the results that we are seeing to put into our grid format.

          Thanks and have great weekend.

          Comment

          Working...
          X