PassMark Logo
Home » Forum

Announcement

Collapse
No announcement yet.

Categories

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Categories

    I'm using v4.0 and am having a hard time setting up categories.

    For "Name" I have Auto and Computer
    For "Pattern" I have /ndrive_docs/auto and /ndrive_docs/computer, respectively

    The default name is "All"

    Problem is, when I search for a document that I know is within the "/ndrive_docs/auto" directory (path is http://www.mysite.com/ndrive_docs/auto/filename.pdf") it does not show up, however it will under if I select "All"

    Also, it appears that "All" is listed twice, one at the top and one at the bottom of the category listing.

    Thoughts?

  • #2
    There is always an "ALL" category, you don't need to define one. So the default category should not be called "ALL". A better name would be "Other documents" or "Unmatched documents".

    The default category picks up all files that didn't match any other category. This explains why you see ALL twice.

    As for the missing document, from what you have said your document,
    http://www.mysite.com/ndrive_docs/auto/filename.pdf
    didn't get put into the
    /ndrive_docs/auto
    category for some reason.

    Can you send me your Zoom configuration file (xxxxxx.zcfg) and the exact URL of the real document. It might just be a simple typo or something like that.

    ------
    David

    Comment


    • #3
      Ordering category names in drop-down list

      Is it possible to order category names in way that Default Category Name appears on the second place in drop-down list?

      Well, the first is All categories, that is pretty obvious. But why should Default Category Name be on the last place in list. I know that default category represents all pages not covered by any other category patterns. I would know how to do that if I choose to create my own search form, but I didn't because I will loose some very usefull ZOOM functions.

      Maybe you can add some control to let user organize even and Default Category Name in drop-down list.

      Any suggestion?
      Regards,
      Nenad

      Comment


      • #4
        The default (or catch all) category always appears at the bottom of the list. You can arrange the order of the other categories however. This is done using the up and down arrows in the category window.

        If you want more control than this in the current version, then you will need to create your own search form.

        Sections 2.3.2 and 5.7 of the users guide can help with defining your own form. But you can also just copy the code that is generated by the script and manually re-arrange the order of the default category if need be.

        -------
        David

        Comment


        • #5
          multiple patterns in categories?

          Hi,

          I’m experimenting with categories.

          My site uses a UBB forum for support. I’d like to create a search category called “Support Forums”.

          The problem is the UBB forum software creates two types of page:

          cgi-bin/forumdisplay.cgi?action= {some parameters} – looks similar to viewforum.php

          and

          ubb/Forumn/HTML/ {HTML pages} – looks similar to viewtopic.php

          Is there any way I can create single category that matches more than one pattern? For example, “.cgi OR ubb”.

          At the moment I can either match the forum topics or the forum posts, but not both, unless they go into the "rest of site", which is hardly ideal.

          If not, can you suggest another way I can achieve the same result?


          Kind regards


          David Chapman

          Comment


          • #6
            The current version of Zoom does not support matching categories by multiple patterns. We might look into it for a future version.

            An alternative would be to allow all URLs containing the word "forum" to be grouped in the "Support forums" category. Then determine the parameters and keywords to add to the "Page and folder skip list" (under Skip options in the Configuration window) which will prevent irrelevant forum pages (such as "user profile", or "sort postings") from being indexed. For example, in the case of this discussion board, I would skip "profile.php", "posting.php", "&mark=topics", "&view=", "&watch=", etc.
            --Ray
            Wrensoft Web Software
            Sydney, Australia
            Zoom Search Engine

            Comment


            • #7
              Re: Categories

              Originally posted by bpagan
              I'm using v4.0 and am having a hard time setting up categories.

              For "Name" I have Auto and Computer
              For "Pattern" I have /ndrive_docs/auto and /ndrive_docs/computer, respectively

              The default name is "All"

              Problem is, when I search for a document that I know is within the "/ndrive_docs/auto" directory (path is http://www.mysite.com/ndrive_docs/auto/filename.pdf") it does not show up, however it will under if I select "All"
              I'm having the same problem as this.

              Be interested to know if you (or Wrensoft) resolved it.

              I have 4 categories, plus a default category, which I call 'Miscellaneous'

              When I search within a category using a keyword that I know is in one of the pages in that category, no results appear.

              And yet if I switch it to the default category ('Miscellaneous') using the dropdown box, (or switch it to 'All'), then the listing shows up, but it's got the 'Miscellaneous' next to it instead of the actual category name.

              Any ideas?

              My four categories are four absolute URLs to different folders on the same server. Also, I had the same site working fine in Zoom 3, but it's since I've changed over to 4 that it's started doing this.

              TIA

              Comment


              • #8
                (...oh, and forgot to add - I'm using PHP and indexing files offline, if that helps)

                Comment


                • #9
                  a bit more info...

                  I just looked inside the file zoom_catpages.zdat and there's a whole column of 4's and nothing else.

                  My previous zoom_catpages.zdat file (the one that Zoom 3 correctly generated) had a combination of numbers from 0 to 4, each representing one of the four categories (5 if you include the catch-all category).

                  Playing around with swapping numbers and uploading it, and it seems that the 4 does indeed refer to the 'catch-all' category.

                  Any ideas what I could be doing wrong in the Zoom indexer that might be generating this? Thx.

                  Comment


                  • #10
                    None of the files indexed matched the patterns specified for your categories. As such, they were all filed under your default/catchall "Miscellaneous" category.

                    Can you tell us what your categories and patterns are? Also give us an example of the URL for some pages indexed. You could send us your ZCFG file if you want us to take a closer look.

                    Note that the category "pattern" is matched against the full URL of a web page or file indexed. For example, a pattern of "/cars" will catch the following:

                    Code:
                    http://cars.mysite.com/index.html
                    http://www.mysite.com/cars/somepages.html
                    http://www.mysite.com/carsandstuff/
                    http://mysite.com/cars.html
                    ... etc.
                    Also be careful not to have extra spaces in the pattern, or they will also not match unless the space characters are also found in the URL.
                    --Ray
                    Wrensoft Web Software
                    Sydney, Australia
                    Zoom Search Engine

                    Comment


                    • #11
                      Well thanks to those good people at Wrensoft, I have got the Categories working as they should! (btw, i'm the Guest author of the above posts)

                      For the purpose of anyone else who stumbles upon this thread with the same problem, here's what it was...

                      Since I was scanning in offline mode, I had the category patterns specified incorrectly.

                      Instead of...
                      http://www.xyz.com/apple
                      http://www.xyz.com/banana
                      http://www.xyz.com/cherry

                      ...they had to be entered along the lines of...
                      \apple\
                      \banana\
                      \cherry\

                      If you're indexing in offline mode, the pattern is not matched against the final URL, but rather the file path which the files were found for indexing.

                      Thanks again to Ray!

                      Comment

                      Working...
                      X