PDA

View Full Version : Categories


Anonymous
12-14-2004, 04:23 PM
I'm using v4.0 and am having a hard time setting up categories.

For "Name" I have Auto and Computer
For "Pattern" I have /ndrive_docs/auto and /ndrive_docs/computer, respectively

The default name is "All"

Problem is, when I search for a document that I know is within the "/ndrive_docs/auto" directory (path is http://www.mysite.com/ndrive_docs/auto/filename.pdf") it does not show up, however it will under if I select "All"

Also, it appears that "All" is listed twice, one at the top and one at the bottom of the category listing.

Thoughts?

wrensoft
12-14-2004, 10:03 PM
There is always an "ALL" category, you don't need to define one. So the default category should not be called "ALL". A better name would be "Other documents" or "Unmatched documents".

The default category picks up all files that didn't match any other category. This explains why you see ALL twice.

As for the missing document, from what you have said your document,
http://www.mysite.com/ndrive_docs/auto/filename.pdf
didn't get put into the
/ndrive_docs/auto
category for some reason.

Can you send me your Zoom configuration file (xxxxxx.zcfg) and the exact URL of the real document. It might just be a simple typo or something like that.

------
David

Nenad
12-22-2004, 08:15 AM
Is it possible to order category names in way that Default Category Name appears on the second place in drop-down list?

Well, the first is All categories, that is pretty obvious. But why should Default Category Name be on the last place in list. I know that default category represents all pages not covered by any other category patterns. I would know how to do that if I choose to create my own search form, but I didn't because I will loose some very usefull ZOOM functions.

Maybe you can add some control to let user organize even and Default Category Name in drop-down list.

Any suggestion?

wrensoft
12-22-2004, 09:15 AM
The default (or catch all) category always appears at the bottom of the list. You can arrange the order of the other categories however. This is done using the up and down arrows in the category window.

If you want more control than this in the current version, then you will need to create your own search form.

Sections 2.3.2 and 5.7 of the users guide can help with defining your own form. But you can also just copy the code that is generated by the script and manually re-arrange the order of the default category if need be.

-------
David

chapman
01-04-2005, 06:52 PM
Hi,

I’m experimenting with categories.

My site uses a UBB forum for support. I’d like to create a search category called “Support Forums”.

The problem is the UBB forum software creates two types of page:

cgi-bin/forumdisplay.cgi?action= {some parameters} – looks similar to viewforum.php

and

ubb/Forumn/HTML/ {HTML pages} – looks similar to viewtopic.php

Is there any way I can create single category that matches more than one pattern? For example, “.cgi OR ubb”.

At the moment I can either match the forum topics or the forum posts, but not both, unless they go into the "rest of site", which is hardly ideal.

If not, can you suggest another way I can achieve the same result?


Kind regards


David Chapman

Ray
01-04-2005, 11:56 PM
The current version of Zoom does not support matching categories by multiple patterns. We might look into it for a future version.

An alternative would be to allow all URLs containing the word "forum" to be grouped in the "Support forums" category. Then determine the parameters and keywords to add to the "Page and folder skip list" (under Skip options in the Configuration window) which will prevent irrelevant forum pages (such as "user profile", or "sort postings") from being indexed. For example, in the case of this discussion board, I would skip "profile.php", "posting.php", "&mark=topics", "&view=", "&watch=", etc.

Anonymous
02-07-2005, 08:24 PM
I'm using v4.0 and am having a hard time setting up categories.

For "Name" I have Auto and Computer
For "Pattern" I have /ndrive_docs/auto and /ndrive_docs/computer, respectively

The default name is "All"

Problem is, when I search for a document that I know is within the "/ndrive_docs/auto" directory (path is http://www.mysite.com/ndrive_docs/auto/filename.pdf") it does not show up, however it will under if I select "All"
I'm having the same problem as this.

Be interested to know if you (or Wrensoft) resolved it.

I have 4 categories, plus a default category, which I call 'Miscellaneous'

When I search within a category using a keyword that I know is in one of the pages in that category, no results appear.

And yet if I switch it to the default category ('Miscellaneous') using the dropdown box, (or switch it to 'All'), then the listing shows up, but it's got the 'Miscellaneous' next to it instead of the actual category name.

Any ideas?

My four categories are four absolute URLs to different folders on the same server. Also, I had the same site working fine in Zoom 3, but it's since I've changed over to 4 that it's started doing this.

TIA

Anonymous
02-07-2005, 09:14 PM
(...oh, and forgot to add - I'm using PHP and indexing files offline, if that helps)

Anonymous
02-07-2005, 10:02 PM
a bit more info...

I just looked inside the file zoom_catpages.zdat and there's a whole column of 4's and nothing else.

My previous zoom_catpages.zdat file (the one that Zoom 3 correctly generated) had a combination of numbers from 0 to 4, each representing one of the four categories (5 if you include the catch-all category).

Playing around with swapping numbers and uploading it, and it seems that the 4 does indeed refer to the 'catch-all' category.

Any ideas what I could be doing wrong in the Zoom indexer that might be generating this? Thx.

Ray
02-07-2005, 11:52 PM
None of the files indexed matched the patterns specified for your categories. As such, they were all filed under your default/catchall "Miscellaneous" category.

Can you tell us what your categories and patterns are? Also give us an example of the URL for some pages indexed. You could send us your ZCFG file if you want us to take a closer look.

Note that the category "pattern" is matched against the full URL of a web page or file indexed. For example, a pattern of "/cars" will catch the following:

http://cars.mysite.com/index.html
http://www.mysite.com/cars/somepages.html
http://www.mysite.com/carsandstuff/
http://mysite.com/cars.html
... etc.
Also be careful not to have extra spaces in the pattern, or they will also not match unless the space characters are also found in the URL.

5tevooo
02-09-2005, 08:10 AM
Well thanks to those good people at Wrensoft, I have got the Categories working as they should! (btw, i'm the Guest author of the above posts)

For the purpose of anyone else who stumbles upon this thread with the same problem, here's what it was...

Since I was scanning in offline mode, I had the category patterns specified incorrectly.

Instead of...
http://www.xyz.com/apple
http://www.xyz.com/banana
http://www.xyz.com/cherry

...they had to be entered along the lines of...
\apple\
\banana\
\cherry\

If you're indexing in offline mode, the pattern is not matched against the final URL, but rather the file path which the files were found for indexing.

Thanks again to Ray! :D