PDA

View Full Version : Seaching quoted strings within a Meta Description


Scott
01-05-2005, 02:13 PM
Does version 4.0 of the Zoom Search Engine support quoted string searches within a meta description.

I've added the following information within the PDF Description. The search finds the words when unquoted, but quoted strings such as "Drawer 1" return 0 found.

<Size:2,042KB>
<Category:Drawer 1-Carbon Steels-Alloy Steel>
<Type:Chart>
<Publisher:The Duriron Company, Inc>
<Author:K. Aoki>

Ray
01-05-2005, 10:14 PM
Do you mean the above was added to a .desc file for the pdf (eg. "mydocument.pdf.desc") or added as meta info within the PDF file itself? Check if the information is being indexed at all by searching for keywords such as "duriron aoki" etc.

If you are sure that it is being indexed, then you can check why exact phrase is not matching. The most likely reason here, is if you have hyphen ("-") selected as a word-join character in the configuration window (under "Indexing Options" tab). In this case, the word "1-Carbon" would be treated as a single word, and the exact phrase match would only find "Drawer 1-Carbon". If you disable the hyphen as a word-join character, you should then be able to find it with a search such as "Drawer 1". Remember that spaces are important in determining what is the start and end of a word. Alternatively, name your categories like "Drawer 1 - Carbon Steels - Alloy Steels".

Scott
01-06-2005, 02:41 PM
Ray,

Thank you for your input. With your suggestions, the quoted search of my meta descriptions is working. I used the suggestion of putting a space separator between the categories. Now the quoted search for "Drawer 1" works when the following item is placed in the Meta Description of the PDF file.

<Category : Drawer 1 - Carbon Steels - Alloy Steel>

If you ever need a reference for the Zoom Search Engine, please ask. This was my third question posted on the forum. Each and every question has been answered and resulted in a solution for my problem.

Thank you for the great product and support.

Scott :D

Scott
01-21-2005, 09:59 PM
Below illustrates two examples of Meta Subject (Description) data inserted into a PDF file. One works and the other doesn't

Works
Search: "Drawer 3"
< Categories : Drawer 3 : Welding : Unusual Techniques >

Does not Work
Search: "Drawer 2"
< Categories : Drawer 2 : Stainless Steels : High Temperature >

The What to Index tab under Configuration has the following checked:

Title of Page
Page content
Meta description
Meta keyword

Indexing words selected:

Dots
Hyphens
Underscores
Apostrophes


Does anyone have any suggestion why this why the "Search 2" does not work.[/b][/list]

wrensoft
01-21-2005, 11:24 PM
Is your web site search function online somewhere where we can see it and have a look at the problem. What is the URL ?

It is probably something trival like a spelling mistake or typo or the document appearing in the results, but far enough down the list that you didn't notice it.

-------
David

Scott
01-24-2005, 01:31 PM
The Zoom Search Engine is behind an intranet firewall. The Search Engine currently has about 2,500 documents. These 2,500 drawers are fairly equally distributed between Drawer 1, Drawer 2, and Drawer 3.

Below is a post of the results for a Search: of "Drawer 3"

1. Welding practices that minimize corrosion - Part II
< NumPages : 4 > < FileSize : 1.83 MB > < Categories : Drawer 3 : Welding : Unusual Techniques > ...
... <NumPages: 4> <FileSize: 1.83 MB> <Categories: Drawer 3: Welding: Unusual Techniques> ...
Terms matched: 1 - 21 Jan 2005 - URL: http://goldwing/TechFiles/Drawer 3/Welding/Unusual Techniques/1470 Welding practices that minimize corrosion - Part II.pdf

2. Friction welding saves money and metal
< NumPages : 7 > < FileSize : 3. MB > < Categories : Drawer 3 : Welding : Unusual Techniques > ...
... <NumPages: 7> <FileSize: 3. MB> <Categories: Drawer 3: Welding: Unusual Techniques> ...
Terms matched: 1 - 21 Jan 2005 - URL: http://goldwing/TechFiles/Drawer 3/Welding/Unusual Techniques/1471 Friction welding saves money and metal.pdf

3. Weld Fabrication of Titanium Equipment
< NumPages : 30 > < FileSize : 5.58 MB > < Categories : Drawer 3 : Welding : Titanium > ...
... <NumPages: 30> <FileSize: 5.58 MB> <Categories: Drawer 3: Welding: Titanium> ...
Terms matched: 1 - 21 Jan 2005 - URL: http://goldwing/TechFiles/Drawer 3/Welding/Titanium/1468 Weld Fabrication of Titanium Equipment.pdf

-----------------------

As mentioned before "Drawer 1" and "Drawer 2" does not return a search result. But if you search for just Drawer and skim through numerous results you can see that the search values are in the results.

1206. Hardness Values for Steels (Alonized - Substrate)
< NumPages : 1 > < FileSize : .12 MB > < Categories : Drawer 2 : Metallic Coatings : Alonizing > ...
... <NumPages: 1> <FileSize: .12 MB> <Categories: Drawer 2: Metallic Coatings: Alonizing> ...
Terms matched: 1 - 22 Jan 2005 - URL: http://goldwing/TechFiles/Drawer 2/Metallic Coatings/Alonizing/1310 Hardness Values for Steels (Alonized - Substrate).pdf

1301. A611 Material for Sulphuric Acid
< NumPages : 1 > < FileSize : .27 MB > < Categories : Drawer 1 : Stainless Steels : High Silicon : VEW Saramet Duromet 5 > ...
... <NumPages: 1> <FileSize: .27 MB> <Categories: Drawer 1: Stainless Steels: High Silicon: VEW Saramet Duromet 5> ...
Terms matched: 1 - 22 Jan 2005 - URL: http://goldwing/TechFiles/Drawer 1/Stainless Steels/High Silicon/VEW Saramet Duromet 5/0791 A611 Material for Sulphuric Acid.pdf

1302. High Silicon Stainless Steel
< NumPages : 1 > < FileSize : .23 MB > < Categories : Drawer 1 : Stainless Steels : High Silicon : VEW Saramet Duromet 5 > ...
... <NumPages: 1> <FileSize: .23 MB> <Categories: Drawer 1: Stainless Steels: High Silicon: VEW Saramet Duromet 5> ...
Terms matched: 1 - 22 Jan 2005 - URL: http://goldwing/TechFiles/Drawer 1/Stainless Steels/High Silicon/VEW Saramet Duromet 5/0792 High Silicon Stainless Steel.pdf

Ray
01-24-2005, 10:52 PM
Check if you have the latest build of Zoom (Version 4.0 Build 1007). It might be related to a recent bug we've fixed regarding meta descriptions:
http://www.wrensoft.com/zoom/whatsnew.html

If you still get this problem, please zip up your search files (all .zdat files, search.php, settings.php, search_template.html) and e-mail them to us (info on the contact us page of our website). We'll take a closer look at the files.

Also - are you using exact phrases for your search queries? (ie: you have enabled exact phrase in config window, and you are enclosing your search terms in quotation marks?)

Scott
01-25-2005, 06:19 PM
Raymond,

Thank you for your assistance in resolving this issue. I could have been more helpful in providing the message received when the search string returned 0 results.

The phrase "drawer 1" contains words which are found to be too common, resulting in a limited search. Try to specify a more specific phrase for better results.

In your previous note you mentioned the Exact Phrase configuration option. When I bumped up the Max Context Seek value to 5000 the common words "Drawer 1", "Drawer 2" were found.

Before doing this I pulled down Build 1007 of version 4. The previous build was 1006. This new build returned 0 results with the "Drawer 1" search string.

Thanks again resolving my problem.

Scott

Ray
01-27-2005, 10:50 PM
Yes, when you need to perform accurate exact phrase matching on more "common" search terms, you may need to increase the "Max context seeks" value in the configuration window.

Usually, if you often see the "phrase... contains words which are found to be too common" message, then its a good idea to up the context seeks value.

I assume you meant that you got zero results with the new build before you increased the max context seeks value. And that once you increased "max context seeks" with this new build, you did get results.