PDA

View Full Version : PDF metadata not indexed



andre2p
01-30-2009, 02:01 PM
Hello:

I tried to solve this on my own for too long. Need help ...

Trying to index just PDFs for my search page. The search can find words within PDFs, but nothing on the metadata. I inserted a test word in the title part of the PDF, but the search cant find it. I checked all setting by searching the forum, but nothing works.

Example: http://130.11.167.14/wetlands/Documents/search.asp

If you search for Baileys, the report comes up. If you see the title says cTechnicalReport. If I search for cTechnicalReport, it cant find the same report. I tried different words in the title field and still doesnt work. Please help.

Thanks,
Andrew

andre2p
01-30-2009, 05:07 PM
Follow up ...

If I index the same documents in spider mode, the metadata gets indexed. Unfortunately, since the PDFs are large, I rather index from the local folder if possible.

Thanks.

wrensoft
01-30-2009, 10:32 PM
I tried to have a look at your site via the IP address, but the site appears to be down. I just got this error for all pages on your site,
The server at 130.11.167.14 is taking too long to respond

PDF titles are indexed in spider mode and offline mode. This assumes you have checked the "Retrieve interbnal metadata" box, which is in the configuration window for the PDF file extension in the Scan options window.

Also in the Indexing options configuration window make sure you are indexing page titles.

andre2p
02-03-2009, 03:22 PM
Hello again. My problem is not solved yet. I now created two examples. For both tests I used the exact same settings. The first link was for a form created by the spider option. The second one with the offline mode.

If you search for the keyword gData, the first one (spider mode) will find it in a few reports. The second one will not. For some reason, the offline mode is not indexing the keywords within the PDFs.

Thanks.
Andrew

http://137.227.242.32/test1/search.asp
http://137.227.242.32/test2/search.asp

Ray
02-04-2009, 02:21 AM
We have confirmed that this is a bug in the current release. Offline Mode is not indexing meta keywords nor meta descriptions from plugin files (but titles are indexed).

This will be fixed in the next release (V6.0 build 1009).

andre2p
02-04-2009, 12:45 PM
I spent hours trying to figure this out. Good to know it wasn't something I did wrong. Andrew.