View Full Version : Not searching file
dbdemaio
04-30-2008, 08:15 PM
I have about 10 pdfs on my site. One of them is particularly important to my users but Zoom is not indexing it. The size is slightly over 1M, and I was a getting a "file to large" message, so I reduced it to 904K. I no longer get any message but Zoom isn't indexing it either.
Any suggestions?
Thanks.
Don:confused:
wrensoft
04-30-2008, 09:49 PM
If you are using the pro or enterprise edition, you can adjust this file size limit from the limits tab in the Zoom configuration window.
If you look through your log of indexing activity, does this PDF get mentioned at all in the log? E-Mail us the log and the document if you want.
See also these related FAQs
Q. Why are some of my pages being skipped by the indexer? (http://www.wrensoft.com/zoom/support/faq_problems.html#skipped)
Q. Why are links in my Javascript menus being skipped? (http://www.wrensoft.com/zoom/support/faq_problems.html#javascriptmenus)
Q. I am indexing with spider mode but it is not finding all the pages on my web site (http://www.wrensoft.com/zoom/support/faq_problems.html#spider_finding)
wrensoft
05-04-2008, 04:55 AM
You sent the log file and from the log it can be seen that the file in question is in fact being indexed.
Queued URL: http://www.lakesestates.org/documents/ACC_Guidelines.pdf
DL Thread #1, got URL (http://www.lakesestates.org/documents/ACC_Guidelines.pdf) off queue
Downloading file http://www.lakesestates.org/documents/ACC_Guidelines.pdf
Index Thread got ready buffer for http://www.lakesestates.org/documents/ACC_Guidelines.pdf (Content-type: Acrobat document)
Downloading file http://www.lakesestates.org/documents/ACC_Guidelines.pdf
Processing PDF file http://www.lakesestates.org/documents/ACC_Guidelines.pdf
Indexing http://www.lakesestates.org/documents/ACC_Guidelines.pdf
Why do you think the file is not indexed?
Mind you however, the text in the document is largely garbage. It looks like a bad OCR job has been done on the document. So now a typical text extract from the document in question looks like this.
fm ces and walls
awnings and shM ers
declcs and balconies
patio. terracas and grolmd level
scr- O ciosures
recreadon and play equipment
qwimmm g px ls
nmilboxes and house numbers
sir s
Just nonsense words for the most part. But you could still use Zoom to search for the words that are there. nonsense or not.
Please also see this FAQ:
Q. Why can't I find words from my scanned PDF files? (PDFs created from scanning in physical documents) (http://www.wrensoft.com/zoom/support/faq_plugins.html#scannedpdfs)
vBulletin® v3.8.7, Copyright ©2000-2012, vBulletin Solutions, Inc.