Zoom Search Engine FAQ - Features
Yes. Wildcard searches such as "wren*" and "wr?nsoft" will match "wrensoft", etc.
Yes. As of V7, all editions (Free, Professinal, and Enterprise) support the use of plugins to index Word document (.DOC) and Acrobat (.PDF) file types. Plugins are now included with the installation of Zoom. For V6 and earlier, Plug-ins are only avaliable to purchased editions. Users of earlier versions will have to download the plugins and install them. See the plugins page for more information.
Yes. Starting with V7, Zoom Search Engine software is available for Windows, Mac/OSX and Linux.
There are two parts to the Zoom software. The indexer and the search scripts. You need to run the indexer at least once to create a set of index file. The search script is run each time there is a search performed on the web site.
The indexing must be executed on a computer with the indexer installed. But you only need to do this every once in a while (depending on how often your website updates). However once you have made a set of index files you upload them to just about any machine. Unix, Linux, Windows, Mac, etc...
So people with Linux and Unix servers (including ourselves) create a set of index files on their desktop PC and upload the
index files to the server on a periodic basis. The indexing and uploading can be scheduled from your desktop to happen automatically. OR
starting with V7, Linux users now have the option to install the indexer directly on their server and use the web interface to control their indexing.
A Mac version is now available starting since V7.
A Linux version is now available starting since V7.
Note #2: It seems Zoom runs OK on Intel based Mac's that run the Parallels Desktop on OS X. We haven't tested this ourselves but some users have reported it works.
Note that the indexer only needs to be run once (or how ever often you update your website) in order to create and update the "index data" that your search page needs. Once these index files are created from a Windows machine running the Indexer, you can host your search page on any web server, regardless of the Operating System.
Yes, Zoom can be installed on EC2 Linux or Windows instances.
To use Zoom Search Engine on AWS, you will need to have an existing account with Amazon. You can purchase a license for Zoom (one off purchase), install onto your instance and just pay the EC2 usage fees charged by Amazon.
After you have completed your index on your instance, you have a couple of options on how to proceed. You can leave your search on your Amazon Elastic Cloud Compute (EC2) instance and link to your search page or you can transfer the files off of your EC2 instance and place them on your own webserver.
Note: If you chose to leave them on the EC2 instance, you may want to look at implementing an Elastic IP Address for your instance.
Version 6 (and newer) of Zoom supports PHP 5.0 and later.
Due to increased functionality and technical requirements, the current version of Zoom no longer supports PHP 4.0 or earlier. However, the PHP4 script is available in our older V5 release (which you can find here) but they will not be updated with new features or enhancements. We recommend users to upgrade to the latest version of PHP where possible.
PHP4 was discontinued by the developers of PHP in 2007. So PHP5 should be now available on all hosts that support PHP, at least as an option. Contact your hosting company for details.
Yes. The spider indexing mode is specifically designed for this, as it indexes the pages it finds by crawling each webpage for links and scanning the pages remotely. Just make sure you have the file extension required (.php, .asp) in the extensions list of the configuration window.
Zoom can index and search a ColdFusion site using Spider mode indexing. This allows it to index any dynamically generated website.
Experienced ColdFusion developers can also wrap the CGI version in a .CF page by coding up a page which executes the CGI binary and return the results within the CF page. Although we do not currently have example code for ColdFusion, you can take a look at the examples available here for PHP and ASP as reference. While the code and syntax will not be the same, the same functionality should be achievable using the equivalent function calls on the ColdFusion platform.
Yes. Just enter the protocol as part of the Spider URL, for example, "https://yoursecurewebsite.com/index.html", and it will scan through the site accordingly.
Zoom can index database-driven sites, regardless of the database backend (eg. MySQL, Postgres, Access, Oracle, etc.). Because Zoom provides a Spider indexing mode, any content which can be accessed by you via a web browser, can also be indexed by the spider.
Note that database content which is not accessible via server-side scripted pages of your website (eg. PHP, ASP, CFM, ASPX etc. pages), will not be indexed. Zoom does not directly connect to the database, because it only return results in the form of URLs or file paths. A database "row" or "column" would not be useful. However, you can create a dynamically generated database index to make your database search engine friendly and allow that aspect of your site to be indexed. See How can I use Zoom to index portions of my site stored in a database (eg. SQL)? for more information.
Yes. Spider indexing mode follows .htaccess redirections for all files.
However, there is one exception: if you have "Use description files for plugin extensions" enabled (only available in Standard and Pro edition), redirecting ".desc" files are not supported due to the way Zoom needs to determine if the file exists or not.
Yes! Zoom now supports "robots.txt" files and will locate and obey their instructions by default. Please see this page for more information.
Q. Does Zoom support very large sites? What are the practical limits of the size of sites that Zoom can be used with?
We have greatly improved the capability of Zoom to support large sites, and it now utilises less memory than before, making it possible to index much larger sites on your computer.
You can always expect reasonable performance (with any of the search option) for up to around ten thousand pages depending on the number of unique words found in your site's content as well as your web server's specifications (processor, memory, server load, etc.).
For the PHP and ASP versions, there is a technical restriction of 65,500 pages due to performance limitations of the scripting environments used. You can however, surpass this by using the CGI version.
The CGI version provides high-performance support for very large sites. It is a binary application so it does not have any of the overhead issues of scripting languages such as PHP, ASP or Perl.
The CGI version can support sites containing a million pages or more (again, depending on the content). It would still be limited by the resources available on your web server (such as the CPU and memory available, or the limitation of your file system). Also note that indexing a site (or collection of sites) to such a grand scale would require a very resourceful computer. The limits are higher on 64bit systems as more RAM is available.
For detailed information on indexing gigantic sites with helpful tips and advice on how to do this (and a real-life case study of indexing the entirety of Wikipedia), please see this support article: Indexing Enormous Sites.
For a distributed system which performs parallel searching on multiple servers and indexing computers, see our MasterNode solution.
See our benchmarks page for some practical comparisons of the above scripting options and the performance you can expect on sites of various sizes.
For larger sites running on CD-ROM or DVD-ROM or any situation where a web server is not available, take a look at this FAQ which allows you to run the CGI or PHP version without a web server.
Q. Does Zoom suit forensics use? Can I use Zoom to search and index a computer for digital forensics purposes?
Yes. While Zoom is designed for web sites and Intranet files, we have a product known as OSForensics which is designed specifically for forensics purposes.
It uses a custom built version of the Zoom engine to index not only your document files, but also unallocated clusters, and the suspect's e-mails. It is the most definitive tool for digital forensics investigation yet. Please find out more at our OSForensics website.
Return to the Zoom Search Engine Support page