PassMark Logo
Home » Forum

Announcement

Collapse
No announcement yet.

Zoom V5 - Spider index for DNN site

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Zoom V5 - Spider index for DNN site

    Hi,

    I have been using Zoom for indexing my company's intranet for almost a month and since being in a development stage its being done on a local machine.

    The website being indexed is a DNN website which is called using the following url

    http://gerald/technical/

    The site contains over 20,000 pages (Major pages from a documentation system - HTML pages) and we have purchased the professional edition.

    Previously before including the documentation system onto the website, we dropped just few HTML pages for the FREE Version which was picked up promptly.

    But for now it doesnt seem to index the HTML file.

    I am enclosing my log file,
    [IMG]file:\\\C:\Documents and Settings\gjoseph\Desktop\zoom2.cfg[/IMG] pretty sure I must have got the config or the Spider URL wrong.

    (Hw do i add attachments in this forum )







    10/31/08 10:15:30 - Start indexing (spider mode)
    10/31/08 10:15:30 - Maximum number of words: 300000
    10/31/08 10:15:30 - Maximum number of files: 65500
    10/31/08 10:15:30 - Will scan files with extensions
    10/31/08 10:15:30 - .php
    10/31/08 10:15:30 - .asp
    10/31/08 10:15:30 - .cfm
    10/31/08 10:15:30 - .aspx
    10/31/08 10:15:30 - .php3
    10/31/08 10:15:30 - .php4
    10/31/08 10:15:30 - .txt
    10/31/08 10:15:30 - .doc
    10/31/08 10:15:30 - .ae
    10/31/08 10:15:30 - .aef
    10/31/08 10:15:30 - .ocx
    10/31/08 10:15:30 - .msi
    10/31/08 10:15:30 - .dll
    10/31/08 10:15:30 - .exe
    10/31/08 10:15:30 - .zip
    10/31/08 10:15:30 - .rar
    10/31/08 10:15:30 - .gif
    10/31/08 10:15:30 - .jpeg
    10/31/08 10:15:30 - .jpg
    10/31/08 10:15:30 - .bmp
    10/31/08 10:15:30 - .png
    10/31/08 10:15:30 - .htm
    10/31/08 10:15:30 - .html
    10/31/08 10:15:30 - .xml
    10/31/08 10:15:30 - Spider from: http://gerald/technical/Home/tabid/36/Default.aspx
    10/31/08 10:15:30 - Web site URL: http://gerald/technical/Home/tabid/36/
    10/31/08 10:15:30 - Estimated RAM required during index process: 472287 KB
    10/31/08 10:15:31 - Initiating HTTP session (thread #1) ...
    10/31/08 10:15:31 - DL Thread #1, got URL (http://gerald/technical/Home/tabid/36/Default.aspx) off queue
    10/31/08 10:15:31 - Downloading file http://gerald/technical/Home/tabid/36/Default.aspx
    10/31/08 10:15:31 - Index Thread got ready buffer for http://gerald/technical/Home/tabid/36/Default.aspx (Content-type: HTML text)
    10/31/08 10:15:31 - Initiating HTTP session (thread #2) ...
    10/31/08 10:15:31 - Spidering for links on http://gerald/technical/Home/tabid/36/Default.aspx
    10/31/08 10:15:31 - Queued URL: http://gerald/technical/Default.aspx
    10/31/08 10:15:31 - Queued URL: http://gerald/Technical/Portals/0/Newest%20Logo.gif
    10/31/08 10:15:31 - Queued URL: http://gerald/Technical/Portals/_default/Skins/skins/media/menuitemsel_l.gif
    10/31/08 10:15:31 - Queued URL: http://gerald/Technical/Portals/_default/Skins/skins/media/menuitemsel_r.gif
    10/31/08 10:15:31 - Queued URL: http://gerald/Technical/Portals/_default/Skins/skins/media/menuitem_l.gif
    10/31/08 10:15:31 - Queued URL: http://gerald/technical/Downloads/tabid/55/Default.aspx
    10/31/08 10:15:31 - Queued URL: http://gerald/Technical/Portals/_default/Skins/skins/media/menuitem_r.gif
    10/31/08 10:15:31 - Queued URL: http://gerald/technical/IssueManagement/tabid/56/Default.aspx
    10/31/08 10:15:31 - Queued URL: http://gerald/technical/ContactUs/tabid/62/Default.aspx
    10/31/08 10:15:31 - Queued URL: http://gerald/Technical/Portals/_default/Skins/skins/spacer.gif
    10/31/08 10:15:31 - Queued URL: http://gerald/Technical/Portals/_default/Containers/containers/media/blue3_tl.gif
    10/31/08 10:15:31 - Queued URL: http://gerald/Technical/Portals/_default/Containers/containers/media/blue3_tr.gif
    10/31/08 10:15:31 - Queued URL: http://gerald/Technical/Portals/_default/Containers/containers/media/blue3_ml.gif
    10/31/08 10:15:31 - Queued URL: http://gerald/Technical/images/min.gif
    10/31/08 10:15:31 - Skipping https://support.stayinfront.co.nz/technical/documentation/Version%2011/138145/138186/141718/StayinFront%20CRM%2011%20Product%20Sheet.pdf (External site - does not match base URL)
    10/31/08 10:15:31 - Skipping https://support.stayinfront.co.nz/technical/documentation/Version%2011/103514/104772/141442/StayinFront%20Analytics%2011%20User%20Guide.pdf (External site - does not match base URL)

    10/31/08 10:15:31 - Writing index data for ASP search... (Please wait)
    10/31/08 10:15:31 - Created pagedata data file (zoom_pagedata.zdat)
    10/31/08 10:15:31 - Created pagetext data file (zoom_pagetext.zdat)
    10/31/08 10:15:31 - Created pageinfo data file (zoom_pageinfo.zdat)
    10/31/08 10:15:31 - Created categories data file (zoom_cats.zdat)
    10/31/08 10:15:31 - Created spelling data file (zoom_spelling.zdat)
    10/31/08 10:15:31 - Created dictionary data file (zoom_dictionary.zdat)
    10/31/08 10:15:31 - Created wordmap data file (zoom_wordmap.zdat)
    10/31/08 10:15:31 - Created script settings file (settings.asp)
    10/31/08 10:15:31 - Indexing completed
    10/31/08 10:15:31 - INDEX SUMMARY
    10/31/08 10:15:31 - Files indexed: 8
    10/31/08 10:15:31 - Files skipped: 19
    10/31/08 10:15:31 - Files filtered: 0
    10/31/08 10:15:31 - Files downloaded: 8
    10/31/08 10:15:31 - Unique words found: 1233
    10/31/08 10:15:31 - Total words found: 4382
    10/31/08 10:15:31 - Avg. unique words per page: 154.13
    10/31/08 10:15:31 - Avg. words per page: 547
    10/31/08 10:15:31 - Start index time: 10:15:30 (2008/10/31)
    10/31/08 10:15:31 - Elapsed index time: 00:00:01
    10/31/08 10:15:31 - Errors: 1
    10/31/08 10:15:31 - URLs visited by spider: 32
    10/31/08 10:15:31 - URLs in spider queue: 0
    10/31/08 10:15:31 - Start points indexed: 2 / 2
    10/31/08 10:15:31 - Total bytes scanned/downloaded: 187833
    10/31/08 10:15:31 - File extensions:
    10/31/08 10:15:31 - .php indexed: 0
    10/31/08 10:15:31 - .asp indexed: 0
    10/31/08 10:15:31 - .cfm indexed: 0
    10/31/08 10:15:31 - .aspx indexed: 8
    10/31/08 10:15:31 - .php3 indexed: 0
    10/31/08 10:15:31 - .php4 indexed: 0
    10/31/08 10:15:31 - .txt indexed: 0
    10/31/08 10:15:31 - .doc indexed: 0
    10/31/08 10:15:31 - .ae indexed: 0
    10/31/08 10:15:31 - .aef indexed: 0
    10/31/08 10:15:31 - .ocx indexed: 0
    10/31/08 10:15:31 - .msi indexed: 0
    10/31/08 10:15:31 - .dll indexed: 0
    10/31/08 10:15:31 - .exe indexed: 0
    10/31/08 10:15:31 - .zip indexed: 0
    10/31/08 10:15:31 - .rar indexed: 0
    10/31/08 10:15:31 - .gif indexed: 0
    10/31/08 10:15:31 - .jpeg indexed: 0
    10/31/08 10:15:31 - .jpg indexed: 0
    10/31/08 10:15:31 - .bmp indexed: 0
    10/31/08 10:15:31 - .png indexed: 0
    10/31/08 10:15:31 - .htm indexed: 0
    10/31/08 10:15:31 - .html indexed: 0
    10/31/08 10:15:31 - .xml indexed: 0
    10/31/08 10:15:31 - No extensions indexed: 0
    10/31/08 10:15:31 - Cleaning up memory used for index data... please wait.
    10/31/08 10:15:31 - Finished cleaning up memory.

  • #2
    There is no way to add attachments to forum posts. But you can E-mail us log files and configuration files. It would be interesting to see the whole log and not just the extract you posted above.

    There is also a tutorial for adding Zoom search to a DNN web site. But you can search function outside of DNN, in which case the tutorial is not so relevent.

    If files are not being found by the indexer, see these FAQ.
    Q. Why are some of my pages being skipped by the indexer?

    Q. Why are links in my Javascript menus being skipped?

    Q. I am indexing with spider mode but it is not finding all the pages on my web site

    Comment

    Working...
    X