PDA

View Full Version : Zoom 5 and SharePoint 2007


BobHank
01-16-2008, 09:03 PM
Hi,
Using Zoom 5, i'm unable to spider my SharePoint 2007 site.
I have a non-sharepoint site that i can spider successfully.
But for my SharePoint site, i get these entries in the Index log:

01/16/08 14:44:22 - Config file loaded: C:\Program Files\Zoom Search Engine 5.0\Iowa-SharePoint-Devel-BrokenLinks-2008.zcfg
01/16/08 14:44:24 - Start indexing (spider mode)
01/16/08 14:44:25 - Spidering for links on http://devi.wfhealthcareia.org/

01/16/08 14:44:25 - Queued URL: http://devi.wfhealthcareia.org/index.aspx?sid=d9a00d87-8af1-4438-bf6c-8417dac6e1dd&aid=6&pos=1&Keywords=Health+Care&qs=06oENya4ZGJbLUnVKHHBbs2XH_zE7eFZGJUEcfhoh6PnEgy IZ3ScqEjYdRMvfsMP1kz3lc_0mGO-uK7anLJZxJitpCIe_BXCbn4RP-DLfGL-jV47b_IqsN77A7xVgoiFun3K_v7aatuAF_szcR24677mH_4fnZ QfbvQcuxxeCiF4qquL4OLosDU8QFCcUxtpqSySYgQouFx8PNTJ YFSk4c-1lhN9Y.,YT0xO0w9SGVhbHRoIENhcmU7Uj0xO1M9TSM.

01/16/08 14:44:25 - Queued URL: http://devi.wfhealthcareia.org/index.aspx?sid=d9a00d87-8af1-4438-bf6c-8417dac6e1dd&aid=6&pos=2&Keywords=Health&qs=06oENya4ZGJbLUnVKHHBbs2XH_zE7eFZGJUEcfhoh6PnEgy IZ3ScqEjYdRMvfsMP1kz3lc_0mGO-uK7anLJZxJitpCIe_BXCbn4RP-DLfGL-jV47b_IqsN77A7xVgoiFun3K_v7aatuAF_szcR24677mH_4fnZ QfbvQcuxxeCiF4qquL4OLosDU8QFCcUxtpqSySYgQouFx8PNTJ YFSk4c-1lhN9Y.,YT0xO0w9SGVhbHRoO1I9MjtTPW8jLSMzZw..

01/16/08 14:44:25 - Queued URL: http://devi.wfhealthcareia.org/index.aspx?sid=d9a00d87-8af1-4438-bf6c-8417dac6e1dd&aid=6&pos=7&Keywords=Medicine&qs=06oENya4ZGJbLUnVKHHBbs2XH_zE7eFZGJUEcfhoh6PnEgy IZ3ScqEjYdRMvfsMP1kz3lc_0mGO-uK7anLJZxJitpCIe_BXCbn4RP-DLfGL-jV47b_IqsN77A7xVgoiFun3K_v7aatuAF_szcR24677mH_4fnZ QfbvQcuxxeCiF4qquL4OLosDU8QFCcUxtpqSySYgQouFx8PNTJ YFSk4c-1lhN9Y.,YT0xO0w9TWVkaWNpbmU7Uj03O1M9byMtIzNn
--
This format repeats throughout the file.
I'm not sure why Zoom write URLs out with the .org/index.asp?sid=... format.

(Note: the site url listed above is on a private network, so you won't be able to bring it up.)
I can send the results from Zoom for both my SharePoint (non-working) site, and my non-SharePoint(working) site, if that will help.

Thanks for your help.

Ray
01-17-2008, 12:47 AM
It's not Zoom that's writing out the URLs in that format, it's SharePoint that's giving those URLs when the Zoom Spider requests a page.

The "sid=d9a000d87...etc." part appears to be a session ID. This is usually utilized by an ASP page when it requires session tracking (or authentication) and cookies are not available. So the first thing to try is enabling cookie support in Zoom (under the "Authentication" tab of the Configuration window).

If your site does require a user login or password to access, you should also take a look at this FAQ:
Q. How do I index protected parts of my website requiring user authentication? (http://www.wrensoft.com/zoom/support/auth.html)

BobHank
01-17-2008, 02:12 PM
Thanks for your clear explanation Ray.
I'll give that a try, then post on the results.
-bob