I'm having trouble with IPB Invision v2.1.4 (discussion forum).
The search index correctly displays the first topic in a thread, but none of the subsequent replies. Here is my skip list:
/forums/cache
/forums/install
/forums/ips_kernel
/forums/lofiversion
/forums/jscripts
/forums/lang
/forums/modules
/forums/retail
/forums/skin_acp
/forums/skin_cache
/forums/sources
/forums/style_avatars
/forums/style_images
/forums/upgrade
/forums/uploads
?s=
?act=calendar
?act=rssout&id
Thanks a million
David
There are some comments about how to index forum and message boards here,
http://www.wrensoft.com/zoom/support...html#msgboards
But we don't have specific instructions for the IPB Invision forum.
My understanding was thas the initially post and subsequent replies are all on a single page with IPB. So it is hard to see how you could see the post and not the replies.
Can you gives URLs to your search function and an example where it doesn't work.
------
David
Hi David,
Yes, I read the FAQ about forums but that was more designed for phpBB, not IPB.
With my current skip list, the index finds the following file:
http://www.newcomers-network.de/forums/index.php?showtopic=17
but not
http://www.newcomers-network.de/forums/index.php?showtopic=17&pid=99&mode=threaded&show=& st=&#entry99
Any ideas would be most appreciated.
David
PS: By the way, I think you software is great. It works very well and is well documented.
I took a quick look at trying to index your IPB board.
It seems that your IPB is configured to return links with a session ID parameter when accessed by a client with cookie support disabled. This means that when this happens, all links on the page will look like the following:
http://www.newcomers-network.de/forums/index.php?showtopic=17&pid=34&mode=threaded&show=& st=&
Note the extra "?s=e62119..." parameter in the URL compared to the links you mentioned above.
Now from your original post, I can see that you have a skip page entry for "?s=", which would cause the indexer to skip this URL accordingly. You can see this by turning on "Verbose" mode.
If you remove this skip page entry, it should scan and index the pages you want, but this would probably not be ideal because you will end up with a search engine that return links containing the same session ID.
An alternative is that you can enable cookie support in Zoom. This can be found in the Configuration window, under the "Authentication" tab. You might also want to make sure you have "Reload all pages (do not use cache)" enabled so that it will take effect immediately.
Doing the above allows Zoom to use cookies shared with IE and Windows. By doing so, you should find that the "?s=e62119..." parameter will disappear from the links. However you should note that because this is using the same cookie as IE, this means that Zoom will see pages as that user would.
Hope that helps.
Thanks for the great support.
I did what you said and now have 21 search entries, where I only need 3.
Below are the list of the URL I want to SKIP and NOT SKIP. Perhaps you can let me know what I need to enter in my config:
NOT SKIP
http://www.newcomers-network.de/forums/index.php?showtopic=17&pid=98&mode=threaded&show=& st=&
SKIP
http://www.newcomers-network.de/forums/index.php?act=Post&CODE=02&f=6&t=17&qpid=100
http://www.newcomers-network.de/forums/index.php?showtopic=17&mode=linear
http://www.newcomers-network.de/forums/index.php?showtopic=17&mode=linearplus
http://www.newcomers-network.de/forums/index.php?act=post&do=reply_post&f=6&t=17
http://www.newcomers-network.de/forums/index.php?act=Print&client=printer&f=6&t=17
http://www.newcomers-network.de/forums/index.php?act=Search&nav=au&CODE=show&searchid=bcb 4eab9a88851e1f24dbac256e69a55&search_in=posts&resu lt_type=posts
Thanks a million
Your guesses would be as good as mine, since I have not looked any further into indexing IPB at this point. But from your examples, I would try the following:
?act=Post
?act=post
&mode=linear
?act=Print
?act=Search
Basically any bit of text in the URL that distinguishes one of the links to skip from the ones you do not want to skip.