PDA

View Full Version : Japanese - UTF8


BAIRSIA
01-20-2008, 05:47 AM
Hi,
I converted several of my web pages from Shift-JIS (x-jis) to UTF8 and then ran Zoomindexer (v5.1b1011). The Japanese.zlang file works fine, its the search results, page-titles and page-descriptions that are not coding correctly at all.

Funnily enough, if I REMOVE the Meta-tag "character set" in one of my HTML pages and then view that page in my browser, the very un-Japanese browser page-title is identical to what zoom search as listed in its "zoom_pageinfo.js" file.

"http://www.iainbairstow.com/hiopter/company.html","u","˔‚ŁAƖς܂BЂ̋ZpƐiB",""],

I am obviously screwing up somewhere, i.e not specifying UTF8 somewhere.I just don`t know where. Appreciate your advice.

As a curiosity, if I set the zoom indexer configuration to "windows-850 (Multilingual)" and the zlang file to English, then hey presto I get Kanji titles and descriptions etc (about 90% okay) in both the "zoom_pageinfo.js" file and my search results.

brgds
Iain


All my Japanese web pages are coded as follows -

<head>
<meta http-equiv="Site-Exit" content="revealTrans(Duration=1,Transition=2)">
<meta http-equiv="Site-Enter" content="revealTrans(Duration=2,Transition=3)">
<meta http-equiv="Page-Exit" content="revealTrans(Duration=1,Transition=6)">
<meta http-equiv="Page-Enter" content="revealTrans(Duration=2,Transition=7)">
<meta http-equiv="content-type" content="text/html; charset=UTF8">
<meta name="author" content="Iain Bairstow">
<meta name="description" content="反射板で、照明が変わります。当社の技術と製品をご覧下さい。">
<meta name="keywords" content="反射板,ディスプレイ,照明,省エネ,ハイオプター,看板,ランプ,蛍光灯,CO2,コストダ ウン,Hi-Opter,HiOpter">
<meta http-equiv="imagetoolbar" content="no">
<title>蛍光商事株式会社</title>

wrensoft
01-20-2008, 09:11 PM
In your search_template.html file did you include the UTF-8 meta data code page tag?

Also in Zoom you need to select both the Japanexe language file and the UTF-8 code page. Do not use windows-850 in Zoom if your pages are UTF-8.

See also this page for general adviace in using Zoom with non English languages (http://www.wrensoft.com/zoom/support/languages.html).

Ray
01-21-2008, 01:30 AM
As you are using the Javascript version, the search template file you need to specify "UTF-8" in is "search.html", rather than "search_template.html".

Ray
01-21-2008, 01:34 AM
<meta http-equiv="content-type" content="text/html; charset=UTF8">

This should be:

<meta http-equiv="content-type" content="text/html; charset=UTF-8">


Note the hyphen in UTF-8.