Hello everybody,

I have sometimes strange problems with codepages on few sites. In 99% everything is 
ok, but few sites indexed incorrect. At the result search page they have "?" symbols 
instead of cyrillic charecters, when other site on this result page looks ok. 
It is looks like this:

----------
1. somesite.ru
...0, ? ????? ??? ? ????? ??????? ????? ????????? ???? ?? ???????. 300 ???????, ? ??? 
????????? ????? ???????? ?????? ?? ??????????????? ?????. 2002 � ???????. somesite .ru 
... ...
http://www.somesite.ru/index.html 
-----------

I look in database, in tables urlwordsXX - there data is in 1251 charset, column 
charset is equal to "1251" ... the only difference from sites where everything is ok, 
column lang = 'en'
Where can be the problem? And how can I solve it?

Here are my conf parameters related to charsets:

aspseek.conf

  Include ucharset.conf
  CharSet windows-1251
  # I add this, so that servers thought that it is IE, 
  # and return pages in 1251 codepage
  HTTPHeader User-Agent: Mozilla/4.0 (compatible; MSIE 5.0; WinNT) 

searchd.conf

  Include ucharset.conf 
  LocalCharset windows-1251

ucharset.conf

  # it is the only string, I unrem
  CharsetTableU1 windows-1251 ru tables/windows-1251.txt  

s.htm

  CharsetTable koi8-r     ru charsets/koi8r
  CharsetTable cp1251    ru charsets/cp1251
  LocalCharset cp1251

Regards
Ivan

Reply via email to