Hello everybody, I have sometimes strange problems with codepages on few sites. In 99% everything is ok, but few sites indexed incorrect. At the result search page they have "?" symbols instead of cyrillic charecters, when other site on this result page looks ok. It is looks like this:
---------- 1. somesite.ru ...0, ? ????? ??? ? ????? ??????? ????? ????????? ???? ?? ???????. 300 ???????, ? ??? ????????? ????? ???????? ?????? ?? ??????????????? ?????. 2002 � ???????. somesite .ru ... ... http://www.somesite.ru/index.html ----------- I look in database, in tables urlwordsXX - there data is in 1251 charset, column charset is equal to "1251" ... the only difference from sites where everything is ok, column lang = 'en' Where can be the problem? And how can I solve it? Here are my conf parameters related to charsets: aspseek.conf Include ucharset.conf CharSet windows-1251 # I add this, so that servers thought that it is IE, # and return pages in 1251 codepage HTTPHeader User-Agent: Mozilla/4.0 (compatible; MSIE 5.0; WinNT) searchd.conf Include ucharset.conf LocalCharset windows-1251 ucharset.conf # it is the only string, I unrem CharsetTableU1 windows-1251 ru tables/windows-1251.txt s.htm CharsetTable koi8-r ru charsets/koi8r CharsetTable cp1251 ru charsets/cp1251 LocalCharset cp1251 Regards Ivan
