Author: Mark Roebuck
Email: [EMAIL PROTECTED]
Message:
In the documentation for v3.1.19 I have found the following:
Document charset detection
--
indexer detects document character set in this order:
1) Content-type: text/html; charset=xxx
2) META NAME=Content
Author: Arthur Zimens
Email:
Message:
Hello All,
I tried to index web site with Cyrillic koi8-r charset,
but indexer didn't store any russian words in dict table, only latin.
As result, I can search latin words, but not russian
indexer.conf:
-
# This is a minimal sample indexer
Author: Alexander Barkov
Email: [EMAIL PROTECTED]
Message:
Check HTTP headers which are sent by your web-server.
Try this: wget -s http://localhost/
What can you see in Content-Type header?
Hello All,
I tried to index web site with Cyrillic koi8-r charset,
but indexer didn't store any
Author: Alexander Barkov
Email: [EMAIL PROTECTED]
Message:
Please contact me by email
% wget -s http://localhost/
--22:45:45-- http://localhost/
=gt; `index.shtml'
Connecting to localhost:80... connected!
HTTP request sent, awaiting response... 200 OK
Length: unspecified