congrats, but i feel there is still some bugs with charsets etc. I still cannot figure out problem with the schwa character CharsetTableU2 had to be a solution to the problem, but ASPSeek still does ignore the charachter while indexing i did ask this question before, but no answers i recieved anyways, where can I find out more about indexing 2bytes characters thanks
----- Original Message ----- From: Thomas -Balu- Walter <[EMAIL PROTECTED]> To: <[EMAIL PROTECTED]> Sent: Sunday, December 15, 2002 9:32 AM Subject: Re: [aseek-users] Umlauts, Charset? > On Sun, Dec 15, 2002 at 05:37:37PM +0259, Kir Kolyshkin wrote: > > Hey guys, this is a small puzzle for you. Try to guess the charset of the > > document, based on information from the below quote. Neither me nor > > ASPseek's index can solve that. > > > > HINT to Balu: tune your apache to return proper charset in 'Content-Type:' > > line. AFAIK newer Apache versions (since 1.3.12) does that by default. > > Urx... Okay, gotta take the stupid-hat for tomorrow... > > balu@foghorn:~$ telnet www2.jugendpolitik.net 80 > Trying 212.124.46.100... > Connected to foghorn.bartels-schoene.de. > Escape character is '^]'. > GET / HTTP/1.1 > Host: www2.jugendpolitik.net > > HTTP/1.1 200 OK > Date: Sun, 15 Dec 2002 17:27:08 GMT > Server: Apache/1.3.26 (Unix) Debian GNU/Linux PHP/4.1.2 mod_ssl/2.8.9 > OpenSSL/0.9.6g > X-Powered-By: PHP/4.1.2 > Transfer-Encoding: chunked > Content-Type: text/html; charset=iso-8859-1 > > > >[kir@kir kir]$ telnet www.jugendpolitik.net 80 > > >Trying 212.124.46.7... > > >Connected to www.jugendpolitik.net. > > >Escape character is '^]'. > > >GET /foerdermittel/c_knete.html HTTP/1.0 > > >Host: www.jugendpolitik.net > > > > > >HTTP/1.1 200 OK > > >Date: Sun, 15 Dec 2002 14:52:50 GMT > > >Server: mod_perl/1.18 Apache/1.3.4 (Unix) (SuSE/Linux) PHP/3.0.7 > > >mod_ssl/2.2.4 SSLeay/0.9.0b > > >Connection: close > > >Content-Type: text/html > > You are right of course - It's better to index the site you are > just setting up (www2.*) than the one on the old server they should be > moved from (and that doesn't support the charset per default)... > And if you have problems, make sure you are checking the right host... > > Stupid mistake :-], thanks for saving me trying more nonsense, Kir. > > Balu
