actualy, there is no need to set encoding. i only need to read files using 
proper decoding and then lucene stores it index properly, so when i retrive 
docs, they are proper strings with letters with accents.

i tought it can't be so simple. the whole thing is in reading and decoding, 
lucene takes care of the rest.

thanks everybody for suggestions

dario




>From: "redpineseed" <[EMAIL PROTECTED]>
>Reply-To: "Lucene Users List" <[EMAIL PROTECTED]>
>To: "Lucene Users List" <[EMAIL PROTECTED]>
>Subject: Re: setting encoding
>Date: Mon, 20 May 2002 13:29:58 -0700
>
>
>convert your native code to unicode (UTF16) with the following lines:
>
>File f = new File('cp1252_input');
>FileInputStream tmp = new FileInputStream(f);
>BufferedReader  brin = new BufferedReader( new InputStreamReader( tmp, 
>"CP1252"));
>String inputString = brin.readLine();
>
>not sure your code designater is CP1252, chech that out in Java Docs.
>
>
>redpineseed


_________________________________________________________________
Chat with friends online, try MSN Messenger: http://messenger.msn.com


--
To unsubscribe, e-mail:   <mailto:[EMAIL PROTECTED]>
For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>

Reply via email to