actualy, there is no need to set encoding. i only need to read files using proper decoding and then lucene stores it index properly, so when i retrive docs, they are proper strings with letters with accents.
i tought it can't be so simple. the whole thing is in reading and decoding, lucene takes care of the rest. thanks everybody for suggestions dario >From: "redpineseed" <[EMAIL PROTECTED]> >Reply-To: "Lucene Users List" <[EMAIL PROTECTED]> >To: "Lucene Users List" <[EMAIL PROTECTED]> >Subject: Re: setting encoding >Date: Mon, 20 May 2002 13:29:58 -0700 > > >convert your native code to unicode (UTF16) with the following lines: > >File f = new File('cp1252_input'); >FileInputStream tmp = new FileInputStream(f); >BufferedReader brin = new BufferedReader( new InputStreamReader( tmp, >"CP1252")); >String inputString = brin.readLine(); > >not sure your code designater is CP1252, chech that out in Java Docs. > > >redpineseed _________________________________________________________________ Chat with friends online, try MSN Messenger: http://messenger.msn.com -- To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>