Hi, That was the problem , Thanks :-) . still i am strugling to get lucene to search non english unicode content . it works partially will simple analyser but doesn't return any results with standard analyser . is there a way by which i can output the exact contents that are going into the index .
Thanks and regards, Harpreet ----- Original Message ----- From: "Ian Lea" <[EMAIL PROTECTED]> To: "Harpreet S Walia" <[EMAIL PROTECTED]> Cc: "Lucene Users List" <[EMAIL PROTECTED]> Sent: Monday, June 10, 2002 5:15 PM Subject: Re: Problem in unicode field value retrival > I don't think you can retrieve the contents of Fields that have > been loaded by a Reader. From the javadoc for Field: > > Text(String name, Reader value) > > Constructs a Reader-valued Field that is tokenized and indexed, but is > not stored in the index verbatim. > > > -- > Ian. > [EMAIL PROTECTED] > > > > [EMAIL PROTECTED] (Harpreet S Walia) wrote > > > > Hi > > > > I am trying to index and search unicode (utf - 8) . the code i am using to index the documents is as follows : > > > > /*************************************************************************** ***********************************************************/ > > IndexWriter iw = new IndexWriter("d:\\jakarta-tomcat3.2.3\\webapps\\lucene\\index", new SimpleAnalyzer(), true); > > String dirBase = "d:\\jakarta-tomcat3.2.3\\webapps\\lucene\\docs"; > > File docDir = new File(dirBase); > > String[] docFiles = docDir.list(); > > InputStreamReader isr; > > InputStream is; > > Document doc; > > for(int i=0;i<docFiles.length;i++) > > { > > File tempFile = new File(dirBase + "\\" + docFiles[i]); > > if(tempFile.isFile()==true) > > { > > System.out.println("Indexing File :" + docFiles[i]); > > is = new FileInputStream(tempFile); > > isr=new InputStreamReader(is,"utf-8"); > > doc= new Document(); > > doc.add(Field.UnIndexed("path",tempFile.toString())); > > doc.add(Field.Text("abc",(Reader)isr)); > > doc.add(Field.Text("all","sansui")); > > iw.addDocument(doc); > > is.close(); > > isr.close(); > > doc=null; > > } > > } > > iw.close(); > > is=null; > > isr=null; > > iw=null; > > docDir=null; > > > > System.out.println("Indexing Complete"); > > > > /*************************************************************************** ***********************************************************/ > > > > Now when i try to search the contents and get the field called abc by using the method doc.get("abc") , i get null as the output. > > > > Can anyone please tell me where i am going wrong . > > > > Thanks And Regards > > Harpreet > > > ---------------------------------------------------------------------- > Searchable personal storage and archiving from http://www.digimem.net/ > > ---------------------------------------------------------------------------- ---- > -- > To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> > For additional commands, e-mail: <mailto:[EMAIL PROTECTED]> -- To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>