Hello
first, you need a parser for each file type: pdf, txt, word, etc.
and use a java api to iterate zip content, see:
http://java.sun.com/j2se/1.4.2/docs/api/java/util/zip/ZipInputStream.html
use getNextEntry() method
little example:
ZipInputStream zis = new ZipInputStream(fileInputStream);
]
To: Lucene Users List lucene-user@jakarta.apache.org
Sent: Tuesday, March 01, 2005 10:48 AM
Subject: Re: Zip Files
Hello
first, you need a parser for each file type: pdf, txt, word, etc.
and use a java api to iterate zip content, see:
http://java.sun.com/j2se/1.4.2/docs/api/java/util/zip
the directory. But this would greatly slow indexing and use up
disk space.
Luke
- Original Message -
From: Ernesto De Santis [EMAIL PROTECTED]
To: Lucene Users List lucene-user@jakarta.apache.org
Sent: Tuesday, March 01, 2005 10:48 AM
Subject: Re: Zip Files
Hello
first, you need