Re: MSPowerPointExtractor problem

2004-08-02 Thread Ralph Scheuer
Ryan, thanks for your reply. I have also seen the posts from Sudhakar on this subject who seems to be contributing a whole lot of code here - which is a great thing but in this code the problem also persists so I think we solve this encoding problem in your code (which is simpler - the fix

Re: MSPowerPointExtractor problem

2004-08-02 Thread Koundinya \(Sudhakar Chavali\)
Hm, Basically we have concentrated on English language. So we never faced any problems. It become a new task for our team now :-) Thanks to Ralph in pointing that problem. We Will work on related and let the Jakarta team knows :-) Regards Sudhakar --- Ralph Scheuer [EMAIL PROTECTED]

RE: MSPowerPointExtractor problem

2004-08-01 Thread Ryan Rhodes
Hi Ralph, I haven't tested the PPT extractor with any other languages. I remember reading about other people having problems with different character sets though. Could you send a before and after example file here or to bugzilla? -Ryan Rhodes -Original Message- From: Ralph Scheuer

RE: MSPowerPointExtractor problem

2004-08-01 Thread Koundinya \(Sudhakar Chavali\)
Check this, http://wiki.apache.org/jakarta-lucene-data/attachments/PowerPoint/attachments/PPT2Text.java --- Ryan Rhodes [EMAIL PROTECTED] wrote: Hi Ralph, I haven't tested the PPT extractor with any other languages. I remember reading about other people having problems with different

RE: MSPowerPointExtractor problem

2004-08-01 Thread Koundinya \(Sudhakar Chavali\)
Hello All, This was my first contribution http://wiki.apache.org/jakarta-lucene-data/attachments/PowerPoint/attachments/PPT2Text.java for jakarta team. And it seems another expert(Ryan Rhodes- [EMAIL PROTECTED]) has already started working on that based on my first given contribution. That