Re: ePub-Collection Reader

2012-06-19 Thread Diman Karagiozov
Hi Nils, try this one... http://www.siegmann.nl/epublib I found it handy and easy to integrate. Another possibility is to user apache TIKA - there is an EPubParser but I have not tried it so far. http://tika.apache.org/0.8/api/org/apache/tika/parser/epub/EpubParser.html greetings Diman On

Re: UTF8 Encoded documents processing

2012-05-27 Thread Diman Karagiozov
Hi there, I had a similar problem with UTF-8 encoded text files with BOM. When I removed the BOM (uisng commons-io 2.3) the processing of the texts went OK. greetings Diman On 5/27/2012 8:43 PM, Marshall Schor wrote: please force a stack trace dump in Java once it is hung in the last line,

Re: UIMA AS - CAS consumer patterns

2010-10-13 Thread Diman Karagiozov
Thanks a lot for the suggestions. It appeared that the best approach in my environment is to add the client specific information into the CAS as system annotations. Not very clean, but working :) The idea that UIMA AS is working in request-response mode is quite interesting and I may

UIMA AS - CAS consumer patterns

2010-10-12 Thread Diman Karagiozov
Hi there, I am working on a project that utilizes UIMA Async Scale-out capabilities. There are two client applications that send messages in an input.queue and several UIMA aggregate engines that fetch messages from this input queue. What I want to achieve is to send the annotated serialized

Re: AW: Convert Text to UIMA CAS

2010-09-27 Thread Diman Karagiozov
[mailto:user-return-3209-armin.wegner=bka.bund...@uima.apache.org] Im Auftrag von Diman Karagiozov Gesendet: Montag, 27. September 2010 15:26 An: user@uima.apache.org Betreff: Re: Convert Text to UIMA CAS Hi there, I suppose that you have to create an UIMA primitive engine and in its process