It may seem odd, but for this to work, you need to follow the
instructions on how to set up the UIMA example code in Eclipse:
http://uima.apache.org/d/uimaj-2.3.1/overview_and_setup.html#ugr.ovv.eclipse_setup.example_code
Note to self: the link in the tutorial is broken. Can we have
links that do
I've installed the latest version of Eclipse, the EMF, and the UIMA
tools (everything that was selectable) and I'm trying to work my way
through the tutorial found here:
http://uima.apache.org/doc-uima-annotator.html
However, when I get to the point where it says to:
Expand "Java Application
I agree with Jorn, I think that's the faster way.
Tommaso
2011/2/14 Jörn Kottmann
> On 2/14/11 4:49 AM, Radhouane Aniba wrote:
>
>> Hello everyone,
>>
>> Quite unusual request to this list, I am wondering if there is any
>> analysis
>> engine that allow to mine MBOX like formats such as the famo
On 2/14/11 4:49 AM, Radhouane Aniba wrote:
Hello everyone,
Quite unusual request to this list, I am wondering if there is any analysis
engine that allow to mine MBOX like formats such as the famous mailman
mailing list archives in a way that it allow to structure these kind of data
into messages
Radhouane,
Not an exact answer to your question, but Perl's CPAN has an MBOX format parser:
http://search.cpan.org/dist/Mail-Mbox-MessageParser/
If you write a 3-liner to convert from MBOX to XML, the UIMA analyzer might be
easier to write, unless you want everything implemented in one language.
Not sure where you want to go with this, but one
approach might be to preprocess your data into some
better structured format, and only start your UIMA
analysis after you've done that.
I have used a project called mstor on sourceforge to
process mbox files, and then you can use javamail
to get at
Hi,
I am newbie to UIMA, and I want to know how I can print out all the
annotations that my Aggregate Engine detects (on a text document), into a
text file. Specifically, I have an aggregate engine that processes a
document and annotates it using different annotation types. I want to print
out all