Re: XMI/annotation viewer for browser

2013-12-03 Thread Jörn Kottmann
Have a look at the brat project, they provide a JS library which can display annotations in a browser. http://brat.nlplab.org/ HTH, Jörn On 12/03/2013 10:44 AM, Roberto Franchini wrote: Hi to all, I'm looking for an annotation viewer for the web. We have a REST api that wraps ours nlp

Re: XMI/annotation viewer for browser

2013-12-03 Thread Jörn Kottmann
On 12/03/2013 11:23 AM, Seid Muhie wrote: And also WebAnnohttps://code.google.com/p/webanno/, which uses the brat visualization but the server side is UIMA annotation (currntly for POS, Named Entity, Coreference, Lemma and Dependency Parsing) Well, they are just using the Brat JS component and

Re: XMI/annotation viewer for browser

2013-12-03 Thread Jörn Kottmann
On 12/03/2013 12:28 PM, Roberto Franchini wrote: On Tue, Dec 3, 2013 at 11:54 AM, Jörn Kottmannkottm...@gmail.com wrote: On 12/03/2013 11:23 AM, Seid Muhie wrote: And also WebAnnohttps://code.google.com/p/webanno/, which uses the brat visualization but the server side is UIMA annotation

Re: Building on Mac; simple NER example?

2013-10-23 Thread Jörn Kottmann
On 10/23/2013 01:02 AM, Paul J. Lucas wrote: Assuming I can get past this, my primary goal is to use UIMA to perform NER, i.e., give it some text and get people places back. I couldn't find any examples of this. Can this be done using UIMA? Example C++ code? Thanks. You could try Apache

Re: Best approach for analyzing a set of documents

2013-10-03 Thread Jörn Kottmann
On 10/03/2013 03:57 PM, Marshall Schor wrote: Later, it became clear that the Collection Reader and Cas Consumer were just parameterizations of normal Analysis Engines, so they were replaced by those. The older classes still work, though. We should deprecate them and communicate this better to

Re: Multi-view CAS and sofa-unaware AE

2013-04-03 Thread Jörn Kottmann
Yes, you can use the sofa mapping, to map some view to the _InitialView. Have a look here: http://uima.apache.org/d/uimaj-2.4.0/tutorials_and_users_guides.html#ugr.tug.mvs.sofa_name_mapping Jörn On 04/03/2013 02:19 PM, Peter Klügl wrote: Hi, sorry for this beginner question: It there a

Re: Apache UIMA is on Ohloh

2013-02-07 Thread Jörn Kottmann
The Apache UIMA page is here: http://www.ohloh.net/p/7917 If anybody of the dev team is interested to join the project just ask on ohloh to be added as a manager. Jörn On 02/06/2013 07:44 PM, Marshall Schor wrote: There's a website, https://www.ohloh.net, that tracks various things about open

Re: Eclipse Cas Editor

2012-09-24 Thread Jörn Kottmann
An easy way of using JCas inside eclipse is to use the eclipse buddy policy to enable the UIMA Runtime Plugin to load the classes from your plugin. Thanks to Peter to pointing that out to me. Jörn On 09/22/2012 11:51 PM, Jörn Kottmann wrote: Hello, using JCas doesn't work here because

Re: Eclipse Cas Editor

2012-09-22 Thread Jörn Kottmann
Hello, using JCas doesn't work here because the UIMA Runtime doesn't import your JCas class. To get things working you can do two things: - Do not use JCas, you can do all operations with the CAS API - Fix the imports in the UIMA Runtime OSGI Manifest and only use the modified bundle HTH,

Re: CAS Editor Plugin

2012-08-08 Thread Jörn Kottmann
Hello, not sure what you are doing. My best guess is that you use the eclipse launch support. If thats the case you probably miss the uimaj-runtime dependency. Try to compile everything with maven first, refresh all eclipse projects and start it again. The maven build generates the

Re: Sofa mapping for a CM

2012-07-11 Thread Jörn Kottmann
comments on proposed fix? Eddie On Wed, Jul 4, 2012 at 6:53 AM, Jörn Kottmann kottm...@gmail.com wrote: Hi all, are sofa mappings for CMs are supposed to work? Jörn

Re: CAS Editor: link feature structure with other FSes

2012-06-21 Thread Jörn Kottmann
Hello, any suggestions how linking FSes together can be improved in the Cas Editor? You always need to find the FSes or annotations first to link them and that always requires a few clicks. It is easy to make this much faster if there is dedicated view support for a specific labeling task.

Re: Exception thrown during CAS serialization for Remote UIMA-AS Service

2012-06-14 Thread Jörn Kottmann
You write a string to the CAS which contains a non-xml character. This character cannot be serialized into XMI, and thats what this exception is about. Have a look at our documentation explaining the issue:

Re: How to deploy a UIMA-AS Serivce with a Cas Multiplier?

2012-06-13 Thread Jörn Kottmann
The error was linked to my delegate imports in the AAE, one delegate was imported twice. It first worked, and when I tried to scale it i got the error message in dd2spring. After removing the duplicate import it worked. The Reader is an AE which creates a new view, so its not a CR. Did you fix

Re: How to deploy a UIMA-AS Serivce with a Cas Multiplier?

2012-06-13 Thread Jörn Kottmann
I am running the 2.3.1 release version on my cluster. Jörn On 06/13/2012 01:02 PM, Jörn Kottmann wrote: The error was linked to my delegate imports in the AAE, one delegate was imported twice. It first worked, and when I tried to scale it i got the error message in dd2spring. After removing

How to deploy a UIMA-AS Serivce with a Cas Multiplier?

2012-06-11 Thread Jörn Kottmann
Hello all, I have an AAE which uses a Cas Multiplier to segment an input CAS into a couple of CASes. My pipeline is constructed like this: ReaderAE SegmentDetectionAE (just outputs segment annotations) SegmentationCM (cuts based on the previously detected annotations) WriterAE (should write

Re: How to deploy a UIMA-AS Serivce with a Cas Multiplier?

2012-06-11 Thread Jörn Kottmann
: XPTY0004: A sequence of more than one item is not allowed as the first argument of string() (@key, @key) After fixing it my delegates can be scaled as needed. Jörn On 06/11/2012 04:06 PM, Jörn Kottmann wrote: Hello all, I have an AAE which uses a Cas Multiplier to segment an input CAS

How to load XMI CAS from a DB inside a Reader AE?

2012-05-08 Thread Jörn Kottmann
Hi all, I have a database which contains XMIs and would like to process these with UIMA-AS. My pipeline looks like this: Database Reader AE Result Writer AE I made an AAE composed out of the two AEs, the first gets a CAS with an ID encoded into its sofa. It then fetches the XMI from the DB,

Re: CAS Editor instructions

2012-04-24 Thread Jörn Kottmann
On 04/24/2012 01:25 AM, Nicolas Hernandez wrote: I do not have tested type system inclusion with the CAS editor. TypeSystem imports should work in 2.4.0. Lots of problems where fixed for this version. Jörn

Re: CAS Editor instructions

2012-04-22 Thread Jörn Kottmann
Looks like you are using an old version. The 2.4.0 version works better. I suggest that you try that one instead. In this version it asks you to point it to a type system if none for the XMI file can be found in the configured location (default is TypeSystem.xml in the root of the project).

Re: UIMA Annotator Library

2012-04-18 Thread Jörn Kottmann
Yes, you can use Apache OpenNLP. Its a very well written library for standard NLP tasks. See here: opennlp.apache.org Hope this helps, Jörn On 04/14/2012 10:34 AM, Jeremy Villalobos wrote: Hello: I have studied the User's Guide on creating Annotators, testing them and deploying them with a

Re: How do I assign more heap memory to CAS Editor Analysis Engine when running Training?

2012-04-13 Thread Jörn Kottmann
On 04/12/2012 11:32 AM, Peter Thygesen wrote: Strange. Still problems. I reduced the corpus files to 10 files. running with RunAE still doesn't produce any events, but when I run it with UIMA Analysis Engine configuration it works. That sounds strange, because it should not make a difference

Re: How do I assign more heap memory to CAS Editor Analysis Engine when running Training?

2012-03-23 Thread Jörn Kottmann
No, we added an eclipse launcher to start AEs, so that you do not have to use some kind of script. Jörn On 03/23/2012 02:06 AM, Fitch, Britt wrote: I assume you are running one of the scripts in the bin directory, if so, you can add the java -Xmx param to where it gets called. If you look in

Re: How do I assign more heap memory to CAS Editor Analysis Engine when running Training?

2012-03-23 Thread Jörn Kottmann
On 03/23/2012 12:34 AM, Peter Thygesen wrote: Trying to build a token model with UIMA CAS editor. But I get out of memory exception. The run configuration is a UIMA Analysis Engine configuration and does not have a arguments tab (like plain java apps) where I can set the command line arguments

Re: Merging CASes

2011-12-16 Thread Jörn Kottmann
At OpenNLP we take the gold data, extract the text, run it through a one of our components and compute the scores on the fly. In your case I would do it similar. I would take the XMI files generated from the gold standard corpus, copy the text and other input data from the initial view to a

UIMA AS client and unkown exception classes

2011-12-08 Thread Jörn Kottmann
Hi all, I have a pretty generic UIMA AS client I reuse for various analysis pipelines. The client gets its input from an ActiveMQ queue and then sends it on to my UIMA AS services via the UIMA AS Client API. The loop which is sending requests to the UIMA AS service is using the sendCAS method.

Re: How to create an annotator that should reload its resources at runtime (without restarting the pipeline)?

2011-11-18 Thread Jörn Kottmann
On 11/18/11 3:17 PM, Spico Florin wrote: I'm using UIMA ConceptMapper Annotator. Due to the fact that its dictionary can be changed outside the UIMA pipeline(we have this scenario), I would like to reload the content of used dictionary without restarting the whole pipeline. The pipeline is

Re: jcas, annotation, address, and id

2011-10-11 Thread Jörn Kottmann
On 10/11/11 10:33 PM, Coarr, Matt wrote: First, is the getAddress() method on Annotation (derived from FeatureStructureImpl according to the javadocs) the same value as the id value in the xmi serialization? As far as I know the ids are re-mapped at least when the CAS is created from an xmi

Re: Scaling using Hadoop

2011-10-06 Thread Jörn Kottmann
On 10/6/11 7:43 AM, Thilo Götz wrote: We use hadoop with UIMA. Here's the fit, in one case: 1) UIMA runs as the map step; we put the uima pipeline into the mapper. Hadoop has a configure (?) method where you can stick the creation and set up of the uima pipeline, similar to UIMA's

Re: UIMA- Support for HTML, PDF, Doc files

2011-09-29 Thread Jörn Kottmann
Hello, UIMA itself is just a framework to build analysis pipelines. To analyze HTML, PDF or Word documents you need a component which can extract the text from these formats. You can use Apache Tika together with our Tika integration in the addons project to extract text from various data

Re: Using JMX console with EC2

2011-08-24 Thread Jörn Kottmann
On 8/24/11 8:38 PM, Richard Eckart de Castilho wrote: I recently had trouble connecting to a JMX server running on Linux (not EC2 though). It worked after I specified a java.rmi.server.hostname The parameters I used on the server side in the end were: -Djava.rmi.server.hostname=192.168.0.25

Re: UIMA-AS Deployment Descriptor Editor installation fails

2011-08-14 Thread Jörn Kottmann
On 8/9/11 3:07 PM, Richard Eckart de Castilho wrote: That's true for users. It would still be nice for the developer one-time-setup to mention that this bundle needs to be installed. Because when I run e.g. the CAS Editor module as an Eclipse application, Eclipse does not try to download

Re: How to deploy multiple UIMA-AS Services?

2011-07-29 Thread Jörn Kottmann
I am just looking for a best practice or some advice how this is usually done. Jörn On 7/29/11 2:05 PM, Jörn Kottmann wrote: Hi all, I changed my UIMA-AS Services a bit, and have now one analysis pipeline per language. Up to now I only had one pipeline and deployed it with the provided

Re: Cas Editor: selecting annotation type

2011-05-05 Thread Jörn Kottmann
On 5/5/11 12:37 PM, Jens Grivolla wrote: It seems to have picked it up ok, but I'm getting errors when opening an XMI with the Annotation editor: Caused by: org.eclipse.core.internal.resources.ResourceException: Resource '/OneOfMyClosedProjects' is not open. at

Re: Cas Editor: selecting annotation type

2011-05-05 Thread Jörn Kottmann
On 5/5/11 12:55 PM, Jens Grivolla wrote: On 05/05/2011 12:37 PM, Jens Grivolla wrote: I'm getting errors when opening an XMI with the Annotation editor: Caused by: org.eclipse.core.internal.resources.ResourceException: Resource '/OneOfMyClosedProjects' is not open. at

Re: Cas Editor: selecting annotation type

2011-05-05 Thread Jörn Kottmann
On 5/5/11 2:41 PM, Jens Grivolla wrote: On 05/05/2011 01:55 PM, Jörn Kottmann wrote: On 5/5/11 1:44 PM, Jörn Kottmann wrote: That sounds like one more good reason to do that. Another one I thought of is that it is confusing when you add an annotation which you cannot see afterward. So lets

Re: Cas Editor: selecting annotation type

2011-05-05 Thread Jörn Kottmann
On 5/5/11 6:09 PM, Jens Grivolla wrote: On 05/05/2011 03:04 PM, Jörn Kottmann wrote: On 5/5/11 2:41 PM, Jens Grivolla wrote: On 05/05/2011 01:55 PM, Jörn Kottmann wrote: On 5/5/11 1:44 PM, Jörn Kottmann wrote: That sounds like one more good reason to do that. Another one I thought

Re: Cas Editor: selecting annotation type

2011-05-04 Thread Jörn Kottmann
On 5/4/11 11:10 AM, Jens Grivolla wrote: Hi, I have recently started using the Annotation Editor (as installed in Eclipse from http://www.apache.org/dist/uima/eclipse-update-site/, i.e. the official 2.3.1 version). In order to add annotations it seems that you need to select the annotation

Re: How to detect text sofa?

2011-04-04 Thread Jörn Kottmann
On 4/4/11 2:17 PM, Richard Eckart de Castilho wrote: Hi Jörn, what is the suggested way to detect a text sofa? As far as I know the suggested way of doing it is via the mime type, right? Which options remain when the mime type is not set? Is CAS.getDocumentText != null appropriate ? in my

Solr now supports UIMA

2011-04-04 Thread Jörn Kottmann
Hi all, some might already know it, the new Solr 3.1 has now support for UIMA: http://wiki.apache.org/solr/SolrUIMA Jörn

Re: How to detect text sofa?

2011-04-04 Thread Jörn Kottmann
On 4/4/11 8:43 PM, Eddie Epstein wrote: On Mon, Apr 4, 2011 at 6:52 AM, Jörn Kottmannkottm...@gmail.com wrote: Hi all, what is the suggested way to detect a text sofa? As far as I know the suggested way of doing it is via the mime type, right? Which options remain when the mime type is not

Re: Referring to file resources

2011-03-02 Thread Jörn Kottmann
On 3/2/11 3:25 PM, Adam Lally wrote: file:com/apache.d.txt is a relative path; relative paths for resources are resolved using the classpath and/or the datapath. For the file protocol, URLs starting with file:/ orfile:/// are absolute. What is the default data path directory when deploying

Re: UIMA-AS Service gets killed when idle

2011-03-01 Thread Jörn Kottmann
On 3/1/11 3:25 PM, Jaroslaw Cwiklik wrote: Hmm, strange. Perhaps putting the process in a background would help like so: nohup bin/deployAsyncService.sh Should be easy to test. Start the process and exit the shell. See what happens. Anything in the log or console indicating OOM or some other

Re: Analysis Engines for mbox like data

2011-02-14 Thread Jörn Kottmann
On 2/14/11 4:49 AM, Radhouane Aniba wrote: Hello everyone, Quite unusual request to this list, I am wondering if there is any analysis engine that allow to mine MBOX like formats such as the famous mailman mailing list archives in a way that it allow to structure these kind of data into

Re: CAS Editor questions

2011-01-25 Thread Jörn Kottmann
On 1/25/11 1:15 PM, Peter Klügl wrote: Hello, how can I open a xmiCAS that's not in a corpus folder? I'm using the version of the current update site. Is this functionality aldready included there or not until the next release. Yes that should be possible. Just double click the xmi file

Re: CAS Editor questions

2011-01-25 Thread Jörn Kottmann
On 1/25/11 1:26 PM, Peter Klügl wrote: Sorry, some additional questions: Can I somehow automatically activate the annotation mode of several types? We want to annotate more than 12 different types and it takes some time for each documents to select these types. Yes that should just work.

Re: CAS Editor type system questions

2011-01-20 Thread Jörn Kottmann
On 1/20/11 1:40 PM, Peter Klügl wrote: Hello, when using the CAS Editor I have two problems and I want to ask if I'm doing something wrong: I can't open a xmi if it contains annotations of types that are not defined in the specified type system. Yes, that is a limitation which is there.

Re: CAS Editor type system questions

2011-01-20 Thread Jörn Kottmann
On 1/20/11 2:00 PM, Peter Klügl wrote: Hello Jörn, Am 20.01.2011 13:49, schrieb Jörn Kottmann: On 1/20/11 1:40 PM, Peter Klügl wrote: Hello, when using the CAS Editor I have two problems and I want to ask if I'm doing something wrong: I can't open a xmi if it contains annotations

Re: CAS Editor as a stand-alone application?

2010-12-02 Thread Jörn Kottmann
On 12/2/10 5:05 PM, Philip Ogren wrote: I am wondering if it is possible to run the CAS Editor as a stand-alone application or if it is only available as a plugin within Eclipse. It could be a part of a RCP application. There might be still a few minor issues which must be solved, but I am

Serialize only one view of a CAS

2010-10-20 Thread Jörn Kottmann
Hi everyone, is it possible to only serialize one view of a CAS ? In my case the _InitialView contains an xml document, which is transformed by an AE into a plain text representation, and placed in a view called Article. The serialization code now should only write the Article view into the