Re: Right to Left Languages in CVD and Document Analyzer

2015-10-24 Thread Thilo Götz
I don't know about the Document Analyzer, but in CVD, we use a standard Swing text area. It may depend on the JVM and the platform, but it has worked for me for Arabic in the past. When your text is in Unicode, the directionality of the writing is part of the information that comes with the cha

Re: jcas, annotation, address, and id

2011-10-12 Thread Thilo Götz
I'm probably missing something here: why aren't you just passing a reference to the JCas annotation object? You store them as object references in your non-UIMA aware module and cast them back to annotations when you get back to UIMA land. --Thilo On 12/10/11 14:34, Coarr, Matt wrote: > Thanks J

Re: Scaling using Hadoop

2011-10-05 Thread Thilo Götz
our hadoop setup, but for the batch scenarios, it's not worth the trouble. --Thilo On 06/10/11 07:43, Thilo Götz wrote: > On 05/10/11 22:43, Marshall Schor wrote: >> We use hadoop with UIMA. Here's the "fit", in one case: >> >> 1) UIMA runs as the map step; we

Re: Scaling using Hadoop

2011-10-05 Thread Thilo Götz
to single failures of disk drives. > > HTH. -Marshall > > On 10/5/2011 2:24 PM, Greg Holmberg wrote: >> On Tue, 27 Sep 2011 01:06:02 -0700, Thilo Götz wrote: >> >>> On 26/09/11 22:31, Greg Holmberg wrote: >>>> >>>> This is what I'm d

Re: Simple Server

2011-09-29 Thread Thilo Götz
On 29/09/11 17:20, Uli Held wrote: > Hi, > > I'm trying to use SimpleServer to create a Website, that uses NLP services. > Unfortunately I can't get past the PEAR installation process of the > SimpleServerServlet. For the tokenizer I use a resource binding which is > located in root/resources a

Re: remoteAnalysisEngine services not scaling to effect

2011-09-27 Thread Thilo Götz
On 26/09/11 22:31, Greg Holmberg wrote: > Arun-- > > > I don't know what the cause of your specific technical issue is, but in my > opinion, there's a better way to slice the problem. > > What you're doing is taking each step in your analysis engine and running it > on > one or more machines.

Re: Iterate over annotations with multiple types

2011-09-07 Thread Thilo Götz
On 07/09/11 15:21, Richard Eckart de Castilho wrote: > It really depends on the data in your CAS. As far as I know, there is > typically only one big annotation index - if you get an iterator for a > specific type, a filtered iterator is created internally and returned. The > only thing to speed

Re: iterate over annotations

2011-08-23 Thread Thilo Götz
On 23/08/11 16:13, Richard Eckart de Castilho wrote: > Hello Alex, > > you'll probably have to use cas.getIndexRepository().getAllIndexedFS(type) to > get to your custom annotations. You can also define a custom index and then > use cas.getIndexRepository().getIndex(label). What Richard says.

Re: XMI Cas Serialization and Control Characters

2011-07-22 Thread Thilo Götz
And a volunteer to add base64 encoding/decoding to the XML serialization code would be welcome as well. That would get rid of all these issues. --Thilo On 22/07/11 02:44, Marshall Schor wrote: > Also, see the main documentation about this, here: > http://uima.apache.org/d/uimaj-2.3.1/tutorials_a

Re: PEAR packaging and maven

2011-05-27 Thread Thilo Götz
On 27/05/11 09:57, Jens Grivolla wrote: > On 05/26/2011 08:37 PM, Greg Holmberg wrote: > >> [...] What I want may simply be outside the design target of PEAR files. My >> expectations of PEAR files were based on how other archive formats in >> Java work. JAR files, WAR files, etc. These can all be

Re: CR+LF = 1 character?

2011-04-20 Thread Thilo Götz
On 4/20/2011 14:31, Steven Bethard wrote: > On Wed, Apr 20, 2011 at 10:58 AM, Jens Grivolla wrote: >> As it turns out, the other system considers CR+LF (Windows style line >> endings) to be two characters, while UIMA sees it as one. > > As Jörn suggested, this is probably a bug in the code somewh

Re: Eclipse Run Configurations...

2011-02-15 Thread Thilo Götz
Eclipse, there could be a different solution. --Thilo > > On Tue, Feb 15, 2011 at 2:11 AM, Thilo Götz wrote: >> It may seem odd, but for this to work, you need to follow the >> instructions on how to set up the UIMA example code in Eclipse: >> http://uima.apa

Re: Eclipse Run Configurations...

2011-02-14 Thread Thilo Götz
It may seem odd, but for this to work, you need to follow the instructions on how to set up the UIMA example code in Eclipse: http://uima.apache.org/d/uimaj-2.3.1/overview_and_setup.html#ugr.ovv.eclipse_setup.example_code Note to self: the link in the tutorial is broken. Can we have links that do

Re: Analysis Engines for mbox like data

2011-02-14 Thread Thilo Götz
Not sure where you want to go with this, but one approach might be to preprocess your data into some better structured format, and only start your UIMA analysis after you've done that. I have used a project called mstor on sourceforge to process mbox files, and then you can use javamail to get at

Re: How to use a pear package, classpath problem with CAS Visual Debugger

2011-01-27 Thread Thilo Götz
g the directory paths. I'm not following. What's the difference between directory and file paths? Can you give us an example? > > I think it would be very useful to have the same expansion mechanism > for the classpath of the VM. > > Thomas > > 2011/1/27 Thilo Gö

Re: How to use a pear package, classpath problem with CAS Visual Debugger

2011-01-27 Thread Thilo Götz
On 1/26/2011 19:20, Thomas Heitz wrote: >>> Dear UIMA users, >>> >>> What is the correct way to use a PEAR package for a final customer? > > So I runPearInstaller.sh/bat is the normal way to install a pear > package for a final customer. > And any other tools not provided by Apache UIMA should w

Re: How to use a pear package, classpath problem with CAS Visual Debugger

2011-01-26 Thread Thilo Götz
I had to moderate your messag through, please subscribe to the list. On 1/26/2011 17:29, Thomas Heitz wrote: > Dear UIMA users, > > What is the correct way to use a PEAR package for a final customer? > > We used the following tools: > - Pear Installer > - CAS Visual Debugger > > Unfortunately w

Re: CAS Editor type system questions

2011-01-20 Thread Thilo Götz
On 1/20/2011 13:49, Jörn Kottmann wrote: > On 1/20/11 1:40 PM, Peter Klügl wrote: >> Hello, >> >> when using the CAS Editor I have two problems and I want to ask if I'm doing >> something wrong: >> >> I can't open a xmi if it contains annotations of types that are not defined >> in >> the specifi

Re: runPearMerger on already merged PEARs

2010-12-15 Thread Thilo Götz
On 12/14/2010 19:18, Jens Grivolla wrote: > It seems that runPearMerger.sh does not correctly adjust the paths when the > input PEARs are already a merge. > > On first run `runPearMerger.sh ae1.pear ae2.pear -n ae12` the paths to > resources > get adjusted from $main_root/X to $main_root/ae1/X or

Re: SimpleServer configuration with Sofas

2010-12-08 Thread Thilo Götz
Hi Ben, the SimpleServer is not Sofa-aware, and neither am I ;-). I don't think there should be an exception, though. Can you please post the full stack trace, maybe that will help. To get the PDF extractor working, can't you change the default view somehow, so that CAS.getDocumentText() will r

Re: type/feature standard name

2010-11-29 Thread Thilo Götz
On 11/29/2010 15:43, Andrea Turbati wrote: > Hello, > > is there a standard regarding the name of Types and Features in UIMA? I've > search in the tutorial files, but I've not found a precise reference to this > aspect. I know that Types can have "." in them (as java package), but that's > all >

Re: Initialization of annotator class "foo" failed.

2010-11-23 Thread Thilo Götz
I'd suggest you take the working room number sample and refactor it one little step at a time until you have your POS tagger. Perhaps while doing so you'll figure out what went wrong in the first place. --Thilo On 11/23/2010 10:07, A. Schaller wrote: > Thilo Götz writes: >

Re: Initialization of annotator class "foo" failed.

2010-11-22 Thread Thilo Götz
Does your class really live in Lesson4/src/de.tudarmstadt.ukp.teaching.tutorial.lesson4/PosTagger.java? Shouldn't that be de/tudarmstadt/upk/...? --Thilo On 11/22/2010 10:30, A. Schaller wrote: > Hey together, > > I am having troubles with UIMA running a AE Descriptor or CPE Descriptor. > When

Re: Problems installing UIMA on W7

2010-11-17 Thread Thilo Götz
On 11/16/2010 18:47, Erik-Lân Do Dinh wrote: > Thilo Götz writes: > >> I think the problem is not the UIMA install path, but the Java install >> path. It chokes on the closing paren of the (x86) part (great idea, MS). >> >> I don't have the time to tr

Re: Problems installing UIMA on W7

2010-11-16 Thread Thilo Götz
On 11/16/2010 16:44, Erik-Lân Do Dinh wrote: > Hello everyone, > > I want to install UIMA on a Windows 7 (x64) machine, Java version is 1.6 > (x86). > I installed UIMA according to http://uima.apache.org/doc-uima-examples.html, > double checked all path entries and install path. Now, when I try t

Re: Training and Learning

2010-11-12 Thread Thilo Götz
On 11/12/2010 15:39, borobudur wrote: > Hi, I had a look at the UIMA architecture and I was asking myself if there is > a > konzept for a training and learning phase in UIMA. > What I learned is that UIMA chains Analysis Engines together. There is no two > phase concept like training and extracti

Re: BigDecimal as CAS type

2010-11-10 Thread Thilo Götz
On 11/10/2010 15:40, Frank Schilder wrote: > Hi all, > > I'm trying to represent the value of a money amount as BigDecimal. For > example $3.25 should be annotated with a Money annotation. The Money > annotation should have a feature called val and the value of this feature > should be of type Big

Re: printing features values

2010-11-09 Thread Thilo Götz
On 11/9/2010 16:04, Andrea Turbati wrote: > Hi all, > I'm trying to make a simple function which take an annotation and a Type and > print all the value associated with all the feature present in the Type. > This is the recursive function: > > public void printTypeSystem(Type uimaType, Annotation

Re: ResourceInitializationException: Could not access the resource data at file:english/BrownModel.dat. with the HMMTagger

2010-10-13 Thread Thilo Götz
On 10/13/2010 06:42, Zunaid Kazi wrote: > Hello, > > I checked out the HMM tagger and was trying to execute it via eclipse (I > have eclipse setup for UIMA and am able to run the various uimaj examples > with no problem). > > I get an exception when I try to load the AE via the CVD. I am appendin

Re: Literatur

2010-10-12 Thread Thilo Götz
On 10/12/2010 09:30, armin.weg...@bka.bund.de wrote: > Hi, > > can you recommend some good literature for an UIMA noobie? Can your > recommend some good literature about Apache UIMA at all? > > Thanks, > > Armin The only textbook I'm aware of is Graham Wilcock‌'s "Introduction to Linguistic Ann

Re: Mutable text and annotations...

2010-09-09 Thread Thilo Götz
Hi, On 9/9/2010 01:00, Jim Hargrave wrote: > I apologize if my terminology doesn't match with normal UIMA usage - but > hopefully the general idea will be understandable. > > Is it always assumed that UIMA's document text is immutable? yes. > Let's say you have some text and with several posi

Re: libs of Annotator

2010-07-29 Thread Thilo Götz
On 7/29/2010 12:00, hawk wrote: > Hello, >I have checked out a copy of Dictionary form sandbox, but there is no lib > directory under the project. would you please tell me how to get the > dict_annot_xml_types.jar? This jar gets generated during the build. You need to build with Maven, as des

Re: Classpath/Classloader issue

2010-06-30 Thread Thilo Götz
On 6/29/2010 19:26, Serg wrote: > Serg writes: >> >> If I put WS jars into JMV /ext dir -it will work. >> I created another version of web-service client based on Axis. And it works >> without moving Axis jars to JVM re/lib/ext directory. All Axis jars are in >> PEAR/lib directory. Surprise >> >>

Re: Classpath/Classloader issue

2010-06-25 Thread Thilo Götz
On 6/25/2010 17:11, Serg wrote: > Hi all! > I have a problem with executing PEAR file under cpeGui and others. > My scenario: > 1) UIMA 2.2 or 2.3 > 2) IBM Java 6 (doesn't support JAX-WS v.2.1) > 3) JAX-WS v.2.1 libraries inside PEAR /lib dir. > 4) PEAR file that call web service via JAX-WS librari

Re: pretty print annotations in txt format

2010-06-24 Thread Thilo Götz
On 6/24/2010 11:22, Alice Lewnes wrote: > hi, > > just wondering if there's any readily available uima component/ > cas consumer that does the above mentioned task? > for example, given PersonTitleAnnotator and input text > "Mr. Smith loves to play golf.", i want my output to look like > "Mr.S

Re: Problem with xml namespace handling in XmiCasDocSerializer

2010-06-22 Thread Thilo Götz
Hi, attachments don't work on the mailing lists. Please open a Jira issue and attach the patch there. Thanks. --Thilo On 6/22/2010 16:51, Martin Schafföner wrote: > Hi, > > while experimenting with UIMA I observed a problem similar to the one > already posted here: http://tinyurl.com/32spzv9.

Re: Exception trying to run Analysis Example

2010-06-22 Thread Thilo Götz
Could be a Java bug. Java 5 is at service release 22, you're at 10. Please get the latest version and try again. --Thilo On 6/22/2010 15:07, Alex Rodriguez Lopez wrote: > Hi UIMA List!! > > This is my first message to the list, we are starting a project that > will involve NLP and semantic stu

Re: Eclipse update site empty?

2010-05-06 Thread Thilo Götz
I think Marshall is working on it. Please check back later today. --Thilo On 5/6/2010 09:11, Greg Holmberg wrote: > The website says this is the Eclipse update site > > http://www.apache.org/dist/uima/eclipse-update-site/ > > but only /dist/uima exists, and it is empty. > > Has it moved? > >