Hello again! One down, one to go. Are there best practices or tricks to reduce Ruta's memory needs? I tried to use the following script to merge names.
Document{->GREEDYANCHORING(true)}; First+ Full {->MARK(Full)}; Full Last+ {->MARK(Full)}; First+ Last+ {->MARK(Full)}; Document{->GREEDYANCHORING(false)}; Full{PARTOFNEQ(Full) -> UNMARK(Full)}; First{PARTOF(Full) -> UNMARK(First)}; Last{PARTOF(Full) -> UNMARK(Last)}; The engine description is create by ruta-maven-plugin:2.4.0 and used with uimaFIT's AnalysisEngineFactory.createEngineDescription("fullyQualifiedDescriptorNameWithoutXmlExtension"). For a 18 Mbyte text, it needs Gbytes of RAM. Cheers, Armin
pgpeFgGCH1gq3.pgp
Description: PGP signature