ote:
> Hi,
>
> Is anything in
> https://uima.apache.org/d/uimaj-2.10.2/tutorials_and_
> users_guides.html#ugr.tug.mvs.name_mapping_application
> helpful?
>
> If not, could you add some details that says why not?
>
> -Marshall
>
>
> On 7/5/2018 8:52 AM, Jens
ators use that to select
> the view(s) to operate on.
>
> Also sounds possible to have a single new component use the cascopier to
> create a new view that is always the one processed.
>
> Regards,
> Eddie
>
> On Thu, Jul 5, 2018 at 8:52 AM, Jens Grivolla wrote:
>
> &
Hi,
I'm trying to run an already instantiated AE on a view other than
_InitialView. Unfortunately, I can't just call process() on the desired
view, as there is a call to Util.getStartingView(...)
in PrimitiveAnalysisEngine_impl that forces it back to _InitialView.
The view mapping methods I
Hi Ben,
if I understand correctly you want to run a process once the whole
collection has been analyzed. You can have an AnalysisEngine that does this
by implementing
http://uima.apache.org/d/uimaj-2.10.0/apidocs/org/apache/uima/analysis_engine/AnalysisEngine.html#collectionProcessComplete()
You
Is there a specific reason to use PEARs?
As far as I remember (but I could be wrong, it's been a few years), the
main advantages of using them (automatic class path configuration, some
degree of isolation between components) was lost when we wanted to change
configuration parameters because then
And I guess you don't get JCAS classes for your type system without going
through JCasGen, which is another disadvantage to generating the types on
the fly. It also kind of goes against the fact that the type system should
be something you can rely on for communication between components, so it
t) and a Java API method
> (HttpSolrClient.getById()) to do so.
>
> Thanks to your help, I have a constantly fast queries, now.
>
> Cheers,
> Armin
>
> -Ursprüngliche Nachricht-
> Von: j...@grivolla.net [mailto:j...@grivolla.net] Im Auftrag von Jens
> Grivolla
Solr is known not to be very good at deep paging, but rather getting the
top relevant results. Running a query asking for the millionth document is
pretty much the worst you can do as it will have to rank all documents
again, up to the millionth, and return that one. It can also be unreliable
if
ected time slot for
> uimaFIT is in December.
>
> Best,
>
> -- Richard
>
> > On 23.10.2015, at 11:09, Jens Grivolla <j+...@grivolla.net> wrote:
> >
> > I'd really like to have that functionality also (we'll need to do
> something
> > like that qui
I'd really like to have that functionality also (we'll need to do something
like that quite soon), so I just voted on the issue...
I haven't tested the patch yet. José, have you been using this over the
last few months?
-- Jens
On Sun, Feb 1, 2015 at 2:04 AM, José Tomás Atria
Hi Matt,
As Richard said, using Views is more designed for having "parallel"
information, such as separate layers of audio, transcript, video, etc.
referring to the same content or "document".
I'm not quite sure why you want to split your document for processing
(which you could do with a CAS
Hi Khirod,
could it be that your single-word document doesn't get marked as a
sentence? You have SpanFeatureStructure set to com.naukri.parse.type.Sentence,
so ConceptMapper only works on things that are within a Sentence
annotation. Tokens that are not part of a sentence will not be seen at all.
On Mon, Jun 15, 2015 at 8:43 AM, Mario Gazzo mario.ga...@gmail.com wrote:
I am referring to to this Github repo:
https://github.com/apache/uima-uimafit
https://github.com/apache/uima-uimafit
Thought it was published by you as a mirror of the SVN repo or the other
way around.
Yes, this
Hi Peter, while I don't think I will be using the HtmlConverter right away,
I would vote for using the length of the document annotation for
annotations that relate to the whole document (such as metadata). That
makes them show up nicely in the CasEditor/Viewer and you could maintain it
in all
Hi Silvestre,
there doesn't seem to be anything RUTA-specific in your question. In
principle, UIMA-AS allows parallel scaleout and merges the results (though
I personally have never used it this way), but there are of course a few
things to take into account.
First, you will of course need to
The workshop program, along with links to the full papers, is now
available: http://glicom.upf.edu/OIAF4HLT/Program.html
I'm looking forward to seeing many of you there. I'll be staying at DCU
(College Park).
-- Jens
On Tue, Jul 1, 2014 at 6:52 PM, Jens Grivolla j+...@grivolla.net wrote
26, 2014 at 2:34 PM, Jens Grivolla j+...@grivolla.net wrote:
Workshop on Open Infrastructures and Analysis Frameworks for HLT
http://glicom.upf.edu/OIAF4HLT/
At the 25th International Conference on Computational Linguistics
, with a strong UIMA presence as
well as a chance to see how other frameworks deal with many of the same
issues that we encounter.
I hope to see many of you there. And thanks to those who have already
submitted their paper to the workshop. :-)
-- Jens
On Thu, May 1, 2014 at 12:13 AM, Jens Grivolla j
2014
You can find the workshop description and CFP at
http://glicom.upf.edu/OIAF4HLT/
I hope to see you there and look forward to your contributions.
-- Jens
On Wed, Mar 26, 2014 at 2:34 PM, Jens Grivolla j+...@grivolla.net wrote:
Workshop on Open Infrastructures and Analysis Frameworks
On Mon, Mar 31, 2014 at 10:12 PM, Marshall Schor m...@schor.com wrote:
On 3/26/2014 9:44 AM, Jens Grivolla wrote:
Finally, despite the fact that UIMA does not appear in the title anymore,
would it be possible to have an announcement on the UIMA web page?
I think so (unless others disagree
, despite the fact that UIMA does not appear in the title
anymore, would it be possible to have an announcement on the UIMA web page?
-- Jens
On 05/02/14 11:46, Jens Grivolla wrote:
We have been asked to merge our workshop with a similar one focusing on
open infrastructures. The result is a Workshop
/jgrivolla/coling2014-nlp-framework-workshop/blob/master/cfp.md
-- Jens
On 19/01/14 15:40, Jens Grivolla wrote:
I have sent the proposal, we'll see what they say...
-- Jens
On 17/01/14 15:02, Jens Grivolla wrote:
On 15/01/14 20:51, Richard Eckart de Castilho wrote:
On 15.01.2014, at 15:10
I have sent the proposal, we'll see what they say...
-- Jens
On 17/01/14 15:02, Jens Grivolla wrote:
On 15/01/14 20:51, Richard Eckart de Castilho wrote:
On 15.01.2014, at 15:10, Jens Grivolla
j+...@grivolla.net wrote:
The CFP itself must still be rewritten to be less UIMA-centric, other
On 15/01/14 20:51, Richard Eckart de Castilho wrote:
On 15.01.2014, at 15:10, Jens Grivolla j+...@grivolla.net wrote:
The CFP itself must still be rewritten to be less UIMA-centric, other than that
this is starting to look quite good.
GATE developer Mark A. Greenwood did the rewrite and sent
Thanks, fixed.
On 14/01/14 19:04, Peter Klügl wrote:
Hi,
Just a small correction:
The last workshop had nine paper presentations and one invited talk.
Best,
Peter
Am 14.01.2014 18:11, schrieb Jens Grivolla:
Hello, there's only 5 days remaining to submit the workshop proposal.
Please
Dear Luca and Sylvain,
as you can see the workshop is still in the proposal phase. If it is
accepted by the COLING organizers pricing etc. will be set by them.
It will of course be possible to attend without presenting a paper, and
on the other hand we are open to all kinds of contributions,
that this is starting to look quite good.
Any input is welcome, so if you have any suggestions hurry up...
-- Jens
On 15/01/14 10:41, Jens Grivolla wrote:
Thanks, fixed.
On 14/01/14 19:04, Peter Klügl wrote:
Hi,
Just a small correction:
The last workshop had nine paper presentations and one
-- Jens
On 19/12/13 13:29, Jens Grivolla wrote:
On 19/12/13 13:08, Peter Klügl wrote:
Am 19.12.2013 12:31, schrieb Jens Grivolla:
Ok, it's time to seriously get started on this.
I guess we can start with the GSCL workshop description, and maybe
make it more inclusive for other frameworks (GATE
On Jan 14, 2014, at 12:11 PM, Jens Grivolla j+...@grivolla.net wrote:
Hello, there's only 5 days remaining to submit the workshop proposal. Please
anybody interested get in touch.
I sent a mail to the GATE user list to get some input from them. The proposal
draft is here:
https://github.com
with a few already
confirmed PC members (Richard, ...)
I'll get started with a first draft. Any input is welcome.
Please also indicate if you plan to submit an article, in order to have
a first idea of what to expect...
Thanks,
Jens
On 21/10/13 11:44, Jens Grivolla wrote:
Hi, at GSCL 2013 we
is out-of-sync
with the video, like lots of u-tube things seem to be), etc.
-Marshall
On 12/4/2013 9:31 AM, Jens Grivolla wrote:
Hi, we're now starting the EUMSSI project, which deals with integrating
annotation layers coming from audio, video and text analysis.
We're thinking to base it all
annotations after mapping the offsets.
-- Jens
On 05/12/13 10:04, Jens Grivolla wrote:
I agree that it might make more sense to model our needs more directly
instead of trying to squeeze it into the schema we normally use for text
processing. But at the same time I would of course like to avoid having
Hi, we're now starting the EUMSSI project, which deals with integrating
annotation layers coming from audio, video and text analysis.
We're thinking to base it all on UIMA, having different views with
separate audio, video, transcribed text, etc. sofas. In order to align
the different views
from Annotation? The getCoveredText() will
not work anyway if you are working with audio/video data.
-- Richard
On 04.12.2013, at 12:31, Jens Grivolla j+...@grivolla.net wrote:
Hi, we're now starting the EUMSSI project, which deals with integrating
annotation layers coming from audio, video
to handle them?
Cheers,
-- Richard
On 27.11.2013, at 07:47, Jens Grivolla j+...@grivolla.net wrote:
Hi,
so far we were using PEARs to manage different configurations of components,
e.g. having a CountryMapper, CityMapper, PersonMapper, etc., all based on
ConceptMapper but with different settings
Hi,
so far we were using PEARs to manage different configurations of
components, e.g. having a CountryMapper, CityMapper, PersonMapper, etc.,
all based on ConceptMapper but with different settings/models.
How would I do that in uimaFIT? Basically I would like to create
components that just
parameter to createEngineDescription()?
Bye,
Jens
On 10/24/2013 05:28 PM, Jens Grivolla wrote:
Hi, I'm trying to run ConceptMapper from uimaFIT, but
createDependencyAndBind doesn't seem to allow to separate declaring the
external resource (with a name) and binding that name to a key. I
looked through
Hi, at GSCL 2013 we talked a bit about options for the next UIMA
workshop. How about trying to have it at COLING 2014?
WORKSHOP TIMELINE
• 19th January 2014: Workshop proposals due
• 26th January 2014: Notification of workshop acceptances
• 18th July 2014:
On 10/18/2013 10:06 AM, Armin Wegner wrote:
What are you doing with very large text documents in an UIMA Pipeline, for
example 9 GB in size.
Just out of curiosity, how can you possibly have 9GB of text that
represent one document? From a quick look at project gutenberg it seems
that a full
Wegner wrote:
Hi Jens,
It's a log file.
Cheers,
Armin
-Ursprüngliche Nachricht-
Von: Jens Grivolla [mailto:j+...@grivolla.net]
Gesendet: Freitag, 18. Oktober 2013 11:05
An: user@uima.apache.org
Betreff: Re: Working with very large text documents
On 10/18/2013 10:06 AM, Armin Wegner
created UIMA-3346 (https://issues.apache.org/jira/browse/UIMA-3346) as
for other descriptor based workflows it would still be very useful to
have automatically generated descriptors that are ready to use with type
system imports.
Bye,
Jens
On 10/08/2013 12:04 PM, Jens Grivolla wrote:
Hi, I'm
?
-- Richard
On 14.10.2013, at 11:30, Jens Grivolla j+...@grivolla.net wrote:
I gave up on integrating uimaFIT-based builds with PEAR packaging, there are
fundamental differences that I don't know how to resolve cleanly, in particular:
uimaFIT: 1 maven artifact = N analysis engines = N
It sounds to me like it would be much easier to just have a custom
collection reader that outputs one CAS per document (i.e. multiple CASes
per input file), rather than having a CR that outputs one CAS per file
(with just metadata) plus an additional AE to generate the real CASes
from there.
On 04.10.2013, at 12:41, Jens Grivolla j+...@grivolla.net wrote:
Hi,
I tried using the uimafit maven plugin, in particular the generate goal
(trying to make it play nice with the pear packaging plugin). However, the generated
descriptor does not include the type system imports, even though
Hi,
I tried using the uimafit maven plugin, in particular the generate
goal (trying to make it play nice with the pear packaging plugin).
However, the generated descriptor does not include the type system
imports, even though they are specified through types.txt.
Is there some way to get
Same here, our own stuff relies on higher versions of Java anyway.
Jens
On 07/29/2013 07:55 AM,
armin.weg...@bka.bund.de wrote:
No, not for me. You can even switch to Java 7.
Armin
-Ursprüngliche Nachricht-
Von: Marshall Schor [mailto:m...@schor.com]
Gesendet: Sonntag, 28. Juli 2013
Hi,
we are sometimes running into problems with Maven when we want to define
tasks to move resources into specific locations, etc. This seems to
often lead to having to use quite a few Maven plugins and makes the POM
hard to manage.
Would Gradle be a better option, in order to have the
Hi,
we currently (almost always) use the CPE to run our AEs (packaged as
PEARs and then installed). However, we would like to start packaging our
AEs differently to make it easier to also use them programatically, or
e.g. include them in Solr using SolrUima. To do so we have started to
Hi, while not strictly a UIMA issue, we have a problem that seems very
relevant in the context of UIMA analysis engines: how to manage large
binary resources such as trained models used by an AE, etc.
So far, we have managed to achieve a good separation between code
development and the actual
What do you want to do? Map-reduce is batch processing, whereas a UIMA
AE works online, so this doesn't really fit.
In Mahout map-reduce is usually used for training, not e.g. for applying
a trained classifier. So you would train whichever way you want (e.g.
using map-reduce, etc.), but your
Hi,
the UIMA-HPC page contains a nice screenshot of what looks like a
graphical tool for configuring UIMA flows. Is it (or anything like it)
available to the public?
Thanks,
Jens
Hi,
we're thinking of using UIMA for multimodal multimedia annotation (text,
video, audio, ...), but have found little information of people actually
doing that. I did find an old post by Burn Lewis about donating the GALE
type system (Donation of a widely used type system for multi-modal
Hi all,
from what I understand this does not involve CAS multipliers at all, but
simply a flow where all CAS consumers are done in one parallel step.
Apparently this can't be done in a CPE so you would need an aggregate of
all the CAS consumers, and have a parallel flow controller for that
This sounds like you are actually looking for the project next door: Mahout.
UIMA really doesn't have a lot to do with clustering (although you could
do some things). We do use UIMA for information extraction *before*
clustering and sending it to Solr, though, as a sort of preprocessing to
We actually do that all the time, it works perfectly. Some archive
managers even let you edit the file without unpacking it. You may need
to rename it from .pear to .zip and back to .pear when you're done.
Jens
On 04/26/2012 06:10 PM, Marshall Schor wrote:
Thanks Thilo.
Could you unzip the
On 03/25/2012 03:35 PM, Eric Buist wrote:
[UIMA chooses bad look-and-feel on some platforms]
Fortunately, I found a workaround: pass
-Dswing.systemlaf=com.sun.java.swing.plaf.gtk.GTKLookAndFeel in the JVM
arguments. That overrides the bad guess of the JVM and fallback to
Metal. Note that the
Hi,
it appears that InlineXMLCasConsumer depends on the system locale for
some internal transformations. The output appears to be written in UTF8
(outStream.write(xmlAnnotations.getBytes(UTF-8));) but when used on a
machine with a locale of ASCII all accented characters get broken.
I
On 02/21/2012 04:08 PM, Thilo Goetz wrote:
On 21/02/12 15:59, Jens Grivolla wrote:
it appears that InlineXMLCasConsumer depends on the system locale for
some internal transformations. The output appears to be written in UTF8
(outStream.write(xmlAnnotations.getBytes(UTF-8));) but when used
Hi Nicolas,
we haven't really made any progress. Right now we're using only Java
within the UIMA pipeline (and one C++ annotator).
We then generate XMIs (or in some cases inline XML to get annotations
aligned automatically) and work on that in Python, without a library and
probably not even
Hi,
that's basically what we are doing, too.
If the PEAR is configured correctly, the CLASSPATH and uima.datapath
should appear in install.xml and setenv.txt, and you could use those to
set your classpath in your executor.bat. You would then avoid having to
define
On 11/15/2011 02:55 PM, Spico Florin wrote:
Hello!
I have an UIMA AS application that is using third party libraries. I
would like to know the following:
1. Where (location) we can add these third libraries such that the
deployed application to be aware of them and not throwing
I'm not sure how you would want to expose that functionality. Since
input and output would be done through the API, those are basically your
Reader and your Consumer. How would you expose other CollectionReaders
and CasConsumers as a web service?
AAEs are obviously no problem, since they are
On 05/26/2011 08:37 PM, Greg Holmberg wrote:
[...] What I want may simply be outside the design target of PEAR files. My
expectations of PEAR files were based on how other archive formats in
Java work. JAR files, WAR files, etc. These can all be use in-place,
without any re-writing of their
On 05/10/2011 10:13 AM, Richard Eckart de Castilho wrote:
[package names vs. type hierarchy]
For a technically-oriented user, the package names are probably
better. But for a linguist or knowledge-engineer, I am pretty sure
that the inheritance hierarchy is more interesting. One dives down to
Hi,
I was wondering if it wouldn't be more useful to group annotation types
in the mode and similar menus by namespace rather than inheritance.
I don't think most users care much about supertypes, and mostly don't
know about them, whereas the namespace seems to me to be a more natural
way
On 05/05/2011 09:30 PM, Jörn Kottmann wrote:
On 5/5/11 6:09 PM, Jens Grivolla wrote:
On 05/05/2011 03:04 PM, Jörn Kottmann wrote:
On 5/5/11 2:41 PM, Jens Grivolla wrote:
At least on my system (Eclipse Helios on Ubuntu 10.10) the Shift+Enter
shortcut does not work, and will be treated
On 05/04/2011 02:44 PM, Jörn Kottmann wrote:
On 5/4/11 2:33 PM, Jens Grivolla wrote:
How do I best update to the
trunk version?
You can either build the trunk version yourself or pick up a
distribution from our build server.
I've got a local build based on trunk.
I am not sure what
On 05/05/2011 12:37 PM, Jens Grivolla wrote:
I'm getting errors when opening an
XMI with the Annotation editor:
Caused by: org.eclipse.core.internal.resources.ResourceException:
Resource '/OneOfMyClosedProjects' is not open.
at
org.eclipse.core.internal.resources.Project.checkAccessible
On 05/05/2011 12:59 PM, Jörn Kottmann wrote:
On 5/5/11 12:55 PM, Jens Grivolla wrote:
On 05/05/2011 12:37 PM, Jens Grivolla wrote:
I'm getting errors when opening an
XMI with the Annotation editor:
Caused by: org.eclipse.core.internal.resources.ResourceException:
Resource
On 05/05/2011 03:04 PM, Jörn Kottmann wrote:
On 5/5/11 2:41 PM, Jens Grivolla wrote:
On 05/05/2011 01:55 PM, Jörn Kottmann wrote:
On 5/5/11 1:44 PM, Jörn Kottmann wrote:
That sounds like one more good reason to do that. Another one I thought
of is that it is confusing when you add
Hi,
what's the current status on combining UIMA and Python?
I know that it should be possible to write AEs in Python using either
the BSF Annotator (and jython) or Pythonnator (using SWIG). I haven't
tried either one yet, so I'm open to recommendations on which to use.
I would also very
Hi,
I have recently started using the Annotation Editor (as installed in
Eclipse from http://www.apache.org/dist/uima/eclipse-update-site/, i.e.
the official 2.3.1 version).
In order to add annotations it seems that you need to select the
annotation type through the Mode context menu, which
On 05/04/2011 11:21 AM, Jörn Kottmann wrote:
On 5/4/11 11:10 AM, Jens Grivolla wrote:
In order to add annotations it seems that you need to select the
annotation type through the Mode context menu, which is quite time
consuming (and error prone) if you have a large type system, and
especially
Hi,
while working on the integration between UIMA and a different text
annotation system we ran into problems with differing offsets between
the two systems.
As it turns out, the other system considers CR+LF (Windows style line
endings) to be two characters, while UIMA sees it as one.
Thanks Bhavani,
I think I will just stay with the 2.3.0-incubating uimacpp for now then.
Jens
On 04/05/2011 10:16 PM, Bhavani Iyer wrote:
Hi Jens
The 2.3.0-incubating uimacpp will work with the 2.3.1 releases of the
uimaj and uima-as. It should work with the ActiveMQ broker 5.4.1
included
Hi,
what's the current status of UIMA-CPP? While uimaj and uima-as have
been released as 2.3.1, uimacpp hasn't and I haven't read of any plans
to release 2.3.1 so far.
Does 2.3.0-incubating uimacpp work with the 2.3.1 versions of uimaj and
uima-as, or should I better build it from trunk?
It seems that runPearMerger.sh does not correctly adjust the paths when
the input PEARs are already a merge.
On first run `runPearMerger.sh ae1.pear ae2.pear -n ae12` the paths to
resources get adjusted from $main_root/X to $main_root/ae1/X or
$main_root/ae2/X respectively.
However, on
77 matches
Mail list logo