On Jul 17, 2010, at 22:23, Martin wrote:
Hi there,
I'm trying to extend the PythonTokenizer class to build my own
custom tokenizer, but seem to get stuck pretty much soon after that.
I know that I'm supposed to extend the incrementToken() method, but
what exactly am I dealing with in th
Hi there,
I'm trying to extend the PythonTokenizer class to build my own custom
tokenizer, but seem to get stuck pretty much soon after that. I know
that I'm supposed to extend the incrementToken() method, but what
exactly am I dealing with in there and what should it return? My goal is
to co
Yea, it was a message from 2007, couldn't find any other mention of
TermPositionVector anywhere else so..
Thanks a lot for the tip, works like a charm!
On Jul 17, 2010, at 15:04, Martin wrote:
Hi,
this seems to be missing in the 3.0.2. release, or at least not
working - I'm getting Attrib
On Jul 17, 2010, at 15:04, Martin wrote:
Hi,
this seems to be missing in the 3.0.2. release, or at least not
working - I'm getting AttributeError: 'TermFreqVector' object has no
attribute 'toTermPositionVector'. Has it been accidentally/
intentionally removed in the last couple of years
Hi,
this seems to be missing in the 3.0.2. release, or at least not working
- I'm getting AttributeError: 'TermFreqVector' object has no attribute
'toTermPositionVector'. Has it been accidentally/intentionally removed
in the last couple of years and/or is there an alternative to getting a
Ter
On Sat, 17 Jul 2010, Darren Govoni wrote:
I wonder why attach locks under processes but not threads though? Is it jcc
related or JVM?
If you're running things in sub-processes, I don't think it makes any sense
to call attachCurrentThread() into the parent JVM at all, nor do I expect
calls i
On Fri, 16 Jul 2010, Darren Govoni wrote:
I corrected some things with my threads, so I'm sure they are being reused
now.
But I was using python's multiprocessing module and my worker threads were
actually processes.
This is where I saw the lock up after a period of time.
Ah ha, so nothing t