[jira] Updated: (LUCENE-1016) TermVectorAccessor, transparent vector space access

2007-10-29 Thread Karl Wettin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karl Wettin updated LUCENE-1016: Attachment: (was: LUCENE-1016-Tanimoto.txt) > TermVectorAccessor, transparent vector space acc

[jira] Updated: (LUCENE-1016) TermVectorAccessor, transparent vector space access

2007-10-29 Thread Karl Wettin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karl Wettin updated LUCENE-1016: Attachment: LUCENE-1016.txt In this patch: * Java 1.4 for real And then I removed everything tha

[jira] Updated: (LUCENE-1016) TermVectorAccessor, transparent vector space access

2007-10-29 Thread Karl Wettin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karl Wettin updated LUCENE-1016: Attachment: (was: LUCENE-1016.txt) > TermVectorAccessor, transparent vector space access > --

[jira] Updated: (LUCENE-1016) TermVectorAccessor, transparent vector space access

2007-10-29 Thread Karl Wettin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karl Wettin updated LUCENE-1016: Attachment: (was: LUCENE-1016.txt) > TermVectorAccessor, transparent vector space access > --

[jira] Updated: (LUCENE-1016) TermVectorAccessor, transparent vector space access

2007-10-29 Thread Karl Wettin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karl Wettin updated LUCENE-1016: Attachment: (was: out.png) > TermVectorAccessor, transparent vector space access > --

[jira] Updated: (LUCENE-1016) TermVectorAccessor, transparent vector space access

2007-10-29 Thread Karl Wettin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karl Wettin updated LUCENE-1016: Attachment: (was: LUCENE-1016-clusterer.txt) > TermVectorAccessor, transparent vector space ac

Hudson build is back to normal: Lucene-Nightly #259

2007-10-29 Thread hudson
See http://lucene.zones.apache.org:8080/hudson/job/Lucene-Nightly/259/changes - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

More about the TermVectorMapper

2007-10-29 Thread Karl Wettin
25 okt 2007 kl. 01.23 skrev Karl Wettin: The use case is that I want a normalized frequency, and I'd like to do that by loading the factor from an IndexReader in setExpectations. Moving along in this code, how about passing down the current document number to the mapper? Perhaps in setExp

[jira] Commented: (LUCENE-1035) Optional Buffer Pool to Improve Search Performance

2007-10-29 Thread Ning Li (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12538638 ] Ning Li commented on LUCENE-1035: - > The question is whether such situations are common enough to warrant adding >

Re: Gate Framework

2007-10-29 Thread Sandeep Mahendru
Hi Steven, Thanks for helping me out. I have now installed a SVN client and downloaded the latest Lucene Code. I would now start working on implementing an anlyzer for the Hindi language. I would take the following the logical steps to achive the same: 1. Idnetify the UTF-8 or Unicode charcte

[jira] Updated: (LUCENE-743) IndexReader.reopen()

2007-10-29 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Busch updated LUCENE-743: - Attachment: lucene-743-take3.patch Ok here is the next one :-)... This patch implements the refC

[jira] Commented: (LUCENE-1035) Optional Buffer Pool to Improve Search Performance

2007-10-29 Thread robert engels (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12538582 ] robert engels commented on LUCENE-1035: --- Again, see my previous code in issue 414. That it only works NioFile

Re: Gate Framework

2007-10-29 Thread Sandeep Mahendru
Thnaks for helping me out. On 10/29/07, Steven Rowe <[EMAIL PROTECTED]> wrote: > > Hi Sandeep, > > Sandeep Mahendru wrote: > > Where can I downlaod SVN from? > > http://subversion.tigris.org/project_packages.html > > -- > Steve Rowe > Center for Natural Language Processing > http://www.cnlp.org/te

[jira] Commented: (LUCENE-1035) Optional Buffer Pool to Improve Search Performance

2007-10-29 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12538576 ] Doug Cutting commented on LUCENE-1035: -- Ning, I didn't mean to sound negative about this. Your benchmarks do s

Re: Gate Framework

2007-10-29 Thread Steven Rowe
Hi Sandeep, Sandeep Mahendru wrote: > Where can I downlaod SVN from? http://subversion.tigris.org/project_packages.html -- Steve Rowe Center for Natural Language Processing http://www.cnlp.org/tech/lucene.asp - To unsubscribe,

Re: Gate Framework

2007-10-29 Thread Sandeep Mahendru
Hi All, Okay the first step is to download the latest SRC code of Lucene. I have always been using CVS. But, I see on the Lucene sidte, that the src code is hosted on the SVN trunk. I would now install SVN on my laptop. Where can I downlaod SVN from? Regards, Sandeep. On 10/29/07, Sandeep Ma

[jira] Updated: (LUCENE-1015) FieldCache should support longs and doubles

2007-10-29 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll updated LUCENE-1015: Attachment: LUCENE-1015.patch Added tests, fixed some documentation bugs. Will commit ton

Gate Framework

2007-10-29 Thread Sandeep Mahendru
Hi , I had been to the http://gate.ac.uk/gate-examples/doc/index.html site. Thanks for pointing me ot that. It appears they have implemented the a plugin for Hindi grammer. I would try to use it. Initially I was planning on using the JAVA CC for writing the grammer. I have installed GATE. I w

Re: Per-document Payloads

2007-10-29 Thread Michael McCandless
"Michael Busch" <[EMAIL PROTECTED]> wrote: > Michael McCandless wrote: > > > > Michael, are you thinking that the storage would/could be non-sparse > > (like norms), and loaded/cached once in memory, especially for fixed > > size fields? EG a big array of ints of length maxDocID? In John's > >

[jira] Commented: (LUCENE-1015) FieldCache should support longs and doubles

2007-10-29 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12538492 ] Grant Ingersoll commented on LUCENE-1015: - Actually, the FieldCache already supports byte and shorts. > Fie

[jira] Updated: (LUCENE-1036) Unreleased 2.3 version of IndexWriter.optimize() consistly throws java.lang.IllegalArgumentException out-of-the-box

2007-10-29 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-1036: --- Fix Version/s: 2.3 > Unreleased 2.3 version of IndexWriter.optimize() consistly thr

[jira] Commented: (LUCENE-1036) Unreleased 2.3 version of IndexWriter.optimize() consistly throws java.lang.IllegalArgumentException out-of-the-box

2007-10-29 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1036?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12538432 ] Michael McCandless commented on LUCENE-1036: I just committed the above patch. I think likely that was

Re: Build failed in Hudson: Lucene-Nightly #258

2007-10-29 Thread Michael McCandless
This build failure seems to be an XML parsing issue with svn -- the checkout failed with this error: > svn: Processing REPORT request response failed: The element type "S:txdelta" > must be terminated by the matching end-tag "". > (/repos/asf/!svn/vcc/default) I was able to do a full checkout

Re: Per-document Payloads

2007-10-29 Thread Michael Busch
Michael McCandless wrote: > > Michael, are you thinking that the storage would/could be non-sparse > (like norms), and loaded/cached once in memory, especially for fixed > size fields? EG a big array of ints of length maxDocID? In John's > original case, every doc has this UID int field; I think

Re: Per-document Payloads

2007-10-29 Thread Michael McCandless
> Michael Busch wrote: > > > Doug Cutting wrote: > > > > If this is really required, perhaps it ought to appear as an > > attribute for stored fields, indicating that the field should be > > stored in a separate "column store". This would permit efficient > > enumeration of values of just that f