[jira] [Created] (LUCENE-4001) Grouping module shouldn't depend on queries module

2012-04-19 Thread Martijn van Groningen (Created) (JIRA)
Grouping module shouldn't depend on queries module
--

 Key: LUCENE-4001
 URL: https://issues.apache.org/jira/browse/LUCENE-4001
 Project: Lucene - Java
  Issue Type: Task
Reporter: Martijn van Groningen


Spin off from LUCENE-3997. Basically move FunctionValues and ValueSource to 
core.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (LUCENE-3972) Improve AllGroupsCollector implementations

2012-04-11 Thread Martijn van Groningen (Created) (JIRA)
Improve AllGroupsCollector implementations
--

 Key: LUCENE-3972
 URL: https://issues.apache.org/jira/browse/LUCENE-3972
 Project: Lucene - Java
  Issue Type: Improvement
  Components: modules/grouping
Reporter: Martijn van Groningen


I think that the performance of TermAllGroupsCollectorm, 
DVAllGroupsCollector.BR and DVAllGroupsCollector.SortedBR can be improved by 
using BytesRefHash to store the groups instead of an ArrayList.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (LUCENE-3938) Add query time parent child search

2012-03-30 Thread Martijn van Groningen (Created) (JIRA)
Add query time parent child search
--

 Key: LUCENE-3938
 URL: https://issues.apache.org/jira/browse/LUCENE-3938
 Project: Lucene - Java
  Issue Type: New Feature
  Components: modules/join
Reporter: Martijn van Groningen


At the moment there is support for index time parent child search with two 
queries implementations and a collector. The index time parent child search 
requires that documents are indexed in a block, this isn't ideal for 
updatability. For example in the case of tv content and subtitles (both being 
separate documents). Updating already indexed tv content with subtitles would 
then require to also re-index the subtitles.

This issue focuses on the collector part for query time parent child search. I 
started a while back with implementing this. Basically a two pass search 
performs a parent child search. In the first pass the top N parent child 
documents are resolved. In the second pass the parent or top N children are 
resolved (depending if the hit is a parent or child) and are associated with 
the top N parent child relation documents. Patch will follow soon.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (LUCENE-3936) Rename StringIndexDocValues to DocTermsIndexDocValues

2012-03-29 Thread Martijn van Groningen (Created) (JIRA)
Rename StringIndexDocValues to DocTermsIndexDocValues
-

 Key: LUCENE-3936
 URL: https://issues.apache.org/jira/browse/LUCENE-3936
 Project: Lucene - Java
  Issue Type: Improvement
  Components: modules/other
Reporter: Martijn van Groningen
 Fix For: 4.0


StringIndex doesn't exists any more in the trunk, so the name DocTermsIndex 
should be used and this is also what it is using.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (LUCENE-3856) Create docvalues based grouped facet collector

2012-03-07 Thread Martijn van Groningen (Created) (JIRA)
Create docvalues based grouped facet collector
--

 Key: LUCENE-3856
 URL: https://issues.apache.org/jira/browse/LUCENE-3856
 Project: Lucene - Java
  Issue Type: Bug
  Components: modules/grouping
Reporter: Martijn van Groningen


Create docvalues based grouped facet collector. Currently only term based 
collectors have been implemented (LUCENE-3802).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (LUCENE-3802) Grouping collector that computes grouped facet counts

2012-02-19 Thread Martijn van Groningen (Created) (JIRA)
Grouping collector that computes grouped facet counts
-

 Key: LUCENE-3802
 URL: https://issues.apache.org/jira/browse/LUCENE-3802
 Project: Lucene - Java
  Issue Type: New Feature
  Components: modules/grouping
Reporter: Martijn van Groningen


Spinoff from issue SOLR-2898. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (LUCENE-3778) Create a grouping convenience class

2012-02-14 Thread Martijn van Groningen (Created) (JIRA)
Create a grouping convenience class
---

 Key: LUCENE-3778
 URL: https://issues.apache.org/jira/browse/LUCENE-3778
 Project: Lucene - Java
  Issue Type: Improvement
  Components: modules/grouping
Reporter: Martijn van Groningen


Currently the grouping module has many collector classes with a lot of 
different options per class. I think it would be a good idea to have a 
GroupUtil (Or another name?) convenience class. I think this could be a 
builder, because of the many options 
(sort,sortWithinGroup,groupOffset,groupCount and more) and implementations 
(term/dv/function) grouping has.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (LUCENE-3759) Support joining a distributed environment.

2012-02-07 Thread Martijn van Groningen (Created) (JIRA)
Support joining a distributed environment.
--

 Key: LUCENE-3759
 URL: https://issues.apache.org/jira/browse/LUCENE-3759
 Project: Lucene - Java
  Issue Type: Improvement
  Components: modules/join
Reporter: Martijn van Groningen


Add two more methods in JoinUtil to support joining in a distributed manner.
* Method to retrieve all from values.
* Method to create a TermsQuery based on a set of from terms.

With these two methods distributed joining can be supported following these 
steps:
# Retrieve from values from each shard
# Merge the retrieved from values. 
# Create a TermsQuery based on the merged from terms and send this query to all 
shards. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (LUCENE-3602) Add join query to Lucene

2011-11-27 Thread Martijn van Groningen (Created) (JIRA)
Add join query to Lucene


 Key: LUCENE-3602
 URL: https://issues.apache.org/jira/browse/LUCENE-3602
 Project: Lucene - Java
  Issue Type: New Feature
  Components: modules/join
Reporter: Martijn van Groningen


Solr has (psuedo) join query for a while now. I think this should also be 
available in Lucene.  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (SOLR-2898) Support grouped faceting

2011-11-14 Thread Martijn van Groningen (Created) (JIRA)
Support grouped faceting


 Key: SOLR-2898
 URL: https://issues.apache.org/jira/browse/SOLR-2898
 Project: Solr
  Issue Type: New Feature
Reporter: Martijn van Groningen


Support grouped faceting. As described in LUCENE-3097 (matrix counts).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (LUCENE-3496) Support grouping by IndexDocValues

2011-10-07 Thread Martijn van Groningen (Created) (JIRA)
Support grouping by IndexDocValues
--

 Key: LUCENE-3496
 URL: https://issues.apache.org/jira/browse/LUCENE-3496
 Project: Lucene - Java
  Issue Type: New Feature
  Components: modules/grouping
Reporter: Martijn van Groningen


Although IDV is not yet finalized (More particular the SortedSource). I think 
we already can discuss / investigate implementing grouping by IDV.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (LUCENE-3483) Move Function grouping collectors from Solr to grouping module

2011-10-03 Thread Martijn van Groningen (Created) (JIRA)
Move Function grouping collectors from Solr to grouping module
--

 Key: LUCENE-3483
 URL: https://issues.apache.org/jira/browse/LUCENE-3483
 Project: Lucene - Java
  Issue Type: Improvement
  Components: modules/grouping
Affects Versions: 4.0
Reporter: Martijn van Groningen
Assignee: Martijn van Groningen
Priority: Minor
 Fix For: 4.0


Move the Function*Collectors from Solr (inside Grouping source file) to 
grouping module.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (LUCENE-3482) Refactor grouping module to be more maintainable

2011-10-02 Thread Martijn van Groningen (Created) (JIRA)
Refactor grouping module to be more maintainable


 Key: LUCENE-3482
 URL: https://issues.apache.org/jira/browse/LUCENE-3482
 Project: Lucene - Java
  Issue Type: Improvement
  Components: modules/grouping
Affects Versions: 4.0
Reporter: Martijn van Groningen
 Fix For: 4.0


Currently we have 4 types of grouping collectors and 8 concrete subclasses in 
Lucene / Solr. In current architecture for each type of collector two concrete 
subclasses need to be created. An implementation optimized for single term 
based groups and a more general implementation that works with MutableValue to 
also support grouping by functions. If we want for example group by 
IndexDocValues each type of grouping collector needs to have three concrete 
subclasses. This design isn't very maintainable.

I think it is best to introduce a concept that knows how deals with dealing 
groups for all the different sources. Therefore the grouping module should 
depend on the queries module, so that grouping can reuse the ValueSource 
concept. A term based concrete impl. of this concept knows for example to use 
the DocValues.ord() method. Or more generic concrete impl. will use 
DocValues.ValueFiller. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (LUCENE-3468) FirstPassGroupingCollector should use pollLast()

2011-09-27 Thread Martijn van Groningen (Created) (JIRA)
FirstPassGroupingCollector should use pollLast()


 Key: LUCENE-3468
 URL: https://issues.apache.org/jira/browse/LUCENE-3468
 Project: Lucene - Java
  Issue Type: Improvement
  Components: modules/grouping
Affects Versions: 4.0
Reporter: Martijn van Groningen
Assignee: Martijn van Groningen
 Fix For: 4.0


Currently FirstPassGroupingCollector uses last and remove method (TreeSet) for 
replacing a more relevant grouping during grouping.
This can be replaced by pollLast since Lucene trunk is now Java 6. 
TermFirstPassGroupingCollectorJava6 in Solr can be removed as well.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org