[jira] Commented: (LUCENE-1478) Missing possibility to supply custom FieldParser when sorting search results

Michael McCandless (JIRA) Fri, 30 Jan 2009 03:22:25 -0800

    [ 
https://issues.apache.org/jira/browse/LUCENE-1478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12668818#action_12668818
 ]


Michael McCandless commented on LUCENE-1478:
--------------------------------------------


bq. the parser should be a singleton for all field caches or have 
hashCode()/equals().

I guess I don't understand your proposed workaround then -- I thought
you were explicitly proposing *not* using a singleton, so that you
could force re-parsing of all values (even for old segments) when a
new external file was being used.

(Also likely my lack of understanding of Solr's ExternalFileField is
adding to my confusion here -- I haven't yet looked at it.)

bq. The Cache of FieldCache instances in FieldCacheImpl is a WeakHashMap, so 
unused FieldCache instances for no longer used readers/parsers are GC'ed.

The WeakHashMap is keyed by the IndexReader, and then there's a normal
HashMap (innerCache) keyed on field,Parser.  So it's only when the
IndexReader (SegmentReader) is no longer referenced elsewhere that the
innerCache, in entirety, can be GCd.  There's no reclaiming of stale
entries in the innerCache.


> Missing possibility to supply custom FieldParser when sorting search results
> ----------------------------------------------------------------------------
>
>                 Key: LUCENE-1478
>                 URL: https://issues.apache.org/jira/browse/LUCENE-1478
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Search
>    Affects Versions: 2.4
>            Reporter: Uwe Schindler
>            Assignee: Michael McCandless
>             Fix For: 2.9
>
>         Attachments: LUCENE-1478-cleanup.patch, 
> LUCENE-1478-no-superinterface.patch, LUCENE-1478.patch, LUCENE-1478.patch, 
> LUCENE-1478.patch, LUCENE-1478.patch, LUCENE-1478.patch
>
>
> When implementing the new TrieRangeQuery for contrib (LUCENE-1470), I was 
> confronted by the problem that the special trie-encoded values (which are 
> longs in a special encoding) cannot be sorted by Searcher.search() and 
> SortField. The problem is: If you use SortField.LONG, you get 
> NumberFormatExceptions. The trie encoded values may be sorted using 
> SortField.String (as the encoding is in such a way, that they are sortable as 
> Strings), but this is very memory ineffective.
> ExtendedFieldCache gives the possibility to specify a custom LongParser when 
> retrieving the cached values. But you cannot use this during searching, 
> because there is no possibility to supply this custom LongParser to the 
> SortField.
> I propose a change in the sort classes:
> Include a pointer to the parser instance to be used in SortField (if not 
> given use the default). My idea is to create a SortField using a new 
> constructor
> {code}SortField(String field, int type, Object parser, boolean reverse){code}
> The parser is "object" because all current parsers have no super-interface. 
> The ideal solution would be to have:
> {code}SortField(String field, int type, FieldCache.Parser parser, boolean 
> reverse){code}
> and FieldCache.Parser is a super-interface (just empty, more like a 
> marker-interface) of all other parsers (like LongParser...). The sort 
> implementation then must be changed to respect the given parser (if not 
> NULL), else use the default FieldCache.getXXXX without parser.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] Commented: (LUCENE-1478) Missing possibility to supply custom FieldParser when sorting search results

Reply via email to