[ 
https://issues.apache.org/jira/browse/SOLR-10803?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16035800#comment-16035800
 ] 

Hoss Man commented on SOLR-10803:
---------------------------------


Quick side digressions...

bq. Maybe also similar stuff to prevent FieldCache usage? ... I have seen that 
Solr 7 allows to merge non-docvalues segments to ones with docvalues using 
uninverter with a special mergepolicy. ...

That feels like a _very_ orthoginal idea (or 2?) that should really be 
discussed in their own Jiras since they are broader in scope then just Trie vs 
Point.  (I'm not familar enough with what would be involved to even create the 
jira)


bq. I'd suggest to also enable DocValues by default for all string/numeric/date 
fields, unless explicitly disabled.

I like this idea -- but it's definitely orthoginal to the topic at hand.

I've spun that off into SOLR-10808


----

bq. I put the blocker priority since I think it is a better experience if all 
7.x indices can be used with Solr 8, but there is also the possibility of just 
removing Trie*Field in 8.0 and refusing to open any index that would make use 
of those fields, even if they were created in 7.x.

Coincidently, sarowe & cassandra & I were just talking yesterday about our 
concerns that beyond "known gaps" in terms of Solr features that work with Trie 
fields but not (yet) Point fields (ex:  SOLR-9989, SOLR-10503, SOLR-9985, 
etc...) a larger concern as we move towards 7.0 is that test coverage of 
PointFields in Solr is currently pretty shallow.  We don't really have a very 
good idea of what does/doesn't work with PointFields, which is disconcerting 
for pushing them as the "default" (or "recommended") numeric types in Solr -- 
let alone forbiding the use of (new) Trie fields as suggested here.

Which is why I've started working on SOLR-10807 -- the current aim is a quick 
and dirty way to help identify all of the potentially problematic areas as 
quickly as possible, by forcing every test to use PointFields instead of 
TrieFields.  See comments in that Jira for details, but the nutshell is: at 
this point it's hard to guess how many features/test might fail if we cut over 
to PointFields -- because a big portion of our tests are using/expecting the 
'id' field to be numeric, and before we can even get to the meat of the test, 
using a Point based numeric as the 'id' field causes all sorts of problems 
because they don't have any 'Terms' for updateDocument/deleteDocument

bq. In addition, the merge policy could also be used to convert Trie* to Point* 
values by first uninverting (if no docvalues on trie) and redindexing the 
fields during merging... (not sure how to do this, but should work somehow).

If we think it's viable to create a MergePolicy (Wrapper) that could convert 
Trie fields to Point fields, then my straw many suggestion would be that in 7.x 
we only discourage Trie fields, but not ban them completely -- with some 
strongly worded warnings that Trie Fields will be completely removed in 8.0, 
and any index that uses them will require manual upgrade using a special 
converstion tool.

As things stand today, even if that MergePolicy/tool doesn't yet exist when 7.0 
comes out, I'd rather say "7.x indexes using Trie fields *MAY* require 
reindexing in 8.0, pending possible development of a tool to upgrade Trie 
fields to Point fields" then ban Trie fields outright.

(Hell: As things stand today, even if we were confident it would be 
_impossible_ to create such a tool/mergepolicy, I'd still rather say "7.x 
indexes using Trie fields will *REQUIRE* reindexing in 8.0" then to change the 
default configsets in 7.0 to use Points, let alone ban new Trie fields)





> Solr should refuse to create Trie*Field instances in 7.0 indices
> ----------------------------------------------------------------
>
>                 Key: SOLR-10803
>                 URL: https://issues.apache.org/jira/browse/SOLR-10803
>             Project: Solr
>          Issue Type: Bug
>      Security Level: Public(Default Security Level. Issues are Public) 
>            Reporter: Adrien Grand
>            Priority: Blocker
>             Fix For: master (7.0)
>
>
> If we want to be able to remove support for legacy numerics from Solr in 8.0, 
> we need to forbid the use of Trie*Field in indices that are created on or 
> after 7.0.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to