[ https://issues.apache.org/jira/browse/LUCENE-4369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13455856#comment-13455856 ]
Jack Krupansky edited comment on LUCENE-4369 at 9/15/12 2:14 AM: ----------------------------------------------------------------- At this stage of the discussion, is there any intention that the replacement for "string" fields will permit analysis, at least CharFilter analysis? After all, that is one of the main reasons people get confused. I'm okay with "ExactTextField" except that character filtering would be really nice and avoid confusion for Solr users. Of course, it would be ironic to call it "exact text" when it needs to be filtered. OTOH, at the Lucene level, especially the Lucene query parser, I can see why you would want the "string" field to prevent analysis - because there is no field-specific analysis, just one analyzer for all fields. Hmmm... maybe that should be proposed for the Lucene query parser to side step that particular rationale for wanting strings to be unanalyzed - provide a map of field-specific analyzers. At the Solr schema level, we could simply have "string" be a TextField with only KeywordTokenizer and then users can copy and/or customize as they wish. This begs the question of how or whether the Solr schema side of the house will rename the "string" field type, or keep it as string and simply change the StrField class name. was (Author: jkrupan): At this stage of the discussion, is there any intention that the replacement for "string" fields will permit analysis, at least CharFilter analysis? After all, that is one of the main reasons people get confused. I'm okay with "ExactTextField" except that character filtering would be really nice and avoid confusion for Solr users. Of course, it would be ironic to call it "exact text" when it needs to be filtered. OTOH, at the Lucene level, especially the Lucene querey parser, I can see why you would want the "string" field to prevent analysis - because there is no field-specific analysis, just one analyzer for all fields. Hmmm... maybe that should be proposed for the Lucene query parser to side step that particular rationale for wanting strings to be unanalyzed - provide a map of field-specific analyzers. At the Solr schema level, we could simply have "string" be a TextField with only KeywordTokenizer and then users can copy and/or customize as they wish. This begs the question of how or whether the Solr schema side of the house will rename the "string" field type, or keep it as string and simply change the StrField class name. > StringFields name is unintuitive and not helpful > ------------------------------------------------ > > Key: LUCENE-4369 > URL: https://issues.apache.org/jira/browse/LUCENE-4369 > Project: Lucene - Core > Issue Type: Bug > Reporter: Robert Muir > Attachments: LUCENE-4369.patch > > > There's a huge difference between TextField and StringField, StringField > screws up scoring and bypasses your Analyzer. > (see java-user thread "Custom Analyzer Not Called When Indexing" as an > example.) > The name we use here is vital, otherwise people will get bad results. > I think we should rename StringField to MatchOnlyField. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org