[ https://issues.apache.org/jira/browse/SOLR-1690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16074869#comment-16074869 ]
Mohamed edited comment on SOLR-1690 at 7/5/17 2:45 PM: -------------------------------------------------------- +1 for this tokenizer index was (Author: med): +1 for this analyzer > JSONKeyValueTokenizerFactory -- JSON Tokenizer > ---------------------------------------------- > > Key: SOLR-1690 > URL: https://issues.apache.org/jira/browse/SOLR-1690 > Project: Solr > Issue Type: New Feature > Components: Schema and Analysis > Reporter: Ryan McKinley > Priority: Minor > Attachments: noggit-1.0-A1.jar, > SOLR-1690-JSONKeyValueTokenizerFactory.patch > > > Sometimes it is nice to group structured data into a single field. > This (rough) patch, takes JSON input and indexes tokens based on the key > values pairs in the json. > {code:xml|title=schema.xml} > <!-- JSON Field Type --> > <fieldtype name="json" class="solr.TextField" positionIncrementGap="100" > omitNorms="true"> > <analyzer type="index"> > <tokenizer class="solr.JSONKeyValueTokenizerFactory" keepArray="true" > hierarchicalKey="false"/> > <filter class="solr.TrimFilterFactory"/> > <filter class="solr.LowerCaseFilterFactory"/> > </analyzer> > <analyzer type="query"> > <tokenizer class="solr.KeywordTokenizerFactory"/> > <filter class="solr.TrimFilterFactory" /> > <filter class="solr.LowerCaseFilterFactory"/> > </analyzer> > </fieldtype> > {code} > Given text: > {code} > { "hello": "world", "rank":5 } > {code} > indexed as two tokens: > || term position | 1 | 2 | > || term text | hello:world | rank:5 | > || term type | word | word | > || source start,end | 12,17 | 27,28 | -- This message was sent by Atlassian JIRA (v6.4.14#64029) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org