[ 
https://issues.apache.org/jira/browse/LUCENE-5989?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14159825#comment-14159825
 ] 

David Smiley commented on LUCENE-5989:
--------------------------------------

bq. This is supremely expert, I wonder if anyone out there has succeeded in 
doing so?

{{org.apache.lucene.spatial.prefix.CellTokenStream}} :-)    Though this doesn't 
count since it's in Lucene.

+1 to make this easier via a BinaryField.  With BinaryField and auto-prefixing, 
CellTokenStream won't be needed for indexing a point.  But it's needed for 
other shapes and to support heat-map style faceting.

Jack's opinion about the "Keyword" name being far from obvious really resonated 
with me.  Despite Shai's reasonable explanation, it doesn't seem to me that 
changing the status-quo to anything non-obvious is helpful.  And it wouldn't 
seem like the text equivalent of BinaryField -- for that the current name is 
perfect, I think.  But I do like the idea of simply having StringField taking a 
byte[] too such that there is no BinaryField.  Either way.

> Add BinaryField, to index a single binary token
> -----------------------------------------------
>
>                 Key: LUCENE-5989
>                 URL: https://issues.apache.org/jira/browse/LUCENE-5989
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Michael McCandless
>            Assignee: Michael McCandless
>             Fix For: 5.0, Trunk
>
>         Attachments: LUCENE-5989.patch
>
>
> 5 years ago (LUCENE-1458) we "enabled" fully binary terms in the
> lowest levels of Lucene (the codec APIs) yet today, actually adding an
> arbitrary byte[] binary term during indexing is far from simple: you
> must make a custom Field with a custom TokenStream and a custom
> TermToBytesRefAttribute, as far as I know.
> This is supremely expert, I wonder if anyone out there has succeeded
> in doing so?
> I think we should make indexing a single byte[] as simple as indexing
> a single String.
> This is a pre-cursor for issues like LUCENE-5596 (encoding IPv6
> address as byte[16]) and LUCENE-5879 (encoding native numeric values
> in their simple binary form).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to