Actually Lucene terms can be arbitrary/fully binary tokens in the
low-level postings APIs.

It's just that our analysis APIs are geared towards analyzing text,
but using StringField you can easily index an arbitrary single-token
byte[].

Mike McCandless

http://blog.mikemccandless.com


On Tue, Sep 1, 2015 at 10:07 AM, Mark Hanfland
<markhanfl...@yahoo.com.invalid> wrote:
> You are correct that Lucene only works with text (no binary or primitives), 
> Base64 would be the way I would suggest.
>
>
>      On Monday, August 31, 2015 11:19 AM, Dan Smith <dsm...@pivotal.io> wrote:
>
>
>  What's the best way to index binary data in Lucene? I'm adding a Lucene
> index to a key value store, and I want to be able to delete documents based
> on a binary key.
>
> As far as I can tell the Lucene API does not support indexing binary data,
> and I was looking into Base64 encoding the key. Is there a better way?
>
> -Dan
>
>
>

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Reply via email to