Actually Lucene terms can be arbitrary/fully binary tokens in the low-level postings APIs.
It's just that our analysis APIs are geared towards analyzing text, but using StringField you can easily index an arbitrary single-token byte[]. Mike McCandless http://blog.mikemccandless.com On Tue, Sep 1, 2015 at 10:07 AM, Mark Hanfland <markhanfl...@yahoo.com.invalid> wrote: > You are correct that Lucene only works with text (no binary or primitives), > Base64 would be the way I would suggest. > > > On Monday, August 31, 2015 11:19 AM, Dan Smith <dsm...@pivotal.io> wrote: > > > What's the best way to index binary data in Lucene? I'm adding a Lucene > index to a key value store, and I want to be able to delete documents based > on a binary key. > > As far as I can tell the Lucene API does not support indexing binary data, > and I was looking into Base64 encoding the key. Is there a better way? > > -Dan > > > --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org