map unicode process-internal codepoints to replacement character
----------------------------------------------------------------
Key: LUCENE-2019
URL: https://issues.apache.org/jira/browse/LUCENE-2019
Project: Lucene - Java
Issue Type: Improvement
Components: Index
Reporter: Robert Muir
Priority: Minor
A spinoff from LUCENE-2016.
There are several process-internal codepoints in unicode, we should not store
these in the index.
Instead they should be mapped to replacement character (U+FFFD), so they can be
used process-internally.
An example of this is how Lucene Java currently uses U+FFFF process-internally,
it can't be in the index or will cause problems.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]