StringHelper
------------
Key: LUCY-38
URL: https://issues.apache.org/jira/browse/LUCY-38
Project: Lucy
Issue Type: Improvement
Components: Core
Reporter: Marvin Humphrey
Assignee: Marvin Humphrey
Lucy::Util::StringHelper provides two types of routines. First, there are
miscellaneous string manipulation routines such as overlap() and to_base36().
Second, there are UTF-8 helper routines: utf8_valid(), encode_utf8_char(),
decode_utf8_char(), is_whitespace(), and back_utf8_char().
The UTF-8 utilities are primarily used by CharBuf, and it may make sense to
either move them into CharBuf.c or publish them as static inline functions so
that CharBuf can optimize around them -- particularly encode_utf8_char() and
decode_utf8_char(). However, they are also needed other places within the
core - e.g. any class that reads character data from the index will need to
check UTF-8 validity.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.