siddharthteotia commented on pull request #8384: URL: https://github.com/apache/pinot/pull/8384#issuecomment-1079441867
> Want to have some discussion on the syntax of the new text index. Currently we are using Java regexp format (with `^$.*` instead of `_%` in sql LIKE) for the native FST, which is okay (not sure if all regexp are supported though, sql LIKE syntax is much simpler). Do we want to reuse the current `TEXT_MATCH` which currently takes a lucene search query, or come up with a new function that performs regexpLike/Like + logical operators? > > cc @kishoreg My suggestion would be to use a new function that accepts potentially ANSI SQL LIKE style + delta needed to handle phrase, fuzzy and is not tied to lucene syntax currently used by TEXT_MATCH. On the other hand, exposing this new index via TEXT_MATCH and Lucene syntax along with index rebuilding in SegmentPreprocessor can help with easy migration of users currently using existing lucene text index -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org For additional commands, e-mail: commits-h...@pinot.apache.org