siddharthteotia commented on pull request #8384:
URL: https://github.com/apache/pinot/pull/8384#issuecomment-1079441867


   > Want to have some discussion on the syntax of the new text index. 
Currently we are using Java regexp format (with `^$.*` instead of `_%` in sql 
LIKE) for the native FST, which is okay (not sure if all regexp are supported 
though, sql LIKE syntax is much simpler). Do we want to reuse the current 
`TEXT_MATCH` which currently takes a lucene search query, or come up with a new 
function that performs regexpLike/Like + logical operators?
   > 
   > cc @kishoreg
   
   My suggestion would be to use a new function that accepts potentially ANSI 
SQL LIKE style + delta needed to handle phrase, fuzzy and is not tied to lucene 
syntax currently used by TEXT_MATCH. 
   
   On the other hand, exposing this new index via TEXT_MATCH and Lucene syntax 
along with index rebuilding in SegmentPreprocessor can help with easy migration 
of users currently using existing lucene text index 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org
For additional commands, e-mail: commits-h...@pinot.apache.org

Reply via email to