[ 
https://issues.apache.org/jira/browse/ATLAS-4225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17841448#comment-17841448
 ] 

chaitali borole commented on ATLAS-4225:
----------------------------------------

In below case this above patch won't work :

Description field contains "THIS IS DESCRIPTION FIELD". Now, if we search for 
each word separately, i.e., (THIS, IS, DESCRIPTION, or FIELD), we will get the 
output. However, if we search for "THIS IS" or "DESCRIPTION FIELD", it will not 
work as the space acts as a delimiter here. So (THIS - 1st Token) (IS 2nd 
Token) and so on..

Similarly, in the case of CJK, "期日" each character is a token. So, if we search 
for "期", we will get the response or if we search for "日", we will get the 
response. However, if we search for "期日" together, we will not get an answer.

though this works as per design similar to English characters.

> Support for Chinese character in entity data.
> ---------------------------------------------
>
>                 Key: ATLAS-4225
>                 URL: https://issues.apache.org/jira/browse/ATLAS-4225
>             Project: Atlas
>          Issue Type: New Feature
>            Reporter: Mayank Jain
>            Assignee: chaitali borole
>            Priority: Major
>         Attachments: ATLAS-4225-3.patch
>
>
> Currently we only allow English characters to be used to adding entity data 
> that is labels , Custom Attributes and Business-Metadata.
>  
> We need to support for Chinese and other languages as well.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to