fsi-yuvraj opened a new pull request, #356:
URL: https://github.com/apache/atlas/pull/356

   ## What changes were proposed in this pull request?
   JIRA : https://issues.apache.org/jira/browse/ATLAS-5032
   
   ### Background:
   When performing a search using long entity names with attributes like 
qualifiedName and the StartsWith operator, results are not returned as expected.
   
   ### Root Cause:
   The qualifiedName attribute is an indexed key. However, Solr's default 
standard tokenizer has a maximum token length of 255 characters. When entity 
names exceed this length, the tokenizer fails to parse the value correctly, 
leading to search failures.
   
   
   ### Changes Proposed:
   #### Approach 1 : Update Solr Configuration to Increase Max Token Length
   - Modify the Solr schema to increase the maxTokenLength option.
   - This allows Solr to properly tokenize field values as per set length.
   - Impact:
   -- Requires full reindexing of all existing data to apply the new schema.
     
   #### [Existing PR] Approach 2 : Approach 2: Handle Long Value Search by 
Querying JanusGraph
   - For long value searches on indexed keys, query to janusgraph instead of 
solr
   - Impact:
   -- This will affect performance as query is executed at janus
   
   ## How was this patch tested?
   - Added unit tests to validate the logic.
   - Manually tested the scenario described in the JIRA issue to ensure 
accurate search results with long entity names.
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@atlas.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to