Jackie-Jiang opened a new issue, #10508:
URL: https://github.com/apache/pinot/issues/10508

   When a column is configured as raw in a real-time table, the committed 
segment will always have the column metadata with:
   - isSorted: false
   - cardinality: Integer.MIN_VALUE
   
   Because of this behavior, when converting raw column to dictionary encoded 
column, we shouldn't rely on these 2 stats from the column metadata. Instead, 
we should collect the stats again, and read these 2 stats from the 
`ColumnStatistics` (stats collector).
   
   Currently for a committed segment, even if the column values are sorted, we 
won't generate sorted index for it because the column metadata always mark it 
as unsorted.
   
   Related to #10497


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to