amansinha100 commented on PR #5091:
URL: https://github.com/apache/hive/pull/5091#issuecomment-2085871834
Chiming in late on this topic ..
I think that the CBO path should be the main focus since it has been the
default for quite some time now - with Hive 3 and Hive 4. In this path, to
avoid breaking change, we should not throw an exception for the string argument
type for the aggregate function even though the SQL standard suggests
otherwise. In an ETL operation, there could be one row out of millions that
has 'dirty' data with string type but Hive has been permissive for such data,
otherwise the whole job would fail. If it means that we should let SUM('text')
return NULL (similar to AVG('text')), it seems a better option than erroring
out.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]