[jira] [Work logged] (HIVE-26754) Implement array_distinct UDF to return an array after removing duplicates in it
[ https://issues.apache.org/jira/browse/HIVE-26754?focusedWorklogId=835389=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-835389 ] ASF GitHub Bot logged work on HIVE-26754: - Author: ASF GitHub Bot Created on: 22/Dec/22 19:07 Start Date: 22/Dec/22 19:07 Worklog Time Spent: 10m Work Description: ayushtkn merged PR #3806: URL: https://github.com/apache/hive/pull/3806 Issue Time Tracking --- Worklog Id: (was: 835389) Time Spent: 5.5h (was: 5h 20m) > Implement array_distinct UDF to return an array after removing duplicates in > it > --- > > Key: HIVE-26754 > URL: https://issues.apache.org/jira/browse/HIVE-26754 > Project: Hive > Issue Type: Sub-task > Components: Hive >Reporter: Taraka Rama Rao Lethavadla >Assignee: Taraka Rama Rao Lethavadla >Priority: Major > Labels: pull-request-available > Time Spent: 5.5h > Remaining Estimate: 0h > > *array_distinct(array(obj1, obj2,...))* - The function returns an array of > the same type as the input argument where all duplicate values have been > removed. > Example: > > SELECT array_distinct(array('b', 'd', 'd', 'a')) FROM src LIMIT 1; > ['a', 'b', 'c'] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Work logged] (HIVE-26754) Implement array_distinct UDF to return an array after removing duplicates in it
[ https://issues.apache.org/jira/browse/HIVE-26754?focusedWorklogId=835074=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-835074 ] ASF GitHub Bot logged work on HIVE-26754: - Author: ASF GitHub Bot Created on: 21/Dec/22 15:13 Start Date: 21/Dec/22 15:13 Worklog Time Spent: 10m Work Description: sonarcloud[bot] commented on PR #3806: URL: https://github.com/apache/hive/pull/3806#issuecomment-1361463823 Kudos, SonarCloud Quality Gate passed! [![Quality Gate passed](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/QualityGateBadge/passed-16px.png 'Quality Gate passed')](https://sonarcloud.io/dashboard?id=apache_hive=3806) [![Bug](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/bug-16px.png 'Bug')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [0 Bugs](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [![Vulnerability](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/vulnerability-16px.png 'Vulnerability')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [0 Vulnerabilities](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [![Security Hotspot](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/security_hotspot-16px.png 'Security Hotspot')](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [0 Security Hotspots](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [![Code Smell](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/code_smell-16px.png 'Code Smell')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [4 Code Smells](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [![No Coverage information](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/CoverageChart/NoCoverageInfo-16px.png 'No Coverage information')](https://sonarcloud.io/component_measures?id=apache_hive=3806=coverage=list) No Coverage information [![No Duplication information](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/Duplications/NoDuplicationInfo-16px.png 'No Duplication information')](https://sonarcloud.io/component_measures?id=apache_hive=3806=duplicated_lines_density=list) No Duplication information Issue Time Tracking --- Worklog Id: (was: 835074) Time Spent: 5h 20m (was: 5h 10m) > Implement array_distinct UDF to return an array after removing duplicates in > it > --- > > Key: HIVE-26754 > URL: https://issues.apache.org/jira/browse/HIVE-26754 > Project: Hive > Issue Type: Sub-task > Components: Hive >Reporter: Taraka Rama Rao Lethavadla >Assignee: Taraka Rama Rao Lethavadla >Priority: Major > Labels: pull-request-available > Time Spent: 5h 20m > Remaining Estimate: 0h > > *array_distinct(array(obj1, obj2,...))* - The function returns an array of > the same type as the input argument where all duplicate values have been > removed. > Example: > > SELECT array_distinct(array('b', 'd', 'd', 'a')) FROM src LIMIT 1; > ['a', 'b', 'c'] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Work logged] (HIVE-26754) Implement array_distinct UDF to return an array after removing duplicates in it
[ https://issues.apache.org/jira/browse/HIVE-26754?focusedWorklogId=834252=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-834252 ] ASF GitHub Bot logged work on HIVE-26754: - Author: ASF GitHub Bot Created on: 16/Dec/22 22:38 Start Date: 16/Dec/22 22:38 Worklog Time Spent: 10m Work Description: sonarcloud[bot] commented on PR #3806: URL: https://github.com/apache/hive/pull/3806#issuecomment-1355740091 Kudos, SonarCloud Quality Gate passed! [![Quality Gate passed](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/QualityGateBadge/passed-16px.png 'Quality Gate passed')](https://sonarcloud.io/dashboard?id=apache_hive=3806) [![Bug](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/bug-16px.png 'Bug')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [0 Bugs](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [![Vulnerability](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/vulnerability-16px.png 'Vulnerability')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [0 Vulnerabilities](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [![Security Hotspot](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/security_hotspot-16px.png 'Security Hotspot')](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [0 Security Hotspots](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [![Code Smell](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/code_smell-16px.png 'Code Smell')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [4 Code Smells](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [![No Coverage information](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/CoverageChart/NoCoverageInfo-16px.png 'No Coverage information')](https://sonarcloud.io/component_measures?id=apache_hive=3806=coverage=list) No Coverage information [![No Duplication information](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/Duplications/NoDuplicationInfo-16px.png 'No Duplication information')](https://sonarcloud.io/component_measures?id=apache_hive=3806=duplicated_lines_density=list) No Duplication information Issue Time Tracking --- Worklog Id: (was: 834252) Time Spent: 5h 10m (was: 5h) > Implement array_distinct UDF to return an array after removing duplicates in > it > --- > > Key: HIVE-26754 > URL: https://issues.apache.org/jira/browse/HIVE-26754 > Project: Hive > Issue Type: Sub-task > Components: Hive >Reporter: Taraka Rama Rao Lethavadla >Assignee: Taraka Rama Rao Lethavadla >Priority: Major > Labels: pull-request-available > Time Spent: 5h 10m > Remaining Estimate: 0h > > *array_distinct(array(obj1, obj2,...))* - The function returns an array of > the same type as the input argument where all duplicate values have been > removed. > Example: > > SELECT array_distinct(array('b', 'd', 'd', 'a')) FROM src LIMIT 1; > ['a', 'b', 'c'] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Work logged] (HIVE-26754) Implement array_distinct UDF to return an array after removing duplicates in it
[ https://issues.apache.org/jira/browse/HIVE-26754?focusedWorklogId=834200=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-834200 ] ASF GitHub Bot logged work on HIVE-26754: - Author: ASF GitHub Bot Created on: 16/Dec/22 16:57 Start Date: 16/Dec/22 16:57 Worklog Time Spent: 10m Work Description: sonarcloud[bot] commented on PR #3806: URL: https://github.com/apache/hive/pull/3806#issuecomment-1355217340 Kudos, SonarCloud Quality Gate passed! [![Quality Gate passed](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/QualityGateBadge/passed-16px.png 'Quality Gate passed')](https://sonarcloud.io/dashboard?id=apache_hive=3806) [![Bug](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/bug-16px.png 'Bug')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [0 Bugs](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [![Vulnerability](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/vulnerability-16px.png 'Vulnerability')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [0 Vulnerabilities](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [![Security Hotspot](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/security_hotspot-16px.png 'Security Hotspot')](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [0 Security Hotspots](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [![Code Smell](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/code_smell-16px.png 'Code Smell')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [4 Code Smells](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [![No Coverage information](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/CoverageChart/NoCoverageInfo-16px.png 'No Coverage information')](https://sonarcloud.io/component_measures?id=apache_hive=3806=coverage=list) No Coverage information [![No Duplication information](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/Duplications/NoDuplicationInfo-16px.png 'No Duplication information')](https://sonarcloud.io/component_measures?id=apache_hive=3806=duplicated_lines_density=list) No Duplication information Issue Time Tracking --- Worklog Id: (was: 834200) Time Spent: 5h (was: 4h 50m) > Implement array_distinct UDF to return an array after removing duplicates in > it > --- > > Key: HIVE-26754 > URL: https://issues.apache.org/jira/browse/HIVE-26754 > Project: Hive > Issue Type: Sub-task > Components: Hive >Reporter: Taraka Rama Rao Lethavadla >Assignee: Taraka Rama Rao Lethavadla >Priority: Major > Labels: pull-request-available > Time Spent: 5h > Remaining Estimate: 0h > > *array_distinct(array(obj1, obj2,...))* - The function returns an array of > the same type as the input argument where all duplicate values have been > removed. > Example: > > SELECT array_distinct(array('b', 'd', 'd', 'a')) FROM src LIMIT 1; > ['a', 'b', 'c'] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Work logged] (HIVE-26754) Implement array_distinct UDF to return an array after removing duplicates in it
[ https://issues.apache.org/jira/browse/HIVE-26754?focusedWorklogId=834059=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-834059 ] ASF GitHub Bot logged work on HIVE-26754: - Author: ASF GitHub Bot Created on: 16/Dec/22 07:08 Start Date: 16/Dec/22 07:08 Worklog Time Spent: 10m Work Description: sonarcloud[bot] commented on PR #3806: URL: https://github.com/apache/hive/pull/3806#issuecomment-1354308391 Kudos, SonarCloud Quality Gate passed! [![Quality Gate passed](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/QualityGateBadge/passed-16px.png 'Quality Gate passed')](https://sonarcloud.io/dashboard?id=apache_hive=3806) [![Bug](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/bug-16px.png 'Bug')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [0 Bugs](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [![Vulnerability](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/vulnerability-16px.png 'Vulnerability')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [0 Vulnerabilities](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [![Security Hotspot](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/security_hotspot-16px.png 'Security Hotspot')](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [0 Security Hotspots](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [![Code Smell](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/code_smell-16px.png 'Code Smell')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [4 Code Smells](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [![No Coverage information](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/CoverageChart/NoCoverageInfo-16px.png 'No Coverage information')](https://sonarcloud.io/component_measures?id=apache_hive=3806=coverage=list) No Coverage information [![No Duplication information](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/Duplications/NoDuplicationInfo-16px.png 'No Duplication information')](https://sonarcloud.io/component_measures?id=apache_hive=3806=duplicated_lines_density=list) No Duplication information Issue Time Tracking --- Worklog Id: (was: 834059) Time Spent: 4h 50m (was: 4h 40m) > Implement array_distinct UDF to return an array after removing duplicates in > it > --- > > Key: HIVE-26754 > URL: https://issues.apache.org/jira/browse/HIVE-26754 > Project: Hive > Issue Type: Sub-task > Components: Hive >Reporter: Taraka Rama Rao Lethavadla >Assignee: Taraka Rama Rao Lethavadla >Priority: Major > Labels: pull-request-available > Time Spent: 4h 50m > Remaining Estimate: 0h > > *array_distinct(array(obj1, obj2,...))* - The function returns an array of > the same type as the input argument where all duplicate values have been > removed. > Example: > > SELECT array_distinct(array('b', 'd', 'd', 'a')) FROM src LIMIT 1; > ['a', 'b', 'c'] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Work logged] (HIVE-26754) Implement array_distinct UDF to return an array after removing duplicates in it
[ https://issues.apache.org/jira/browse/HIVE-26754?focusedWorklogId=834048=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-834048 ] ASF GitHub Bot logged work on HIVE-26754: - Author: ASF GitHub Bot Created on: 16/Dec/22 05:50 Start Date: 16/Dec/22 05:50 Worklog Time Spent: 10m Work Description: sonarcloud[bot] commented on PR #3806: URL: https://github.com/apache/hive/pull/3806#issuecomment-1354254825 Kudos, SonarCloud Quality Gate passed! [![Quality Gate passed](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/QualityGateBadge/passed-16px.png 'Quality Gate passed')](https://sonarcloud.io/dashboard?id=apache_hive=3806) [![Bug](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/bug-16px.png 'Bug')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [0 Bugs](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [![Vulnerability](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/vulnerability-16px.png 'Vulnerability')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [0 Vulnerabilities](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [![Security Hotspot](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/security_hotspot-16px.png 'Security Hotspot')](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [0 Security Hotspots](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [![Code Smell](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/code_smell-16px.png 'Code Smell')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [4 Code Smells](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [![No Coverage information](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/CoverageChart/NoCoverageInfo-16px.png 'No Coverage information')](https://sonarcloud.io/component_measures?id=apache_hive=3806=coverage=list) No Coverage information [![No Duplication information](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/Duplications/NoDuplicationInfo-16px.png 'No Duplication information')](https://sonarcloud.io/component_measures?id=apache_hive=3806=duplicated_lines_density=list) No Duplication information Issue Time Tracking --- Worklog Id: (was: 834048) Time Spent: 4h 40m (was: 4.5h) > Implement array_distinct UDF to return an array after removing duplicates in > it > --- > > Key: HIVE-26754 > URL: https://issues.apache.org/jira/browse/HIVE-26754 > Project: Hive > Issue Type: Sub-task > Components: Hive >Reporter: Taraka Rama Rao Lethavadla >Assignee: Taraka Rama Rao Lethavadla >Priority: Major > Labels: pull-request-available > Time Spent: 4h 40m > Remaining Estimate: 0h > > *array_distinct(array(obj1, obj2,...))* - The function returns an array of > the same type as the input argument where all duplicate values have been > removed. > Example: > > SELECT array_distinct(array('b', 'd', 'd', 'a')) FROM src LIMIT 1; > ['a', 'b', 'c'] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Work logged] (HIVE-26754) Implement array_distinct UDF to return an array after removing duplicates in it
[ https://issues.apache.org/jira/browse/HIVE-26754?focusedWorklogId=833950=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-833950 ] ASF GitHub Bot logged work on HIVE-26754: - Author: ASF GitHub Bot Created on: 15/Dec/22 19:32 Start Date: 15/Dec/22 19:32 Worklog Time Spent: 10m Work Description: sonarcloud[bot] commented on PR #3806: URL: https://github.com/apache/hive/pull/3806#issuecomment-1353607482 Kudos, SonarCloud Quality Gate passed! [![Quality Gate passed](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/QualityGateBadge/passed-16px.png 'Quality Gate passed')](https://sonarcloud.io/dashboard?id=apache_hive=3806) [![Bug](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/bug-16px.png 'Bug')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [0 Bugs](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [![Vulnerability](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/vulnerability-16px.png 'Vulnerability')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [0 Vulnerabilities](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [![Security Hotspot](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/security_hotspot-16px.png 'Security Hotspot')](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [0 Security Hotspots](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [![Code Smell](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/code_smell-16px.png 'Code Smell')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [4 Code Smells](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [![No Coverage information](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/CoverageChart/NoCoverageInfo-16px.png 'No Coverage information')](https://sonarcloud.io/component_measures?id=apache_hive=3806=coverage=list) No Coverage information [![No Duplication information](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/Duplications/NoDuplicationInfo-16px.png 'No Duplication information')](https://sonarcloud.io/component_measures?id=apache_hive=3806=duplicated_lines_density=list) No Duplication information Issue Time Tracking --- Worklog Id: (was: 833950) Time Spent: 4.5h (was: 4h 20m) > Implement array_distinct UDF to return an array after removing duplicates in > it > --- > > Key: HIVE-26754 > URL: https://issues.apache.org/jira/browse/HIVE-26754 > Project: Hive > Issue Type: Sub-task > Components: Hive >Reporter: Taraka Rama Rao Lethavadla >Assignee: Taraka Rama Rao Lethavadla >Priority: Major > Labels: pull-request-available > Time Spent: 4.5h > Remaining Estimate: 0h > > *array_distinct(array(obj1, obj2,...))* - The function returns an array of > the same type as the input argument where all duplicate values have been > removed. > Example: > > SELECT array_distinct(array('b', 'd', 'd', 'a')) FROM src LIMIT 1; > ['a', 'b', 'c'] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Work logged] (HIVE-26754) Implement array_distinct UDF to return an array after removing duplicates in it
[ https://issues.apache.org/jira/browse/HIVE-26754?focusedWorklogId=833760=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-833760 ] ASF GitHub Bot logged work on HIVE-26754: - Author: ASF GitHub Bot Created on: 15/Dec/22 10:13 Start Date: 15/Dec/22 10:13 Worklog Time Spent: 10m Work Description: sonarcloud[bot] commented on PR #3806: URL: https://github.com/apache/hive/pull/3806#issuecomment-1352838848 Kudos, SonarCloud Quality Gate passed! [![Quality Gate passed](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/QualityGateBadge/passed-16px.png 'Quality Gate passed')](https://sonarcloud.io/dashboard?id=apache_hive=3806) [![Bug](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/bug-16px.png 'Bug')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [0 Bugs](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [![Vulnerability](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/vulnerability-16px.png 'Vulnerability')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [0 Vulnerabilities](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [![Security Hotspot](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/security_hotspot-16px.png 'Security Hotspot')](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [0 Security Hotspots](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [![Code Smell](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/code_smell-16px.png 'Code Smell')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [4 Code Smells](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [![No Coverage information](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/CoverageChart/NoCoverageInfo-16px.png 'No Coverage information')](https://sonarcloud.io/component_measures?id=apache_hive=3806=coverage=list) No Coverage information [![No Duplication information](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/Duplications/NoDuplicationInfo-16px.png 'No Duplication information')](https://sonarcloud.io/component_measures?id=apache_hive=3806=duplicated_lines_density=list) No Duplication information Issue Time Tracking --- Worklog Id: (was: 833760) Time Spent: 4h 20m (was: 4h 10m) > Implement array_distinct UDF to return an array after removing duplicates in > it > --- > > Key: HIVE-26754 > URL: https://issues.apache.org/jira/browse/HIVE-26754 > Project: Hive > Issue Type: Sub-task > Components: Hive >Reporter: Taraka Rama Rao Lethavadla >Assignee: Taraka Rama Rao Lethavadla >Priority: Major > Labels: pull-request-available > Time Spent: 4h 20m > Remaining Estimate: 0h > > *array_distinct(array(obj1, obj2,...))* - The function returns an array of > the same type as the input argument where all duplicate values have been > removed. > Example: > > SELECT array_distinct(array('b', 'd', 'd', 'a')) FROM src LIMIT 1; > ['a', 'b', 'c'] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Work logged] (HIVE-26754) Implement array_distinct UDF to return an array after removing duplicates in it
[ https://issues.apache.org/jira/browse/HIVE-26754?focusedWorklogId=832001=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-832001 ] ASF GitHub Bot logged work on HIVE-26754: - Author: ASF GitHub Bot Created on: 08/Dec/22 10:17 Start Date: 08/Dec/22 10:17 Worklog Time Spent: 10m Work Description: sonarcloud[bot] commented on PR #3806: URL: https://github.com/apache/hive/pull/3806#issuecomment-1342458652 Kudos, SonarCloud Quality Gate passed! [![Quality Gate passed](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/QualityGateBadge/passed-16px.png 'Quality Gate passed')](https://sonarcloud.io/dashboard?id=apache_hive=3806) [![Bug](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/bug-16px.png 'Bug')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [0 Bugs](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [![Vulnerability](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/vulnerability-16px.png 'Vulnerability')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [0 Vulnerabilities](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [![Security Hotspot](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/security_hotspot-16px.png 'Security Hotspot')](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [0 Security Hotspots](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [![Code Smell](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/code_smell-16px.png 'Code Smell')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [4 Code Smells](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [![No Coverage information](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/CoverageChart/NoCoverageInfo-16px.png 'No Coverage information')](https://sonarcloud.io/component_measures?id=apache_hive=3806=coverage=list) No Coverage information [![No Duplication information](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/Duplications/NoDuplicationInfo-16px.png 'No Duplication information')](https://sonarcloud.io/component_measures?id=apache_hive=3806=duplicated_lines_density=list) No Duplication information Issue Time Tracking --- Worklog Id: (was: 832001) Time Spent: 4h 10m (was: 4h) > Implement array_distinct UDF to return an array after removing duplicates in > it > --- > > Key: HIVE-26754 > URL: https://issues.apache.org/jira/browse/HIVE-26754 > Project: Hive > Issue Type: Sub-task > Components: Hive >Reporter: Taraka Rama Rao Lethavadla >Assignee: Taraka Rama Rao Lethavadla >Priority: Major > Labels: pull-request-available > Time Spent: 4h 10m > Remaining Estimate: 0h > > *array_distinct(array(obj1, obj2,...))* - The function returns an array of > the same type as the input argument where all duplicate values have been > removed. > Example: > > SELECT array_distinct(array('b', 'd', 'd', 'a')) FROM src LIMIT 1; > ['a', 'b', 'c'] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Work logged] (HIVE-26754) Implement array_distinct UDF to return an array after removing duplicates in it
[ https://issues.apache.org/jira/browse/HIVE-26754?focusedWorklogId=831667=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-831667 ] ASF GitHub Bot logged work on HIVE-26754: - Author: ASF GitHub Bot Created on: 07/Dec/22 07:59 Start Date: 07/Dec/22 07:59 Worklog Time Spent: 10m Work Description: sonarcloud[bot] commented on PR #3806: URL: https://github.com/apache/hive/pull/3806#issuecomment-1340545069 Kudos, SonarCloud Quality Gate passed! [![Quality Gate passed](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/QualityGateBadge/passed-16px.png 'Quality Gate passed')](https://sonarcloud.io/dashboard?id=apache_hive=3806) [![Bug](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/bug-16px.png 'Bug')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [0 Bugs](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [![Vulnerability](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/vulnerability-16px.png 'Vulnerability')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [0 Vulnerabilities](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [![Security Hotspot](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/security_hotspot-16px.png 'Security Hotspot')](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [0 Security Hotspots](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [![Code Smell](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/code_smell-16px.png 'Code Smell')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [5 Code Smells](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [![No Coverage information](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/CoverageChart/NoCoverageInfo-16px.png 'No Coverage information')](https://sonarcloud.io/component_measures?id=apache_hive=3806=coverage=list) No Coverage information [![No Duplication information](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/Duplications/NoDuplicationInfo-16px.png 'No Duplication information')](https://sonarcloud.io/component_measures?id=apache_hive=3806=duplicated_lines_density=list) No Duplication information Issue Time Tracking --- Worklog Id: (was: 831667) Time Spent: 4h (was: 3h 50m) > Implement array_distinct UDF to return an array after removing duplicates in > it > --- > > Key: HIVE-26754 > URL: https://issues.apache.org/jira/browse/HIVE-26754 > Project: Hive > Issue Type: Sub-task > Components: Hive >Reporter: Taraka Rama Rao Lethavadla >Assignee: Taraka Rama Rao Lethavadla >Priority: Major > Labels: pull-request-available > Time Spent: 4h > Remaining Estimate: 0h > > *array_distinct(array(obj1, obj2,...))* - The function returns an array of > the same type as the input argument where all duplicate values have been > removed. > Example: > > SELECT array_distinct(array('b', 'd', 'd', 'a')) FROM src LIMIT 1; > ['a', 'b', 'c'] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Work logged] (HIVE-26754) Implement array_distinct UDF to return an array after removing duplicates in it
[ https://issues.apache.org/jira/browse/HIVE-26754?focusedWorklogId=830962=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-830962 ] ASF GitHub Bot logged work on HIVE-26754: - Author: ASF GitHub Bot Created on: 05/Dec/22 09:00 Start Date: 05/Dec/22 09:00 Worklog Time Spent: 10m Work Description: tarak271 commented on PR #3806: URL: https://github.com/apache/hive/pull/3806#issuecomment-1336982493 Test failure seems unrelated to this changes. That test 'orc_ppd_basic.q' is even failing without my changes Issue Time Tracking --- Worklog Id: (was: 830962) Time Spent: 3h 50m (was: 3h 40m) > Implement array_distinct UDF to return an array after removing duplicates in > it > --- > > Key: HIVE-26754 > URL: https://issues.apache.org/jira/browse/HIVE-26754 > Project: Hive > Issue Type: Sub-task > Components: Hive >Reporter: Taraka Rama Rao Lethavadla >Assignee: Taraka Rama Rao Lethavadla >Priority: Major > Labels: pull-request-available > Time Spent: 3h 50m > Remaining Estimate: 0h > > *array_distinct(array(obj1, obj2,...))* - The function returns an array of > the same type as the input argument where all duplicate values have been > removed. > Example: > > SELECT array_distinct(array('b', 'd', 'd', 'a')) FROM src LIMIT 1; > ['a', 'b', 'c'] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Work logged] (HIVE-26754) Implement array_distinct UDF to return an array after removing duplicates in it
[ https://issues.apache.org/jira/browse/HIVE-26754?focusedWorklogId=830689=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-830689 ] ASF GitHub Bot logged work on HIVE-26754: - Author: ASF GitHub Bot Created on: 02/Dec/22 17:15 Start Date: 02/Dec/22 17:15 Worklog Time Spent: 10m Work Description: sonarcloud[bot] commented on PR #3806: URL: https://github.com/apache/hive/pull/3806#issuecomment-133464 Kudos, SonarCloud Quality Gate passed! [![Quality Gate passed](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/QualityGateBadge/passed-16px.png 'Quality Gate passed')](https://sonarcloud.io/dashboard?id=apache_hive=3806) [![Bug](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/bug-16px.png 'Bug')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [0 Bugs](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [![Vulnerability](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/vulnerability-16px.png 'Vulnerability')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [0 Vulnerabilities](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [![Security Hotspot](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/security_hotspot-16px.png 'Security Hotspot')](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [0 Security Hotspots](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [![Code Smell](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/code_smell-16px.png 'Code Smell')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [5 Code Smells](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [![No Coverage information](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/CoverageChart/NoCoverageInfo-16px.png 'No Coverage information')](https://sonarcloud.io/component_measures?id=apache_hive=3806=coverage=list) No Coverage information [![No Duplication information](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/Duplications/NoDuplicationInfo-16px.png 'No Duplication information')](https://sonarcloud.io/component_measures?id=apache_hive=3806=duplicated_lines_density=list) No Duplication information Issue Time Tracking --- Worklog Id: (was: 830689) Time Spent: 3h 40m (was: 3.5h) > Implement array_distinct UDF to return an array after removing duplicates in > it > --- > > Key: HIVE-26754 > URL: https://issues.apache.org/jira/browse/HIVE-26754 > Project: Hive > Issue Type: Sub-task > Components: Hive >Reporter: Taraka Rama Rao Lethavadla >Assignee: Taraka Rama Rao Lethavadla >Priority: Major > Labels: pull-request-available > Time Spent: 3h 40m > Remaining Estimate: 0h > > *array_distinct(array(obj1, obj2,...))* - The function returns an array of > the same type as the input argument where all duplicate values have been > removed. > Example: > > SELECT array_distinct(array('b', 'd', 'd', 'a')) FROM src LIMIT 1; > ['a', 'b', 'c'] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Work logged] (HIVE-26754) Implement array_distinct UDF to return an array after removing duplicates in it
[ https://issues.apache.org/jira/browse/HIVE-26754?focusedWorklogId=830621=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-830621 ] ASF GitHub Bot logged work on HIVE-26754: - Author: ASF GitHub Bot Created on: 02/Dec/22 11:37 Start Date: 02/Dec/22 11:37 Worklog Time Spent: 10m Work Description: tarak271 commented on code in PR #3806: URL: https://github.com/apache/hive/pull/3806#discussion_r1038057429 ## ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDFArrayDistinct.java: ## @@ -0,0 +1,59 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one + * or more contributor license agreements. See the NOTICE file + * distributed with this work for additional information + * regarding copyright ownership. The ASF licenses this file + * to you under the Apache License, Version 2.0 (the + * "License"); you may not use this file except in compliance + * with the License. You may obtain a copy of the License at + * + * http://www.apache.org/licenses/LICENSE-2.0 + * + * Unless required by applicable law or agreed to in writing, software + * distributed under the License is distributed on an "AS IS" BASIS, + * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + * See the License for the specific language governing permissions and + * limitations under the License. + */ +package org.apache.hadoop.hive.ql.udf.generic; + +import org.apache.hadoop.hive.ql.exec.Description; +import org.apache.hadoop.hive.ql.metadata.HiveException; +import org.apache.hadoop.hive.serde2.objectinspector.ListObjectInspector; +import org.apache.hadoop.hive.serde2.objectinspector.ObjectInspector; + +import java.util.ArrayList; +import java.util.List; +import java.util.stream.Collectors; + +/** + * Generic UDF for distinct array + * ARRAY_DISTINCT(array(obj1, obj2, obj3...)). + * + * @see org.apache.hadoop.hive.ql.udf.generic.GenericUDF + */ +@Description(name = "array_distinct", +value = "_FUNC_(array(obj1, obj2,...)) - " ++ "The function returns an array of the same type as the input array with distinct values.", +extended = "Example:\n" ++ " > SELECT _FUNC_(array('b', 'd', 'd', 'a')) FROM src LIMIT 1;\n" ++ " ['b', 'd', 'a']") +public class GenericUDFArrayDistinct extends AbstractGenericUDFArrayBase { + +public GenericUDFArrayDistinct() { +super("ARRAY_DISTINCT", 1, 1, ObjectInspector.Category.LIST); +} + +@Override +public Object evaluate(DeferredObject[] arguments) throws HiveException { + +Object array = arguments[ARRAY_IDX].get(); + +if (arrayOI.getListLength(array) <= 0) { +return new ArrayList(); +} Review Comment: And added another Test class to simulate null Object, TestGenericUDFArray which can be reused across Array functions Issue Time Tracking --- Worklog Id: (was: 830621) Time Spent: 3.5h (was: 3h 20m) > Implement array_distinct UDF to return an array after removing duplicates in > it > --- > > Key: HIVE-26754 > URL: https://issues.apache.org/jira/browse/HIVE-26754 > Project: Hive > Issue Type: Sub-task > Components: Hive >Reporter: Taraka Rama Rao Lethavadla >Assignee: Taraka Rama Rao Lethavadla >Priority: Major > Labels: pull-request-available > Time Spent: 3.5h > Remaining Estimate: 0h > > *array_distinct(array(obj1, obj2,...))* - The function returns an array of > the same type as the input argument where all duplicate values have been > removed. > Example: > > SELECT array_distinct(array('b', 'd', 'd', 'a')) FROM src LIMIT 1; > ['a', 'b', 'c'] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Work logged] (HIVE-26754) Implement array_distinct UDF to return an array after removing duplicates in it
[ https://issues.apache.org/jira/browse/HIVE-26754?focusedWorklogId=830619=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-830619 ] ASF GitHub Bot logged work on HIVE-26754: - Author: ASF GitHub Bot Created on: 02/Dec/22 11:35 Start Date: 02/Dec/22 11:35 Worklog Time Spent: 10m Work Description: tarak271 commented on code in PR #3806: URL: https://github.com/apache/hive/pull/3806#discussion_r1038055787 ## ql/src/test/queries/clientpositive/udf_array_distinct.q: ## @@ -0,0 +1,33 @@ +--! qt:dataset:src +set hive.fetch.task.conversion=more; Review Comment: Added flag ## ql/src/java/org/apache/hadoop/hive/ql/udf/generic/AbstractGenericUDFArrayBase.java: ## @@ -106,4 +106,4 @@ ObjectInspector initListOI(ObjectInspector[] arguments) { return ObjectInspectorFactory.getStandardListObjectInspector(initOI(arguments)); } -} +} Review Comment: Done Issue Time Tracking --- Worklog Id: (was: 830619) Time Spent: 3h 10m (was: 3h) > Implement array_distinct UDF to return an array after removing duplicates in > it > --- > > Key: HIVE-26754 > URL: https://issues.apache.org/jira/browse/HIVE-26754 > Project: Hive > Issue Type: Sub-task > Components: Hive >Reporter: Taraka Rama Rao Lethavadla >Assignee: Taraka Rama Rao Lethavadla >Priority: Major > Labels: pull-request-available > Time Spent: 3h 10m > Remaining Estimate: 0h > > *array_distinct(array(obj1, obj2,...))* - The function returns an array of > the same type as the input argument where all duplicate values have been > removed. > Example: > > SELECT array_distinct(array('b', 'd', 'd', 'a')) FROM src LIMIT 1; > ['a', 'b', 'c'] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Work logged] (HIVE-26754) Implement array_distinct UDF to return an array after removing duplicates in it
[ https://issues.apache.org/jira/browse/HIVE-26754?focusedWorklogId=830618=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-830618 ] ASF GitHub Bot logged work on HIVE-26754: - Author: ASF GitHub Bot Created on: 02/Dec/22 11:35 Start Date: 02/Dec/22 11:35 Worklog Time Spent: 10m Work Description: tarak271 commented on code in PR #3806: URL: https://github.com/apache/hive/pull/3806#discussion_r1038055541 ## ql/src/test/queries/clientpositive/udf_array_distinct.q: ## @@ -0,0 +1,33 @@ +--! qt:dataset:src +set hive.fetch.task.conversion=more; + +DESCRIBE FUNCTION array_distinct; +DESCRIBE FUNCTION EXTENDED array_distinct; + + Issue Time Tracking --- Worklog Id: (was: 830618) Time Spent: 3h (was: 2h 50m) > Implement array_distinct UDF to return an array after removing duplicates in > it > --- > > Key: HIVE-26754 > URL: https://issues.apache.org/jira/browse/HIVE-26754 > Project: Hive > Issue Type: Sub-task > Components: Hive >Reporter: Taraka Rama Rao Lethavadla >Assignee: Taraka Rama Rao Lethavadla >Priority: Major > Labels: pull-request-available > Time Spent: 3h > Remaining Estimate: 0h > > *array_distinct(array(obj1, obj2,...))* - The function returns an array of > the same type as the input argument where all duplicate values have been > removed. > Example: > > SELECT array_distinct(array('b', 'd', 'd', 'a')) FROM src LIMIT 1; > ['a', 'b', 'c'] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Work logged] (HIVE-26754) Implement array_distinct UDF to return an array after removing duplicates in it
[ https://issues.apache.org/jira/browse/HIVE-26754?focusedWorklogId=830620=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-830620 ] ASF GitHub Bot logged work on HIVE-26754: - Author: ASF GitHub Bot Created on: 02/Dec/22 11:35 Start Date: 02/Dec/22 11:35 Worklog Time Spent: 10m Work Description: tarak271 commented on code in PR #3806: URL: https://github.com/apache/hive/pull/3806#discussion_r1038056071 ## ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDFArrayDistinct.java: ## @@ -0,0 +1,59 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one + * or more contributor license agreements. See the NOTICE file + * distributed with this work for additional information + * regarding copyright ownership. The ASF licenses this file + * to you under the Apache License, Version 2.0 (the + * "License"); you may not use this file except in compliance + * with the License. You may obtain a copy of the License at + * + * http://www.apache.org/licenses/LICENSE-2.0 + * + * Unless required by applicable law or agreed to in writing, software + * distributed under the License is distributed on an "AS IS" BASIS, + * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + * See the License for the specific language governing permissions and + * limitations under the License. + */ +package org.apache.hadoop.hive.ql.udf.generic; + +import org.apache.hadoop.hive.ql.exec.Description; +import org.apache.hadoop.hive.ql.metadata.HiveException; +import org.apache.hadoop.hive.serde2.objectinspector.ListObjectInspector; +import org.apache.hadoop.hive.serde2.objectinspector.ObjectInspector; + +import java.util.ArrayList; +import java.util.List; +import java.util.stream.Collectors; + +/** + * Generic UDF for distinct array + * ARRAY_DISTINCT(array(obj1, obj2, obj3...)). + * + * @see org.apache.hadoop.hive.ql.udf.generic.GenericUDF + */ +@Description(name = "array_distinct", +value = "_FUNC_(array(obj1, obj2,...)) - " ++ "The function returns an array of the same type as the input array with distinct values.", +extended = "Example:\n" ++ " > SELECT _FUNC_(array('b', 'd', 'd', 'a')) FROM src LIMIT 1;\n" ++ " ['b', 'd', 'a']") +public class GenericUDFArrayDistinct extends AbstractGenericUDFArrayBase { + +public GenericUDFArrayDistinct() { +super("ARRAY_DISTINCT", 1, 1, ObjectInspector.Category.LIST); +} + +@Override +public Object evaluate(DeferredObject[] arguments) throws HiveException { + +Object array = arguments[ARRAY_IDX].get(); + +if (arrayOI.getListLength(array) <= 0) { +return new ArrayList(); +} Review Comment: Changed as recommended Issue Time Tracking --- Worklog Id: (was: 830620) Time Spent: 3h 20m (was: 3h 10m) > Implement array_distinct UDF to return an array after removing duplicates in > it > --- > > Key: HIVE-26754 > URL: https://issues.apache.org/jira/browse/HIVE-26754 > Project: Hive > Issue Type: Sub-task > Components: Hive >Reporter: Taraka Rama Rao Lethavadla >Assignee: Taraka Rama Rao Lethavadla >Priority: Major > Labels: pull-request-available > Time Spent: 3h 20m > Remaining Estimate: 0h > > *array_distinct(array(obj1, obj2,...))* - The function returns an array of > the same type as the input argument where all duplicate values have been > removed. > Example: > > SELECT array_distinct(array('b', 'd', 'd', 'a')) FROM src LIMIT 1; > ['a', 'b', 'c'] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Work logged] (HIVE-26754) Implement array_distinct UDF to return an array after removing duplicates in it
[ https://issues.apache.org/jira/browse/HIVE-26754?focusedWorklogId=830337=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-830337 ] ASF GitHub Bot logged work on HIVE-26754: - Author: ASF GitHub Bot Created on: 01/Dec/22 13:37 Start Date: 01/Dec/22 13:37 Worklog Time Spent: 10m Work Description: ayushtkn commented on code in PR #3806: URL: https://github.com/apache/hive/pull/3806#discussion_r1037110890 ## ql/src/test/queries/clientpositive/udf_array_distinct.q: ## @@ -0,0 +1,33 @@ +--! qt:dataset:src +set hive.fetch.task.conversion=more; Review Comment: There are too many select *, better to add `` Issue Time Tracking --- Worklog Id: (was: 830337) Time Spent: 2h 50m (was: 2h 40m) > Implement array_distinct UDF to return an array after removing duplicates in > it > --- > > Key: HIVE-26754 > URL: https://issues.apache.org/jira/browse/HIVE-26754 > Project: Hive > Issue Type: Sub-task > Components: Hive >Reporter: Taraka Rama Rao Lethavadla >Assignee: Taraka Rama Rao Lethavadla >Priority: Major > Labels: pull-request-available > Time Spent: 2h 50m > Remaining Estimate: 0h > > *array_distinct(array(obj1, obj2,...))* - The function returns an array of > the same type as the input argument where all duplicate values have been > removed. > Example: > > SELECT array_distinct(array('b', 'd', 'd', 'a')) FROM src LIMIT 1; > ['a', 'b', 'c'] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Work logged] (HIVE-26754) Implement array_distinct UDF to return an array after removing duplicates in it
[ https://issues.apache.org/jira/browse/HIVE-26754?focusedWorklogId=830158=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-830158 ] ASF GitHub Bot logged work on HIVE-26754: - Author: ASF GitHub Bot Created on: 30/Nov/22 17:43 Start Date: 30/Nov/22 17:43 Worklog Time Spent: 10m Work Description: sonarcloud[bot] commented on PR #3806: URL: https://github.com/apache/hive/pull/3806#issuecomment-1332522742 Kudos, SonarCloud Quality Gate passed! [![Quality Gate passed](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/QualityGateBadge/passed-16px.png 'Quality Gate passed')](https://sonarcloud.io/dashboard?id=apache_hive=3806) [![Bug](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/bug-16px.png 'Bug')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [0 Bugs](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [![Vulnerability](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/vulnerability-16px.png 'Vulnerability')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [0 Vulnerabilities](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [![Security Hotspot](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/security_hotspot-16px.png 'Security Hotspot')](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [0 Security Hotspots](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [![Code Smell](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/code_smell-16px.png 'Code Smell')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [0 Code Smells](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [![No Coverage information](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/CoverageChart/NoCoverageInfo-16px.png 'No Coverage information')](https://sonarcloud.io/component_measures?id=apache_hive=3806=coverage=list) No Coverage information [![No Duplication information](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/Duplications/NoDuplicationInfo-16px.png 'No Duplication information')](https://sonarcloud.io/component_measures?id=apache_hive=3806=duplicated_lines_density=list) No Duplication information Issue Time Tracking --- Worklog Id: (was: 830158) Time Spent: 2h 40m (was: 2.5h) > Implement array_distinct UDF to return an array after removing duplicates in > it > --- > > Key: HIVE-26754 > URL: https://issues.apache.org/jira/browse/HIVE-26754 > Project: Hive > Issue Type: Sub-task > Components: Hive >Reporter: Taraka Rama Rao Lethavadla >Assignee: Taraka Rama Rao Lethavadla >Priority: Major > Labels: pull-request-available > Time Spent: 2h 40m > Remaining Estimate: 0h > > *array_distinct(array(obj1, obj2,...))* - The function returns an array of > the same type as the input argument where all duplicate values have been > removed. > Example: > > SELECT array_distinct(array('b', 'd', 'd', 'a')) FROM src LIMIT 1; > ['a', 'b', 'c'] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Work logged] (HIVE-26754) Implement array_distinct UDF to return an array after removing duplicates in it
[ https://issues.apache.org/jira/browse/HIVE-26754?focusedWorklogId=830032=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-830032 ] ASF GitHub Bot logged work on HIVE-26754: - Author: ASF GitHub Bot Created on: 30/Nov/22 12:18 Start Date: 30/Nov/22 12:18 Worklog Time Spent: 10m Work Description: tarak271 commented on code in PR #3806: URL: https://github.com/apache/hive/pull/3806#discussion_r1035898672 ## ql/src/test/queries/clientpositive/udf_array_distinct.q: ## @@ -0,0 +1,20 @@ +--! qt:dataset:src +set hive.fetch.task.conversion=more; + +DESCRIBE FUNCTION array_distinct; +DESCRIBE FUNCTION EXTENDED array_distinct; + + Issue Time Tracking --- Worklog Id: (was: 830032) Time Spent: 2h 20m (was: 2h 10m) > Implement array_distinct UDF to return an array after removing duplicates in > it > --- > > Key: HIVE-26754 > URL: https://issues.apache.org/jira/browse/HIVE-26754 > Project: Hive > Issue Type: Sub-task > Components: Hive >Reporter: Taraka Rama Rao Lethavadla >Assignee: Taraka Rama Rao Lethavadla >Priority: Major > Labels: pull-request-available > Time Spent: 2h 20m > Remaining Estimate: 0h > > *array_distinct(array(obj1, obj2,...))* - The function returns an array of > the same type as the input argument where all duplicate values have been > removed. > Example: > > SELECT array_distinct(array('b', 'd', 'd', 'a')) FROM src LIMIT 1; > ['a', 'b', 'c'] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Work logged] (HIVE-26754) Implement array_distinct UDF to return an array after removing duplicates in it
[ https://issues.apache.org/jira/browse/HIVE-26754?focusedWorklogId=830033=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-830033 ] ASF GitHub Bot logged work on HIVE-26754: - Author: ASF GitHub Bot Created on: 30/Nov/22 12:18 Start Date: 30/Nov/22 12:18 Worklog Time Spent: 10m Work Description: tarak271 commented on code in PR #3806: URL: https://github.com/apache/hive/pull/3806#discussion_r1035898966 ## ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDFArrayDistinct.java: ## @@ -0,0 +1,59 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one + * or more contributor license agreements. See the NOTICE file + * distributed with this work for additional information + * regarding copyright ownership. The ASF licenses this file + * to you under the Apache License, Version 2.0 (the + * "License"); you may not use this file except in compliance + * with the License. You may obtain a copy of the License at + * + * http://www.apache.org/licenses/LICENSE-2.0 + * + * Unless required by applicable law or agreed to in writing, software + * distributed under the License is distributed on an "AS IS" BASIS, + * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + * See the License for the specific language governing permissions and + * limitations under the License. + */ +package org.apache.hadoop.hive.ql.udf.generic; + +import org.apache.hadoop.hive.ql.exec.Description; +import org.apache.hadoop.hive.ql.metadata.HiveException; +import org.apache.hadoop.hive.serde2.objectinspector.ListObjectInspector; +import org.apache.hadoop.hive.serde2.objectinspector.ObjectInspector; + +import java.util.List; +import java.util.stream.Collectors; + +/** + * Generic UDF for distinct array + * ARRAY_DISTINCT(array(obj1, obj2, obj3...)). + * + * @see org.apache.hadoop.hive.ql.udf.generic.GenericUDF + */ +@Description(name = "array_distinct", +value = "_FUNC_(array(obj1, obj2,...)) - " ++ "The function returns an array of the same type as the input argument where all duplicate" ++ " values have been removed.", +extended = "Example:\n" ++ " > SELECT _FUNC_(array('b', 'd', 'd', 'a')) FROM src LIMIT 1;\n" ++ " ['b', 'd', 'a']") +public class GenericUDFArrayDistinct extends AbstractGenericUDFArrayBase { + +public GenericUDFArrayDistinct() { +super("ARRAY_DISTINCT", 1, 1, ObjectInspector.Category.LIST); +} + +@Override +public Object evaluate(DeferredObject[] arguments) throws HiveException { + +Object array = arguments[ARRAY_IDX].get(); + +if (arrayOI.getListLength(array) <= 0) { +return null; +} + +List retArray = ((ListObjectInspector) argumentOIs[ARRAY_IDX]).getList(array); +return convertArray(retArray.stream().distinct().collect(Collectors.toList())); Review Comment: Made changes as recommended Issue Time Tracking --- Worklog Id: (was: 830033) Time Spent: 2.5h (was: 2h 20m) > Implement array_distinct UDF to return an array after removing duplicates in > it > --- > > Key: HIVE-26754 > URL: https://issues.apache.org/jira/browse/HIVE-26754 > Project: Hive > Issue Type: Sub-task > Components: Hive >Reporter: Taraka Rama Rao Lethavadla >Assignee: Taraka Rama Rao Lethavadla >Priority: Major > Labels: pull-request-available > Time Spent: 2.5h > Remaining Estimate: 0h > > *array_distinct(array(obj1, obj2,...))* - The function returns an array of > the same type as the input argument where all duplicate values have been > removed. > Example: > > SELECT array_distinct(array('b', 'd', 'd', 'a')) FROM src LIMIT 1; > ['a', 'b', 'c'] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Work logged] (HIVE-26754) Implement array_distinct UDF to return an array after removing duplicates in it
[ https://issues.apache.org/jira/browse/HIVE-26754?focusedWorklogId=830031=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-830031 ] ASF GitHub Bot logged work on HIVE-26754: - Author: ASF GitHub Bot Created on: 30/Nov/22 12:13 Start Date: 30/Nov/22 12:13 Worklog Time Spent: 10m Work Description: tarak271 commented on code in PR #3806: URL: https://github.com/apache/hive/pull/3806#discussion_r1035894140 ## ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDFArrayDistinct.java: ## @@ -0,0 +1,59 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one + * or more contributor license agreements. See the NOTICE file + * distributed with this work for additional information + * regarding copyright ownership. The ASF licenses this file + * to you under the Apache License, Version 2.0 (the + * "License"); you may not use this file except in compliance + * with the License. You may obtain a copy of the License at + * + * http://www.apache.org/licenses/LICENSE-2.0 + * + * Unless required by applicable law or agreed to in writing, software + * distributed under the License is distributed on an "AS IS" BASIS, + * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + * See the License for the specific language governing permissions and + * limitations under the License. + */ +package org.apache.hadoop.hive.ql.udf.generic; + +import org.apache.hadoop.hive.ql.exec.Description; +import org.apache.hadoop.hive.ql.metadata.HiveException; +import org.apache.hadoop.hive.serde2.objectinspector.ListObjectInspector; +import org.apache.hadoop.hive.serde2.objectinspector.ObjectInspector; + +import java.util.List; +import java.util.stream.Collectors; + +/** + * Generic UDF for distinct array + * ARRAY_DISTINCT(array(obj1, obj2, obj3...)). + * + * @see org.apache.hadoop.hive.ql.udf.generic.GenericUDF + */ +@Description(name = "array_distinct", +value = "_FUNC_(array(obj1, obj2,...)) - " ++ "The function returns an array of the same type as the input argument where all duplicate" ++ " values have been removed.", +extended = "Example:\n" ++ " > SELECT _FUNC_(array('b', 'd', 'd', 'a')) FROM src LIMIT 1;\n" ++ " ['b', 'd', 'a']") +public class GenericUDFArrayDistinct extends AbstractGenericUDFArrayBase { + +public GenericUDFArrayDistinct() { +super("ARRAY_DISTINCT", 1, 1, ObjectInspector.Category.LIST); +} + +@Override +public Object evaluate(DeferredObject[] arguments) throws HiveException { + +Object array = arguments[ARRAY_IDX].get(); + +if (arrayOI.getListLength(array) <= 0) { +return null; +} Review Comment: changed to return empty array, return new ArrayList(); Issue Time Tracking --- Worklog Id: (was: 830031) Time Spent: 2h 10m (was: 2h) > Implement array_distinct UDF to return an array after removing duplicates in > it > --- > > Key: HIVE-26754 > URL: https://issues.apache.org/jira/browse/HIVE-26754 > Project: Hive > Issue Type: Sub-task > Components: Hive >Reporter: Taraka Rama Rao Lethavadla >Assignee: Taraka Rama Rao Lethavadla >Priority: Major > Labels: pull-request-available > Time Spent: 2h 10m > Remaining Estimate: 0h > > *array_distinct(array(obj1, obj2,...))* - The function returns an array of > the same type as the input argument where all duplicate values have been > removed. > Example: > > SELECT array_distinct(array('b', 'd', 'd', 'a')) FROM src LIMIT 1; > ['a', 'b', 'c'] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Work logged] (HIVE-26754) Implement array_distinct UDF to return an array after removing duplicates in it
[ https://issues.apache.org/jira/browse/HIVE-26754?focusedWorklogId=830030=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-830030 ] ASF GitHub Bot logged work on HIVE-26754: - Author: ASF GitHub Bot Created on: 30/Nov/22 12:11 Start Date: 30/Nov/22 12:11 Worklog Time Spent: 10m Work Description: tarak271 commented on code in PR #3806: URL: https://github.com/apache/hive/pull/3806#discussion_r1035892692 ## ql/src/java/org/apache/hadoop/hive/ql/udf/generic/GenericUDFArrayDistinct.java: ## @@ -0,0 +1,59 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one + * or more contributor license agreements. See the NOTICE file + * distributed with this work for additional information + * regarding copyright ownership. The ASF licenses this file + * to you under the Apache License, Version 2.0 (the + * "License"); you may not use this file except in compliance + * with the License. You may obtain a copy of the License at + * + * http://www.apache.org/licenses/LICENSE-2.0 + * + * Unless required by applicable law or agreed to in writing, software + * distributed under the License is distributed on an "AS IS" BASIS, + * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + * See the License for the specific language governing permissions and + * limitations under the License. + */ +package org.apache.hadoop.hive.ql.udf.generic; + +import org.apache.hadoop.hive.ql.exec.Description; +import org.apache.hadoop.hive.ql.metadata.HiveException; +import org.apache.hadoop.hive.serde2.objectinspector.ListObjectInspector; +import org.apache.hadoop.hive.serde2.objectinspector.ObjectInspector; + +import java.util.List; +import java.util.stream.Collectors; + +/** + * Generic UDF for distinct array + * ARRAY_DISTINCT(array(obj1, obj2, obj3...)). + * + * @see org.apache.hadoop.hive.ql.udf.generic.GenericUDF + */ +@Description(name = "array_distinct", +value = "_FUNC_(array(obj1, obj2,...)) - " ++ "The function returns an array of the same type as the input argument where all duplicate" ++ " values have been removed.", Review Comment: Changed description as recommended Issue Time Tracking --- Worklog Id: (was: 830030) Time Spent: 2h (was: 1h 50m) > Implement array_distinct UDF to return an array after removing duplicates in > it > --- > > Key: HIVE-26754 > URL: https://issues.apache.org/jira/browse/HIVE-26754 > Project: Hive > Issue Type: Sub-task > Components: Hive >Reporter: Taraka Rama Rao Lethavadla >Assignee: Taraka Rama Rao Lethavadla >Priority: Major > Labels: pull-request-available > Time Spent: 2h > Remaining Estimate: 0h > > *array_distinct(array(obj1, obj2,...))* - The function returns an array of > the same type as the input argument where all duplicate values have been > removed. > Example: > > SELECT array_distinct(array('b', 'd', 'd', 'a')) FROM src LIMIT 1; > ['a', 'b', 'c'] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Work logged] (HIVE-26754) Implement array_distinct UDF to return an array after removing duplicates in it
[ https://issues.apache.org/jira/browse/HIVE-26754?focusedWorklogId=829334=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-829334 ] ASF GitHub Bot logged work on HIVE-26754: - Author: ASF GitHub Bot Created on: 28/Nov/22 14:09 Start Date: 28/Nov/22 14:09 Worklog Time Spent: 10m Work Description: ayushtkn commented on code in PR #3806: URL: https://github.com/apache/hive/pull/3806#discussion_r1033556476 ## ql/src/test/queries/clientpositive/udf_array_distinct.q: ## @@ -0,0 +1,20 @@ +--! qt:dataset:src +set hive.fetch.task.conversion=more; + +DESCRIBE FUNCTION array_distinct; +DESCRIBE FUNCTION EXTENDED array_distinct; + + Issue Time Tracking --- Worklog Id: (was: 829334) Time Spent: 1h 50m (was: 1h 40m) > Implement array_distinct UDF to return an array after removing duplicates in > it > --- > > Key: HIVE-26754 > URL: https://issues.apache.org/jira/browse/HIVE-26754 > Project: Hive > Issue Type: Sub-task > Components: Hive >Reporter: Taraka Rama Rao Lethavadla >Assignee: Taraka Rama Rao Lethavadla >Priority: Major > Labels: pull-request-available > Time Spent: 1h 50m > Remaining Estimate: 0h > > *array_distinct(array(obj1, obj2,...))* - The function returns an array of > the same type as the input argument where all duplicate values have been > removed. > Example: > > SELECT array_distinct(array('b', 'd', 'd', 'a')) FROM src LIMIT 1; > ['a', 'b', 'c'] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Work logged] (HIVE-26754) Implement array_distinct UDF to return an array after removing duplicates in it
[ https://issues.apache.org/jira/browse/HIVE-26754?focusedWorklogId=828849=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-828849 ] ASF GitHub Bot logged work on HIVE-26754: - Author: ASF GitHub Bot Created on: 25/Nov/22 08:57 Start Date: 25/Nov/22 08:57 Worklog Time Spent: 10m Work Description: sonarcloud[bot] commented on PR #3806: URL: https://github.com/apache/hive/pull/3806#issuecomment-1327176696 Kudos, SonarCloud Quality Gate passed! [![Quality Gate passed](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/QualityGateBadge/passed-16px.png 'Quality Gate passed')](https://sonarcloud.io/dashboard?id=apache_hive=3806) [![Bug](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/bug-16px.png 'Bug')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [0 Bugs](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [![Vulnerability](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/vulnerability-16px.png 'Vulnerability')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [0 Vulnerabilities](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [![Security Hotspot](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/security_hotspot-16px.png 'Security Hotspot')](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [0 Security Hotspots](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [![Code Smell](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/code_smell-16px.png 'Code Smell')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [1 Code Smell](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [![No Coverage information](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/CoverageChart/NoCoverageInfo-16px.png 'No Coverage information')](https://sonarcloud.io/component_measures?id=apache_hive=3806=coverage=list) No Coverage information [![No Duplication information](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/Duplications/NoDuplicationInfo-16px.png 'No Duplication information')](https://sonarcloud.io/component_measures?id=apache_hive=3806=duplicated_lines_density=list) No Duplication information Issue Time Tracking --- Worklog Id: (was: 828849) Time Spent: 1h 40m (was: 1.5h) > Implement array_distinct UDF to return an array after removing duplicates in > it > --- > > Key: HIVE-26754 > URL: https://issues.apache.org/jira/browse/HIVE-26754 > Project: Hive > Issue Type: Sub-task > Components: Hive >Reporter: Taraka Rama Rao Lethavadla >Assignee: Taraka Rama Rao Lethavadla >Priority: Major > Labels: pull-request-available > Time Spent: 1h 40m > Remaining Estimate: 0h > > *array_distinct(array(obj1, obj2,...))* - The function returns an array of > the same type as the input argument where all duplicate values have been > removed. > Example: > > SELECT array_distinct(array('b', 'd', 'd', 'a')) FROM src LIMIT 1; > ['a', 'b', 'c'] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Work logged] (HIVE-26754) Implement array_distinct UDF to return an array after removing duplicates in it
[ https://issues.apache.org/jira/browse/HIVE-26754?focusedWorklogId=828742=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-828742 ] ASF GitHub Bot logged work on HIVE-26754: - Author: ASF GitHub Bot Created on: 24/Nov/22 14:50 Start Date: 24/Nov/22 14:50 Worklog Time Spent: 10m Work Description: sonarcloud[bot] commented on PR #3806: URL: https://github.com/apache/hive/pull/3806#issuecomment-1326550765 Kudos, SonarCloud Quality Gate passed! [![Quality Gate passed](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/QualityGateBadge/passed-16px.png 'Quality Gate passed')](https://sonarcloud.io/dashboard?id=apache_hive=3806) [![Bug](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/bug-16px.png 'Bug')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [0 Bugs](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=BUG) [![Vulnerability](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/vulnerability-16px.png 'Vulnerability')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [0 Vulnerabilities](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=VULNERABILITY) [![Security Hotspot](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/security_hotspot-16px.png 'Security Hotspot')](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [0 Security Hotspots](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3806=false=SECURITY_HOTSPOT) [![Code Smell](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/code_smell-16px.png 'Code Smell')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [8 Code Smells](https://sonarcloud.io/project/issues?id=apache_hive=3806=false=CODE_SMELL) [![No Coverage information](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/CoverageChart/NoCoverageInfo-16px.png 'No Coverage information')](https://sonarcloud.io/component_measures?id=apache_hive=3806=coverage=list) No Coverage information [![No Duplication information](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/Duplications/NoDuplicationInfo-16px.png 'No Duplication information')](https://sonarcloud.io/component_measures?id=apache_hive=3806=duplicated_lines_density=list) No Duplication information Issue Time Tracking --- Worklog Id: (was: 828742) Time Spent: 1.5h (was: 1h 20m) > Implement array_distinct UDF to return an array after removing duplicates in > it > --- > > Key: HIVE-26754 > URL: https://issues.apache.org/jira/browse/HIVE-26754 > Project: Hive > Issue Type: Sub-task > Components: Hive >Reporter: Taraka Rama Rao Lethavadla >Assignee: Taraka Rama Rao Lethavadla >Priority: Major > Labels: pull-request-available > Time Spent: 1.5h > Remaining Estimate: 0h > > *array_distinct(array(obj1, obj2,...))* - The function returns an array of > the same type as the input argument where all duplicate values have been > removed. > Example: > > SELECT array_distinct(array('b', 'd', 'd', 'a')) FROM src LIMIT 1; > ['a', 'b', 'c'] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Work logged] (HIVE-26754) Implement array_distinct UDF to return an array after removing duplicates in it
[ https://issues.apache.org/jira/browse/HIVE-26754?focusedWorklogId=828724=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-828724 ] ASF GitHub Bot logged work on HIVE-26754: - Author: ASF GitHub Bot Created on: 24/Nov/22 14:06 Start Date: 24/Nov/22 14:06 Worklog Time Spent: 10m Work Description: tarak271 commented on PR #3777: URL: https://github.com/apache/hive/pull/3777#issuecomment-1326500204 Closing this as duplicate of https://github.com/apache/hive/pull/3806 Issue Time Tracking --- Worklog Id: (was: 828724) Time Spent: 1h 10m (was: 1h) > Implement array_distinct UDF to return an array after removing duplicates in > it > --- > > Key: HIVE-26754 > URL: https://issues.apache.org/jira/browse/HIVE-26754 > Project: Hive > Issue Type: Sub-task > Components: Hive >Reporter: Taraka Rama Rao Lethavadla >Assignee: Taraka Rama Rao Lethavadla >Priority: Major > Labels: pull-request-available > Time Spent: 1h 10m > Remaining Estimate: 0h > > *array_distinct(array(obj1, obj2,...))* - The function returns an array of > the same type as the input argument where all duplicate values have been > removed. > Example: > > SELECT array_distinct(array('b', 'd', 'd', 'a')) FROM src LIMIT 1; > ['a', 'b', 'c'] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Work logged] (HIVE-26754) Implement array_distinct UDF to return an array after removing duplicates in it
[ https://issues.apache.org/jira/browse/HIVE-26754?focusedWorklogId=828723=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-828723 ] ASF GitHub Bot logged work on HIVE-26754: - Author: ASF GitHub Bot Created on: 24/Nov/22 14:06 Start Date: 24/Nov/22 14:06 Worklog Time Spent: 10m Work Description: tarak271 opened a new pull request, #3806: URL: https://github.com/apache/hive/pull/3806 …moving duplicates in it ### What changes were proposed in this pull request? Implement array_distinct function in Hive ### Why are the changes needed? This enhancement is already implemented in Spark ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Created Junit tests as well as qtests as part of this change Issue Time Tracking --- Worklog Id: (was: 828723) Time Spent: 1h (was: 50m) > Implement array_distinct UDF to return an array after removing duplicates in > it > --- > > Key: HIVE-26754 > URL: https://issues.apache.org/jira/browse/HIVE-26754 > Project: Hive > Issue Type: Sub-task > Components: Hive >Reporter: Taraka Rama Rao Lethavadla >Assignee: Taraka Rama Rao Lethavadla >Priority: Major > Labels: pull-request-available > Time Spent: 1h > Remaining Estimate: 0h > > *array_distinct(array(obj1, obj2,...))* - The function returns an array of > the same type as the input argument where all duplicate values have been > removed. > Example: > > SELECT array_distinct(array('b', 'd', 'd', 'a')) FROM src LIMIT 1; > ['a', 'b', 'c'] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Work logged] (HIVE-26754) Implement array_distinct UDF to return an array after removing duplicates in it
[ https://issues.apache.org/jira/browse/HIVE-26754?focusedWorklogId=828725=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-828725 ] ASF GitHub Bot logged work on HIVE-26754: - Author: ASF GitHub Bot Created on: 24/Nov/22 14:06 Start Date: 24/Nov/22 14:06 Worklog Time Spent: 10m Work Description: tarak271 closed pull request #3777: HIVE-26754 - Implement array_distinct UDF to return an array after removing duplicates in it URL: https://github.com/apache/hive/pull/3777 Issue Time Tracking --- Worklog Id: (was: 828725) Time Spent: 1h 20m (was: 1h 10m) > Implement array_distinct UDF to return an array after removing duplicates in > it > --- > > Key: HIVE-26754 > URL: https://issues.apache.org/jira/browse/HIVE-26754 > Project: Hive > Issue Type: Sub-task > Components: Hive >Reporter: Taraka Rama Rao Lethavadla >Assignee: Taraka Rama Rao Lethavadla >Priority: Major > Labels: pull-request-available > Time Spent: 1h 20m > Remaining Estimate: 0h > > *array_distinct(array(obj1, obj2,...))* - The function returns an array of > the same type as the input argument where all duplicate values have been > removed. > Example: > > SELECT array_distinct(array('b', 'd', 'd', 'a')) FROM src LIMIT 1; > ['a', 'b', 'c'] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Work logged] (HIVE-26754) Implement array_distinct UDF to return an array after removing duplicates in it
[ https://issues.apache.org/jira/browse/HIVE-26754?focusedWorklogId=827569=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-827569 ] ASF GitHub Bot logged work on HIVE-26754: - Author: ASF GitHub Bot Created on: 21/Nov/22 13:01 Start Date: 21/Nov/22 13:01 Worklog Time Spent: 10m Work Description: SourabhBadhya commented on code in PR #3777: URL: https://github.com/apache/hive/pull/3777#discussion_r1028009527 ## ql/src/java/org/apache/hadoop/hive/ql/udf/generic/AbstractGenericUDFArrayBase.java: ## @@ -0,0 +1,160 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one + * or more contributor license agreements. See the NOTICE file + * distributed with this work for additional information + * regarding copyright ownership. The ASF licenses this file + * to you under the Apache License, Version 2.0 (the + * "License"); you may not use this file except in compliance + * with the License. You may obtain a copy of the License at + * + * http://www.apache.org/licenses/LICENSE-2.0 + * + * Unless required by applicable law or agreed to in writing, software + * distributed under the License is distributed on an "AS IS" BASIS, + * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + * See the License for the specific language governing permissions and + * limitations under the License. + */ +package org.apache.hadoop.hive.ql.udf.generic; + +import org.apache.hadoop.hive.ql.exec.UDFArgumentException; +import org.apache.hadoop.hive.ql.exec.UDFArgumentTypeException; +import org.apache.hadoop.hive.serde.serdeConstants; + +import org.apache.hadoop.hive.serde2.objectinspector.ListObjectInspector; +import org.apache.hadoop.hive.serde2.objectinspector.ObjectInspector; +import org.apache.hadoop.hive.serde2.objectinspector.ObjectInspectorFactory; +import org.apache.hadoop.hive.serde2.objectinspector.ObjectInspectorConverters; +import org.apache.hadoop.hive.serde2.objectinspector.ObjectInspectorConverters.Converter; +import org.apache.hadoop.hive.serde2.objectinspector.PrimitiveObjectInspector; + +import java.util.ArrayList; +import java.util.List; + +/** + * Abstract GenericUDF for array functions + */ + +public abstract class AbstractGenericUDFArrayBase extends GenericUDF { + +static final int ARRAY_IDX = 0; +static final int ARRAY2_IDX = 1; +static final int START_IDX = 1; +static final int LENGTH_IDX = 2; +static final int SEPARATOR_IDX = 1; +static final int REPLACE_NULL_IDX = 2; + +int MIN_ARG_COUNT; +int MAX_ARG_COUNT; + +transient ListObjectInspector arrayOI; +transient ObjectInspector[] argumentOIs; + +transient Converter converter; + +enum FUNC_NAMES { +ARRAY_MAX, ARRAY_MIN, ARRAY_DISTINCT, ARRAY_SLICE, ARRAY_JOIN, ARRAY_EXCEPT, ARRAY_INTERSECT +} + +FUNC_NAMES FUNC_NAME; + +@Override +public ObjectInspector initialize(ObjectInspector[] arguments) +throws UDFArgumentException { + +// Check if wrong number of arguments were passed +checkArgsSize(arguments, MIN_ARG_COUNT, MAX_ARG_COUNT); + +// Check if the argument is of category LIST or not +checkArgCategory(arguments, ARRAY_IDX, ObjectInspector.Category.LIST, FUNC_NAME, +org.apache.hadoop.hive.serde.serdeConstants.LIST_TYPE_NAME); + +if (FUNC_NAME == FUNC_NAMES.ARRAY_EXCEPT +|| FUNC_NAME == FUNC_NAMES.ARRAY_INTERSECT +|| FUNC_NAME == FUNC_NAMES.ARRAY_JOIN) { +checkArgCategory(arguments, ARRAY2_IDX, ObjectInspector.Category.LIST, FUNC_NAME, + org.apache.hadoop.hive.serde.serdeConstants.LIST_TYPE_NAME); +} + +if (FUNC_NAME == FUNC_NAMES.ARRAY_SLICE) { +PrimitiveObjectInspector startIndexObjectInspector = (PrimitiveObjectInspector) arguments[START_IDX]; +PrimitiveObjectInspector lengthObjectInspector = (PrimitiveObjectInspector) arguments[LENGTH_IDX]; +checkArgIntPrimitiveCategory(startIndexObjectInspector, FUNC_NAME, 2); +checkArgIntPrimitiveCategory(lengthObjectInspector, FUNC_NAME, 3); +} + +arrayOI = (ListObjectInspector) arguments[ARRAY_IDX]; +argumentOIs = arguments; + +//return initialize(arguments); +return initListOI(arguments); +} + +@Override +public String getDisplayString(String[] children) { +assert (children.length == MIN_ARG_COUNT); +return FUNC_NAME.toString().toLowerCase() + "(" + children[ARRAY_IDX] + ")"; +} + +List convertArray(List objects) { +List ret = new ArrayList<>(); +for (Object o : objects) { +ret.add(converter.convert(o)); +} +return ret; +} + +void checkArgCategory(ObjectInspector[] arguments, int idx, Enum category, + FUNC_NAMES
[jira] [Work logged] (HIVE-26754) Implement array_distinct UDF to return an array after removing duplicates in it
[ https://issues.apache.org/jira/browse/HIVE-26754?focusedWorklogId=827246=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-827246 ] ASF GitHub Bot logged work on HIVE-26754: - Author: ASF GitHub Bot Created on: 18/Nov/22 18:46 Start Date: 18/Nov/22 18:46 Worklog Time Spent: 10m Work Description: sonarcloud[bot] commented on PR #3777: URL: https://github.com/apache/hive/pull/3777#issuecomment-1320402712 Kudos, SonarCloud Quality Gate passed! [![Quality Gate passed](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/QualityGateBadge/passed-16px.png 'Quality Gate passed')](https://sonarcloud.io/dashboard?id=apache_hive=3777) [![Bug](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/bug-16px.png 'Bug')](https://sonarcloud.io/project/issues?id=apache_hive=3777=false=BUG) [![C](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/C-16px.png 'C')](https://sonarcloud.io/project/issues?id=apache_hive=3777=false=BUG) [2 Bugs](https://sonarcloud.io/project/issues?id=apache_hive=3777=false=BUG) [![Vulnerability](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/vulnerability-16px.png 'Vulnerability')](https://sonarcloud.io/project/issues?id=apache_hive=3777=false=VULNERABILITY) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3777=false=VULNERABILITY) [0 Vulnerabilities](https://sonarcloud.io/project/issues?id=apache_hive=3777=false=VULNERABILITY) [![Security Hotspot](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/security_hotspot-16px.png 'Security Hotspot')](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3777=false=SECURITY_HOTSPOT) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3777=false=SECURITY_HOTSPOT) [0 Security Hotspots](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3777=false=SECURITY_HOTSPOT) [![Code Smell](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/code_smell-16px.png 'Code Smell')](https://sonarcloud.io/project/issues?id=apache_hive=3777=false=CODE_SMELL) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3777=false=CODE_SMELL) [15 Code Smells](https://sonarcloud.io/project/issues?id=apache_hive=3777=false=CODE_SMELL) [![No Coverage information](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/CoverageChart/NoCoverageInfo-16px.png 'No Coverage information')](https://sonarcloud.io/component_measures?id=apache_hive=3777=coverage=list) No Coverage information [![No Duplication information](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/Duplications/NoDuplicationInfo-16px.png 'No Duplication information')](https://sonarcloud.io/component_measures?id=apache_hive=3777=duplicated_lines_density=list) No Duplication information Issue Time Tracking --- Worklog Id: (was: 827246) Time Spent: 40m (was: 0.5h) > Implement array_distinct UDF to return an array after removing duplicates in > it > --- > > Key: HIVE-26754 > URL: https://issues.apache.org/jira/browse/HIVE-26754 > Project: Hive > Issue Type: Sub-task > Components: Hive >Reporter: Taraka Rama Rao Lethavadla >Assignee: Taraka Rama Rao Lethavadla >Priority: Major > Labels: pull-request-available > Time Spent: 40m > Remaining Estimate: 0h > > *array_distinct(array(obj1, obj2,...))* - The function returns an array of > the same type as the input argument where all duplicate values have been > removed. > Example: > > SELECT array_distinct(array('b', 'd', 'd', 'a')) FROM src LIMIT 1; > ['a', 'b', 'c'] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Work logged] (HIVE-26754) Implement array_distinct UDF to return an array after removing duplicates in it
[ https://issues.apache.org/jira/browse/HIVE-26754?focusedWorklogId=826946=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-826946 ] ASF GitHub Bot logged work on HIVE-26754: - Author: ASF GitHub Bot Created on: 17/Nov/22 19:05 Start Date: 17/Nov/22 19:05 Worklog Time Spent: 10m Work Description: sonarcloud[bot] commented on PR #3777: URL: https://github.com/apache/hive/pull/3777#issuecomment-1319077349 Kudos, SonarCloud Quality Gate passed! [![Quality Gate passed](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/QualityGateBadge/passed-16px.png 'Quality Gate passed')](https://sonarcloud.io/dashboard?id=apache_hive=3777) [![Bug](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/bug-16px.png 'Bug')](https://sonarcloud.io/project/issues?id=apache_hive=3777=false=BUG) [![C](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/C-16px.png 'C')](https://sonarcloud.io/project/issues?id=apache_hive=3777=false=BUG) [2 Bugs](https://sonarcloud.io/project/issues?id=apache_hive=3777=false=BUG) [![Vulnerability](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/vulnerability-16px.png 'Vulnerability')](https://sonarcloud.io/project/issues?id=apache_hive=3777=false=VULNERABILITY) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3777=false=VULNERABILITY) [0 Vulnerabilities](https://sonarcloud.io/project/issues?id=apache_hive=3777=false=VULNERABILITY) [![Security Hotspot](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/security_hotspot-16px.png 'Security Hotspot')](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3777=false=SECURITY_HOTSPOT) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3777=false=SECURITY_HOTSPOT) [0 Security Hotspots](https://sonarcloud.io/project/security_hotspots?id=apache_hive=3777=false=SECURITY_HOTSPOT) [![Code Smell](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/common/code_smell-16px.png 'Code Smell')](https://sonarcloud.io/project/issues?id=apache_hive=3777=false=CODE_SMELL) [![A](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/RatingBadge/A-16px.png 'A')](https://sonarcloud.io/project/issues?id=apache_hive=3777=false=CODE_SMELL) [15 Code Smells](https://sonarcloud.io/project/issues?id=apache_hive=3777=false=CODE_SMELL) [![No Coverage information](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/CoverageChart/NoCoverageInfo-16px.png 'No Coverage information')](https://sonarcloud.io/component_measures?id=apache_hive=3777=coverage=list) No Coverage information [![No Duplication information](https://sonarsource.github.io/sonarcloud-github-static-resources/v2/checks/Duplications/NoDuplicationInfo-16px.png 'No Duplication information')](https://sonarcloud.io/component_measures?id=apache_hive=3777=duplicated_lines_density=list) No Duplication information Issue Time Tracking --- Worklog Id: (was: 826946) Time Spent: 0.5h (was: 20m) > Implement array_distinct UDF to return an array after removing duplicates in > it > --- > > Key: HIVE-26754 > URL: https://issues.apache.org/jira/browse/HIVE-26754 > Project: Hive > Issue Type: Sub-task > Components: Hive >Reporter: Taraka Rama Rao Lethavadla >Assignee: Taraka Rama Rao Lethavadla >Priority: Major > Labels: pull-request-available > Time Spent: 0.5h > Remaining Estimate: 0h > > *array_distinct(array(obj1, obj2,...))* - The function returns an array of > the same type as the input argument where all duplicate values have been > removed. > Example: > > SELECT array_distinct(array('b', 'd', 'd', 'a')) FROM src LIMIT 1; > ['a', 'b', 'c'] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Work logged] (HIVE-26754) Implement array_distinct UDF to return an array after removing duplicates in it
[ https://issues.apache.org/jira/browse/HIVE-26754?focusedWorklogId=826917=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-826917 ] ASF GitHub Bot logged work on HIVE-26754: - Author: ASF GitHub Bot Created on: 17/Nov/22 17:25 Start Date: 17/Nov/22 17:25 Worklog Time Spent: 10m Work Description: scarlin-cloudera commented on code in PR #3777: URL: https://github.com/apache/hive/pull/3777#discussion_r1025468514 ## ql/src/java/org/apache/hadoop/hive/ql/udf/generic/AbstractGenericUDFArrayBase.java: ## @@ -0,0 +1,160 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one + * or more contributor license agreements. See the NOTICE file + * distributed with this work for additional information + * regarding copyright ownership. The ASF licenses this file + * to you under the Apache License, Version 2.0 (the + * "License"); you may not use this file except in compliance + * with the License. You may obtain a copy of the License at + * + * http://www.apache.org/licenses/LICENSE-2.0 + * + * Unless required by applicable law or agreed to in writing, software + * distributed under the License is distributed on an "AS IS" BASIS, + * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + * See the License for the specific language governing permissions and + * limitations under the License. + */ +package org.apache.hadoop.hive.ql.udf.generic; + +import org.apache.hadoop.hive.ql.exec.UDFArgumentException; +import org.apache.hadoop.hive.ql.exec.UDFArgumentTypeException; +import org.apache.hadoop.hive.serde.serdeConstants; + +import org.apache.hadoop.hive.serde2.objectinspector.ListObjectInspector; +import org.apache.hadoop.hive.serde2.objectinspector.ObjectInspector; +import org.apache.hadoop.hive.serde2.objectinspector.ObjectInspectorFactory; +import org.apache.hadoop.hive.serde2.objectinspector.ObjectInspectorConverters; +import org.apache.hadoop.hive.serde2.objectinspector.ObjectInspectorConverters.Converter; +import org.apache.hadoop.hive.serde2.objectinspector.PrimitiveObjectInspector; + +import java.util.ArrayList; +import java.util.List; + +/** + * Abstract GenericUDF for array functions + */ + +public abstract class AbstractGenericUDFArrayBase extends GenericUDF { + +static final int ARRAY_IDX = 0; +static final int ARRAY2_IDX = 1; +static final int START_IDX = 1; +static final int LENGTH_IDX = 2; +static final int SEPARATOR_IDX = 1; +static final int REPLACE_NULL_IDX = 2; + +int MIN_ARG_COUNT; Review Comment: Variables should be lower case and camel case. Also, in cases like this? my first preference is to declare these as private and have derived classes retrieve with getter classes. I don't oppose allowing child classes to have access, but if I do that, usually I go with "protected"...which doesn't have as much meaning in Java, but it still shows me as a developer that children are gonna use it. ## ql/src/java/org/apache/hadoop/hive/ql/udf/generic/AbstractGenericUDFArrayBase.java: ## @@ -0,0 +1,160 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one + * or more contributor license agreements. See the NOTICE file + * distributed with this work for additional information + * regarding copyright ownership. The ASF licenses this file + * to you under the Apache License, Version 2.0 (the + * "License"); you may not use this file except in compliance + * with the License. You may obtain a copy of the License at + * + * http://www.apache.org/licenses/LICENSE-2.0 + * + * Unless required by applicable law or agreed to in writing, software + * distributed under the License is distributed on an "AS IS" BASIS, + * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + * See the License for the specific language governing permissions and + * limitations under the License. + */ +package org.apache.hadoop.hive.ql.udf.generic; + +import org.apache.hadoop.hive.ql.exec.UDFArgumentException; +import org.apache.hadoop.hive.ql.exec.UDFArgumentTypeException; +import org.apache.hadoop.hive.serde.serdeConstants; + +import org.apache.hadoop.hive.serde2.objectinspector.ListObjectInspector; +import org.apache.hadoop.hive.serde2.objectinspector.ObjectInspector; +import org.apache.hadoop.hive.serde2.objectinspector.ObjectInspectorFactory; +import org.apache.hadoop.hive.serde2.objectinspector.ObjectInspectorConverters; +import org.apache.hadoop.hive.serde2.objectinspector.ObjectInspectorConverters.Converter; +import org.apache.hadoop.hive.serde2.objectinspector.PrimitiveObjectInspector; + +import java.util.ArrayList; +import java.util.List; + +/** + * Abstract GenericUDF for array functions + */ + +public abstract class AbstractGenericUDFArrayBase extends GenericUDF { + +static final int ARRAY_IDX = 0; +static final int ARRAY2_IDX =
[jira] [Work logged] (HIVE-26754) Implement array_distinct UDF to return an array after removing duplicates in it
[ https://issues.apache.org/jira/browse/HIVE-26754?focusedWorklogId=826881=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-826881 ] ASF GitHub Bot logged work on HIVE-26754: - Author: ASF GitHub Bot Created on: 17/Nov/22 15:08 Start Date: 17/Nov/22 15:08 Worklog Time Spent: 10m Work Description: tarak271 opened a new pull request, #3777: URL: https://github.com/apache/hive/pull/3777 ### What changes were proposed in this pull request? Implement array_distinct function in Hive ### Why are the changes needed? This enhancement is already implemented in Spark ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Created Junit tests as well as qtests as part of this change Issue Time Tracking --- Worklog Id: (was: 826881) Remaining Estimate: 0h Time Spent: 10m > Implement array_distinct UDF to return an array after removing duplicates in > it > --- > > Key: HIVE-26754 > URL: https://issues.apache.org/jira/browse/HIVE-26754 > Project: Hive > Issue Type: Sub-task > Components: Hive >Reporter: Taraka Rama Rao Lethavadla >Assignee: Taraka Rama Rao Lethavadla >Priority: Major > Time Spent: 10m > Remaining Estimate: 0h > > *array_distinct(array(obj1, obj2,...))* - The function returns an array of > the same type as the input argument where all duplicate values have been > removed. > Example: > > SELECT array_distinct(array('b', 'd', 'd', 'a')) FROM src LIMIT 1; > ['a', 'b', 'c'] -- This message was sent by Atlassian Jira (v8.20.10#820010)