[
https://issues.apache.org/jira/browse/TAJO-691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13950814#comment-13950814
]
Hudson commented on TAJO-691:
-----------------------------
SUCCESS: Integrated in Tajo-0.8.0-build #48 (See
[https://builds.apache.org/job/Tajo-0.8.0-build/48/])
TAJO-691: HashJoin or HashAggregation is too slow if there is many unique keys.
(hyoungjunkim via hyunsik) (hyunsik: rev
ebc60c51e819432fda8a19618cb4ff9323168ddd)
*
tajo-core/tajo-core-backend/src/test/resources/results/TestBuiltinFunctions/testRandom.result
* tajo-storage/src/main/java/org/apache/tajo/storage/VTuple.java
*
tajo-core/tajo-core-backend/src/test/resources/results/TestBuiltinFunctions/testAvgDouble.result
* tajo-rpc/src/main/java/org/apache/tajo/rpc/NettyClientBase.java
*
tajo-core/tajo-core-backend/src/test/resources/queries/TestGroupByQuery/testGroupBy4.sql
*
tajo-core/tajo-core-backend/src/test/resources/queries/TestJoinQuery/testJoinCoReferredEvalsWithSameExprs2.sql
*
tajo-core/tajo-core-backend/src/test/resources/queries/TestBuiltinFunctions/testAvgDouble.sql
*
tajo-core/tajo-core-backend/src/test/resources/results/TestCaseByCases/testTAJO415Case.result
*
tajo-core/tajo-core-backend/src/test/resources/results/TestJoinQuery/testJoinCoReferredEvalsWithSameExprs2.result
*
tajo-core/tajo-core-backend/src/test/resources/queries/TestNetTypes/testGroupby.sql
*
tajo-core/tajo-core-backend/src/test/resources/results/TestGroupByQuery/testHavingWithNamedTarget.result
*
tajo-core/tajo-core-backend/src/test/resources/results/TestNetTypes/testGroupby.result
*
tajo-core/tajo-core-backend/src/test/resources/results/TestJoinQuery/testJoinCoReferredEvalsWithSameExprs1.result
*
tajo-core/tajo-core-backend/src/test/resources/results/TestNetTypes/testGroupby2.result
*
tajo-core/tajo-core-backend/src/test/resources/queries/TestJoinQuery/testJoinCoReferredEvalsWithSameExprs1.sql
*
tajo-core/tajo-core-backend/src/test/resources/queries/TestNetTypes/testGroupby2.sql
* tajo-storage/src/main/java/org/apache/tajo/storage/LazyTuple.java
* CHANGES.txt
*
tajo-core/tajo-core-backend/src/test/resources/queries/TestCaseByCases/testTAJO415Case.sql
*
tajo-core/tajo-core-backend/src/test/resources/results/TestGroupByQuery/testGroupBy4.result
> HashJoin or HashAggregation is too slow if there is many unique keys
> --------------------------------------------------------------------
>
> Key: TAJO-691
> URL: https://issues.apache.org/jira/browse/TAJO-691
> Project: Tajo
> Issue Type: Improvement
> Reporter: hyoungjunkim
> Assignee: hyoungjunkim
> Fix For: 0.8-incubating, 1.0-incubating
>
> Attachments: TAJO-691.patch, TAJO-691_2.patch
>
>
> HashJoin or HashAggregation is too slow if there is many unique keys.
> Java's native Map is inefficient to handle many items. In case more than 1
> million items in HashMap, Adding 10000 items takes more than 7 ~ 10 seconds.
>
> This should be improved.
--
This message was sent by Atlassian JIRA
(v6.2#6252)