[
https://issues.apache.org/jira/browse/TAJO-879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14036870#comment-14036870
]
ASF GitHub Bot commented on TAJO-879:
-------------------------------------
GitHub user babokim opened a pull request:
https://github.com/apache/tajo/pull/43
TAJO-879: Some data is missing in the case of BROADCAST JOIN and
multi-column partition.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/babokim/tajo TAJO-879
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/tajo/pull/43.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #43
----
commit 81d811a8549ac8e63c43a72a179a78efe5e694f2
Author: 김형준 <[email protected]>
Date: 2014-06-19T02:32:59Z
TAJO-879: Some data is missing in the case of BROADCAST JOIN and
multi-column partition.
----
> Some data is missing in the case of BROADCAST JOIN and multi-column partition.
> ------------------------------------------------------------------------------
>
> Key: TAJO-879
> URL: https://issues.apache.org/jira/browse/TAJO-879
> Project: Tajo
> Issue Type: Bug
> Reporter: Hyoungjun Kim
> Assignee: Hyoungjun Kim
> Priority: Trivial
>
> If the data directory is the following, some data is missing.
> {noformat}
> /tajo/warehouse/table1/year=2014/month=01/hour=12/part-01-00000
> /tajo/warehouse/table1/year=2014/month=02/hour=12/part-01-00000
> {noformat}
> SeqScanExec uses a last partition column path's name as a broadcast table's
> cache key.
> In this case the table is partitioned by year, month, hour. So the cache key
> is "hour=12" which is not unique.
> It should be fixed.
--
This message was sent by Atlassian JIRA
(v6.2#6252)