[
https://issues.apache.org/jira/browse/PIG-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12907805#action_12907805
]
Olga Natkovich commented on PIG-1518:
-
Hi Justin, thanks for the patch!
I don't think we
[
https://issues.apache.org/jira/browse/PIG-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12903528#action_12903528
]
Yan Zhou commented on PIG-1518:
---
All other functionalities except for the two mentioned in the
[
https://issues.apache.org/jira/browse/PIG-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12903525#action_12903525
]
Yan Zhou commented on PIG-1518:
---
In summary, the following functionalities won't see splits com
[
https://issues.apache.org/jira/browse/PIG-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12903501#action_12903501
]
Olga Natkovich commented on PIG-1518:
-
After discussion with Ashutosh and Yan tha agreeme
[
https://issues.apache.org/jira/browse/PIG-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12903423#action_12903423
]
Yan Zhou commented on PIG-1518:
---
MergeJoinIndexer and IndexableLoadFunc are both not combinable
[
https://issues.apache.org/jira/browse/PIG-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12903283#action_12903283
]
Ashutosh Chauhan commented on PIG-1518:
---
Yan,
Sorry for being late on this now thats i
[
https://issues.apache.org/jira/browse/PIG-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12903102#action_12903102
]
Yan Zhou commented on PIG-1518:
---
It is not combinable if the loader is a CollectableLoadFunc AN
[
https://issues.apache.org/jira/browse/PIG-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12903031#action_12903031
]
Dmitriy V. Ryaboy commented on PIG-1518:
This is a great feature, thanks Yan.
Could
[
https://issues.apache.org/jira/browse/PIG-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12902350#action_12902350
]
Mridul Muralidharan commented on PIG-1518:
--
Might be a good idea to contact aruniyer
[
https://issues.apache.org/jira/browse/PIG-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12901600#action_12901600
]
Richard Ding commented on PIG-1518:
---
+1. The patch looks good.
A few of minor points:
* I
[
https://issues.apache.org/jira/browse/PIG-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12900123#action_12900123
]
Yan Zhou commented on PIG-1518:
---
No. It does not work inside an optimizer as logical/physical p
[
https://issues.apache.org/jira/browse/PIG-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1292#action_1292
]
Mridul Muralidharan commented on PIG-1518:
--
if optimizer is turned off, does this al
[
https://issues.apache.org/jira/browse/PIG-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12899888#action_12899888
]
Yan Zhou commented on PIG-1518:
---
In summary, the split combination's controllables are through
[
https://issues.apache.org/jira/browse/PIG-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12899609#action_12899609
]
Yan Zhou commented on PIG-1518:
---
The formatting of the table of the last comment is a bit off:
[
https://issues.apache.org/jira/browse/PIG-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12899605#action_12899605
]
Yan Zhou commented on PIG-1518:
---
One experimental result on a 15-node cluster of 2 x Xeon L5420
[
https://issues.apache.org/jira/browse/PIG-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12899445#action_12899445
]
Yan Zhou commented on PIG-1518:
---
Another approach is to mark splits as uncombinable only when n
[
https://issues.apache.org/jira/browse/PIG-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12898648#action_12898648
]
Ashutosh Chauhan commented on PIG-1518:
---
This feature of combining multiple splits shou
[
https://issues.apache.org/jira/browse/PIG-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12898490#action_12898490
]
Yan Zhou commented on PIG-1518:
---
There is a bigger question at hand. The semantics of OrderedLo
[
https://issues.apache.org/jira/browse/PIG-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12897887#action_12897887
]
Yan Zhou commented on PIG-1518:
---
During the merge process, any empty splits will be skipped. Cu
[
https://issues.apache.org/jira/browse/PIG-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12897493#action_12897493
]
Yan Zhou commented on PIG-1518:
---
Right, map side cogroup needs the sortness of the input, but j
[
https://issues.apache.org/jira/browse/PIG-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12897368#action_12897368
]
Alan Gates commented on PIG-1518:
-
bq. For mapside cogroup or mapside group by, though, the s
[
https://issues.apache.org/jira/browse/PIG-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12897085#action_12897085
]
Yan Zhou commented on PIG-1518:
---
The pseudo code of the combination op is as follows:
for each
[
https://issues.apache.org/jira/browse/PIG-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12895338#action_12895338
]
Yan Zhou commented on PIG-1518:
---
To provide a safe valve for any input fomats that might dislik
[
https://issues.apache.org/jira/browse/PIG-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12895335#action_12895335
]
Yan Zhou commented on PIG-1518:
---
The combination algorithm currently does not consider rack-loc
[
https://issues.apache.org/jira/browse/PIG-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12894778#action_12894778
]
Yan Zhou commented on PIG-1518:
---
In contrast with Hive, where the CombineFileInputFormat is use
[
https://issues.apache.org/jira/browse/PIG-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12894205#action_12894205
]
Yan Zhou commented on PIG-1518:
---
CombinedInputFormat, in lieu of the deprecated MultiFileInputF
26 matches
Mail list logo