[
https://issues.apache.org/jira/browse/HADOOP-17042?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17109240#comment-17109240
]
Aki Tanaka commented on HADOOP-17042:
-
Thank you, [~aajisaka]
> In addition, can
[
https://issues.apache.org/jira/browse/HADOOP-17042?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Aki Tanaka updated HADOOP-17042:
Attachment: HADOOP-17042.patch
> Hadoop distcp throws "ERROR: Tools helper
> ///usr/lib/hadoop/l
Aki Tanaka created HADOOP-17042:
---
Summary: Hadoop distcp throws "ERROR: Tools helper
///usr/lib/hadoop/libexec/tools/hadoop-distcp.sh was not found"
Key: HADOOP-17042
URL: https://issues.apache.org/jira/browse/HADOO
[
https://issues.apache.org/jira/browse/HADOOP-15206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16391568#comment-16391568
]
Aki Tanaka commented on HADOOP-15206:
-
In my understanding, skipBytes-- will not be
[
https://issues.apache.org/jira/browse/HADOOP-15206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Aki Tanaka updated HADOOP-15206:
Attachment: HADOOP-15206.008.patch
> BZip2 drops and duplicates records when input split size is s
[
https://issues.apache.org/jira/browse/HADOOP-15206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16366061#comment-16366061
]
Aki Tanaka commented on HADOOP-15206:
-
Thank you. Since this is the first time to ap
[
https://issues.apache.org/jira/browse/HADOOP-15206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16365134#comment-16365134
]
Aki Tanaka commented on HADOOP-15206:
-
Thank you!! I updated the patch.
* Changed t
[
https://issues.apache.org/jira/browse/HADOOP-15206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Aki Tanaka updated HADOOP-15206:
Attachment: HADOOP-15206.007.patch
> BZip2 drops and duplicates records when input split size is s
[
https://issues.apache.org/jira/browse/HADOOP-15206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Aki Tanaka updated HADOOP-15206:
Attachment: HADOOP-15206.006.patch
> BZip2 drops and duplicates records when input split size is s
[
https://issues.apache.org/jira/browse/HADOOP-15206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16364611#comment-16364611
]
Aki Tanaka commented on HADOOP-15206:
-
Thank you very much for the review!
I've add
[
https://issues.apache.org/jira/browse/HADOOP-15206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16362627#comment-16362627
]
Aki Tanaka commented on HADOOP-15206:
-
Thank you, I updated the patch.
Implement
[
https://issues.apache.org/jira/browse/HADOOP-15206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Aki Tanaka updated HADOOP-15206:
Attachment: HADOOP-15206.005.patch
> BZip2 drops and duplicates records when input split size is s
[
https://issues.apache.org/jira/browse/HADOOP-15206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16359803#comment-16359803
]
Aki Tanaka commented on HADOOP-15206:
-
Hi Jason,
I really appreciate your code revi
[
https://issues.apache.org/jira/browse/HADOOP-15206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Aki Tanaka updated HADOOP-15206:
Attachment: HADOOP-15206.004.patch
> BZip2 drops and duplicates records when input split size is s
[
https://issues.apache.org/jira/browse/HADOOP-15206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16357510#comment-16357510
]
Aki Tanaka commented on HADOOP-15206:
-
Added the updated patch. Please let me know i
[
https://issues.apache.org/jira/browse/HADOOP-15206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Aki Tanaka updated HADOOP-15206:
Attachment: HADOOP-15206.003.patch
> BZip2 drops and duplicates records when input split size is s
[
https://issues.apache.org/jira/browse/HADOOP-15206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16354819#comment-16354819
]
Aki Tanaka commented on HADOOP-15206:
-
Thank you very much for the comments!
{quote}
[
https://issues.apache.org/jira/browse/HADOOP-15206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16354317#comment-16354317
]
Aki Tanaka commented on HADOOP-15206:
-
Thank you for the comment! I've updated the p
[
https://issues.apache.org/jira/browse/HADOOP-15206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Aki Tanaka updated HADOOP-15206:
Attachment: HADOOP-15206.002.patch
> BZip2 drops and duplicates records when input split size is s
[
https://issues.apache.org/jira/browse/HADOOP-15206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16353510#comment-16353510
]
Aki Tanaka commented on HADOOP-15206:
-
[~jlowe]
Thank you for your insights. I have
[
https://issues.apache.org/jira/browse/HADOOP-15206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Aki Tanaka updated HADOOP-15206:
Attachment: HADOOP-15206.001.patch
> BZip2 drops and duplicates records when input split size is s
[
https://issues.apache.org/jira/browse/HADOOP-14919?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16349239#comment-16349239
]
Aki Tanaka commented on HADOOP-14919:
-
Hello,
Regarding this issue, I found another
[
https://issues.apache.org/jira/browse/HADOOP-15206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16349234#comment-16349234
]
Aki Tanaka commented on HADOOP-15206:
-
Added a unit test that can reproduce the prob
[
https://issues.apache.org/jira/browse/HADOOP-15206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Aki Tanaka updated HADOOP-15206:
Attachment: HADOOP-15206-test.patch
> BZip2 drops and duplicates records when input split size is
Aki Tanaka created HADOOP-15206:
---
Summary: BZip2 drops and duplicates records when input split size
is small
Key: HADOOP-15206
URL: https://issues.apache.org/jira/browse/HADOOP-15206
Project: Hadoop Com
[
https://issues.apache.org/jira/browse/HADOOP-14919?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16193075#comment-16193075
]
Aki Tanaka commented on HADOOP-14919:
-
Thank you for the patch. I tested the patch a
[
https://issues.apache.org/jira/browse/HADOOP-14919?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Aki Tanaka updated HADOOP-14919:
Description:
BZip2 can drop records when reading data in splits. This problem was already
discuss
[
https://issues.apache.org/jira/browse/HADOOP-14919?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Aki Tanaka updated HADOOP-14919:
Attachment: 25.bz2
Adding the test bz2 file (The bz2 file that the attached unit test generate
[
https://issues.apache.org/jira/browse/HADOOP-14919?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Aki Tanaka updated HADOOP-14919:
Attachment: HADOOP-14919-test.patch
Add patch for the unit test.
> BZip2 drops records when readi
Aki Tanaka created HADOOP-14919:
---
Summary: BZip2 drops records when reading data in splits
Key: HADOOP-14919
URL: https://issues.apache.org/jira/browse/HADOOP-14919
Project: Hadoop Common
Issue
30 matches
Mail list logo