[jira] [Updated] (HADOOP-12474) Fix data race when allocating server port in MiniKMS

2015-10-12 Thread Mingliang Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HADOOP-12474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mingliang Liu updated HADOOP-12474:
---
Status: Patch Available  (was: Open)

> Fix data race when allocating server port in MiniKMS 
> -
>
> Key: HADOOP-12474
> URL: https://issues.apache.org/jira/browse/HADOOP-12474
> Project: Hadoop Common
>  Issue Type: Bug
>Reporter: Mingliang Liu
>Assignee: Mingliang Liu
> Attachments: HADOOP-12474.000.patch
>
>
> In {{MiniKMS}}, there is potential data race when create jetty server in 
> {{createJettyServer}}. It looks like the code searches for a free port and 
> then starts jetty, but maybe there's enough of a race condition between port 
> location and jetty.start to cause intermittent failures.
> {code}
>   ServerSocket ss = new ServerSocket((inPort < 0) ? 0 : inPort, 50, 
> localhost);
>   int port = ss.getLocalPort();
>   Server server = new Server(0);
>   if (!ssl) {
> server.getConnectors()[0].setHost(host);
> server.getConnectors()[0].setPort(port);
>   } else {
> ...
> c.setPort(port);
> {code}
> We've seen test failures saying {{java.net.BindException: Address already in 
> use}}, e.g. 
> [https://builds.apache.org/job/PreCommit-HDFS-Build/12942/testReport/]
> As in [HADOOP-12417], we should always bind port 0 which gives us an 
> ephemeral port, instead of searching a free port before starting jetty.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HADOOP-12474) Fix data race when allocating server port in MiniKMS

2015-10-12 Thread Mingliang Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HADOOP-12474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mingliang Liu updated HADOOP-12474:
---
Attachment: HADOOP-12474.000.patch

The v0 patch address this issue as we addressed the issue in [HADOOP-12417].

> Fix data race when allocating server port in MiniKMS 
> -
>
> Key: HADOOP-12474
> URL: https://issues.apache.org/jira/browse/HADOOP-12474
> Project: Hadoop Common
>  Issue Type: Bug
>Reporter: Mingliang Liu
>Assignee: Mingliang Liu
> Attachments: HADOOP-12474.000.patch
>
>
> In {{MiniKMS}}, there is potential data race when create jetty server in 
> {{createJettyServer}}. It looks like the code searches for a free port and 
> then starts jetty, but maybe there's enough of a race condition between port 
> location and jetty.start to cause intermittent failures.
> {code}
>   ServerSocket ss = new ServerSocket((inPort < 0) ? 0 : inPort, 50, 
> localhost);
>   int port = ss.getLocalPort();
>   Server server = new Server(0);
>   if (!ssl) {
> server.getConnectors()[0].setHost(host);
> server.getConnectors()[0].setPort(port);
>   } else {
> ...
> c.setPort(port);
> {code}
> We've seen test failures saying {{java.net.BindException: Address already in 
> use}}, e.g. 
> [https://builds.apache.org/job/PreCommit-HDFS-Build/12942/testReport/]
> As in [HADOOP-12417], we should always bind port 0 which gives us an 
> ephemeral port, instead of searching a free port before starting jetty.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HADOOP-12474) Fix data race when allocating server port in MiniKMS

2015-10-12 Thread Mingliang Liu (JIRA)
Mingliang Liu created HADOOP-12474:
--

 Summary: Fix data race when allocating server port in MiniKMS 
 Key: HADOOP-12474
 URL: https://issues.apache.org/jira/browse/HADOOP-12474
 Project: Hadoop Common
  Issue Type: Bug
Reporter: Mingliang Liu
Assignee: Mingliang Liu


In {{MiniKMS}}, there is potential data race when create jetty server in 
{{createJettyServer}}. It looks like the code searches for a free port and then 
starts jetty, but maybe there's enough of a race condition between port 
location and jetty.start to cause intermittent failures.
{code}
  ServerSocket ss = new ServerSocket((inPort < 0) ? 0 : inPort, 50, 
localhost);
  int port = ss.getLocalPort();
  Server server = new Server(0);
  if (!ssl) {
server.getConnectors()[0].setHost(host);
server.getConnectors()[0].setPort(port);
  } else {
...
c.setPort(port);
{code}

We've seen test failures saying {{java.net.BindException: Address already in 
use}}, e.g. 
[https://builds.apache.org/job/PreCommit-HDFS-Build/12942/testReport/]

As in [HADOOP-12417], we should always bind port 0 which gives us an ephemeral 
port, instead of searching a free port before starting jetty.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HADOOP-11684) S3a to use thread pool that blocks clients

2015-10-12 Thread Aaron Fabbri (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-11684?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14954431#comment-14954431
 ] 

Aaron Fabbri commented on HADOOP-11684:
---

+1 on the 002 patch.

I'm attaching a 004 patch that is identical, except it adds an extra unit test 
on {{BlockingThreadPoolExecutorService}}.  It is optional, but demonstrates 
some testing I've done.  These blobstore tests don't run by default, but my 
test does depend on timing, so it could theoretically fail on a heavily loaded 
system. (I detect whether or not the threadpool blocks via elapsed system time 
across {{submit()}} calls, and these aren't realtime systems.}

Reviewers: The idea behind the 002 approach pretty simple.  The meat of the 
change is just a semaphore around {{ExecutorService}} functions {{submit()}} 
etc..  e.g.

{code}
+  @Override
+  public ListenableFuture submit(Runnable task) {
+try {
+  queueingPermits.acquire();
+} catch (InterruptedException e) {
+  Thread.currentThread().interrupt();
+  return Futures.immediateFailedCheckedFuture(e);
+}
+return super.submit(new RunnableWithPermitRelease(task));
+  }
{code}

> S3a to use thread pool that blocks clients
> --
>
> Key: HADOOP-11684
> URL: https://issues.apache.org/jira/browse/HADOOP-11684
> Project: Hadoop Common
>  Issue Type: Sub-task
>  Components: fs/s3
>Affects Versions: 2.7.0
>Reporter: Thomas Demoor
>Assignee: Thomas Demoor
> Attachments: HADOOP-11684-001.patch, HADOOP-11684-002.patch, 
> HADOOP-11684-003.patch, HADOOP-11684-004.patch
>
>
> Currently, if fs.s3a.max.total.tasks are queued and another (part)upload 
> wants to start, a RejectedExecutionException is thrown. 
> We should use a threadpool that blocks clients, nicely throtthling them, 
> rather than throwing an exception. F.i. something similar to 
> https://github.com/apache/incubator-s4/blob/master/subprojects/s4-comm/src/main/java/org/apache/s4/comm/staging/BlockingThreadPoolExecutorService.java



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HADOOP-11684) S3a to use thread pool that blocks clients

2015-10-12 Thread Aaron Fabbri (JIRA)

 [ 
https://issues.apache.org/jira/browse/HADOOP-11684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron Fabbri updated HADOOP-11684:
--
Attachment: HADOOP-11684-004.patch

> S3a to use thread pool that blocks clients
> --
>
> Key: HADOOP-11684
> URL: https://issues.apache.org/jira/browse/HADOOP-11684
> Project: Hadoop Common
>  Issue Type: Sub-task
>  Components: fs/s3
>Affects Versions: 2.7.0
>Reporter: Thomas Demoor
>Assignee: Thomas Demoor
> Attachments: HADOOP-11684-001.patch, HADOOP-11684-002.patch, 
> HADOOP-11684-003.patch, HADOOP-11684-004.patch
>
>
> Currently, if fs.s3a.max.total.tasks are queued and another (part)upload 
> wants to start, a RejectedExecutionException is thrown. 
> We should use a threadpool that blocks clients, nicely throtthling them, 
> rather than throwing an exception. F.i. something similar to 
> https://github.com/apache/incubator-s4/blob/master/subprojects/s4-comm/src/main/java/org/apache/s4/comm/staging/BlockingThreadPoolExecutorService.java



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HADOOP-12469) distcp should not ignore the ignoreFailures option

2015-10-12 Thread Gera Shegalov (JIRA)

 [ 
https://issues.apache.org/jira/browse/HADOOP-12469?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gera Shegalov updated HADOOP-12469:
---
Summary: distcp should not ignore the ignoreFailures option  (was: distcp 
shout not ignore the ignoreFailures option)

> distcp should not ignore the ignoreFailures option
> --
>
> Key: HADOOP-12469
> URL: https://issues.apache.org/jira/browse/HADOOP-12469
> Project: Hadoop Common
>  Issue Type: Bug
>  Components: tools/distcp
>Affects Versions: 2.7.1
>Reporter: Gera Shegalov
>Assignee: Mingliang Liu
>Priority: Critical
> Fix For: 2.8.0
>
> Attachments: HADOOP-12469.000.patch
>
>
> {{RetriableFileCopyCommand.CopyReadException}} is double-wrapped via
> # via {{RetriableCommand::execute}}
> # via {{CopyMapper#copyFileWithRetry}}
> before {{CopyMapper::handleFailure}} tests 
> {code}
> if (ignoreFailures && exception.getCause() instanceof
> RetriableFileCopyCommand.CopyReadException
> {code}
> which is always false.
> Orthogonally, ignoring failures should be mutually exclusive with the atomic 
> option otherwise an incomplete dir is eligible for commit defeating the 
> purpose.
>  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HADOOP-10775) Shell operations to fail with meaningful errors on windows if winutils.exe not found

2015-10-12 Thread Chris Nauroth (JIRA)

 [ 
https://issues.apache.org/jira/browse/HADOOP-10775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chris Nauroth updated HADOOP-10775:
---
Hadoop Flags: Reviewed

+1 for patch v009.  I confirmed a full test run on Windows.

Whitespace issues can be fixed on commit.

I have one more very minor nitpick.

{code}
-if (Shell.WINDOWS && !Shell.isJava7OrAbove()) {
-  // On Java6 on Windows, we copied the file
-  Assert.assertEquals(data.length, link.length());
-} else {
-  // Otherwise, the target file size is zero
-  Assert.assertEquals(0, link.length());
-}
+// Otherwise, the target file size is zero
+Assert.assertEquals(0, link.length());
{code}

Now that the conditional logic is gone, the "otherwise" phrasing of the comment 
doesn't make sense.  I think the whole comment can be removed at this point.  
Feel free to drop it during the final commit.  IMO, there is no need to go 
through full review of another patch revision for this minor tweak.

[~ste...@apache.org], thank you for contributing the patch.

> Shell operations to fail with meaningful errors on windows if winutils.exe 
> not found
> 
>
> Key: HADOOP-10775
> URL: https://issues.apache.org/jira/browse/HADOOP-10775
> Project: Hadoop Common
>  Issue Type: Improvement
>  Components: util
>Affects Versions: trunk-win, 2.7.1
> Environment: windows
>Reporter: Steve Loughran
>Assignee: Steve Loughran
>Priority: Minor
> Attachments: HADOOP-10775-002.patch, HADOOP-10775-003.patch, 
> HADOOP-10775-004.patch, HADOOP-10775-005.patch, HADOOP-10775-006.patch, 
> HADOOP-10775-007.patch, HADOOP-10775-008.patch, HADOOP-10775-009.patch, 
> HADOOP-10775.patch
>
>
> If {{winutils.exe}} can't be found {{HADOOP_HOME}} wrong/unset or other 
> causes, then an error is logged -but when any of the {{Shell}} operations are 
> used, an NPE is raised rather than something meaningful.
> The error message at setup time should be preserved and then raised before 
> any attempt to invoke a winutils-driven process made



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HADOOP-11683) Need a plugin API to translate long principal names to local OS user names arbitrarily

2015-10-12 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-11683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14953794#comment-14953794
 ] 

Hadoop QA commented on HADOOP-11683:


\\
\\
| (x) *{color:red}-1 overall{color}* |
\\
\\
|| Vote || Subsystem || Runtime || Comment ||
| {color:red}-1{color} | patch |   0m  1s | The patch command could not apply 
the patch during dryrun. |
\\
\\
|| Subsystem || Report/Notes ||
| Patch URL | 
http://issues.apache.org/jira/secure/attachment/12766168/HADOOP-11683.002.patch 
|
| Optional Tests | javadoc javac unit findbugs checkstyle |
| git revision | trunk / 9849c8b |
| Console output | 
https://builds.apache.org/job/PreCommit-HADOOP-Build/7796/console |


This message was automatically generated.

> Need a plugin API to translate long principal names to local OS user names 
> arbitrarily
> --
>
> Key: HADOOP-11683
> URL: https://issues.apache.org/jira/browse/HADOOP-11683
> Project: Hadoop Common
>  Issue Type: Improvement
>  Components: security
>Affects Versions: 2.6.0
>Reporter: Sunny Cheung
>Assignee: roger mak
> Attachments: HADOOP-11683.001.patch, HADOOP-11683.002.patch
>
>
> We need a plugin API to translate long principal names (e.g. 
> john@example.com) to local OS user names (e.g. user123456) arbitrarily.
> For some organizations the name translation is straightforward (e.g. 
> john@example.com to john_doe), and the hadoop.security.auth_to_local 
> configurable mapping is sufficient to resolve this (see HADOOP-6526). 
> However, in some other cases the name translation is arbitrary and cannot be 
> generalized by a set of translation rules easily.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HADOOP-11683) Need a plugin API to translate long principal names to local OS user names arbitrarily

2015-10-12 Thread roger mak (JIRA)

 [ 
https://issues.apache.org/jira/browse/HADOOP-11683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

roger mak updated HADOOP-11683:
---
Target Version/s: 2.6.2
  Status: Patch Available  (was: Open)

submit for review.

> Need a plugin API to translate long principal names to local OS user names 
> arbitrarily
> --
>
> Key: HADOOP-11683
> URL: https://issues.apache.org/jira/browse/HADOOP-11683
> Project: Hadoop Common
>  Issue Type: Improvement
>  Components: security
>Affects Versions: 2.6.0
>Reporter: Sunny Cheung
>Assignee: roger mak
> Attachments: HADOOP-11683.001.patch, HADOOP-11683.002.patch
>
>
> We need a plugin API to translate long principal names (e.g. 
> john@example.com) to local OS user names (e.g. user123456) arbitrarily.
> For some organizations the name translation is straightforward (e.g. 
> john@example.com to john_doe), and the hadoop.security.auth_to_local 
> configurable mapping is sufficient to resolve this (see HADOOP-6526). 
> However, in some other cases the name translation is arbitrary and cannot be 
> generalized by a set of translation rules easily.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HADOOP-11683) Need a plugin API to translate long principal names to local OS user names arbitrarily

2015-10-12 Thread roger mak (JIRA)

 [ 
https://issues.apache.org/jira/browse/HADOOP-11683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

roger mak updated HADOOP-11683:
---
Attachment: HADOOP-11683.002.patch

Enhanced test logic.

> Need a plugin API to translate long principal names to local OS user names 
> arbitrarily
> --
>
> Key: HADOOP-11683
> URL: https://issues.apache.org/jira/browse/HADOOP-11683
> Project: Hadoop Common
>  Issue Type: Improvement
>  Components: security
>Affects Versions: 2.6.0
>Reporter: Sunny Cheung
>Assignee: roger mak
> Attachments: HADOOP-11683.001.patch, HADOOP-11683.002.patch
>
>
> We need a plugin API to translate long principal names (e.g. 
> john@example.com) to local OS user names (e.g. user123456) arbitrarily.
> For some organizations the name translation is straightforward (e.g. 
> john@example.com to john_doe), and the hadoop.security.auth_to_local 
> configurable mapping is sufficient to resolve this (see HADOOP-6526). 
> However, in some other cases the name translation is arbitrary and cannot be 
> generalized by a set of translation rules easily.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HADOOP-11683) Need a plugin API to translate long principal names to local OS user names arbitrarily

2015-10-12 Thread roger mak (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-11683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14953702#comment-14953702
 ] 

roger mak commented on HADOOP-11683:


Update patch for trunk.

> Need a plugin API to translate long principal names to local OS user names 
> arbitrarily
> --
>
> Key: HADOOP-11683
> URL: https://issues.apache.org/jira/browse/HADOOP-11683
> Project: Hadoop Common
>  Issue Type: Improvement
>  Components: security
>Affects Versions: 2.6.0
>Reporter: Sunny Cheung
>Assignee: roger mak
> Attachments: HADOOP-11683.001.patch
>
>
> We need a plugin API to translate long principal names (e.g. 
> john@example.com) to local OS user names (e.g. user123456) arbitrarily.
> For some organizations the name translation is straightforward (e.g. 
> john@example.com to john_doe), and the hadoop.security.auth_to_local 
> configurable mapping is sufficient to resolve this (see HADOOP-6526). 
> However, in some other cases the name translation is arbitrary and cannot be 
> generalized by a set of translation rules easily.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HADOOP-11683) Need a plugin API to translate long principal names to local OS user names arbitrarily

2015-10-12 Thread roger mak (JIRA)

 [ 
https://issues.apache.org/jira/browse/HADOOP-11683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

roger mak updated HADOOP-11683:
---
Status: Open  (was: Patch Available)

> Need a plugin API to translate long principal names to local OS user names 
> arbitrarily
> --
>
> Key: HADOOP-11683
> URL: https://issues.apache.org/jira/browse/HADOOP-11683
> Project: Hadoop Common
>  Issue Type: Improvement
>  Components: security
>Affects Versions: 2.6.0
>Reporter: Sunny Cheung
>Assignee: roger mak
> Attachments: HADOOP-11683.001.patch
>
>
> We need a plugin API to translate long principal names (e.g. 
> john@example.com) to local OS user names (e.g. user123456) arbitrarily.
> For some organizations the name translation is straightforward (e.g. 
> john@example.com to john_doe), and the hadoop.security.auth_to_local 
> configurable mapping is sufficient to resolve this (see HADOOP-6526). 
> However, in some other cases the name translation is arbitrary and cannot be 
> generalized by a set of translation rules easily.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HADOOP-12364) Deleting pid file after stop is causing the daemons to keep restarting

2015-10-12 Thread Siqi Li (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-12364?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14953684#comment-14953684
 ] 

Siqi Li commented on HADOOP-12364:
--

Hi [~aw], can you take a look at the latest patch, I have fixed the shell check 
and change double bracket to single bracket as you mentioned above

> Deleting pid file after stop is causing the daemons to keep restarting 
> ---
>
> Key: HADOOP-12364
> URL: https://issues.apache.org/jira/browse/HADOOP-12364
> Project: Hadoop Common
>  Issue Type: Bug
>Affects Versions: 2.6.0
>Reporter: Siqi Li
>Assignee: Siqi Li
>Priority: Minor
> Attachments: HADOOP-12364.v1.patch, HADOOP-12364.v2.patch, 
> HADOOP-12364.v3.patch, HADOOP-12364.v4.patch
>
>
> pid files are deleting in 5 seconds after we stop the daemons. If a start 
> command were executed within the 5 seconds, the pid file will be overwrite 
> with a new pid. However, this pid file is going to be deleted by the former 
> stop command. This is causing the monitoring service to lose track of the 
> daemons, hence keep rebooting them



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HADOOP-12469) distcp shout not ignore the ignoreFailures option

2015-10-12 Thread Mingliang Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-12469?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14953458#comment-14953458
 ] 

Mingliang Liu commented on HADOOP-12469:


Thanks [~wheat9] for reviewing this patch. I should make it clear that it was 
not to address the whole issue when I upload the patch. Sorry for the confusion.

As [~jira.shegalov] suggested, I'll update the patch with tests enabled. I 
filed a new issue [HADOOP-12473] about the atomic option for ignoring failures. 

> distcp shout not ignore the ignoreFailures option
> -
>
> Key: HADOOP-12469
> URL: https://issues.apache.org/jira/browse/HADOOP-12469
> Project: Hadoop Common
>  Issue Type: Bug
>  Components: tools/distcp
>Affects Versions: 2.7.1
>Reporter: Gera Shegalov
>Assignee: Mingliang Liu
>Priority: Critical
> Fix For: 2.8.0
>
> Attachments: HADOOP-12469.000.patch
>
>
> {{RetriableFileCopyCommand.CopyReadException}} is double-wrapped via
> # via {{RetriableCommand::execute}}
> # via {{CopyMapper#copyFileWithRetry}}
> before {{CopyMapper::handleFailure}} tests 
> {code}
> if (ignoreFailures && exception.getCause() instanceof
> RetriableFileCopyCommand.CopyReadException
> {code}
> which is always false.
> Orthogonally, ignoring failures should be mutually exclusive with the atomic 
> option otherwise an incomplete dir is eligible for commit defeating the 
> purpose.
>  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HADOOP-12473) distcp's ignoring failures should be mutually exclusive with the atomic option

2015-10-12 Thread Mingliang Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HADOOP-12473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mingliang Liu updated HADOOP-12473:
---
Description: In {{CopyMapper::handleFailure}}, the mapper handles failure 
and will ignore it if no it's config key is on. Ignoring failures should be 
mutually exclusive with the atomic option otherwise an incomplete dir is 
eligible for commit defeating the purpose.  (was: In 
{{CopyMapper::handleFailure}}, the mapper handles failures  Ignoring failures 
should be mutually exclusive with the atomic option otherwise an incomplete dir 
is eligible for commit defeating the purpose.)

> distcp's ignoring failures should be mutually exclusive with the atomic option
> --
>
> Key: HADOOP-12473
> URL: https://issues.apache.org/jira/browse/HADOOP-12473
> Project: Hadoop Common
>  Issue Type: Bug
>  Components: tools/distcp
>Affects Versions: 2.7.1
>Reporter: Mingliang Liu
>Priority: Critical
> Fix For: 2.8.0
>
>
> In {{CopyMapper::handleFailure}}, the mapper handles failure and will ignore 
> it if no it's config key is on. Ignoring failures should be mutually 
> exclusive with the atomic option otherwise an incomplete dir is eligible for 
> commit defeating the purpose.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HADOOP-12473) distcp's ignoring failures should be mutually exclusive with the atomic option

2015-10-12 Thread Mingliang Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HADOOP-12473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mingliang Liu updated HADOOP-12473:
---
Assignee: (was: Mingliang Liu)

> distcp's ignoring failures should be mutually exclusive with the atomic option
> --
>
> Key: HADOOP-12473
> URL: https://issues.apache.org/jira/browse/HADOOP-12473
> Project: Hadoop Common
>  Issue Type: Bug
>  Components: tools/distcp
>Affects Versions: 2.7.1
>Reporter: Mingliang Liu
>Priority: Critical
> Fix For: 2.8.0
>
>
> In {{CopyMapper::handleFailure}}, the mapper handles failures  Ignoring 
> failures should be mutually exclusive with the atomic option otherwise an 
> incomplete dir is eligible for commit defeating the purpose.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HADOOP-12473) distcp's ignoring failures should be mutually exclusive with the atomic option

2015-10-12 Thread Mingliang Liu (JIRA)
Mingliang Liu created HADOOP-12473:
--

 Summary: distcp's ignoring failures should be mutually exclusive 
with the atomic option
 Key: HADOOP-12473
 URL: https://issues.apache.org/jira/browse/HADOOP-12473
 Project: Hadoop Common
  Issue Type: Bug
  Components: tools/distcp
Affects Versions: 2.7.1
Reporter: Mingliang Liu
Assignee: Mingliang Liu
Priority: Critical
 Fix For: 2.8.0


{{RetriableFileCopyCommand.CopyReadException}} is double-wrapped via

# via {{RetriableCommand::execute}}
# via {{CopyMapper#copyFileWithRetry}}

before {{CopyMapper::handleFailure}} tests 
{code}
if (ignoreFailures && exception.getCause() instanceof
RetriableFileCopyCommand.CopyReadException
{code}
which is always false.

Orthogonally, ignoring failures should be mutually exclusive with the atomic 
option otherwise an incomplete dir is eligible for commit defeating the purpose.
 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HADOOP-12473) distcp's ignoring failures should be mutually exclusive with the atomic option

2015-10-12 Thread Mingliang Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HADOOP-12473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mingliang Liu updated HADOOP-12473:
---
Description: In {{CopyMapper::handleFailure}}, the mapper handles failures  
Ignoring failures should be mutually exclusive with the atomic option otherwise 
an incomplete dir is eligible for commit defeating the purpose.  (was: 
{{RetriableFileCopyCommand.CopyReadException}} is double-wrapped via

# via {{RetriableCommand::execute}}
# via {{CopyMapper#copyFileWithRetry}}

before {{CopyMapper::handleFailure}} tests 
{code}
if (ignoreFailures && exception.getCause() instanceof
RetriableFileCopyCommand.CopyReadException
{code}
which is always false.

Orthogonally, ignoring failures should be mutually exclusive with the atomic 
option otherwise an incomplete dir is eligible for commit defeating the purpose.
 )

> distcp's ignoring failures should be mutually exclusive with the atomic option
> --
>
> Key: HADOOP-12473
> URL: https://issues.apache.org/jira/browse/HADOOP-12473
> Project: Hadoop Common
>  Issue Type: Bug
>  Components: tools/distcp
>Affects Versions: 2.7.1
>Reporter: Mingliang Liu
>Assignee: Mingliang Liu
>Priority: Critical
> Fix For: 2.8.0
>
>
> In {{CopyMapper::handleFailure}}, the mapper handles failures  Ignoring 
> failures should be mutually exclusive with the atomic option otherwise an 
> incomplete dir is eligible for commit defeating the purpose.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HADOOP-10300) Allowed deferred sending of call responses

2015-10-12 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-10300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14953115#comment-14953115
 ] 

Hudson commented on HADOOP-10300:
-

FAILURE: Integrated in Hadoop-Hdfs-trunk-Java8 #484 (See 
[https://builds.apache.org/job/Hadoop-Hdfs-trunk-Java8/484/])
HADOOP-10300. Allowed deferred sending of call responses. (Daryn Sharp (yliu: 
rev e617cf6dd13f2bb5d7cbb15ee2cdb260ecd46cd3)
* 
hadoop-common-project/hadoop-common/src/test/java/org/apache/hadoop/ipc/TestIPCServerResponder.java
* 
hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/ipc/Server.java
* hadoop-common-project/hadoop-common/CHANGES.txt


> Allowed deferred sending of call responses
> --
>
> Key: HADOOP-10300
> URL: https://issues.apache.org/jira/browse/HADOOP-10300
> Project: Hadoop Common
>  Issue Type: Sub-task
>  Components: ipc
>Affects Versions: 2.0.0-alpha, 3.0.0
>Reporter: Daryn Sharp
>Assignee: Daryn Sharp
>  Labels: BB2015-05-TBR
> Fix For: 2.8.0
>
> Attachments: HADOOP-10300.patch, HADOOP-10300.patch, 
> HADOOP-10300.patch
>
>
> RPC handlers currently do not return until the RPC call completes and 
> response is sent, or a partially sent response has been queued for the 
> responder.  It would be useful for a proxy method to notify the handler to 
> not yet the send the call's response.
> An potential use case is a namespace handler in the NN might want to return 
> before the edit log is synced so it can service more requests and allow 
> increased batching of edits per sync.  Background syncing could later trigger 
> the sending of the call response to the client.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HADOOP-10300) Allowed deferred sending of call responses

2015-10-12 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-10300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14953105#comment-14953105
 ] 

Hudson commented on HADOOP-10300:
-

FAILURE: Integrated in Hadoop-Hdfs-trunk #2422 (See 
[https://builds.apache.org/job/Hadoop-Hdfs-trunk/2422/])
HADOOP-10300. Allowed deferred sending of call responses. (Daryn Sharp (yliu: 
rev e617cf6dd13f2bb5d7cbb15ee2cdb260ecd46cd3)
* 
hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/ipc/Server.java
* 
hadoop-common-project/hadoop-common/src/test/java/org/apache/hadoop/ipc/TestIPCServerResponder.java
* hadoop-common-project/hadoop-common/CHANGES.txt


> Allowed deferred sending of call responses
> --
>
> Key: HADOOP-10300
> URL: https://issues.apache.org/jira/browse/HADOOP-10300
> Project: Hadoop Common
>  Issue Type: Sub-task
>  Components: ipc
>Affects Versions: 2.0.0-alpha, 3.0.0
>Reporter: Daryn Sharp
>Assignee: Daryn Sharp
>  Labels: BB2015-05-TBR
> Fix For: 2.8.0
>
> Attachments: HADOOP-10300.patch, HADOOP-10300.patch, 
> HADOOP-10300.patch
>
>
> RPC handlers currently do not return until the RPC call completes and 
> response is sent, or a partially sent response has been queued for the 
> responder.  It would be useful for a proxy method to notify the handler to 
> not yet the send the call's response.
> An potential use case is a namespace handler in the NN might want to return 
> before the edit log is synced so it can service more requests and allow 
> increased batching of edits per sync.  Background syncing could later trigger 
> the sending of the call response to the client.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HADOOP-10300) Allowed deferred sending of call responses

2015-10-12 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-10300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14952910#comment-14952910
 ] 

Hudson commented on HADOOP-10300:
-

FAILURE: Integrated in Hadoop-Yarn-trunk-Java8 #524 (See 
[https://builds.apache.org/job/Hadoop-Yarn-trunk-Java8/524/])
HADOOP-10300. Allowed deferred sending of call responses. (Daryn Sharp (yliu: 
rev e617cf6dd13f2bb5d7cbb15ee2cdb260ecd46cd3)
* 
hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/ipc/Server.java
* hadoop-common-project/hadoop-common/CHANGES.txt
* 
hadoop-common-project/hadoop-common/src/test/java/org/apache/hadoop/ipc/TestIPCServerResponder.java


> Allowed deferred sending of call responses
> --
>
> Key: HADOOP-10300
> URL: https://issues.apache.org/jira/browse/HADOOP-10300
> Project: Hadoop Common
>  Issue Type: Sub-task
>  Components: ipc
>Affects Versions: 2.0.0-alpha, 3.0.0
>Reporter: Daryn Sharp
>Assignee: Daryn Sharp
>  Labels: BB2015-05-TBR
> Fix For: 2.8.0
>
> Attachments: HADOOP-10300.patch, HADOOP-10300.patch, 
> HADOOP-10300.patch
>
>
> RPC handlers currently do not return until the RPC call completes and 
> response is sent, or a partially sent response has been queued for the 
> responder.  It would be useful for a proxy method to notify the handler to 
> not yet the send the call's response.
> An potential use case is a namespace handler in the NN might want to return 
> before the edit log is synced so it can service more requests and allow 
> increased batching of edits per sync.  Background syncing could later trigger 
> the sending of the call response to the client.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HADOOP-10296) Incorrect null check in SwiftRestClient#buildException()

2015-10-12 Thread Kanaka Kumar Avvaru (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-10296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14952892#comment-14952892
 ] 

Kanaka Kumar Avvaru commented on HADOOP-10296:
--

Thanks [~ajisakaa] for the review & commit 

> Incorrect null check in SwiftRestClient#buildException()
> 
>
> Key: HADOOP-10296
> URL: https://issues.apache.org/jira/browse/HADOOP-10296
> Project: Hadoop Common
>  Issue Type: Bug
>Affects Versions: 2.3.0
>Reporter: Ted Yu
>Assignee: Kanaka Kumar Avvaru
>Priority: Minor
>  Labels: newbie
> Fix For: 2.8.0
>
> Attachments: HADOOP-10296.1.patch, HADOOP-10296.2.patch
>
>
> {code}
> if (requestContentLen!=null) {
>   errorText.append(" available 
> ").append(availableContentRange.getValue());
> }
> {code}
> The null check should be for availableContentRange



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HADOOP-10300) Allowed deferred sending of call responses

2015-10-12 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-10300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14952875#comment-14952875
 ] 

Hudson commented on HADOOP-10300:
-

FAILURE: Integrated in Hadoop-Yarn-trunk #1249 (See 
[https://builds.apache.org/job/Hadoop-Yarn-trunk/1249/])
HADOOP-10300. Allowed deferred sending of call responses. (Daryn Sharp (yliu: 
rev e617cf6dd13f2bb5d7cbb15ee2cdb260ecd46cd3)
* 
hadoop-common-project/hadoop-common/src/test/java/org/apache/hadoop/ipc/TestIPCServerResponder.java
* 
hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/ipc/Server.java
* hadoop-common-project/hadoop-common/CHANGES.txt


> Allowed deferred sending of call responses
> --
>
> Key: HADOOP-10300
> URL: https://issues.apache.org/jira/browse/HADOOP-10300
> Project: Hadoop Common
>  Issue Type: Sub-task
>  Components: ipc
>Affects Versions: 2.0.0-alpha, 3.0.0
>Reporter: Daryn Sharp
>Assignee: Daryn Sharp
>  Labels: BB2015-05-TBR
> Fix For: 2.8.0
>
> Attachments: HADOOP-10300.patch, HADOOP-10300.patch, 
> HADOOP-10300.patch
>
>
> RPC handlers currently do not return until the RPC call completes and 
> response is sent, or a partially sent response has been queued for the 
> responder.  It would be useful for a proxy method to notify the handler to 
> not yet the send the call's response.
> An potential use case is a namespace handler in the NN might want to return 
> before the edit log is synced so it can service more requests and allow 
> increased batching of edits per sync.  Background syncing could later trigger 
> the sending of the call response to the client.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HADOOP-10300) Allowed deferred sending of call responses

2015-10-12 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-10300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14952857#comment-14952857
 ] 

Hudson commented on HADOOP-10300:
-

FAILURE: Integrated in Hadoop-Mapreduce-trunk-Java8 #512 (See 
[https://builds.apache.org/job/Hadoop-Mapreduce-trunk-Java8/512/])
HADOOP-10300. Allowed deferred sending of call responses. (Daryn Sharp (yliu: 
rev e617cf6dd13f2bb5d7cbb15ee2cdb260ecd46cd3)
* 
hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/ipc/Server.java
* hadoop-common-project/hadoop-common/CHANGES.txt
* 
hadoop-common-project/hadoop-common/src/test/java/org/apache/hadoop/ipc/TestIPCServerResponder.java


> Allowed deferred sending of call responses
> --
>
> Key: HADOOP-10300
> URL: https://issues.apache.org/jira/browse/HADOOP-10300
> Project: Hadoop Common
>  Issue Type: Sub-task
>  Components: ipc
>Affects Versions: 2.0.0-alpha, 3.0.0
>Reporter: Daryn Sharp
>Assignee: Daryn Sharp
>  Labels: BB2015-05-TBR
> Fix For: 2.8.0
>
> Attachments: HADOOP-10300.patch, HADOOP-10300.patch, 
> HADOOP-10300.patch
>
>
> RPC handlers currently do not return until the RPC call completes and 
> response is sent, or a partially sent response has been queued for the 
> responder.  It would be useful for a proxy method to notify the handler to 
> not yet the send the call's response.
> An potential use case is a namespace handler in the NN might want to return 
> before the edit log is synced so it can service more requests and allow 
> increased batching of edits per sync.  Background syncing could later trigger 
> the sending of the call response to the client.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HADOOP-10300) Allowed deferred sending of call responses

2015-10-12 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-10300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14952835#comment-14952835
 ] 

Hudson commented on HADOOP-10300:
-

FAILURE: Integrated in Hadoop-Mapreduce-trunk #2458 (See 
[https://builds.apache.org/job/Hadoop-Mapreduce-trunk/2458/])
HADOOP-10300. Allowed deferred sending of call responses. (Daryn Sharp (yliu: 
rev e617cf6dd13f2bb5d7cbb15ee2cdb260ecd46cd3)
* 
hadoop-common-project/hadoop-common/src/test/java/org/apache/hadoop/ipc/TestIPCServerResponder.java
* hadoop-common-project/hadoop-common/CHANGES.txt
* 
hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/ipc/Server.java


> Allowed deferred sending of call responses
> --
>
> Key: HADOOP-10300
> URL: https://issues.apache.org/jira/browse/HADOOP-10300
> Project: Hadoop Common
>  Issue Type: Sub-task
>  Components: ipc
>Affects Versions: 2.0.0-alpha, 3.0.0
>Reporter: Daryn Sharp
>Assignee: Daryn Sharp
>  Labels: BB2015-05-TBR
> Fix For: 2.8.0
>
> Attachments: HADOOP-10300.patch, HADOOP-10300.patch, 
> HADOOP-10300.patch
>
>
> RPC handlers currently do not return until the RPC call completes and 
> response is sent, or a partially sent response has been queued for the 
> responder.  It would be useful for a proxy method to notify the handler to 
> not yet the send the call's response.
> An potential use case is a namespace handler in the NN might want to return 
> before the edit log is synced so it can service more requests and allow 
> increased batching of edits per sync.  Background syncing could later trigger 
> the sending of the call response to the client.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HADOOP-10300) Allowed deferred sending of call responses

2015-10-12 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-10300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14952803#comment-14952803
 ] 

Hudson commented on HADOOP-10300:
-

FAILURE: Integrated in Hadoop-trunk-Commit #8611 (See 
[https://builds.apache.org/job/Hadoop-trunk-Commit/8611/])
HADOOP-10300. Allowed deferred sending of call responses. (Daryn Sharp (yliu: 
rev e617cf6dd13f2bb5d7cbb15ee2cdb260ecd46cd3)
* 
hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/ipc/Server.java
* hadoop-common-project/hadoop-common/CHANGES.txt
* 
hadoop-common-project/hadoop-common/src/test/java/org/apache/hadoop/ipc/TestIPCServerResponder.java


> Allowed deferred sending of call responses
> --
>
> Key: HADOOP-10300
> URL: https://issues.apache.org/jira/browse/HADOOP-10300
> Project: Hadoop Common
>  Issue Type: Sub-task
>  Components: ipc
>Affects Versions: 2.0.0-alpha, 3.0.0
>Reporter: Daryn Sharp
>Assignee: Daryn Sharp
>  Labels: BB2015-05-TBR
> Fix For: 2.8.0
>
> Attachments: HADOOP-10300.patch, HADOOP-10300.patch, 
> HADOOP-10300.patch
>
>
> RPC handlers currently do not return until the RPC call completes and 
> response is sent, or a partially sent response has been queued for the 
> responder.  It would be useful for a proxy method to notify the handler to 
> not yet the send the call's response.
> An potential use case is a namespace handler in the NN might want to return 
> before the edit log is synced so it can service more requests and allow 
> increased batching of edits per sync.  Background syncing could later trigger 
> the sending of the call response to the client.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HADOOP-10300) Allowed deferred sending of call responses

2015-10-12 Thread Yi Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HADOOP-10300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yi Liu updated HADOOP-10300:

   Resolution: Fixed
 Hadoop Flags: Reviewed
Fix Version/s: 2.8.0
   Status: Resolved  (was: Patch Available)

Committed to trunk and branch-2.

> Allowed deferred sending of call responses
> --
>
> Key: HADOOP-10300
> URL: https://issues.apache.org/jira/browse/HADOOP-10300
> Project: Hadoop Common
>  Issue Type: Sub-task
>  Components: ipc
>Affects Versions: 2.0.0-alpha, 3.0.0
>Reporter: Daryn Sharp
>Assignee: Daryn Sharp
>  Labels: BB2015-05-TBR
> Fix For: 2.8.0
>
> Attachments: HADOOP-10300.patch, HADOOP-10300.patch, 
> HADOOP-10300.patch
>
>
> RPC handlers currently do not return until the RPC call completes and 
> response is sent, or a partially sent response has been queued for the 
> responder.  It would be useful for a proxy method to notify the handler to 
> not yet the send the call's response.
> An potential use case is a namespace handler in the NN might want to return 
> before the edit log is synced so it can service more requests and allow 
> increased batching of edits per sync.  Background syncing could later trigger 
> the sending of the call response to the client.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HADOOP-10300) Allowed deferred sending of call responses

2015-10-12 Thread Yi Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-10300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14952778#comment-14952778
 ] 

Yi Liu commented on HADOOP-10300:
-

+1, thanks [~daryn]. 

> Allowed deferred sending of call responses
> --
>
> Key: HADOOP-10300
> URL: https://issues.apache.org/jira/browse/HADOOP-10300
> Project: Hadoop Common
>  Issue Type: Sub-task
>  Components: ipc
>Affects Versions: 2.0.0-alpha, 3.0.0
>Reporter: Daryn Sharp
>Assignee: Daryn Sharp
>  Labels: BB2015-05-TBR
> Attachments: HADOOP-10300.patch, HADOOP-10300.patch, 
> HADOOP-10300.patch
>
>
> RPC handlers currently do not return until the RPC call completes and 
> response is sent, or a partially sent response has been queued for the 
> responder.  It would be useful for a proxy method to notify the handler to 
> not yet the send the call's response.
> An potential use case is a namespace handler in the NN might want to return 
> before the edit log is synced so it can service more requests and allow 
> increased batching of edits per sync.  Background syncing could later trigger 
> the sending of the call response to the client.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)