[jira] [Updated] (HDFS-9100) HDFS Balancer does not respect dfs.client.use.datanode.hostname

2015-09-29 Thread Casey Brotherton (JIRA)

 [ 
https://issues.apache.org/jira/browse/HDFS-9100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Casey Brotherton updated HDFS-9100:
---
Status: Open  (was: Patch Available)

> HDFS Balancer does not respect dfs.client.use.datanode.hostname
> ---
>
> Key: HDFS-9100
> URL: https://issues.apache.org/jira/browse/HDFS-9100
> Project: Hadoop HDFS
>  Issue Type: Bug
>  Components: balancer & mover, HDFS
>Reporter: Yongjun Zhang
>Assignee: Casey Brotherton
> Attachments: HDFS-9100.000.patch, HDFS-9100.001.patch, 
> HDFS-9100.002.patch
>
>
> In Balancer Dispatch.java:
> {code}
>private void dispatch() {
>   LOG.info("Start moving " + this);
>   Socket sock = new Socket();
>   DataOutputStream out = null;
>   DataInputStream in = null;
>   try {
> sock.connect(
> NetUtils.createSocketAddr(target.getDatanodeInfo().getXferAddr()),
> HdfsConstants.READ_TIMEOUT);
> {code}
> getXferAddr() is called without taking into consideration of 
> dfs.client.use.datanode.hostname setting, this would possibly fail balancer 
> run issued from outside a cluster.
> Thanks [~caseyjbrotherton] for reporting the issue.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HDFS-9100) HDFS Balancer does not respect dfs.client.use.datanode.hostname

2015-09-29 Thread Casey Brotherton (JIRA)

 [ 
https://issues.apache.org/jira/browse/HDFS-9100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Casey Brotherton updated HDFS-9100:
---
Attachment: HDFS-9100.003.patch

> HDFS Balancer does not respect dfs.client.use.datanode.hostname
> ---
>
> Key: HDFS-9100
> URL: https://issues.apache.org/jira/browse/HDFS-9100
> Project: Hadoop HDFS
>  Issue Type: Bug
>  Components: balancer & mover, HDFS
>Reporter: Yongjun Zhang
>Assignee: Casey Brotherton
> Attachments: HDFS-9100.000.patch, HDFS-9100.001.patch, 
> HDFS-9100.002.patch, HDFS-9100.003.patch
>
>
> In Balancer Dispatch.java:
> {code}
>private void dispatch() {
>   LOG.info("Start moving " + this);
>   Socket sock = new Socket();
>   DataOutputStream out = null;
>   DataInputStream in = null;
>   try {
> sock.connect(
> NetUtils.createSocketAddr(target.getDatanodeInfo().getXferAddr()),
> HdfsConstants.READ_TIMEOUT);
> {code}
> getXferAddr() is called without taking into consideration of 
> dfs.client.use.datanode.hostname setting, this would possibly fail balancer 
> run issued from outside a cluster.
> Thanks [~caseyjbrotherton] for reporting the issue.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HDFS-9100) HDFS Balancer does not respect dfs.client.use.datanode.hostname

2015-09-29 Thread Casey Brotherton (JIRA)

 [ 
https://issues.apache.org/jira/browse/HDFS-9100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Casey Brotherton updated HDFS-9100:
---
Status: Patch Available  (was: Open)

> HDFS Balancer does not respect dfs.client.use.datanode.hostname
> ---
>
> Key: HDFS-9100
> URL: https://issues.apache.org/jira/browse/HDFS-9100
> Project: Hadoop HDFS
>  Issue Type: Bug
>  Components: balancer & mover, HDFS
>Reporter: Yongjun Zhang
>Assignee: Casey Brotherton
> Attachments: HDFS-9100.000.patch, HDFS-9100.001.patch, 
> HDFS-9100.002.patch, HDFS-9100.003.patch
>
>
> In Balancer Dispatch.java:
> {code}
>private void dispatch() {
>   LOG.info("Start moving " + this);
>   Socket sock = new Socket();
>   DataOutputStream out = null;
>   DataInputStream in = null;
>   try {
> sock.connect(
> NetUtils.createSocketAddr(target.getDatanodeInfo().getXferAddr()),
> HdfsConstants.READ_TIMEOUT);
> {code}
> getXferAddr() is called without taking into consideration of 
> dfs.client.use.datanode.hostname setting, this would possibly fail balancer 
> run issued from outside a cluster.
> Thanks [~caseyjbrotherton] for reporting the issue.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HDFS-9100) HDFS Balancer does not respect dfs.client.use.datanode.hostname

2015-09-29 Thread Casey Brotherton (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-9100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14935653#comment-14935653
 ] 

Casey Brotherton commented on HDFS-9100:


Patch is limited to Dispatcher.java 

pre-patch concerns shows something with:
org.apache.hadoop.hdfs.server.datanode.DirectoryScanner

Test log failure was with:
org.apache.hadoop.hdfs.server.datanode.TestDirectoryScanner

Not sure what the issue is.  



> HDFS Balancer does not respect dfs.client.use.datanode.hostname
> ---
>
> Key: HDFS-9100
> URL: https://issues.apache.org/jira/browse/HDFS-9100
> Project: Hadoop HDFS
>  Issue Type: Bug
>  Components: balancer & mover, HDFS
>Reporter: Yongjun Zhang
>Assignee: Casey Brotherton
> Attachments: HDFS-9100.000.patch, HDFS-9100.001.patch, 
> HDFS-9100.002.patch, HDFS-9100.003.patch
>
>
> In Balancer Dispatch.java:
> {code}
>private void dispatch() {
>   LOG.info("Start moving " + this);
>   Socket sock = new Socket();
>   DataOutputStream out = null;
>   DataInputStream in = null;
>   try {
> sock.connect(
> NetUtils.createSocketAddr(target.getDatanodeInfo().getXferAddr()),
> HdfsConstants.READ_TIMEOUT);
> {code}
> getXferAddr() is called without taking into consideration of 
> dfs.client.use.datanode.hostname setting, this would possibly fail balancer 
> run issued from outside a cluster.
> Thanks [~caseyjbrotherton] for reporting the issue.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HDFS-9100) HDFS Balancer does not respect dfs.client.use.datanode.hostname

2015-09-28 Thread Casey Brotherton (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-9100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14934593#comment-14934593
 ] 

Casey Brotherton commented on HDFS-9100:


I am sorry, the failure appears to be due to changes with HDFS-8053.

Will reapply the patch, and run tests tonight.

> HDFS Balancer does not respect dfs.client.use.datanode.hostname
> ---
>
> Key: HDFS-9100
> URL: https://issues.apache.org/jira/browse/HDFS-9100
> Project: Hadoop HDFS
>  Issue Type: Bug
>  Components: balancer & mover, HDFS
>Reporter: Yongjun Zhang
>Assignee: Casey Brotherton
> Attachments: HDFS-9100.000.patch, HDFS-9100.001.patch, 
> HDFS-9100.002.patch
>
>
> In Balancer Dispatch.java:
> {code}
>private void dispatch() {
>   LOG.info("Start moving " + this);
>   Socket sock = new Socket();
>   DataOutputStream out = null;
>   DataInputStream in = null;
>   try {
> sock.connect(
> NetUtils.createSocketAddr(target.getDatanodeInfo().getXferAddr()),
> HdfsConstants.READ_TIMEOUT);
> {code}
> getXferAddr() is called without taking into consideration of 
> dfs.client.use.datanode.hostname setting, this would possibly fail balancer 
> run issued from outside a cluster.
> Thanks [~caseyjbrotherton] for reporting the issue.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HDFS-9100) HDFS Balancer does not respect dfs.client.use.datanode.hostname

2015-09-28 Thread Casey Brotherton (JIRA)

 [ 
https://issues.apache.org/jira/browse/HDFS-9100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Casey Brotherton updated HDFS-9100:
---
Status: Open  (was: Patch Available)

> HDFS Balancer does not respect dfs.client.use.datanode.hostname
> ---
>
> Key: HDFS-9100
> URL: https://issues.apache.org/jira/browse/HDFS-9100
> Project: Hadoop HDFS
>  Issue Type: Bug
>  Components: balancer & mover, HDFS
>Reporter: Yongjun Zhang
>Assignee: Casey Brotherton
> Attachments: HDFS-9100.000.patch, HDFS-9100.001.patch
>
>
> In Balancer Dispatch.java:
> {code}
>private void dispatch() {
>   LOG.info("Start moving " + this);
>   Socket sock = new Socket();
>   DataOutputStream out = null;
>   DataInputStream in = null;
>   try {
> sock.connect(
> NetUtils.createSocketAddr(target.getDatanodeInfo().getXferAddr()),
> HdfsConstants.READ_TIMEOUT);
> {code}
> getXferAddr() is called without taking into consideration of 
> dfs.client.use.datanode.hostname setting, this would possibly fail balancer 
> run issued from outside a cluster.
> Thanks [~caseyjbrotherton] for reporting the issue.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HDFS-9100) HDFS Balancer does not respect dfs.client.use.datanode.hostname

2015-09-28 Thread Casey Brotherton (JIRA)

 [ 
https://issues.apache.org/jira/browse/HDFS-9100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Casey Brotherton updated HDFS-9100:
---
Attachment: HDFS-9100.002.patch

Thank you for the review, [~yzhangal]

I have made the changes and uploaded a new patch file.

> HDFS Balancer does not respect dfs.client.use.datanode.hostname
> ---
>
> Key: HDFS-9100
> URL: https://issues.apache.org/jira/browse/HDFS-9100
> Project: Hadoop HDFS
>  Issue Type: Bug
>  Components: balancer & mover, HDFS
>Reporter: Yongjun Zhang
>Assignee: Casey Brotherton
> Attachments: HDFS-9100.000.patch, HDFS-9100.001.patch, 
> HDFS-9100.002.patch
>
>
> In Balancer Dispatch.java:
> {code}
>private void dispatch() {
>   LOG.info("Start moving " + this);
>   Socket sock = new Socket();
>   DataOutputStream out = null;
>   DataInputStream in = null;
>   try {
> sock.connect(
> NetUtils.createSocketAddr(target.getDatanodeInfo().getXferAddr()),
> HdfsConstants.READ_TIMEOUT);
> {code}
> getXferAddr() is called without taking into consideration of 
> dfs.client.use.datanode.hostname setting, this would possibly fail balancer 
> run issued from outside a cluster.
> Thanks [~caseyjbrotherton] for reporting the issue.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HDFS-9100) HDFS Balancer does not respect dfs.client.use.datanode.hostname

2015-09-28 Thread Casey Brotherton (JIRA)

 [ 
https://issues.apache.org/jira/browse/HDFS-9100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Casey Brotherton updated HDFS-9100:
---
Status: Patch Available  (was: Open)

> HDFS Balancer does not respect dfs.client.use.datanode.hostname
> ---
>
> Key: HDFS-9100
> URL: https://issues.apache.org/jira/browse/HDFS-9100
> Project: Hadoop HDFS
>  Issue Type: Bug
>  Components: balancer & mover, HDFS
>Reporter: Yongjun Zhang
>Assignee: Casey Brotherton
> Attachments: HDFS-9100.000.patch, HDFS-9100.001.patch, 
> HDFS-9100.002.patch
>
>
> In Balancer Dispatch.java:
> {code}
>private void dispatch() {
>   LOG.info("Start moving " + this);
>   Socket sock = new Socket();
>   DataOutputStream out = null;
>   DataInputStream in = null;
>   try {
> sock.connect(
> NetUtils.createSocketAddr(target.getDatanodeInfo().getXferAddr()),
> HdfsConstants.READ_TIMEOUT);
> {code}
> getXferAddr() is called without taking into consideration of 
> dfs.client.use.datanode.hostname setting, this would possibly fail balancer 
> run issued from outside a cluster.
> Thanks [~caseyjbrotherton] for reporting the issue.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HDFS-9100) HDFS Balancer does not respect dfs.client.use.datanode.hostname

2015-09-25 Thread Casey Brotherton (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-9100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14908125#comment-14908125
 ] 

Casey Brotherton commented on HDFS-9100:


Hello
hdfs tests were related to hflush, and secondary namenode webui.  Both of which 
I didn't change.

Should I re-publish the patch to see if those tests resolve?

Thanks,
Casey

> HDFS Balancer does not respect dfs.client.use.datanode.hostname
> ---
>
> Key: HDFS-9100
> URL: https://issues.apache.org/jira/browse/HDFS-9100
> Project: Hadoop HDFS
>  Issue Type: Bug
>  Components: balancer & mover, HDFS
>Reporter: Yongjun Zhang
>Assignee: Casey Brotherton
> Attachments: HDFS-9100.000.patch, HDFS-9100.001.patch
>
>
> In Balancer Dispatch.java:
> {code}
>private void dispatch() {
>   LOG.info("Start moving " + this);
>   Socket sock = new Socket();
>   DataOutputStream out = null;
>   DataInputStream in = null;
>   try {
> sock.connect(
> NetUtils.createSocketAddr(target.getDatanodeInfo().getXferAddr()),
> HdfsConstants.READ_TIMEOUT);
> {code}
> getXferAddr() is called without taking into consideration of 
> dfs.client.use.datanode.hostname setting, this would possibly fail balancer 
> run issued from outside a cluster.
> Thanks [~caseyjbrotherton] for reporting the issue.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HDFS-9100) HDFS Balancer does not respect dfs.client.use.datanode.hostname

2015-09-23 Thread Casey Brotherton (JIRA)

 [ 
https://issues.apache.org/jira/browse/HDFS-9100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Casey Brotherton updated HDFS-9100:
---
Status: Patch Available  (was: Open)

> HDFS Balancer does not respect dfs.client.use.datanode.hostname
> ---
>
> Key: HDFS-9100
> URL: https://issues.apache.org/jira/browse/HDFS-9100
> Project: Hadoop HDFS
>  Issue Type: Bug
>  Components: balancer & mover, HDFS
>Reporter: Yongjun Zhang
>Assignee: Casey Brotherton
> Attachments: HDFS-9100.000.patch
>
>
> In Balancer Dispatch.java:
> {code}
>private void dispatch() {
>   LOG.info("Start moving " + this);
>   Socket sock = new Socket();
>   DataOutputStream out = null;
>   DataInputStream in = null;
>   try {
> sock.connect(
> NetUtils.createSocketAddr(target.getDatanodeInfo().getXferAddr()),
> HdfsConstants.READ_TIMEOUT);
> {code}
> getXferAddr() is called without taking into consideration of 
> dfs.client.use.datanode.hostname setting, this would possibly fail balancer 
> run issued from outside a cluster.
> Thanks [~caseyjbrotherton] for reporting the issue.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HDFS-9100) HDFS Balancer does not respect dfs.client.use.datanode.hostname

2015-09-23 Thread Casey Brotherton (JIRA)

 [ 
https://issues.apache.org/jira/browse/HDFS-9100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Casey Brotherton updated HDFS-9100:
---
Attachment: HDFS-9100.000.patch

Still testing in my lab.  A unit test with multiple networks is difficult.
Will change to patch-available once I finished testing

> HDFS Balancer does not respect dfs.client.use.datanode.hostname
> ---
>
> Key: HDFS-9100
> URL: https://issues.apache.org/jira/browse/HDFS-9100
> Project: Hadoop HDFS
>  Issue Type: Bug
>  Components: balancer & mover, HDFS
>Reporter: Yongjun Zhang
>Assignee: Casey Brotherton
> Attachments: HDFS-9100.000.patch
>
>
> In Balancer Dispatch.java:
> {code}
>private void dispatch() {
>   LOG.info("Start moving " + this);
>   Socket sock = new Socket();
>   DataOutputStream out = null;
>   DataInputStream in = null;
>   try {
> sock.connect(
> NetUtils.createSocketAddr(target.getDatanodeInfo().getXferAddr()),
> HdfsConstants.READ_TIMEOUT);
> {code}
> getXferAddr() is called without taking into consideration of 
> dfs.client.use.datanode.hostname setting, this would possibly fail balancer 
> run issued from outside a cluster.
> Thanks [~caseyjbrotherton] for reporting the issue.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HDFS-9100) HDFS Balancer does not respect dfs.client.use.datanode.hostname

2015-09-23 Thread Casey Brotherton (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-9100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904479#comment-14904479
 ] 

Casey Brotherton commented on HDFS-9100:


Created a multi-node cluster with multi-homed networks.

Created one of the nodes with only one ip address, on the subnet
that was not the primary network of the other hosts in the cluster.

Verified that without appropriate configuration, the lone node 
could not send data to datanodes.

Confirmed that changing dfs.client.use.datanode.hostname allowed the 
lone node to put and read from the datanodes.

Placed data within the HDFS cluster in an unbalanced way.  ( dfs.replication=1 
on one datanode, 
with minimal data spread across two datanodes with dfs.replication=2)

Running the balancer from the lone node causes these errors:
15/09/23 06:07:04 WARN balancer.Dispatcher: Failed to move blk_1073742600_1776 
with size=4128368 from 10.17.74.156:50010:DISK to 10.17.74.158:50010:DISK 
through 10.17.74.156:50010: Network is unreachable

After the changes, the balancer worked.  The logging still showed the IP 
address.
15/09/23 06:05:38 INFO balancer.Dispatcher: Successfully moved 
blk_1073742139_1315 with size=4128368 from 10.17.74.156:50010:DISK to 
10.17.74.158:50010:DISK through 10.17.74.156:50010

This did not test a large environment, where blocks were moved by proxy, 
although I don't believe that would cause any problems.


> HDFS Balancer does not respect dfs.client.use.datanode.hostname
> ---
>
> Key: HDFS-9100
> URL: https://issues.apache.org/jira/browse/HDFS-9100
> Project: Hadoop HDFS
>  Issue Type: Bug
>  Components: balancer & mover, HDFS
>Reporter: Yongjun Zhang
>Assignee: Casey Brotherton
> Attachments: HDFS-9100.000.patch
>
>
> In Balancer Dispatch.java:
> {code}
>private void dispatch() {
>   LOG.info("Start moving " + this);
>   Socket sock = new Socket();
>   DataOutputStream out = null;
>   DataInputStream in = null;
>   try {
> sock.connect(
> NetUtils.createSocketAddr(target.getDatanodeInfo().getXferAddr()),
> HdfsConstants.READ_TIMEOUT);
> {code}
> getXferAddr() is called without taking into consideration of 
> dfs.client.use.datanode.hostname setting, this would possibly fail balancer 
> run issued from outside a cluster.
> Thanks [~caseyjbrotherton] for reporting the issue.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HDFS-9100) HDFS Balancer does not respect dfs.client.use.datanode.hostname

2015-09-23 Thread Casey Brotherton (JIRA)

 [ 
https://issues.apache.org/jira/browse/HDFS-9100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Casey Brotherton updated HDFS-9100:
---
Attachment: HDFS-9100.001.patch

Fixed the whitespace, and line length, and deprecation change

> HDFS Balancer does not respect dfs.client.use.datanode.hostname
> ---
>
> Key: HDFS-9100
> URL: https://issues.apache.org/jira/browse/HDFS-9100
> Project: Hadoop HDFS
>  Issue Type: Bug
>  Components: balancer & mover, HDFS
>Reporter: Yongjun Zhang
>Assignee: Casey Brotherton
> Attachments: HDFS-9100.000.patch, HDFS-9100.001.patch
>
>
> In Balancer Dispatch.java:
> {code}
>private void dispatch() {
>   LOG.info("Start moving " + this);
>   Socket sock = new Socket();
>   DataOutputStream out = null;
>   DataInputStream in = null;
>   try {
> sock.connect(
> NetUtils.createSocketAddr(target.getDatanodeInfo().getXferAddr()),
> HdfsConstants.READ_TIMEOUT);
> {code}
> getXferAddr() is called without taking into consideration of 
> dfs.client.use.datanode.hostname setting, this would possibly fail balancer 
> run issued from outside a cluster.
> Thanks [~caseyjbrotherton] for reporting the issue.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HDFS-9100) HDFS Balancer does not respect dfs.client.use.datanode.hostname

2015-09-23 Thread Casey Brotherton (JIRA)

 [ 
https://issues.apache.org/jira/browse/HDFS-9100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Casey Brotherton updated HDFS-9100:
---
Status: Patch Available  (was: Open)

> HDFS Balancer does not respect dfs.client.use.datanode.hostname
> ---
>
> Key: HDFS-9100
> URL: https://issues.apache.org/jira/browse/HDFS-9100
> Project: Hadoop HDFS
>  Issue Type: Bug
>  Components: balancer & mover, HDFS
>Reporter: Yongjun Zhang
>Assignee: Casey Brotherton
> Attachments: HDFS-9100.000.patch, HDFS-9100.001.patch
>
>
> In Balancer Dispatch.java:
> {code}
>private void dispatch() {
>   LOG.info("Start moving " + this);
>   Socket sock = new Socket();
>   DataOutputStream out = null;
>   DataInputStream in = null;
>   try {
> sock.connect(
> NetUtils.createSocketAddr(target.getDatanodeInfo().getXferAddr()),
> HdfsConstants.READ_TIMEOUT);
> {code}
> getXferAddr() is called without taking into consideration of 
> dfs.client.use.datanode.hostname setting, this would possibly fail balancer 
> run issued from outside a cluster.
> Thanks [~caseyjbrotherton] for reporting the issue.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HDFS-9100) HDFS Balancer does not respect dfs.client.use.datanode.hostname

2015-09-23 Thread Casey Brotherton (JIRA)

 [ 
https://issues.apache.org/jira/browse/HDFS-9100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Casey Brotherton updated HDFS-9100:
---
Status: Open  (was: Patch Available)

> HDFS Balancer does not respect dfs.client.use.datanode.hostname
> ---
>
> Key: HDFS-9100
> URL: https://issues.apache.org/jira/browse/HDFS-9100
> Project: Hadoop HDFS
>  Issue Type: Bug
>  Components: balancer & mover, HDFS
>Reporter: Yongjun Zhang
>Assignee: Casey Brotherton
> Attachments: HDFS-9100.000.patch
>
>
> In Balancer Dispatch.java:
> {code}
>private void dispatch() {
>   LOG.info("Start moving " + this);
>   Socket sock = new Socket();
>   DataOutputStream out = null;
>   DataInputStream in = null;
>   try {
> sock.connect(
> NetUtils.createSocketAddr(target.getDatanodeInfo().getXferAddr()),
> HdfsConstants.READ_TIMEOUT);
> {code}
> getXferAddr() is called without taking into consideration of 
> dfs.client.use.datanode.hostname setting, this would possibly fail balancer 
> run issued from outside a cluster.
> Thanks [~caseyjbrotherton] for reporting the issue.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HDFS-8118) Delay in checkpointing Trash can leave trash for 2 intervals before deleting

2015-08-24 Thread Casey Brotherton (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-8118?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14709816#comment-14709816
 ] 

Casey Brotherton commented on HDFS-8118:


Issue in the common tests is a bind issue in the unit test:  
TestWebDelegationToken was a bind issue:

java.net.BindException: Address already in use

Is there a way to trigger the tests again to see if they clear independently?  
That would provide some indication whether there is a timing issue with the 
TestWebDelegationToken test.

Thanks,
Casey

 Delay in checkpointing Trash can leave trash for 2 intervals before deleting
 

 Key: HDFS-8118
 URL: https://issues.apache.org/jira/browse/HDFS-8118
 Project: Hadoop HDFS
  Issue Type: Bug
Affects Versions: 2.7.1
Reporter: Casey Brotherton
Assignee: Casey Brotherton
Priority: Trivial
 Attachments: HDFS-8118.001.patch, HDFS-8118.patch


 When the fs.trash.checkpoint.interval and the fs.trash.interval are set 
 non-zero and the same, it is possible for trash to be left for two intervals.
 The TrashPolicyDefault will use a floor and ceiling function to ensure that 
 the Trash will be checkpointed every interval of minutes.
 Each user's trash is checkpointed individually.  The time resolution of the 
 checkpoint timestamp is to the second.
 If the seconds switch while one user is checkpointing, then the next user's 
 timestamp will be later.
 This will cause the next user's checkpoint to not be deleted at the next 
 interval.
 I have recreated this in a lab cluster 
 I also have a suggestion for a patch that I can upload later tonight after 
 testing it further.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HDFS-8118) Delay in checkpointing Trash can leave trash for 2 intervals before deleting

2015-08-11 Thread Casey Brotherton (JIRA)

 [ 
https://issues.apache.org/jira/browse/HDFS-8118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Casey Brotherton updated HDFS-8118:
---
Attachment: HDFS-8118.001.patch

This is a simplified patch addressing only the defect, and not the testcases.

 Delay in checkpointing Trash can leave trash for 2 intervals before deleting
 

 Key: HDFS-8118
 URL: https://issues.apache.org/jira/browse/HDFS-8118
 Project: Hadoop HDFS
  Issue Type: Bug
Reporter: Casey Brotherton
Assignee: Casey Brotherton
Priority: Trivial
 Attachments: HDFS-8118.001.patch, HDFS-8118.patch


 When the fs.trash.checkpoint.interval and the fs.trash.interval are set 
 non-zero and the same, it is possible for trash to be left for two intervals.
 The TrashPolicyDefault will use a floor and ceiling function to ensure that 
 the Trash will be checkpointed every interval of minutes.
 Each user's trash is checkpointed individually.  The time resolution of the 
 checkpoint timestamp is to the second.
 If the seconds switch while one user is checkpointing, then the next user's 
 timestamp will be later.
 This will cause the next user's checkpoint to not be deleted at the next 
 interval.
 I have recreated this in a lab cluster 
 I also have a suggestion for a patch that I can upload later tonight after 
 testing it further.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HDFS-8118) Delay in checkpointing Trash can leave trash for 2 intervals before deleting

2015-08-06 Thread Casey Brotherton (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-8118?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14660213#comment-14660213
 ] 

Casey Brotherton commented on HDFS-8118:


Talked with Harsh about this offline.

Still working through a testcase.  We have talked about a method for delaying 
between the different checkpoints.
There are still difficulties, though , as the problem occurs when two different 
users delete files, and there is a delay between checkpointing each user's 
deletes.

However, TestTrash operates under the assumption that there is only one user 
that is creating files, and removing them for the trash.
( For example, the shell is used to getCurrentTrashDirectory, and that will 
only return one location.  For a test, I will either need to 
break Object Oriented walls, and create a path with users/d/.Trash/Current, or 
create a test that doesn't really test for the issue. )

Still trying to work on this.


 Delay in checkpointing Trash can leave trash for 2 intervals before deleting
 

 Key: HDFS-8118
 URL: https://issues.apache.org/jira/browse/HDFS-8118
 Project: Hadoop HDFS
  Issue Type: Bug
Reporter: Casey Brotherton
Assignee: Casey Brotherton
Priority: Trivial
 Attachments: HDFS-8118.patch


 When the fs.trash.checkpoint.interval and the fs.trash.interval are set 
 non-zero and the same, it is possible for trash to be left for two intervals.
 The TrashPolicyDefault will use a floor and ceiling function to ensure that 
 the Trash will be checkpointed every interval of minutes.
 Each user's trash is checkpointed individually.  The time resolution of the 
 checkpoint timestamp is to the second.
 If the seconds switch while one user is checkpointing, then the next user's 
 timestamp will be later.
 This will cause the next user's checkpoint to not be deleted at the next 
 interval.
 I have recreated this in a lab cluster 
 I also have a suggestion for a patch that I can upload later tonight after 
 testing it further.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HDFS-5696) Examples for httpfs REST API incorrect on apache.org

2013-12-20 Thread Casey Brotherton (JIRA)
Casey Brotherton created HDFS-5696:
--

 Summary: Examples for httpfs REST API incorrect on apache.org
 Key: HDFS-5696
 URL: https://issues.apache.org/jira/browse/HDFS-5696
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: documentation
Affects Versions: 2.2.0
 Environment: NA
Reporter: Casey Brotherton
Priority: Minor


The examples provided for the httpfs REST API are incorrect.

http://hadoop.apache.org/docs/r2.2.0/hadoop-hdfs-httpfs/index.html
http://hadoop.apache.org/docs/r2.0.5-alpha/hadoop-hdfs-httpfs/index.html
From the documentation:
*
HttpFS is a separate service from Hadoop NameNode.
HttpFS itself is Java web-application and it runs using a preconfigured Tomcat 
bundled with HttpFS binary distribution.
HttpFS HTTP web-service API calls are HTTP REST calls that map to a HDFS file 
system operation. For example, using the curl Unix command:
$ curl http://httpfs-host:14000/webhdfs/v1/user/foo/README.txt returns the 
contents of the HDFS /user/foo/README.txt file.
$ curl http://httpfs-host:14000/webhdfs/v1/user/foo?op=list returns the 
contents of the HDFS /user/foo directory in JSON format.
$ curl -X POST http://httpfs-host:14000/webhdfs/v1/user/foo/bar?op=mkdirs 
creates the HDFS /user/foo.bar directory.
***
The commands have incorrect operations. ( Verified through source code in 
HttpFSFileSystem.java )
In addition, although the webhdfs documentation specifies user.name as 
optional, on my cluster, each action required a user.name
It should be included in the short examples to allow for the greatest chance of 
success.
Three examples rewritten:
curl -i -L 
http://httpfs-host:14000/webhdfs/v1/user/foo/README.txt?op=openuser.name=hdfsuser;
curl -i 
http://httpfs-host:14000/webhdfs/v1/user/foo/?op=liststatususer.name=hdfsuser;
curl -i -X PUT 
http://httpfs-host:14000/webhdfs/v1/user/foo/bar?op=mkdirsuser.name=hdfsuser;
Not sure what the convention should be for specifying the user.name. Use hdfs? 
or a name that is obviously an example?
It would also be beneficial if the HTTPfs page linked to the webhdfs 
documentation page in the text instead of just on the menu sidebar.

http://hadoop.apache.org/docs/r2.2.0/hadoop-project-dist/hadoop-hdfs/WebHDFS.html




--
This message was sent by Atlassian JIRA
(v6.1.4#6159)


[jira] [Updated] (HDFS-5696) Examples for httpfs REST API incorrect on apache.org

2013-12-20 Thread Casey Brotherton (JIRA)

 [ 
https://issues.apache.org/jira/browse/HDFS-5696?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Casey Brotherton updated HDFS-5696:
---

Priority: Trivial  (was: Minor)

 Examples for httpfs REST API incorrect on apache.org
 

 Key: HDFS-5696
 URL: https://issues.apache.org/jira/browse/HDFS-5696
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: documentation
Affects Versions: 2.2.0
 Environment: NA
Reporter: Casey Brotherton
Priority: Trivial

 The examples provided for the httpfs REST API are incorrect.
 http://hadoop.apache.org/docs/r2.2.0/hadoop-hdfs-httpfs/index.html
 http://hadoop.apache.org/docs/r2.0.5-alpha/hadoop-hdfs-httpfs/index.html
 From the documentation:
 *
 HttpFS is a separate service from Hadoop NameNode.
 HttpFS itself is Java web-application and it runs using a preconfigured 
 Tomcat bundled with HttpFS binary distribution.
 HttpFS HTTP web-service API calls are HTTP REST calls that map to a HDFS file 
 system operation. For example, using the curl Unix command:
 $ curl http://httpfs-host:14000/webhdfs/v1/user/foo/README.txt returns the 
 contents of the HDFS /user/foo/README.txt file.
 $ curl http://httpfs-host:14000/webhdfs/v1/user/foo?op=list returns the 
 contents of the HDFS /user/foo directory in JSON format.
 $ curl -X POST http://httpfs-host:14000/webhdfs/v1/user/foo/bar?op=mkdirs 
 creates the HDFS /user/foo.bar directory.
 ***
 The commands have incorrect operations. ( Verified through source code in 
 HttpFSFileSystem.java )
 In addition, although the webhdfs documentation specifies user.name as 
 optional, on my cluster, each action required a user.name
 It should be included in the short examples to allow for the greatest chance 
 of success.
 Three examples rewritten:
 curl -i -L 
 http://httpfs-host:14000/webhdfs/v1/user/foo/README.txt?op=openuser.name=hdfsuser;
 curl -i 
 http://httpfs-host:14000/webhdfs/v1/user/foo/?op=liststatususer.name=hdfsuser;
 curl -i -X PUT 
 http://httpfs-host:14000/webhdfs/v1/user/foo/bar?op=mkdirsuser.name=hdfsuser;
 Not sure what the convention should be for specifying the user.name. Use 
 hdfs? or a name that is obviously an example?
 It would also be beneficial if the HTTPfs page linked to the webhdfs 
 documentation page in the text instead of just on the menu sidebar.
 http://hadoop.apache.org/docs/r2.2.0/hadoop-project-dist/hadoop-hdfs/WebHDFS.html



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)