[jira] [Created] (HDFS-8936) Simplify Erasure Coding Zone DiskSpace quota exceeded exception error message

2015-08-21 Thread GAO Rui (JIRA)
GAO Rui created HDFS-8936:
-

 Summary: Simplify Erasure Coding Zone DiskSpace quota exceeded 
exception error message
 Key: HDFS-8936
 URL: https://issues.apache.org/jira/browse/HDFS-8936
 Project: Hadoop HDFS
  Issue Type: Sub-task
Reporter: GAO Rui
Assignee: GAO Rui


When a EC directory exceed DiskSpace quota, the error message is along with 
DFSStripedOutputStream inner exception message. Error messages should be as 
simple and clear as normal hdfs directory. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HDFS-8388) Time and Date format need to be in sync in Namenode UI page

2015-08-21 Thread Surendra Singh Lilhore (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-8388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14706512#comment-14706512
 ] 

Surendra Singh Lilhore commented on HDFS-8388:
--

Thanks [~vinayrpet] for review

bq. I feel, if we are unifying, then unify everywhere. agree?
yes 

I will update the patch.

> Time and Date format need to be in sync in Namenode UI page
> ---
>
> Key: HDFS-8388
> URL: https://issues.apache.org/jira/browse/HDFS-8388
> Project: Hadoop HDFS
>  Issue Type: Bug
>Reporter: Archana T
>Assignee: Surendra Singh Lilhore
>Priority: Minor
> Attachments: HDFS-8388-002.patch, HDFS-8388-003.patch, 
> HDFS-8388-004.patch, HDFS-8388.patch, HDFS-8388_1.patch, 
> ScreenShot-InvalidDate.png
>
>
> In NameNode UI Page, Date and Time FORMAT  displayed on the page are not in 
> sync currently.
> Started:Wed May 13 12:28:02 IST 2015
> Compiled:23 Apr 2015 12:22:59 
> Block Deletion Start Time   13 May 2015 12:28:02
> We can keep a common format in all the above places.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HDFS-8932) NPE thrown in NameNode when try to get "TotalSyncCount" metric before editLogStream initialization

2015-08-21 Thread Surendra Singh Lilhore (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-8932?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14706506#comment-14706506
 ] 

Surendra Singh Lilhore commented on HDFS-8932:
--

Thanks [~vinayrpet] for review

> NPE thrown in NameNode when try to get "TotalSyncCount" metric before 
> editLogStream initialization
> --
>
> Key: HDFS-8932
> URL: https://issues.apache.org/jira/browse/HDFS-8932
> Project: Hadoop HDFS
>  Issue Type: Bug
>Reporter: Surendra Singh Lilhore
>Assignee: Surendra Singh Lilhore
> Attachments: HDFS-8932.patch
>
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HDFS-6939) Support path-based filtering of inotify events

2015-08-21 Thread Surendra Singh Lilhore (JIRA)

 [ 
https://issues.apache.org/jira/browse/HDFS-6939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Surendra Singh Lilhore updated HDFS-6939:
-
Status: Patch Available  (was: Open)

> Support path-based filtering of inotify events
> --
>
> Key: HDFS-6939
> URL: https://issues.apache.org/jira/browse/HDFS-6939
> Project: Hadoop HDFS
>  Issue Type: Sub-task
>  Components: hdfs-client, namenode, qjm
>Reporter: James Thomas
>Assignee: Surendra Singh Lilhore
> Attachments: HDFS-6939-001.patch
>
>
> Users should be able to specify that they only want events involving 
> particular paths.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HDFS-6939) Support path-based filtering of inotify events

2015-08-21 Thread Surendra Singh Lilhore (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-6939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14706503#comment-14706503
 ] 

Surendra Singh Lilhore commented on HDFS-6939:
--

>From inotify design doc.

bq. 1. Allow clients to watch events from specific directories rather than from 
the entirety of HDFS. We will need to decide whether undesired events should be 
filtered out on the NameNode or client side.

I think its good to add filtering on the Namenode side. Client side user only 
can do filtering in his application.


*Patch 001:* 


1. Added *Regex* based filterting, User can pass regular expression for 
filtering. 
2. User can watch events for perticular file, by passing file path.
3. Added two new API in *DistributedFileSystem* 
{noformat}
public DFSInotifyEventInputStream 
getInotifyEventStream(InotifyFilter filter);
public DFSInotifyEventInputStream getInotifyEventStream(long 
lastReadTxid, InotifyFilter filter)
{noformat}
4. In feature we can add more filter option in *InotifyFilter* object 
without changing API, for example event based filtering HDFS-8933

Please review and give suggestions .

> Support path-based filtering of inotify events
> --
>
> Key: HDFS-6939
> URL: https://issues.apache.org/jira/browse/HDFS-6939
> Project: Hadoop HDFS
>  Issue Type: Sub-task
>  Components: hdfs-client, namenode, qjm
>Reporter: James Thomas
>Assignee: Surendra Singh Lilhore
> Attachments: HDFS-6939-001.patch
>
>
> Users should be able to specify that they only want events involving 
> particular paths.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HDFS-6939) Support path-based filtering of inotify events

2015-08-21 Thread Surendra Singh Lilhore (JIRA)

 [ 
https://issues.apache.org/jira/browse/HDFS-6939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Surendra Singh Lilhore updated HDFS-6939:
-
Attachment: HDFS-6939-001.patch

> Support path-based filtering of inotify events
> --
>
> Key: HDFS-6939
> URL: https://issues.apache.org/jira/browse/HDFS-6939
> Project: Hadoop HDFS
>  Issue Type: Sub-task
>  Components: hdfs-client, namenode, qjm
>Reporter: James Thomas
>Assignee: Surendra Singh Lilhore
> Attachments: HDFS-6939-001.patch
>
>
> Users should be able to specify that they only want events involving 
> particular paths.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HDFS-7116) Add a metric to expose the bandwidth of balancer

2015-08-21 Thread Vinayakumar B (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-7116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14706489#comment-14706489
 ] 

Vinayakumar B commented on HDFS-7116:
-

bq. 1. Would you remove the sentence starts with "The new value is not 
persistent ... "? If we add the metric, the new value does persist on the 
DataNodeMetrics.
IMO, exposing via metric is not same as persisting. It just shows out the 
current value. Here not actually meant, new value is only till lifetime of the 
datanode. On restart it will be reset. so i feel still NOTE is valid. May be 
the sentence related to exposing can be added before the note. To make it clear.

> Add a metric to expose the bandwidth of balancer
> 
>
> Key: HDFS-7116
> URL: https://issues.apache.org/jira/browse/HDFS-7116
> Project: Hadoop HDFS
>  Issue Type: New Feature
>  Components: balancer & mover
>Reporter: Akira AJISAKA
>Assignee: Rakesh R
> Attachments: HDFS-7116-00.patch, HDFS-7116-01.patch, 
> HDFS-7116-02.patch, HDFS-7116-03.patch, HDFS-7116-04.patch, HDFS-7116-05.patch
>
>
> Now reading logs is the only way to check how the balancer bandwidth is set. 
> It would be useful for administrators if they can get the value of the same. 
> This jira to discuss & implement the way to access the balancer bandwidth 
> value of the datanode.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Comment Edited] (HDFS-7116) Add a metric to expose the bandwidth of balancer

2015-08-21 Thread Vinayakumar B (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-7116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14706489#comment-14706489
 ] 

Vinayakumar B edited comment on HDFS-7116 at 8/21/15 9:57 AM:
--

bq. 1. Would you remove the sentence starts with "The new value is not 
persistent ... "? If we add the metric, the new value does persist on the 
DataNodeMetrics.
IMO, exposing via metric is not same as persisting. It just shows out the 
current value. Here NOTE actually meant, new value is only till lifetime of the 
datanode. On restart it will be reset. so i feel still NOTE is valid. May be 
the sentence related to exposing can be added before the note. To make it clear.


was (Author: vinayrpet):
bq. 1. Would you remove the sentence starts with "The new value is not 
persistent ... "? If we add the metric, the new value does persist on the 
DataNodeMetrics.
IMO, exposing via metric is not same as persisting. It just shows out the 
current value. Here not actually meant, new value is only till lifetime of the 
datanode. On restart it will be reset. so i feel still NOTE is valid. May be 
the sentence related to exposing can be added before the note. To make it clear.

> Add a metric to expose the bandwidth of balancer
> 
>
> Key: HDFS-7116
> URL: https://issues.apache.org/jira/browse/HDFS-7116
> Project: Hadoop HDFS
>  Issue Type: New Feature
>  Components: balancer & mover
>Reporter: Akira AJISAKA
>Assignee: Rakesh R
> Attachments: HDFS-7116-00.patch, HDFS-7116-01.patch, 
> HDFS-7116-02.patch, HDFS-7116-03.patch, HDFS-7116-04.patch, HDFS-7116-05.patch
>
>
> Now reading logs is the only way to check how the balancer bandwidth is set. 
> It would be useful for administrators if they can get the value of the same. 
> This jira to discuss & implement the way to access the balancer bandwidth 
> value of the datanode.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HDFS-8388) Time and Date format need to be in sync in Namenode UI page

2015-08-21 Thread Vinayakumar B (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-8388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14706481#comment-14706481
 ] 

Vinayakumar B commented on HDFS-8388:
-

Patch looks great.

I feel there are some other places in NameNode UI where this unification could 
be done.
ex: Datanode last contact time, file last modification time, datanode volume 
failure time, etc.

I feel, if we are unifying, then unify everywhere.
agree?

> Time and Date format need to be in sync in Namenode UI page
> ---
>
> Key: HDFS-8388
> URL: https://issues.apache.org/jira/browse/HDFS-8388
> Project: Hadoop HDFS
>  Issue Type: Bug
>Reporter: Archana T
>Assignee: Surendra Singh Lilhore
>Priority: Minor
> Attachments: HDFS-8388-002.patch, HDFS-8388-003.patch, 
> HDFS-8388-004.patch, HDFS-8388.patch, HDFS-8388_1.patch, 
> ScreenShot-InvalidDate.png
>
>
> In NameNode UI Page, Date and Time FORMAT  displayed on the page are not in 
> sync currently.
> Started:Wed May 13 12:28:02 IST 2015
> Compiled:23 Apr 2015 12:22:59 
> Block Deletion Start Time   13 May 2015 12:28:02
> We can keep a common format in all the above places.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HDFS-8829) DataNode sets SO_RCVBUF explicitly is disabling tcp auto-tuning

2015-08-21 Thread He Tianyi (JIRA)

 [ 
https://issues.apache.org/jira/browse/HDFS-8829?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

He Tianyi updated HDFS-8829:

Attachment: HDFS-8829.0002.patch

Added configuration to hdfs-default.xml

> DataNode sets SO_RCVBUF explicitly is disabling tcp auto-tuning
> ---
>
> Key: HDFS-8829
> URL: https://issues.apache.org/jira/browse/HDFS-8829
> Project: Hadoop HDFS
>  Issue Type: Improvement
>  Components: datanode
>Affects Versions: 2.3.0, 2.6.0
>Reporter: He Tianyi
>Assignee: He Tianyi
> Attachments: HDFS-8829.0001.patch, HDFS-8829.0002.patch
>
>
> {code:java}
>   private void initDataXceiver(Configuration conf) throws IOException {
> // find free port or use privileged port provided
> TcpPeerServer tcpPeerServer;
> if (secureResources != null) {
>   tcpPeerServer = new TcpPeerServer(secureResources);
> } else {
>   tcpPeerServer = new TcpPeerServer(dnConf.socketWriteTimeout,
>   DataNode.getStreamingAddr(conf));
> }
> 
> tcpPeerServer.setReceiveBufferSize(HdfsConstants.DEFAULT_DATA_SOCKET_SIZE);
> {code}
> The last line sets SO_RCVBUF explicitly, thus disabling tcp auto-tuning on 
> some system.
> Shall we make this behavior configurable?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HDFS-7116) Add a metric to expose the bandwidth of balancer

2015-08-21 Thread Rakesh R (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-7116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14706461#comment-14706461
 ] 

Rakesh R commented on HDFS-7116:


OK, thank you for the explanation. Attached patch addressing the comments.

> Add a metric to expose the bandwidth of balancer
> 
>
> Key: HDFS-7116
> URL: https://issues.apache.org/jira/browse/HDFS-7116
> Project: Hadoop HDFS
>  Issue Type: New Feature
>  Components: balancer & mover
>Reporter: Akira AJISAKA
>Assignee: Rakesh R
> Attachments: HDFS-7116-00.patch, HDFS-7116-01.patch, 
> HDFS-7116-02.patch, HDFS-7116-03.patch, HDFS-7116-04.patch, HDFS-7116-05.patch
>
>
> Now reading logs is the only way to check how the balancer bandwidth is set. 
> It would be useful for administrators if they can get the value of the same. 
> This jira to discuss & implement the way to access the balancer bandwidth 
> value of the datanode.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HDFS-7116) Add a metric to expose the bandwidth of balancer

2015-08-21 Thread Rakesh R (JIRA)

 [ 
https://issues.apache.org/jira/browse/HDFS-7116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rakesh R updated HDFS-7116:
---
Attachment: HDFS-7116-05.patch

> Add a metric to expose the bandwidth of balancer
> 
>
> Key: HDFS-7116
> URL: https://issues.apache.org/jira/browse/HDFS-7116
> Project: Hadoop HDFS
>  Issue Type: New Feature
>  Components: balancer & mover
>Reporter: Akira AJISAKA
>Assignee: Rakesh R
> Attachments: HDFS-7116-00.patch, HDFS-7116-01.patch, 
> HDFS-7116-02.patch, HDFS-7116-03.patch, HDFS-7116-04.patch, HDFS-7116-05.patch
>
>
> Now reading logs is the only way to check how the balancer bandwidth is set. 
> It would be useful for administrators if they can get the value of the same. 
> This jira to discuss & implement the way to access the balancer bandwidth 
> value of the datanode.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HDFS-8892) ShortCircuitCache.CacheCleaner can add Slot.isInvalid() check too

2015-08-21 Thread Ravikumar (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-8892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14706455#comment-14706455
 ] 

Ravikumar commented on HDFS-8892:
-

Iterating over all replicas is a cause of concern. Is it possible for client to 
ack a slot made invalid by DN? But my guess is that, it would needlessly 
complicate this issue.

May be the patch itself is applicable for specific use-cases…

How about exposing a boolean config 
"dfs.client.read.shortcircuit.replica.stale.cleanup" and wrapping this small 
patch in a conditional block?

> ShortCircuitCache.CacheCleaner can add Slot.isInvalid() check too
> -
>
> Key: HDFS-8892
> URL: https://issues.apache.org/jira/browse/HDFS-8892
> Project: Hadoop HDFS
>  Issue Type: Improvement
>  Components: hdfs-client
>Affects Versions: 2.7.1
>Reporter: Ravikumar
>Assignee: kanaka kumar avvaru
>Priority: Minor
>
> Currently CacheCleaner thread checks only for cache-expiry times. It would be 
> nice if it handles an invalid-slot too in an extra-pass of evictable map…
> for(ShortCircuitReplica replica:evictable.values()) {
>  if(!scr.getSlot().isValid()) {
> purge(replica);
>  }
> }
> //Existing code...
> int numDemoted = demoteOldEvictableMmaped(curMs);
> int numPurged = 0;
> Long evictionTimeNs = Long.valueOf(0);
> ….
> …..
> Apps like HBase can tweak the expiry/staleness/cache-size params in 
> DFS-Client, so that ShortCircuitReplica will never be closed except when Slot 
> is declared invalid. 
> I assume slot-invalidation will happen during block-invalidation/deletes 
> {Primarily triggered by compaction/shard-takeover etc..}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HDFS-7116) Add a metric to expose the bandwidth of balancer

2015-08-21 Thread Akira AJISAKA (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-7116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14706452#comment-14706452
 ] 

Akira AJISAKA commented on HDFS-7116:
-

Thanks [~rakeshr] for updating the patch.
{code}
> -| `-setBalancerBandwidth` \ | Changes the 
> network bandwidth used by each datanode during HDFS block balancing. 
> \ is the maximum number of bytes per second that will be used by 
> each datanode. This value overrides the dfs.balance.bandwidthPerSec 
> parameter.NOTE: The new value is not persistent on the DataNode. |
> +| `-setBalancerBandwidth` \ | Changes the 
> network bandwidth used by each datanode during HDFS block balancing. 
> \ is the maximum number of bytes per second that will be used by 
> each datanode. This value overrides the dfs.balance.bandwidthPerSec 
> parameter.NOTE: The new value is not persistent on the DataNode. The 
> value is exposed via BalancerBandwidth metric. |
{code}

1. Would you remove the sentence starts with "The new value is not persistent 
... "? If we add the metric, the new value does persist on the DataNodeMetrics.
2. (minor) Would you replace {{0xC2A0}} (whitespace in UTF-8) before "NOTE:" 
with whitespace as well?

> Add a metric to expose the bandwidth of balancer
> 
>
> Key: HDFS-7116
> URL: https://issues.apache.org/jira/browse/HDFS-7116
> Project: Hadoop HDFS
>  Issue Type: New Feature
>  Components: balancer & mover
>Reporter: Akira AJISAKA
>Assignee: Rakesh R
> Attachments: HDFS-7116-00.patch, HDFS-7116-01.patch, 
> HDFS-7116-02.patch, HDFS-7116-03.patch, HDFS-7116-04.patch
>
>
> Now reading logs is the only way to check how the balancer bandwidth is set. 
> It would be useful for administrators if they can get the value of the same. 
> This jira to discuss & implement the way to access the balancer bandwidth 
> value of the datanode.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HDFS-7116) Add a metric to expose the bandwidth of balancer

2015-08-21 Thread Rakesh R (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-7116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14706441#comment-14706441
 ] 

Rakesh R commented on HDFS-7116:


Attached another patch for the HDFSCommands doc changes. Thanks again!

> Add a metric to expose the bandwidth of balancer
> 
>
> Key: HDFS-7116
> URL: https://issues.apache.org/jira/browse/HDFS-7116
> Project: Hadoop HDFS
>  Issue Type: New Feature
>  Components: balancer & mover
>Reporter: Akira AJISAKA
>Assignee: Rakesh R
> Attachments: HDFS-7116-00.patch, HDFS-7116-01.patch, 
> HDFS-7116-02.patch, HDFS-7116-03.patch, HDFS-7116-04.patch
>
>
> Now reading logs is the only way to check how the balancer bandwidth is set. 
> It would be useful for administrators if they can get the value of the same. 
> This jira to discuss & implement the way to access the balancer bandwidth 
> value of the datanode.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HDFS-7116) Add a metric to expose the bandwidth of balancer

2015-08-21 Thread Rakesh R (JIRA)

 [ 
https://issues.apache.org/jira/browse/HDFS-7116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rakesh R updated HDFS-7116:
---
Attachment: HDFS-7116-04.patch

> Add a metric to expose the bandwidth of balancer
> 
>
> Key: HDFS-7116
> URL: https://issues.apache.org/jira/browse/HDFS-7116
> Project: Hadoop HDFS
>  Issue Type: New Feature
>  Components: balancer & mover
>Reporter: Akira AJISAKA
>Assignee: Rakesh R
> Attachments: HDFS-7116-00.patch, HDFS-7116-01.patch, 
> HDFS-7116-02.patch, HDFS-7116-03.patch, HDFS-7116-04.patch
>
>
> Now reading logs is the only way to check how the balancer bandwidth is set. 
> It would be useful for administrators if they can get the value of the same. 
> This jira to discuss & implement the way to access the balancer bandwidth 
> value of the datanode.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HDFS-8853) Erasure Coding: Provide ECSchema validation when creating ECZone

2015-08-21 Thread J.Andreina (JIRA)

 [ 
https://issues.apache.org/jira/browse/HDFS-8853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

J.Andreina updated HDFS-8853:
-
Attachment: HDFS-8853-HDFS-7285-merge-04.patch

Thanks [~vinayrpet] and [~zhz] for the review comments. 

Updated the patch . Please review. 
Have raised a separate jira for passing policy name for creating EC zone 
(HDFS-8935)

> Erasure Coding: Provide ECSchema validation when creating ECZone
> 
>
> Key: HDFS-8853
> URL: https://issues.apache.org/jira/browse/HDFS-8853
> Project: Hadoop HDFS
>  Issue Type: Sub-task
>Reporter: Rakesh R
>Assignee: J.Andreina
> Attachments: HDFS-8853-HDFS-7285-01.patch, 
> HDFS-8853-HDFS-7285-merge-02.patch, HDFS-8853-HDFS-7285-merge-03.patch, 
> HDFS-8853-HDFS-7285-merge-04.patch
>
>
> Presently the {{DFS#createErasureCodingZone(path, ecSchema, cellSize)}} 
> doesn't have any validation that the given {{ecSchema}} is available in 
> {{ErasureCodingSchemaManager#activeSchemas}} list. Now, if it doesn't exists 
> then will create the ECZone with {{null}} schema. IMHO we could improve this 
> by doing necessary basic sanity checks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HDFS-8935) Erasure Coding: createErasureCodingZone api should accept the policyname as argument instead of ErasureCodingPolicy

2015-08-21 Thread J.Andreina (JIRA)
J.Andreina created HDFS-8935:


 Summary: Erasure Coding: createErasureCodingZone api should accept 
the policyname as argument instead of ErasureCodingPolicy
 Key: HDFS-8935
 URL: https://issues.apache.org/jira/browse/HDFS-8935
 Project: Hadoop HDFS
  Issue Type: Sub-task
Reporter: J.Andreina
Assignee: J.Andreina


Current behavior : User has to specify ErasureCodingPolicy as an argument for 
createErasureCodingZone api .

This can be made in sync with creation of EC zone through CLI , where user need 
to specify only the policy name.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HDFS-8894) Set SO_KEEPALIVE on DN server sockets

2015-08-21 Thread kanaka kumar avvaru (JIRA)

 [ 
https://issues.apache.org/jira/browse/HDFS-8894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

kanaka kumar avvaru updated HDFS-8894:
--
Status: Open  (was: Patch Available)

> Set SO_KEEPALIVE on DN server sockets
> -
>
> Key: HDFS-8894
> URL: https://issues.apache.org/jira/browse/HDFS-8894
> Project: Hadoop HDFS
>  Issue Type: Bug
>  Components: datanode
>Affects Versions: 2.7.1
>Reporter: Nathan Roberts
>Assignee: kanaka kumar avvaru
> Attachments: HDFS-8894-01.patch
>
>
> SO_KEEPALIVE is not set on things like datastreamer sockets which can cause 
> lingering ESTABLISHED sockets when there is a network glitch.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HDFS-8894) Set SO_KEEPALIVE on DN server sockets

2015-08-21 Thread kanaka kumar avvaru (JIRA)

 [ 
https://issues.apache.org/jira/browse/HDFS-8894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

kanaka kumar avvaru updated HDFS-8894:
--
Status: Patch Available  (was: Open)

to trigger jenkins

> Set SO_KEEPALIVE on DN server sockets
> -
>
> Key: HDFS-8894
> URL: https://issues.apache.org/jira/browse/HDFS-8894
> Project: Hadoop HDFS
>  Issue Type: Bug
>  Components: datanode
>Affects Versions: 2.7.1
>Reporter: Nathan Roberts
>Assignee: kanaka kumar avvaru
> Attachments: HDFS-8894-01.patch
>
>
> SO_KEEPALIVE is not set on things like datastreamer sockets which can cause 
> lingering ESTABLISHED sockets when there is a network glitch.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HDFS-7116) Add a metric to expose the bandwidth of balancer

2015-08-21 Thread Akira AJISAKA (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-7116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14706408#comment-14706408
 ] 

Akira AJISAKA commented on HDFS-7116:
-

Thanks [~rakeshr] for updating the patch. I built a pseudo-distributed cluster 
with your patch and the metric worked well.
One comment: Would you update the description of "dfsadmin 
-setBalancerBandwidth" in HDFSCommands document?
{code}
NOTE: The new value is not persistent on the DataNode.
{code}
Now the new value is exposed via BalancerBandwidth metric.

I'm +1 if that is addressed.

> Add a metric to expose the bandwidth of balancer
> 
>
> Key: HDFS-7116
> URL: https://issues.apache.org/jira/browse/HDFS-7116
> Project: Hadoop HDFS
>  Issue Type: New Feature
>  Components: balancer & mover
>Reporter: Akira AJISAKA
>Assignee: Rakesh R
> Attachments: HDFS-7116-00.patch, HDFS-7116-01.patch, 
> HDFS-7116-02.patch, HDFS-7116-03.patch
>
>
> Now reading logs is the only way to check how the balancer bandwidth is set. 
> It would be useful for administrators if they can get the value of the same. 
> This jira to discuss & implement the way to access the balancer bandwidth 
> value of the datanode.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HDFS-8899) Erasure Coding: use threadpool for EC recovery tasks

2015-08-21 Thread Rakesh R (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-8899?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14706398#comment-14706398
 ] 

Rakesh R commented on HDFS-8899:


Test case failures and findbug warnings are not related to the patch, please 
ignore it.

> Erasure Coding: use threadpool for EC recovery tasks
> 
>
> Key: HDFS-8899
> URL: https://issues.apache.org/jira/browse/HDFS-8899
> Project: Hadoop HDFS
>  Issue Type: Sub-task
>Reporter: Rakesh R
>Assignee: Rakesh R
> Attachments: HDFS-8899-HDFS-7285-merge-00.patch
>
>
> The idea is to use threadpool for processing erasure coding recovery tasks at 
> the datanode.
> {code}
> new Daemon(new ReconstructAndTransferBlock(recoveryInfo)).start();
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HDFS-8909) Erasure coding: update BlockInfoContiguousUC and BlockInfoStripedUC to use BlockUnderConstructionFeature

2015-08-21 Thread Zhe Zhang (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-8909?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14706361#comment-14706361
 ] 

Zhe Zhang commented on HDFS-8909:
-

Jenkins [job | https://builds.apache.org/job/Hadoop-HDFS-8909/] started.

> Erasure coding: update BlockInfoContiguousUC and BlockInfoStripedUC to use 
> BlockUnderConstructionFeature
> 
>
> Key: HDFS-8909
> URL: https://issues.apache.org/jira/browse/HDFS-8909
> Project: Hadoop HDFS
>  Issue Type: Sub-task
>  Components: namenode
>Affects Versions: HDFS-7285
>Reporter: Zhe Zhang
>Assignee: Jing Zhao
> Attachments: HDFS-8909.000.patch, HDFS-8909.001.patch, 
> HDFS-8909.002.patch
>
>
> HDFS-8801 converts {{BlockInfoUC}} as a feature. We should consolidate 
> {{BlockInfoContiguousUC}} and {{BlockInfoStripedUC}} logics to use this 
> feature.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HDFS-7116) Add a metric to expose the bandwidth of balancer

2015-08-21 Thread Vinayakumar B (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-7116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14706336#comment-14706336
 ] 

Vinayakumar B commented on HDFS-7116:
-

Thanks @Akira for confirming.

> Add a metric to expose the bandwidth of balancer
> 
>
> Key: HDFS-7116
> URL: https://issues.apache.org/jira/browse/HDFS-7116
> Project: Hadoop HDFS
>  Issue Type: New Feature
>  Components: balancer & mover
>Reporter: Akira AJISAKA
>Assignee: Rakesh R
> Attachments: HDFS-7116-00.patch, HDFS-7116-01.patch, 
> HDFS-7116-02.patch, HDFS-7116-03.patch
>
>
> Now reading logs is the only way to check how the balancer bandwidth is set. 
> It would be useful for administrators if they can get the value of the same. 
> This jira to discuss & implement the way to access the balancer bandwidth 
> value of the datanode.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HDFS-7116) Add a metric to expose the bandwidth of balancer

2015-08-21 Thread Rakesh R (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-7116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14706338#comment-14706338
 ] 

Rakesh R commented on HDFS-7116:


Thanks [~ajisakaa], [~vinayrpet] for the comments. Attached another patch 
addressing the comments.

> Add a metric to expose the bandwidth of balancer
> 
>
> Key: HDFS-7116
> URL: https://issues.apache.org/jira/browse/HDFS-7116
> Project: Hadoop HDFS
>  Issue Type: New Feature
>  Components: balancer & mover
>Reporter: Akira AJISAKA
>Assignee: Rakesh R
> Attachments: HDFS-7116-00.patch, HDFS-7116-01.patch, 
> HDFS-7116-02.patch, HDFS-7116-03.patch
>
>
> Now reading logs is the only way to check how the balancer bandwidth is set. 
> It would be useful for administrators if they can get the value of the same. 
> This jira to discuss & implement the way to access the balancer bandwidth 
> value of the datanode.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HDFS-7116) Add a metric to expose the bandwidth of balancer

2015-08-21 Thread Rakesh R (JIRA)

 [ 
https://issues.apache.org/jira/browse/HDFS-7116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rakesh R updated HDFS-7116:
---
Attachment: HDFS-7116-03.patch

> Add a metric to expose the bandwidth of balancer
> 
>
> Key: HDFS-7116
> URL: https://issues.apache.org/jira/browse/HDFS-7116
> Project: Hadoop HDFS
>  Issue Type: New Feature
>  Components: balancer & mover
>Reporter: Akira AJISAKA
>Assignee: Rakesh R
> Attachments: HDFS-7116-00.patch, HDFS-7116-01.patch, 
> HDFS-7116-02.patch, HDFS-7116-03.patch
>
>
> Now reading logs is the only way to check how the balancer bandwidth is set. 
> It would be useful for administrators if they can get the value of the same. 
> This jira to discuss & implement the way to access the balancer bandwidth 
> value of the datanode.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HDFS-8829) DataNode sets SO_RCVBUF explicitly is disabling tcp auto-tuning

2015-08-21 Thread Vinayakumar B (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-8829?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14706335#comment-14706335
 ] 

Vinayakumar B commented on HDFS-8829:
-

Changes looks great.

1. Need to add this new configuration in the {{hdfs-default.xml}} and In the 
description need to mention about AutoTuning, it would help users. I believe 
there is also a testcase which will fail, if entry not found in xml.

> DataNode sets SO_RCVBUF explicitly is disabling tcp auto-tuning
> ---
>
> Key: HDFS-8829
> URL: https://issues.apache.org/jira/browse/HDFS-8829
> Project: Hadoop HDFS
>  Issue Type: Improvement
>  Components: datanode
>Affects Versions: 2.3.0, 2.6.0
>Reporter: He Tianyi
>Assignee: He Tianyi
> Attachments: HDFS-8829.0001.patch
>
>
> {code:java}
>   private void initDataXceiver(Configuration conf) throws IOException {
> // find free port or use privileged port provided
> TcpPeerServer tcpPeerServer;
> if (secureResources != null) {
>   tcpPeerServer = new TcpPeerServer(secureResources);
> } else {
>   tcpPeerServer = new TcpPeerServer(dnConf.socketWriteTimeout,
>   DataNode.getStreamingAddr(conf));
> }
> 
> tcpPeerServer.setReceiveBufferSize(HdfsConstants.DEFAULT_DATA_SOCKET_SIZE);
> {code}
> The last line sets SO_RCVBUF explicitly, thus disabling tcp auto-tuning on 
> some system.
> Shall we make this behavior configurable?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HDFS-7116) Add a metric to expose the bandwidth of balancer

2015-08-21 Thread Akira AJISAKA (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-7116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14706329#comment-14706329
 ] 

Akira AJISAKA commented on HDFS-7116:
-

bq. Can someone confirm whether adding a new metric is a compatible change?
Adding a new metric is compatible. 
http://hadoop.apache.org/docs/r2.7.1/hadoop-project-dist/hadoop-common/Compatibility.html#MetricsJMX

> Add a metric to expose the bandwidth of balancer
> 
>
> Key: HDFS-7116
> URL: https://issues.apache.org/jira/browse/HDFS-7116
> Project: Hadoop HDFS
>  Issue Type: New Feature
>  Components: balancer & mover
>Reporter: Akira AJISAKA
>Assignee: Rakesh R
> Attachments: HDFS-7116-00.patch, HDFS-7116-01.patch, 
> HDFS-7116-02.patch
>
>
> Now reading logs is the only way to check how the balancer bandwidth is set. 
> It would be useful for administrators if they can get the value of the same. 
> This jira to discuss & implement the way to access the balancer bandwidth 
> value of the datanode.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HDFS-7116) Add a metric to expose the bandwidth of balancer

2015-08-21 Thread Vinayakumar B (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-7116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14706326#comment-14706326
 ] 

Vinayakumar B commented on HDFS-7116:
-

Thanks [~rakeshr],
Patch looks great for adding the metric. This is the simple way to expose the 
bandwidth value.
+1 for the patch.

Can someone confirm whether adding a new metric is a compatible change? I am 
asking because, a new entry will show up in jmx json.
If incompatible, need to mark this jira as incompatible.



> Add a metric to expose the bandwidth of balancer
> 
>
> Key: HDFS-7116
> URL: https://issues.apache.org/jira/browse/HDFS-7116
> Project: Hadoop HDFS
>  Issue Type: New Feature
>  Components: balancer & mover
>Reporter: Akira AJISAKA
>Assignee: Rakesh R
> Attachments: HDFS-7116-00.patch, HDFS-7116-01.patch, 
> HDFS-7116-02.patch
>
>
> Now reading logs is the only way to check how the balancer bandwidth is set. 
> It would be useful for administrators if they can get the value of the same. 
> This jira to discuss & implement the way to access the balancer bandwidth 
> value of the datanode.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HDFS-7116) Add a metric to expose the bandwidth of balancer

2015-08-21 Thread Akira AJISAKA (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-7116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14706325#comment-14706325
 ] 

Akira AJISAKA commented on HDFS-7116:
-

Thanks [~rakeshr] for creating the patch.
{code}
  private AtomicLong balancerBandwidth = new AtomicLong(0);
{code}
There is a class {{MutableGaugeLong}}, which is similar to 
{{MutableCounterLong}} and can set arbitrary long value. Would you use the 
class instead of {{AtomicLong}}? The rest of the patch looks good to me.

> Add a metric to expose the bandwidth of balancer
> 
>
> Key: HDFS-7116
> URL: https://issues.apache.org/jira/browse/HDFS-7116
> Project: Hadoop HDFS
>  Issue Type: New Feature
>  Components: balancer & mover
>Reporter: Akira AJISAKA
>Assignee: Rakesh R
> Attachments: HDFS-7116-00.patch, HDFS-7116-01.patch, 
> HDFS-7116-02.patch
>
>
> Now reading logs is the only way to check how the balancer bandwidth is set. 
> It would be useful for administrators if they can get the value of the same. 
> This jira to discuss & implement the way to access the balancer bandwidth 
> value of the datanode.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


<    1   2