[jira] [Updated] (HADOOP-9195) Generic Use Date Range PathFilter

2013-01-10 Thread Caleb Jones (JIRA)

 [ 
https://issues.apache.org/jira/browse/HADOOP-9195?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Caleb Jones updated HADOOP-9195:


Attachment: (was: HADOOP-9195.patch)

 Generic Use Date Range PathFilter
 -

 Key: HADOOP-9195
 URL: https://issues.apache.org/jira/browse/HADOOP-9195
 Project: Hadoop Common
  Issue Type: New Feature
  Components: fs
Reporter: Caleb Jones
Priority: Minor

 It would be useful for Hadoop to provide a general purpose date range 
 PathFilter that operates on file mtime. I have implemented one, with tests, 
 and would like to know where best to put it.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HADOOP-9195) Generic Use Date Range PathFilter

2013-01-10 Thread Caleb Jones (JIRA)

 [ 
https://issues.apache.org/jira/browse/HADOOP-9195?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Caleb Jones updated HADOOP-9195:


Status: Open  (was: Patch Available)

 Generic Use Date Range PathFilter
 -

 Key: HADOOP-9195
 URL: https://issues.apache.org/jira/browse/HADOOP-9195
 Project: Hadoop Common
  Issue Type: New Feature
  Components: fs
Reporter: Caleb Jones
Priority: Minor
 Attachments: HADOOP-9195.patch


 It would be useful for Hadoop to provide a general purpose date range 
 PathFilter that operates on file mtime. I have implemented one, with tests, 
 and would like to know where best to put it.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HADOOP-9195) Generic Use Date Range PathFilter

2013-01-10 Thread Caleb Jones (JIRA)

 [ 
https://issues.apache.org/jira/browse/HADOOP-9195?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Caleb Jones updated HADOOP-9195:


Attachment: HADOOP-9195.patch

Generic use date range PathFilter implementation (fixed references to 
deprecated FileStatus.isDir() and replaced with new FileStatus.isDirectory()).

 Generic Use Date Range PathFilter
 -

 Key: HADOOP-9195
 URL: https://issues.apache.org/jira/browse/HADOOP-9195
 Project: Hadoop Common
  Issue Type: New Feature
  Components: fs
Reporter: Caleb Jones
Priority: Minor
 Attachments: HADOOP-9195.patch


 It would be useful for Hadoop to provide a general purpose date range 
 PathFilter that operates on file mtime. I have implemented one, with tests, 
 and would like to know where best to put it.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HADOOP-9195) Generic Use Date Range PathFilter

2013-01-10 Thread Caleb Jones (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-9195?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13549471#comment-13549471
 ] 

Caleb Jones commented on HADOOP-9195:
-

I'm failing to see how TestZKFailoverController's tests are in any way related 
to the mere addition of a new PathFilter in org.apache.hadoop.fs. I've fixed 
the compiler warnings though (was referencing deprecated FileStatus.isDir()).

Unless someone can point out how TestZKFailoverController is failing due to 
these code changes, I'm going to assume that TestZKFailoverController's tests 
are having stability issues:

{noformat}
Error Message

test timed out after 15000 milliseconds

Stacktrace

java.lang.Exception: test timed out after 15000 milliseconds
at java.lang.Object.wait(Native Method)
at 
org.apache.hadoop.ha.ZKFailoverController.waitForActiveAttempt(ZKFailoverController.java:458)
at 
org.apache.hadoop.ha.ZKFailoverController.doGracefulFailover(ZKFailoverController.java:646)
...
{noformat}

 Generic Use Date Range PathFilter
 -

 Key: HADOOP-9195
 URL: https://issues.apache.org/jira/browse/HADOOP-9195
 Project: Hadoop Common
  Issue Type: New Feature
  Components: fs
Reporter: Caleb Jones
Priority: Minor
 Attachments: HADOOP-9195.patch


 It would be useful for Hadoop to provide a general purpose date range 
 PathFilter that operates on file mtime. I have implemented one, with tests, 
 and would like to know where best to put it.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HADOOP-9195) Generic Use Date Range PathFilter

2013-01-10 Thread Caleb Jones (JIRA)

 [ 
https://issues.apache.org/jira/browse/HADOOP-9195?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Caleb Jones updated HADOOP-9195:


Status: Patch Available  (was: Open)

Submitting updated patch with compiler warnings fixed.

 Generic Use Date Range PathFilter
 -

 Key: HADOOP-9195
 URL: https://issues.apache.org/jira/browse/HADOOP-9195
 Project: Hadoop Common
  Issue Type: New Feature
  Components: fs
Reporter: Caleb Jones
Priority: Minor
 Attachments: HADOOP-9195.patch


 It would be useful for Hadoop to provide a general purpose date range 
 PathFilter that operates on file mtime. I have implemented one, with tests, 
 and would like to know where best to put it.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HADOOP-9195) Generic Use Date Range PathFilter

2013-01-10 Thread Caleb Jones (JIRA)

 [ 
https://issues.apache.org/jira/browse/HADOOP-9195?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Caleb Jones updated HADOOP-9195:


Attachment: HADOOP-9195.patch

Tweaked it a bit to allow for open-ended ranges.

 Generic Use Date Range PathFilter
 -

 Key: HADOOP-9195
 URL: https://issues.apache.org/jira/browse/HADOOP-9195
 Project: Hadoop Common
  Issue Type: New Feature
  Components: fs
Reporter: Caleb Jones
Priority: Minor
 Attachments: HADOOP-9195.patch, HADOOP-9195.patch


 It would be useful for Hadoop to provide a general purpose date range 
 PathFilter that operates on file mtime. I have implemented one, with tests, 
 and would like to know where best to put it.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HADOOP-9195) Generic Use Date Range PathFilter

2013-01-10 Thread Caleb Jones (JIRA)

 [ 
https://issues.apache.org/jira/browse/HADOOP-9195?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Caleb Jones updated HADOOP-9195:


Status: Patch Available  (was: Open)

 Generic Use Date Range PathFilter
 -

 Key: HADOOP-9195
 URL: https://issues.apache.org/jira/browse/HADOOP-9195
 Project: Hadoop Common
  Issue Type: New Feature
  Components: fs
Reporter: Caleb Jones
Priority: Minor
 Attachments: HADOOP-9195.patch, HADOOP-9195.patch


 It would be useful for Hadoop to provide a general purpose date range 
 PathFilter that operates on file mtime. I have implemented one, with tests, 
 and would like to know where best to put it.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HADOOP-9195) Generic Use Date Range PathFilter

2013-01-10 Thread Caleb Jones (JIRA)

 [ 
https://issues.apache.org/jira/browse/HADOOP-9195?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Caleb Jones updated HADOOP-9195:


Status: Open  (was: Patch Available)

 Generic Use Date Range PathFilter
 -

 Key: HADOOP-9195
 URL: https://issues.apache.org/jira/browse/HADOOP-9195
 Project: Hadoop Common
  Issue Type: New Feature
  Components: fs
Reporter: Caleb Jones
Priority: Minor
 Attachments: HADOOP-9195.patch, HADOOP-9195.patch


 It would be useful for Hadoop to provide a general purpose date range 
 PathFilter that operates on file mtime. I have implemented one, with tests, 
 and would like to know where best to put it.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (HADOOP-9195) Generic Use Date Range PathFilter

2013-01-09 Thread Caleb Jones (JIRA)
Caleb Jones created HADOOP-9195:
---

 Summary: Generic Use Date Range PathFilter
 Key: HADOOP-9195
 URL: https://issues.apache.org/jira/browse/HADOOP-9195
 Project: Hadoop Common
  Issue Type: New Feature
  Components: fs
Reporter: Caleb Jones
Priority: Minor


It would be useful for Hadoop to provide a general purpose date range 
PathFilter that operates on file mtime. I have implemented one, with tests, and 
would like to know where best to put it.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HADOOP-9195) Generic Use Date Range PathFilter

2013-01-09 Thread Caleb Jones (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-9195?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13549367#comment-13549367
 ] 

Caleb Jones commented on HADOOP-9195:
-

I'll attach a patch.

 Generic Use Date Range PathFilter
 -

 Key: HADOOP-9195
 URL: https://issues.apache.org/jira/browse/HADOOP-9195
 Project: Hadoop Common
  Issue Type: New Feature
  Components: fs
Reporter: Caleb Jones
Priority: Minor

 It would be useful for Hadoop to provide a general purpose date range 
 PathFilter that operates on file mtime. I have implemented one, with tests, 
 and would like to know where best to put it.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HADOOP-9195) Generic Use Date Range PathFilter

2013-01-09 Thread Caleb Jones (JIRA)

 [ 
https://issues.apache.org/jira/browse/HADOOP-9195?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Caleb Jones updated HADOOP-9195:


Attachment: HADOOP-9195.patch

Attached is a patch for this implementation along with unit tests. I reviewed 
http://wiki.apache.org/hadoop/HowToContribute and structured the code 
accordingly.

 Generic Use Date Range PathFilter
 -

 Key: HADOOP-9195
 URL: https://issues.apache.org/jira/browse/HADOOP-9195
 Project: Hadoop Common
  Issue Type: New Feature
  Components: fs
Reporter: Caleb Jones
Priority: Minor
 Attachments: HADOOP-9195.patch


 It would be useful for Hadoop to provide a general purpose date range 
 PathFilter that operates on file mtime. I have implemented one, with tests, 
 and would like to know where best to put it.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HADOOP-8989) hadoop dfs -find feature

2013-01-09 Thread Caleb Jones (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-8989?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13549411#comment-13549411
 ] 

Caleb Jones commented on HADOOP-8989:
-

I've submitted a patch for HADOOP-9195 (DateTimePathFilter based on mtime) 
which may have some functionality similar to this.

 hadoop dfs -find feature
 

 Key: HADOOP-8989
 URL: https://issues.apache.org/jira/browse/HADOOP-8989
 Project: Hadoop Common
  Issue Type: New Feature
Reporter: Marco Nicosia
Assignee: Jonathan Allen
 Attachments: HADOOP-8989.patch, HADOOP-8989.patch, HADOOP-8989.patch, 
 HADOOP-8989.patch, HADOOP-8989.patch, HADOOP-8989.patch, HADOOP-8989.patch


 Both sysadmins and users make frequent use of the unix 'find' command, but 
 Hadoop has no correlate. Without this, users are writing scripts which make 
 heavy use of hadoop dfs -lsr, and implementing find one-offs. I think hdfs 
 -lsr is somewhat taxing on the NameNode, and a really slow experience on the 
 client side. Possibly an in-NameNode find operation would be only a bit more 
 taxing on the NameNode, but significantly faster from the client's point of 
 view?
 The minimum set of options I can think of which would make a Hadoop find 
 command generally useful is (in priority order):
 * -type (file or directory, for now)
 * -atime/-ctime-mtime (... and -creationtime?) (both + and - arguments)
 * -print0 (for piping to xargs -0)
 * -depth
 * -owner/-group (and -nouser/-nogroup)
 * -name (allowing for shell pattern, or even regex?)
 * -perm
 * -size
 One possible special case, but could possibly be really cool if it ran from 
 within the NameNode:
 * -delete
 The hadoop dfs -lsr | hadoop dfs -rm cycle is really, really slow.
 Lower priority, some people do use operators, mostly to execute -or searches 
 such as:
 * find / \(-nouser -or -nogroup\)
 Finally, I thought I'd include a link to the [Posix spec for 
 find|http://www.opengroup.org/onlinepubs/009695399/utilities/find.html]

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HADOOP-9195) Generic Use Date Range PathFilter

2013-01-09 Thread Caleb Jones (JIRA)

 [ 
https://issues.apache.org/jira/browse/HADOOP-9195?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Caleb Jones updated HADOOP-9195:


Status: Patch Available  (was: Open)

 Generic Use Date Range PathFilter
 -

 Key: HADOOP-9195
 URL: https://issues.apache.org/jira/browse/HADOOP-9195
 Project: Hadoop Common
  Issue Type: New Feature
  Components: fs
Reporter: Caleb Jones
Priority: Minor
 Attachments: HADOOP-9195.patch


 It would be useful for Hadoop to provide a general purpose date range 
 PathFilter that operates on file mtime. I have implemented one, with tests, 
 and would like to know where best to put it.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira