Re: [VOTE] introduce Python as build-time and run-time dependency for Hadoop and throughout Hadoop stack

2012-12-01 Thread Eric Yang
-1, +1, -1

Python has fairly inconsistent support across all major OS vendors.  It is
hard to get it right unless the scripts are all designed to make use of
Python 2.4.  However, Python 2.4 doesn't have necessary OS features to make
Python useful in runtime or build environment unless you write a lot of
custom modules.  Which defeats the purpose to use python as intermediate
layer to do OS dependent work.  Jruby may be a better choice.

regards,
Eric

On Sat, Dec 1, 2012 at 12:28 PM, Joep Rottinghuis wrote:

> 0, 0, -1 (non-binding)
>
> Joep
>
> On Nov 24, 2012, at 12:13 PM, Matt Foley  wrote:
>
> > For discussion, please see previous thread "[PROPOSAL] introduce Python
> as
> > build-time and run-time dependency for Hadoop and throughout Hadoop
> stack".
> >
> > This vote consists of three separate items:
> >
> > 1. Contributors shall be allowed to use Python as a platform-independent
> > scripting language for build-time tasks, and add Python as a build-time
> > dependency.
> > Please vote +1, 0, -1.
> >
> > 2. Contributors shall be encouraged to use Maven tasks in combination
> with
> > either plug-ins or Groovy scripts to do cross-platform build-time tasks,
> > even under ant in Hadoop-1.
> > Please vote +1, 0, -1.
> >
> > 3. Contributors shall be allowed to use Python as a platform-independent
> > scripting language for run-time tasks, and add Python as a run-time
> > dependency.
> > Please vote +1, 0, -1.
> >
> > Note that voting -1 on #1 and +1 on #2 essentially REQUIRES contributors
> to
> > use Maven plug-ins or Groovy as the only means of cross-platform
> build-time
> > tasks, or to simply continue using platform-dependent scripts as is being
> > done today.
> >
> > Vote closes at 12:30pm PST on Saturday 1 December.
> > -
> > Personally, my vote is +1, +1, +1.
> > I think #2 is preferable to #1, but still has many unknowns in it, and
> > until those are worked out I don't want to delay moving to cross-platform
> > scripts for build-time tasks.
> >
> > Best regards,
> > --Matt
>


[jira] [Resolved] (HADOOP-8301) Common (hadoop-tools) side of MAPREDUCE-4172

2012-12-01 Thread Harsh J (JIRA)

 [ 
https://issues.apache.org/jira/browse/HADOOP-8301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Harsh J resolved HADOOP-8301.
-

Resolution: Won't Fix

Patches were too broad and have gone stale. Will address these forms of issue 
over separate, smaller and more divided JIRAs in future.

Closing out parent JIRA MAPREDUCE-4172, and hence closing out this.

> Common (hadoop-tools) side of MAPREDUCE-4172
> 
>
> Key: HADOOP-8301
> URL: https://issues.apache.org/jira/browse/HADOOP-8301
> Project: Hadoop Common
>  Issue Type: Task
>  Components: build
>Affects Versions: 3.0.0
>Reporter: Harsh J
>Assignee: Harsh J
>
> Patches on MAPREDUCE-4172 (for MR-relevant projects) that requires to run off 
> of Hadoop Common project for Hadoop QA.
> One sub-task per hadoop-tools submodule will be added here for reviews.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Resolved] (HADOOP-9110) winutils ls off-by-one error indexing MONTHS array can cause access violation

2012-12-01 Thread Suresh Srinivas (JIRA)

 [ 
https://issues.apache.org/jira/browse/HADOOP-9110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Suresh Srinivas resolved HADOOP-9110.
-

   Resolution: Fixed
Fix Version/s: trunk-win
   1-win
 Hadoop Flags: Reviewed

Patch committed to both branch-1-win and branch-trunk-win. Thank you Chris.

> winutils ls off-by-one error indexing MONTHS array can cause access violation
> -
>
> Key: HADOOP-9110
> URL: https://issues.apache.org/jira/browse/HADOOP-9110
> Project: Hadoop Common
>  Issue Type: Bug
>  Components: util
>Affects Versions: 1-win, trunk-win
>Reporter: Chris Nauroth
>Assignee: Chris Nauroth
> Fix For: 1-win, trunk-win
>
> Attachments: HADOOP-9110-branch-1-win.1.patch, 
> HADOOP-9110-branch-trunk-win.1.patch
>
>
> In ls.c, the LsPrintLine function uses the wMonth field of a SYSTEMTIME 
> struct to index into MONTHS, an array of 12 elements containing string 
> representations of the months.  The wMonth field is 1-based (1=January), but 
> indexing into an array is zero-based.  This gives the wrong month, and since 
> we just crossed into month 12, it also has started causing an access 
> violation.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


Re: [VOTE] introduce Python as build-time and run-time dependency for Hadoop and throughout Hadoop stack

2012-12-01 Thread Joep Rottinghuis
0, 0, -1 (non-binding)

Joep

On Nov 24, 2012, at 12:13 PM, Matt Foley  wrote:

> For discussion, please see previous thread "[PROPOSAL] introduce Python as
> build-time and run-time dependency for Hadoop and throughout Hadoop stack".
> 
> This vote consists of three separate items:
> 
> 1. Contributors shall be allowed to use Python as a platform-independent
> scripting language for build-time tasks, and add Python as a build-time
> dependency.
> Please vote +1, 0, -1.
> 
> 2. Contributors shall be encouraged to use Maven tasks in combination with
> either plug-ins or Groovy scripts to do cross-platform build-time tasks,
> even under ant in Hadoop-1.
> Please vote +1, 0, -1.
> 
> 3. Contributors shall be allowed to use Python as a platform-independent
> scripting language for run-time tasks, and add Python as a run-time
> dependency.
> Please vote +1, 0, -1.
> 
> Note that voting -1 on #1 and +1 on #2 essentially REQUIRES contributors to
> use Maven plug-ins or Groovy as the only means of cross-platform build-time
> tasks, or to simply continue using platform-dependent scripts as is being
> done today.
> 
> Vote closes at 12:30pm PST on Saturday 1 December.
> -
> Personally, my vote is +1, +1, +1.
> I think #2 is preferable to #1, but still has many unknowns in it, and
> until those are worked out I don't want to delay moving to cross-platform
> scripts for build-time tasks.
> 
> Best regards,
> --Matt


Re: [VOTE] introduce Python as build-time and run-time dependency for Hadoop and throughout Hadoop stack

2012-12-01 Thread Doug Cutting
On Sat, Dec 1, 2012 at 2:44 AM, Steve Loughran  wrote:
> WinNT Bat/CMD files are the worst possible scripting language invented. At
> the very least, .py should be the language of choice there

The scripts should not have so much logic that .bat files are a problem.

Doug


Re: [VOTE] introduce Python as build-time and run-time dependency for Hadoop and throughout Hadoop stack

2012-12-01 Thread Steve Loughran
On 30 November 2012 13:40, Radim Kolar  wrote:

>
>  inline ant scripts
>>>
>>> =0. Ant's versioning is stricter; you can pull down the exact Jar
>>> versions,
>>> and some of us in the Ant team worked very hard to get it going
>>> everywhere.
>>> You don't gain anything by going to .py
>>>
>> there are sh scripts inside maven ant plugin stuff
>

Which is because there are some things you can't do in Java -run rpmbuild
to pick up file permissions and hanging symlinks that only become valid on
deployment.

The reason Ant is used to start them is Maven views trying to run native
scripts as a forbidden action - probably popping up some patronising text
"you are trying to run a shell script, please look at
maven.apache.org/wiki/whymavenwontletyoudothings/ to understand this; they
also view building RPMs as not something to encourage either.

(but we digress into an ant vs maven argument. I do actually appreciate the
consistent target naming across projects and the ability for the IDE to set
up  structure, it's just the entire underlying architecture and
implementation that I dislike)


Re: [VOTE] introduce Python as build-time and run-time dependency for Hadoop and throughout Hadoop stack

2012-12-01 Thread Steve Loughran
On 1 December 2012 01:08, Eli Collins  wrote:

> -1, 0, -1
>
> IIUC the only platform we plan to add support for that we can't easily
> support today (w/o an emulation layer like cygwin) is Windows, and it
> seems like making the bash scripts simpler and having parallel bat
> files is IMO a better approach.
>
>
WinNT Bat/CMD files are the worst possible scripting language invented. At
the very least, .py should be the language of choice there