[VOTE] Sponsoring Howl as an Apache Incubator project

2011-02-02 Thread Alan Gates
Howl is a table management system built to provide metadata and storage management across data processing tools in Hadoop (Pig, Hive, MapReduce, ...). You can learn more details at http://wiki.apache.org/pig/Howl . For the last six months the code has been hosted at github. The Howl team

Re: [VOTE] Sponsoring Howl as an Apache Incubator project

2011-02-02 Thread Jeff Hammerbacher
Awesome! Huge +1. On Wed, Feb 2, 2011 at 1:18 PM, Alan Gates wrote: > Howl is a table management system built to provide metadata and storage > management across data processing tools in Hadoop (Pig, Hive, MapReduce, > ...). You can learn more details at http://wiki.apache.org/pig/Howl. For >

Nondeterministic results depending on whether you have an intermediate load

2011-02-02 Thread Jonathan Coveney
I am currently running this on: https://svn.apache.org/repos/asf/pig/branches/branch-0.8 So, I have a rather long script where the results that are output depend on whether or not I have an intermediate store. Clearly this is undesirable :) I was just curious if this might be related to any known

RE: [VOTE] Sponsoring Howl as an Apache Incubator project

2011-02-02 Thread Olga Natkovich
+1 -Original Message- From: Alan Gates [mailto:ga...@yahoo-inc.com] Sent: Wednesday, February 02, 2011 1:19 PM To: user@pig.apache.org Subject: [VOTE] Sponsoring Howl as an Apache Incubator project Howl is a table management system built to provide metadata and storage management acros

Re: [VOTE] Sponsoring Howl as an Apache Incubator project

2011-02-02 Thread Edward Capriolo
On Wed, Feb 2, 2011 at 5:08 PM, Jeff Hammerbacher wrote: > Awesome! Huge +1. > > On Wed, Feb 2, 2011 at 1:18 PM, Alan Gates wrote: > >> Howl is a table management system built to provide metadata and storage >> management across data processing tools in Hadoop (Pig, Hive, MapReduce, >> ...).  You

Re: [VOTE] Sponsoring Howl as an Apache Incubator project

2011-02-02 Thread Daniel Dai
+1 Olga Natkovich wrote: +1 -Original Message- From: Alan Gates [mailto:ga...@yahoo-inc.com] Sent: Wednesday, February 02, 2011 1:19 PM To: user@pig.apache.org Subject: [VOTE] Sponsoring Howl as an Apache Incubator project Howl is a table management system built to provide metadata an

Re: [VOTE] Sponsoring Howl as an Apache Incubator project

2011-02-02 Thread Benjamin Reed
+1 On 02/02/2011 03:15 PM, Daniel Dai wrote: +1 Olga Natkovich wrote: +1 -Original Message- From: Alan Gates [mailto:ga...@yahoo-inc.com] Sent: Wednesday, February 02, 2011 1:19 PM To: user@pig.apache.org Subject: [VOTE] Sponsoring Howl as an Apache Incubator project Howl is a table ma

Re: [VOTE] Sponsoring Howl as an Apache Incubator project

2011-02-02 Thread Thejas M Nair
+1 -Thejas On 2/2/11 1:18 PM, "Alan Gates" wrote: Howl is a table management system built to provide metadata and storage management across data processing tools in Hadoop (Pig, Hive, MapReduce, ...). You can learn more details at http://wiki.apache.org/pig/Howl . For the last six months the

Re: pig Loader UDF with nested tuples

2011-02-02 Thread Dmitriy Ryaboy
Elephant Bird does this all over the place for both 0.6 and 0.8 And even auto-generates deep schemas :) D On Tue, Feb 1, 2011 at 11:54 AM, Alex McLintock wrote: > Can anyone point me to a Loader UDF which creates nested tuples - ie tuples > with bags/other tuples within them? > > I believe you co

Re: [VOTE] Sponsoring Howl as an Apache Incubator project

2011-02-02 Thread Julien Le Dem
+1 On 2/2/11 2:08 PM, "Jeff Hammerbacher" wrote: Awesome! Huge +1. On Wed, Feb 2, 2011 at 1:18 PM, Alan Gates wrote: > Howl is a table management system built to provide metadata and storage > management across data processing tools in Hadoop (Pig, Hive, MapReduce, > ...). You can learn mor

Re: [VOTE] Sponsoring Howl as an Apache Incubator project

2011-02-02 Thread Milind Bhandarkar
I feel that Howl should start as a contrib to Hadoop, and move to be a subproject of Hadoop once there is sufficient adoption, rather than going the incubator way. My reasons are as follows: 1. Howl is aimed at providing abstractions for facilitating interoperability between various systems bui

Re: [VOTE] Sponsoring Howl as an Apache Incubator project

2011-02-02 Thread Richard Ding
+1 On 2/2/11 1:18 PM, "Alan Gates" wrote: Howl is a table management system built to provide metadata and storage management across data processing tools in Hadoop (Pig, Hive, MapReduce, ...). You can learn more details at http://wiki.apache.org/pig/Howl . For the last six months the code has

Re: UDF with parameterized constructor in DEFINE statement

2011-02-02 Thread Charles Gonçalves
Thank you guys, In fact, was my bad! Sorry! On Tue, Feb 1, 2011 at 4:05 PM, Santhosh Srinivasan wrote: > The error message is misleading. The user expected 'day' to be the alias > used for the UDF and not an alias in the schema. > > -Original Message- > From: Jonathan Coveney [mailto:jcov

read concatened gzip files in Pig 0.8.0

2011-02-02 Thread Charles Gonçalves
Hi Guys, I noted that concatenated gziped files not work on Hadoop https://issues.apache.org/jira/browse/HADOOP-6335 So, have anyone passed by this problem ? There is a workaround that I could do in my Load Function? I will appreciate any help!

Re: [VOTE] Sponsoring Howl as an Apache Incubator project

2011-02-02 Thread Alan Gates
I see a couple blockers that prevent this from being a contrib project of Hadoop: 1) The Hadoop project is actively trying to remove the contrib projects it has, see http://tinyurl.com/6yl25jz. I doubt it's interested in any new ones. 2) The Hadoop project is producing a release every 2

Re: [VOTE] Sponsoring Howl as an Apache Incubator project

2011-02-02 Thread Alan Gates
Edward, I understand your concern with having a copy of the metastore code in Howl. However, let's separate code from governance. The reason Howl has a copy of Hive's metastore is not because we're proposing it for the Incubator, it is because in the course of developing it over the las

RE: [VOTE] Sponsoring Howl as an Apache Incubator project

2011-02-02 Thread Santhosh Srinivasan
+1 for Howl as an incubator project. -Original Message- From: Alan Gates [mailto:ga...@yahoo-inc.com] Sent: Wednesday, February 02, 2011 9:17 PM To: user@pig.apache.org Cc: u...@hive.apache.org Subject: Re: [VOTE] Sponsoring Howl as an Apache Incubator project Edward, I understand your