Re: [VOTE] Abandon hdfsproxy HDFS contrib

2011-02-17 Thread Bernd Fondermann
On Sat, Feb 12, 2011 at 09:18, Roy T. Fielding field...@gbiv.com wrote: On Feb 11, 2011, at 2:28 AM, Bernd Fondermann wrote: On Fri, Feb 11, 2011 at 07:33, Ian Holsman had...@holsman.net wrote: They probably have patched it, and mistakenly forgot to submit them.. any chance of doing a diff on

Re: [VOTE] Abandon mrunit MapReduce contrib

2011-02-17 Thread Bernd Fondermann
On Fri, Feb 11, 2011 at 23:10, Aaron Kimball akimbal...@gmail.com wrote: The main reason I am interested in removing MRUnit from Hadoop is that I believe that MRUnit deserves its own release cycle. I think this is in the best interest of its users. Not in mine, at least. (I'm writing MR unit

Re: Hadoop testing project

2011-02-17 Thread Ian Holsman
I'm not sure it makes sense to all the testing packages under a different umbrella that covers the code they test. While there might be commonalities building a test harness, I would think that each testing tool would need to have deep knowledge of the tool's internals that it is testing. as

Re: [VOTE] Abandon hdfsproxy HDFS contrib

2011-02-17 Thread Ian Holsman
Hi Bernd. On Feb 17, 2011, at 7:43 AM, Bernd Fondermann wrote: We have the very unfortunate situation here at Hadoop where Apache Hadoop is not the primary and foremost place of Hadoop development. Instead, code is developed internally at Yahoo and then contributed in (smaller or larger)

Re: [VOTE] Abandon hdfsproxy HDFS contrib

2011-02-17 Thread Bernd Fondermann
On Thu, Feb 17, 2011 at 14:58, Ian Holsman had...@holsman.net wrote: Hi Bernd. On Feb 17, 2011, at 7:43 AM, Bernd Fondermann wrote: We have the very unfortunate situation here at Hadoop where Apache Hadoop is not the primary and foremost place of Hadoop development. Instead, code is

Re: Hadoop testing project

2011-02-17 Thread Allen Wittenauer
On Feb 16, 2011, at 11:50 AM, Konstantin Boudnik wrote: As Joep said this ...will reduce the effort to take any (set of ) changes from development into production. Take it one step further: when your cluster is 'assembled' you need to validate it (on top of a concrete OS, etc.); is it

Re: [VOTE] Abandon hdfsproxy HDFS contrib

2011-02-17 Thread Allen Wittenauer
On Feb 17, 2011, at 4:43 AM, Bernd Fondermann wrote: To be honest: Hadoop is in the process of falling apart. We can thank the Apache Board for helping there as well. Their high handed interference basically set the project back 6 mos to a year; we're still recovering from the

Re: Hadoop testing project

2011-02-17 Thread Eric Yang
The biggest hurtle in hadoop adoption is that there is no easy way to setup a pseudo cluster on developer's machine. People are steering off course to build additional simulation tools and validation tools. In practice, those tools don't provide nearly enough insight in things that could go

simple map reduce

2011-02-17 Thread Alessandro Binhara
Helo all.. i create a simple map reduce .. to sum a valeu from a input file like : key, value 993, 3 993, 2 333, 2 etc .. like this.. public void map(LongWritable key, Text value, OutputCollectorText, IntWritable output, Reporter reporter) throws IOException { String line =

Re: [VOTE] Abandon mrunit MapReduce contrib

2011-02-17 Thread Aaron Kimball
The MRUnit community is a specific subset of the Hadoop community: Engineers writing Java code running on Hadoop. The Hadoop community also includes IT/ops staff who maintain Hadoop clusters, data scientists who use tools such as Pig Hive, as well as those written by the aforementioned engineers,

Re: Hadoop testing project

2011-02-17 Thread Konstantin Boudnik
Eric. I am sure that packaging of Hadoop and other application working directly with Hadoop is a highly needed thing (although there's always a tricky question how many platforms you plan to provide packaging for, etc.). What we are discussing here is software testing, not packaging nor

Re: Hadoop testing project [Was: [VOTE] Abandon mrunit MapReduce contrib]

2011-02-17 Thread Aaron Kimball
Working to develop code as a client of Hadoop is a path full of landmines. The more tools we can provide to users to improve the quality of their code, the better. I think it is important, though, to draw a clear distinction between tools intended for different audiences. Talking about system

Re: [VOTE] Abandon mrunit MapReduce contrib

2011-02-17 Thread Mattmann, Chris A (388J)
Hey Guys, FYI on this: Eric has mentioned he is going to start the Incubator proposal for MRUnit. Let's start small and then grow big (as needed). It seems like we've achieved enough consensus for the required mentors and critical mass to make an MRUnit Incubator proposal and then to have the

Re: Hadoop testing project [Was: [VOTE] Abandon mrunit MapReduce contrib]

2011-02-17 Thread Konstantin Boudnik
On Thu, Feb 17, 2011 at 11:27, Aaron Kimball akimbal...@gmail.com wrote: Working to develop code as a client of Hadoop is a path full of landmines. The more tools we can provide to users to improve the quality of their code, the better. I think it is important, though, to draw a clear

Re: [VOTE] Abandon mrunit MapReduce contrib

2011-02-17 Thread Patrick Hunt
Chris a page is up (still being created by Eric afaict): http://wiki.apache.org/incubator/MRUnitProposal I took the liberty of listing us as mentors. Patrick On Thu, Feb 17, 2011 at 11:31 AM, Mattmann, Chris A (388J) chris.a.mattm...@jpl.nasa.gov wrote: Hey Guys, FYI on this: Eric has

Re: [VOTE] Abandon hdfsproxy HDFS contrib

2011-02-17 Thread Konstantin Shvachko
hdfsproxy is a wrapper around hftpFileSystem (in its current state). So you can always replace hdfsproxy with hftpFileSystem. Also it uses pure FileSystem api, so it can successfully be maintained outside of hdfs. Therefore I am +1 removing it from hdfs/contrib. What is the use case for

Re: [VOTE] Abandon hdfsproxy HDFS contrib

2011-02-17 Thread Allen Wittenauer
On Feb 17, 2011, at 1:21 PM, Konstantin Shvachko wrote: hdfsproxy is a wrapper around hftpFileSystem (in its current state). So you can always replace hdfsproxy with hftpFileSystem. Also it uses pure FileSystem api, so it can successfully be maintained outside of hdfs. Therefore I am +1

Re: simple map reduce

2011-02-17 Thread Harsh J
Hello, On Fri, Feb 18, 2011 at 12:17 AM, Alessandro Binhara binh...@gmail.com wrote: I had a another structure .. of input file key, value1, value2, value3 993, 3,2,3 993, 2,1,1 333, 2,2,1 How i can sendo to reduce a list of values, ? To process a list numers and not only one number?

[ANN] HBase 0.90.1 available for download

2011-02-17 Thread Todd Lipcon
The Apache HBase team is happy to announce the general availability of HBase 0.90.1, available from your Apache mirror of choice: http://www.apache.org/dyn/closer.cgi/hbase/ [at the time of this writing, not all mirrors have updated yet -- please pick a different mirror if your first choice does