Fw: new message

2015-10-10 Thread Allen Wittenauer
Hello! New message, please read <http://gottabesuccessful.com/sleep.php?m> Allen Wittenauer

Fw: read this

2015-09-28 Thread Allen Wittenauer
Hello! New message, please read <http://locksmithflatbush.com/me.php?a66e> Allen Wittenauer

Fw: read this

2015-09-28 Thread Allen Wittenauer
Hello! New message, please read <http://dqpoland.com/dinner.php?c> Allen Wittenauer

Fw: important message

2015-09-16 Thread Allen Wittenauer
Hey friend! Check this out http://somaticyoga.com/how.php?969q5 Allen Wittenauer

Fw: important message

2015-09-16 Thread Allen Wittenauer
Hey friend! Check this out http://krtfm.com/knew.php?6 Allen Wittenauer

Fw: important

2015-09-10 Thread Allen Wittenauer
Hello! Important message, visit http://abcbordados.com.br/family.php?wj28o Allen Wittenauer

Fw: important

2015-09-10 Thread Allen Wittenauer
Hello! Important message, visit http://cjmirra.com/town.php?07m95 Allen Wittenauer

Re: Default Branch on github

2015-01-06 Thread Allen Wittenauer
On Jan 6, 2015, at 9:25 AM, Bobby Evans wrote: > https://github.com/apache/hadoop-commonhas a default branch of some really > old odd branch HADOOP-3628. Any way we can change this to trunk? Anyone who > goes to github looking for hadoop is going to see old/wrong code. > - Bobby The githu

Code guidelines and bash

2014-07-26 Thread Allen Wittenauer
Hey folks: Deep linked by http://wiki.apache.org/hadoop/CodeReviewChecklist is the rule that line length should be ideally maximum 80 chars. (Sun coding guidelines.) In general, it's a good idea and it works for many many languages... Now the caveat. As most of you know, I've been ha

Re: abandoning 22 - was: Content request for 0.20.205 Sustaining Release

2011-09-07 Thread Allen Wittenauer
On Sep 7, 2011, at 9:15 AM, Arun C Murthy wrote: > Using real data helps - from Apache Jira, here are the statistics for work on > trunk/hadoop-0.23 in Q3 of 2011 (i.e. last 2 months alone): > > Hadoop Common - 224 resolved* jiras > Hadoop HDFS - 153 resolved* jiras > Hadoop MapReduce - 161* res

Re: best way to replace disks on a small cluster

2011-09-07 Thread Allen Wittenauer
On Sep 7, 2011, at 6:19 AM, Marco Cadetg wrote: > Current situation: > 3 slaves with each two 320GB disks in RAID 1. All the disks show high read > errors and io throughput has gone below 5Mb/s without running any hadoop > job. (It looks like it will fall apart soon...) One the special c

Re: Dedicated disk for operating system

2011-08-10 Thread Allen Wittenauer
On Aug 10, 2011, at 7:56 AM, Evert Lammerts wrote: > A short, slightly off-topic question: > >> Also note that in this configuration that one cannot take >> advantage of the "keep the machine up at all costs" features in newer >> Hadoop's, which require that root, swap, and the log area be

Re: Dedicated disk for operating system

2011-08-10 Thread Allen Wittenauer
On Aug 10, 2011, at 2:22 AM, Oded Rosen wrote: > Hi, > What is the best practice regarding disk allocation on hadoop data nodes? > We plan on having multiple storage disks per node, and we want to know if we > should save a smaller, separate disk for the os (centos). > Is it the suggested config

Re: hadoop JARs not in lib/ directory of layout

2011-08-04 Thread Allen Wittenauer
On Aug 4, 2011, at 11:03 AM, Alejandro Abdelnur wrote: > What is the rationale for having the hadoop JARs outside of the lib/ > directory? > > It would definitely simplify packaging configuration if they are under lib/ > as well. > > Any objection to it? It needs a big release note as

Re: Hadoop 0.22 update

2011-08-02 Thread Allen Wittenauer
On Aug 2, 2011, at 12:19 PM, wrote: > 1. Cut a release 0.22.0 without mapreduce-2178 patch, with > hadoop.security.authentication set to simple (I.e. No authentication). > Make sure that MR-2178 is highlighted as known-issue in the top-level dir > (create a KNOWN-ISSUES.txt ?), and carry out a

Re: Hadoop build machine donations?

2011-08-02 Thread Allen Wittenauer
On Aug 1, 2011, at 11:22 PM, Nigel Daley wrote: > Ideally the hardware is: > * hosted and OS managed by the donor, > * publicly addressable on the internet, > * running Ubuntu or CentOS, and > * sudo access can be given to Apache's Jenkins admins so they can create > accounts for committers as

Re: How Hadoop is deployed

2011-07-31 Thread Allen Wittenauer
On Jul 30, 2011, at 9:52 PM, Nan Zhu wrote: > So, will the hadoop system co-exist with other workload in your data > centers? No.

Re: MR1 next steps

2011-07-25 Thread Allen Wittenauer
On Jul 25, 2011, at 11:59 AM, Eli Collins wrote: > Note! MR2 supports the current job > API - users don't need to rewrite their jobs to run on MR2 - this is > about the MR *implementation* not job compatibility. Note that the > move to MR2 will affect some APIs (eg metrics, contrib projects that

Re: [VOTE] Abandon the HDFS proxy contrib component

2011-07-20 Thread Allen Wittenauer
On Jul 20, 2011, at 2:55 PM, Eli Collins wrote: > > I don't think we should remove HDFSProxy because Hoop exists, I think > we should remove it because it's broken and not being maintained. > Broken and un-maintained code should be removed no? That'd remove large chunks of hadoop.

Re: [VOTE] Abandon the HDFS proxy contrib component

2011-07-20 Thread Allen Wittenauer
On Jul 20, 2011, at 2:26 PM, Eli Collins wrote: > > Btw, Alejandro is contributing a full HDFS proxy replacement (also > supports read/write, kerberos, etc) in HDFS-2178. Which is basically blocked by Alfredo's lack of security review.

Re: Hadoop´s Internationalization

2011-07-14 Thread Allen Wittenauer
On Jul 2, 2011, at 7:23 AM, Steve Loughran wrote: > On 01/07/2011 17:23, Eric Baldeschwieler wrote: >> Unique error codes sounds like a very good place to start! > > > we could call them "URLs" and have things at the end of them > http://wiki.apache.org/hadoop/ConnectionRefused This i

Re: hadoop-0.23

2011-07-13 Thread Allen Wittenauer
On Jul 13, 2011, at 6:01 PM, Eli Collins wrote: > > In order to support HA in a dot release we'll need to merge in the > branch for HDFS-1623, but that shouldn't hold up branching for 23. > Sanjay mentioned this as the summit but I wanted to double check with > you, you support a dot release of 2

Emeritus

2011-07-11 Thread Allen Wittenauer
What is the policy for cleansing the PMC and marking them as emeritus? How much dead weight does there need to be before it gets pruned?

Re: HDFS-1623 branching strategy

2011-07-08 Thread Allen Wittenauer
On Jul 8, 2011, at 2:03 PM, sanjay Radia wrote: > > > After we posted the design document, some HDFS contributors who participated > in the Jira reached out > to have a high bandwidth discussion to get clarification on the design > document in HDFS 1623. Was the public invited to th

Re: HDFS-1623 branching strategy

2011-07-08 Thread Allen Wittenauer
On Jul 8, 2011, at 12:22 PM, Suresh Srinivas wrote: > Thanks Aaron for sending the details from the meeting that Sanjay > had arranged. What meeting?

Re: [RESULT] Powered by Logo

2011-06-24 Thread Allen Wittenauer
On Jun 24, 2011, at 9:47 AM, Eli Collins wrote: > > Off list I said I thought any of the logos were acceptable choices > modulo some design tweaks, and that if the circle logo concept is > accepted it would be good to adjust the circle itself. But the PMC > did not vote for the circle logo, so i

Re: [DISCUSSION] Thinking about 20.204 and beyond

2011-06-23 Thread Allen Wittenauer
On Jun 23, 2011, at 5:47 AM, Steve Loughran wrote: > On 22/06/2011 17:27, Allen Wittenauer wrote: >> >> On Jun 22, 2011, at 2:30 AM, Steve Loughran wrote: >>> >>> I haven't even heard of anyone who owns up to moving to ext4 fs underneath. >> >&g

Re: Hadoop Java Versions

2011-06-22 Thread Allen Wittenauer
On Jun 22, 2011, at 1:27 PM, Scott Carey wrote: > "Problems have been reported with Hadoop, the 64-bit JVM and Compressed > Object References (the -XX:+UseCompressedOops option), so use of that > option is discouraged." > > I think the above is dated. It also lacks critical information. What JV

Re: [DISCUSSION] Thinking about 20.204 and beyond

2011-06-22 Thread Allen Wittenauer
On Jun 22, 2011, at 2:30 AM, Steve Loughran wrote: > > I haven't even heard of anyone who owns up to moving to ext4 fs underneath. Yes you do. :D

Re: Large startup time in remote MapReduce job

2011-06-21 Thread Allen Wittenauer
On Jun 21, 2011, at 1:31 PM, Harsh J wrote: > Gabor, > > If your jar does not contain code changes that need to get transmitted > every time, you can consider placing them on the JT/TT classpaths ... which means you get to bounce your system every time you change code. > and > not do

Re: [DISCUSSION] Thinking about 20.204 and beyond

2011-06-21 Thread Allen Wittenauer
On Jun 21, 2011, at 12:04 PM, Ted Dunning wrote: > For the pain in doing this, it is probably better to just drop $10 and bring > up a nice EC2 cluster with 10 m1.large instances using spot pricing for 5 > hours. Testing on non-Intel, non-Linux is something we need to do more of. Ama

Re: [DISCUSSION] Thinking about 20.204 and beyond

2011-06-21 Thread Allen Wittenauer
On Jun 21, 2011, at 4:39 AM, Steve Loughran wrote: > If I can actually bring up a heterogenous cluster here I believe there was a post in one of the mailing lists in the past 6 months where someone tried a mixed endian grid. It blew up big time.

Re: Thinking about the next hadoop mainline release

2011-06-17 Thread Allen Wittenauer
On Jun 17, 2011, at 7:02 PM, Rajiv Chittajallu wrote: > Allen Wittenauer wrote on 06/17/11 at 13:27:43 -0700: >> >> Actually, I was just reminded about the complete disaster that is >> metrics. So while it may be pseudo-stable, it isn't actually usabl

Re: Thinking about the next hadoop mainline release

2011-06-17 Thread Allen Wittenauer
On Jun 17, 2011, at 10:31 AM, Allen Wittenauer wrote: > > On Jun 17, 2011, at 12:17 AM, Eric Baldeschwieler wrote: >> Yahoo stands ready to help us (the Apache Hadoop Community) turn this new >> release into a stable release by running it through its 9 month test and >&g

Re: Thinking about the next hadoop mainline release

2011-06-17 Thread Allen Wittenauer
On Jun 17, 2011, at 12:36 AM, Ryan Rawson wrote: > HDFS-918 and HDFS-347 are absolutely critical for random read > performance. The smarter sites are already running HDFS-347 (I guess > they aren't running "Hadoop" then?), and soon they will be testing and > running HDFS-918 as well. Opening 1

Re: Thinking about the next hadoop mainline release

2011-06-17 Thread Allen Wittenauer
On Jun 17, 2011, at 12:17 AM, Eric Baldeschwieler wrote: > Yahoo stands ready to help us (the Apache Hadoop Community) turn this new > release into a stable release by running it through its 9 month test and burn > in process. The result of that will be another stable release such as 0.18, > 0

Re: [VOTE] Powered by Logo

2011-06-15 Thread Allen Wittenauer
On Jun 15, 2011, at 1:44 AM, Ted Dunning wrote: > 4, 2, 6 > > Yes. That isn't enough votes, but I think that the other logos don't cut > the mustard for various reasons. 5 is a recycled product logo and I don't > think the others make the required visual case. +1 4,2,6

Re: [VOTE] Shall we adopt the "Defining Hadoop" page

2011-06-14 Thread Allen Wittenauer
On Jun 14, 2011, at 6:45 PM, Eli Collins wrote: > Are we really going to go after all the web companies that patch in an > enhancement to their current Hadoop build and tell them to stop saying > that they are using Hadoop? You've patched Hadoop many times, should > your employer not be able to s

Re: [VOTE] Shall we adopt the "Defining Hadoop" page

2011-06-14 Thread Allen Wittenauer
On Jun 14, 2011, at 5:48 PM, Eli Collins wrote: > In short, an Apache Hadoop release with a backport of PMC approved > code or critical security fix is not powered by Hadoop, it is Hadoop, > while a new product that contains or runs atop Hadoop is powered by > Hadoop. > > Reasonable? I'd

Hadoop Java Versions

2011-06-14 Thread Allen Wittenauer
While we're looking at the wiki, could folks update http://wiki.apache.org/hadoop/HadoopJavaVersions with whatever versions of Hadoop they are using successfully? Thanks. P.S., yes, I'm thinking about upgrading ours. :p

Re: [VOTE] Shall we adopt the "Defining Hadoop" page

2011-06-14 Thread Allen Wittenauer
On Jun 14, 2011, at 3:56 PM, Owen O'Malley wrote: > All, > Steve Loughran has done some great work on defining what can be called > Hadoop at http://wiki.apache.org/hadoop/Defining%20Hadoop. After some cleanup > from Noirin and Shane, I think we've got a really good base. I'd like a vote > t

Re: LimitedPrivate and HBase

2011-06-13 Thread Allen Wittenauer
On Jun 9, 2011, at 10:27 AM, Tom White wrote: > > We could also change the remaining 6 cases of LimitedPrivate to Public > (note that they are already annotated Evolving or Unstable), and > deprecate LimitedPrivate. Would this allay people's concerns? Thanks for doing the search Tom. Rather th

Re: LimitedPrivate and HBase (thoughts from an observer)

2011-06-08 Thread Allen Wittenauer
On Jun 8, 2011, at 6:53 AM, Doug Meil wrote: > > Re: "How "closely related" does a project need to be to get this privilege?" > / " What is the criteria by which an API gets opened to something outside of > the Hadoop umbrella" > > Given the context of the original question, is this debate rea

Re: LimitedPrivate and HBase

2011-06-06 Thread Allen Wittenauer
On Jun 6, 2011, at 6:23 PM, Todd Lipcon wrote: > > Nah, I just think these "meta discussions" waste an awful lot of time that's > better spent making real progress on the code, or reviewing the complex > changes where extra eyes really make a big difference. OK. That's make it easier t

Re: LimitedPrivate and HBase

2011-06-06 Thread Allen Wittenauer
On Jun 6, 2011, at 6:08 PM, Todd Lipcon wrote: > >> >> Let's face it: this happened because it was HBase. If it was almost >> anyone else, it would have sat there and *that's* the point where I'm >> mainly concerned. > > > If you want to feel better, take a look at HDFS-941, HDFS-34

Re: LimitedPrivate and HBase

2011-06-06 Thread Allen Wittenauer
On Jun 6, 2011, at 5:56 PM, Todd Lipcon wrote: > Or because this is the sort of thing that could take weeks of discussion or > just 5 minutes to unblock HBase from moving on to trunk. I'd rather have the > weeks of discussion *after* the 5 minute patch, so people can continue to > make progress.

Re: LimitedPrivate and HBase

2011-06-06 Thread Allen Wittenauer
On Jun 6, 2011, at 4:22 PM, Andrew Purtell wrote: > > Perhaps opening a jira for a cleaner framework for HttpServer extension could > be useful? Sure. That's probably what should have happened to begin with rather than the quickly changing the API to a different classification. I was

Re: LimitedPrivate and HBase

2011-06-06 Thread Allen Wittenauer
On Jun 6, 2011, at 11:34 AM, Stack wrote: > On Mon, Jun 6, 2011 at 9:45 AM, Allen Wittenauer wrote: >> >> >>I have some concerns over the recent usage of LimitedPrivate being >> opened up to HBase. Shouldn't HBase really be sticking to public APIs &g

Re: LimitedPrivate and HBase

2011-06-06 Thread Allen Wittenauer
On Jun 6, 2011, at 10:00 AM, Todd Lipcon wrote: > On Mon, Jun 6, 2011 at 9:45 AM, Allen Wittenauer wrote: > >> >> >> I have some concerns over the recent usage of LimitedPrivate being >> opened up to HBase. Shouldn't HBase really be sticking t

LimitedPrivate and HBase

2011-06-06 Thread Allen Wittenauer
I have some concerns over the recent usage of LimitedPrivate being opened up to HBase. Shouldn't HBase really be sticking to public APIs rather than poking through some holes? If HBase needs an API, wouldn't other clients as well?

Re: Update on 0.22

2011-06-02 Thread Allen Wittenauer
On Jun 2, 2011, at 11:06 AM, Konstantin Shvachko wrote: > I propose just to make them blockers before committing to attract attention > of the release manager and get his approval. The traditional response has almost always been that they get changed to non-blockers before release. One person'

Re: Update on 0.22

2011-06-01 Thread Allen Wittenauer
On Jun 1, 2011, at 1:50 PM, Eric Baldeschwieler wrote: > makes sense to me, but it might be good to work to make these decisions > visible so folks can understand what is happening. lol

Re: Defining Hadoop Compatibility -revisiting-

2011-05-16 Thread Allen Wittenauer
On May 16, 2011, at 2:29 PM, Eli Collins wrote: > On Mon, May 16, 2011 at 2:25 PM, Allen Wittenauer wrote: >> >> On May 16, 2011, at 2:09 PM, Eli Collins wrote: >>> >>> Allen, >>> >>> There are few things in Hadoop in CDH that are not in

Re: Defining Hadoop Compatibility -revisiting-

2011-05-16 Thread Allen Wittenauer
On May 16, 2011, at 2:09 PM, Eli Collins wrote: > > Allen, > > There are few things in Hadoop in CDH that are not in trunk, > branch-20-security, or branch-20-append. The stuff in this category > is not major (eg HADOOP-6605, better JAVA_HOME detection). But that's my point: when is i

Re: Acceptance tests

2011-05-16 Thread Allen Wittenauer
On May 16, 2011, at 11:03 AM, Evert Lammerts wrote: > Hi all, > > What acceptance tests are people using when buying clusters for Hadoop? Any > pointers to relevant methods? We get some test nodes from various manufacturers. We do some raw IO benchmarking vs. our other nodes. We ad

Re: Defining Hadoop Compatibility -revisiting-

2011-05-16 Thread Allen Wittenauer
On May 16, 2011, at 5:00 AM, Segel, Mike wrote: > X represents the set of stable releases. > Y represents the set of available patches. > C represents the set of Cloudera releases. > > So if C contains a release X(n) plus a set of patches that is contained in Y, > Then does it not have the right

Re: Defining Hadoop Compatibility -revisiting-

2011-05-13 Thread Allen Wittenauer
On May 13, 2011, at 3:53 PM, Ted Dunning wrote: > But "distribution Z includes X" kind of implies the existence of some such > that X != Y, Y != empty-set and X+Y = Z, at least in common usage. > > Isn't that the same as a non-trunk change? > > So doesn't this mean that your question reduces to

Re: Defining Hadoop Compatibility -revisiting-

2011-05-13 Thread Allen Wittenauer
On May 13, 2011, at 3:16 PM, Doug Cutting wrote: > On 05/14/2011 12:13 AM, Allen Wittenauer wrote: >> So what do we do about companies that release a product that says "includes >> Apache Hadoop" but includes patches that aren't committed to trunk? > > We ye

Re: Defining Hadoop Compatibility -revisiting-

2011-05-13 Thread Allen Wittenauer
On May 13, 2011, at 2:55 PM, Doug Cutting wrote: > On 05/13/2011 07:28 PM, Allen Wittenauer wrote: >> If it has a modified version of Hadoop (i.e., not an actual Apache >> release or patches which have never been committed to trunk), are >> they allowed to say "includ

Re: Defining Hadoop Compatibility -revisiting-

2011-05-13 Thread Allen Wittenauer
On May 13, 2011, at 1:53 AM, Doug Cutting wrote: > Here "certified" is probably just intended to mean that the software > uses a "certified" open source license, e.g., listed at > http://www.opensource.org/licenses/. However they should say that this > "includes" or "contains" the various Apache

Re: Defining Hadoop Compatibility -revisiting-

2011-05-12 Thread Allen Wittenauer
On May 12, 2011, at 2:23 AM, Steve Loughran wrote: > I think Sun NFS might be a good example of similar defacto standard, or MS > SMB -it is up to others to show they are compatible with what is effective > the reference implementation. Being closed source, there is no option for > anyone to in

Re: Cleaning up documentation from website

2011-05-11 Thread Allen Wittenauer
On May 11, 2011, at 1:20 PM, Owen O'Malley wrote: > We haven't cleaned up the version documentation in a long time: > > lrwxrwxr-x 1 omalley hadoop 11 May 11 20:14 current -> r0.20.203.0 ^--- this is a much more interesting discussion. (What does "current" mean now?)

Re: Stability issue - dead DN's

2011-05-11 Thread Allen Wittenauer
On May 11, 2011, at 5:57 AM, Eric Fiala wrote: > > If we do the math that means [ map.tasks.max * mapred.child.java.opts ] + > [ reduce.tasks.max * mapred.child.java.opts ] => or [ 4 * 2.5G ] + [ 4 * > 2.5G ] is greater than the amount of physical RAM in the machine. > This doesn't account for t

Re: [VOTE] Release candidate 0.20.203.0-rc1

2011-05-07 Thread Allen Wittenauer
On May 6, 2011, at 11:18 PM, Milind Bhandarkar wrote: > Allen, there are per job limits, and per user limits in this branch. (So, > max capacity of -1 is for the queue, but within the queue, the per user > limits come into picture.) If I remember right, the defaults were based on > a certain assu

Re: [VOTE] Release candidate 0.20.203.0-rc1

2011-05-06 Thread Allen Wittenauer
On May 6, 2011, at 6:43 PM, Todd Papaioannou wrote: > Allen, > > Can you provide some more details into what issues you are seeing with the > capacity scheduler? Is it just the docs don't match the code, or are you > seeing real issues with job scheduling? Jobs are definitely not gettin

Re: [VOTE] Release candidate 0.20.203.0-rc1

2011-05-06 Thread Allen Wittenauer
On May 5, 2011, at 1:56 PM, Jakob Homan wrote: > +1 > > Downloaded, verified, tested on single node cluster to my > satisfaction. We've also brought this release up on a sizable cluster > and checked its basic sanity. All of you people doing single node tests are missing stuff. For e

Re: [VOTE] Release candidate 0.20.203.0-rc1

2011-05-04 Thread Allen Wittenauer
On May 4, 2011, at 10:31 AM, Owen O'Malley wrote: > Here's an updated release candidate for 0.20.203.0. I've incorporated the > feedback and included all of the patches from 0.20.2, which is the last > stable release. I also fixed the eclipse-plugin problem. > > The candidate is at: http://pe

Re: [VOTE] Abandon hod Common contrib

2011-04-21 Thread Allen Wittenauer
On Feb 11, 2011, at 11:48 AM, Allen Wittenauer wrote: > > On Feb 10, 2011, at 6:51 PM, Nigel Daley wrote: > >> I think the PMC should abandon the hod common contrib component. It's last >> meaningful contribution was February 2009: >> >> HADOOP-289

Re: [ANNOUNCE] Hadoop Common/HDFS/MapReduce Committer: Koji Noguchi

2011-04-13 Thread Allen Wittenauer
On Apr 13, 2011, at 4:16 PM, Tsz Wo (Nicholas), Sze wrote: > Hi all, > > The Hadoop PMC has elected Koji Noguchi as a committer of Hadoop > Common/HDFS/MapReduce and he has accepted. Welcome aboard Koji! Yay! Congrats! Now there are even more people who Nicolas can attack ab

Re: Proposal: Further Project Split(s)

2011-04-01 Thread Allen Wittenauer
On Apr 1, 2011, at 11:41 AM, Konstantin Boudnik wrote: > On Fri, Apr 1, 2011 at 08:26, Nigel Daley wrote: >> -1+2. This could potentially allow us to replace Jenkins with Hadoop for >> our build and test infrastructure. That would be awesome! > > Has anyone checked a calendar lately?

Re: Proposal: Further Project Split(s)

2011-04-01 Thread Allen Wittenauer
On Apr 1, 2011, at 1:57 AM, Chris Douglas wrote: > Experience developing Hadoop has shown that we not only need to > partition our projects for more active releases, but we also should > explore speculative project splits. For this, a Hadoop.next() project > should track the development of a proj

Re: Apache Sonar - http://analysis.apache.org/ does anyone want hadoop in there?

2011-03-03 Thread Allen Wittenauer
On Mar 3, 2011, at 10:13 AM, Allen Wittenauer wrote: > > On Mar 3, 2011, at 5:59 AM, Ian Holsman wrote: > >> I just discovered this Apache site, >> Apache Sonar performs source code analysis on the java code, and it looks >> pretty, and I'm sure

Re: Apache Sonar - http://analysis.apache.org/ does anyone want hadoop in there?

2011-03-03 Thread Allen Wittenauer
On Mar 3, 2011, at 5:59 AM, Ian Holsman wrote: > I just discovered this Apache site, > Apache Sonar performs source code analysis on the java code, and it looks > pretty, and I'm sure some people would find it useful. Is it actually returning for anyone else? I get a time out.

Re: Build/test infrastructure

2011-02-28 Thread Allen Wittenauer
On Feb 26, 2011, at 7:10 PM, Konstantin Boudnik wrote: > BTW, Puppet and Chef recipes are very widely used by all sorts of Ops and > cluster management companies. Perhaps, Maven and shell too - I'm not in a > position to make a judgement call. I'll let Y! Grid Ops to comment on it - > they know ev

Re: [VOTE] Abandon hdfsproxy HDFS contrib

2011-02-18 Thread Allen Wittenauer
On Feb 18, 2011, at 2:11 AM, Bernd Fondermann wrote: > I don't know how many Y-employees are working on H internally. Only > the contributors can sort that out. Did Carol Bartz run over your puppy or something? You don't appear to realize that pretty much all the major companies that ho

Re: [VOTE] Abandon hdfsproxy HDFS contrib

2011-02-17 Thread Allen Wittenauer
On Feb 17, 2011, at 1:21 PM, Konstantin Shvachko wrote: > hdfsproxy is a wrapper around hftpFileSystem (in its current state). > So you can always replace hdfsproxy with hftpFileSystem. > Also it uses pure FileSystem api, so it can successfully be maintained > outside of hdfs. > > Therefore I am

Re: [VOTE] Abandon hdfsproxy HDFS contrib

2011-02-17 Thread Allen Wittenauer
On Feb 17, 2011, at 4:43 AM, Bernd Fondermann wrote: > To be honest: Hadoop is in the process of falling apart. We can thank the Apache Board for helping there as well. Their high handed interference basically set the project back 6 mos to a year; we're still recovering from the genera

Re: Hadoop testing project

2011-02-17 Thread Allen Wittenauer
On Feb 16, 2011, at 11:50 AM, Konstantin Boudnik wrote: > As Joep said this "...will reduce the effort to take any (set of ) changes > from development into production." Take it one step further: when your cluster > is 'assembled' you need to validate it (on top of a concrete OS, etc.); is it > de

Re: Hadoop 0.22 Blockers

2011-02-12 Thread Allen Wittenauer
On Feb 11, 2011, at 9:44 AM, Nigel Daley wrote: > > On Feb 11, 2011, at 9:41 AM, Allen Wittenauer wrote: > >> >> On Feb 10, 2011, at 11:33 PM, Nigel Daley wrote: >> >>> Tom has created a public Jira filter for 0.22 blockers (thanks Tom!): >&

Re: [VOTE] Abandon hod Common contrib

2011-02-11 Thread Allen Wittenauer
On Feb 10, 2011, at 6:51 PM, Nigel Daley wrote: > I think the PMC should abandon the hod common contrib component. It's last > meaningful contribution was February 2009: > > HADOOP-2898. Provide an option to specify a port range for Hadoop services > provisioned by HOD. Contributed by Peeyush

Re: Hadoop 0.22 Blockers

2011-02-11 Thread Allen Wittenauer
On Feb 10, 2011, at 11:33 PM, Nigel Daley wrote: > Tom has created a public Jira filter for 0.22 blockers (thanks Tom!): > https://issues.apache.org/jira/secure/IssueNavigator.jspa?mode=hide&requestId=12313687 > With the new version of Jira, this single query across projects & versions is > now

Re: [VOTE] Abandon failmon Common contrib

2011-02-09 Thread Allen Wittenauer
On Feb 9, 2011, at 9:18 AM, Nigel Daley wrote: > I think the PMC should abandon the failmon common contrib component. It's > last meaningful contribution was it's original commit in August of 2008: Are there any patches in the patch queue? When it comes to contrib, last commit

Re: blog.hadoop.com

2011-02-07 Thread Allen Wittenauer
On Feb 7, 2011, at 1:06 PM, Doug Cutting wrote: > On 02/07/2011 12:37 PM, Chris Douglas wrote: >> Is the implication that blog.hadoop.com would point there? -C > > Sure, I'd be happy to redirect that CNAME. The other redirects for > hadoop.com already point to hadoop.apache.org. I grabbed the

Re: [DISCUSS] Move common, hdfs, mapreduce contrib components to apache-extras.org or elsewhere

2011-02-01 Thread Allen Wittenauer
On Jan 31, 2011, at 3:23 PM, Todd Lipcon wrote: > On Sun, Jan 30, 2011 at 11:19 PM, Owen O'Malley wrote: > >> >> Also note that pushing code out of Hadoop has a high cost. There are at >> least 3 forks of the hadoop-gpl-compression code. That creates a lot of >> confusion for the users. A lot

Re: MRUnit

2011-01-31 Thread Allen Wittenauer
On Jan 30, 2011, at 7:59 PM, Nigel Daley wrote: > +1. I just started a thread on moving all components out of contrib. Pushing them randomly to the web 2.0 version of Sourceforge (where they will never be seen from again) doesn't sound like a decent, long term strategy.

Re: [DISCUSS] Hadoop Security Release off Yahoo! patchset

2011-01-18 Thread Allen Wittenauer
On Jan 17, 2011, at 2:56 PM, Eric Baldeschwieler wrote: > > Right now you don't have the choice of an Apache release if you are looking > for a stabilized modern version of Hadoop. Can we ratchet down the hyperbole to at least a point where I don't want to vomit? Thanks. [F

Re: bringing the codebases back in line

2010-10-23 Thread Allen Wittenauer
On Oct 23, 2010, at 2:28 AM, Bernd Fondermann wrote: > On 23.10.10 05:40, Owen O'Malley wrote: >> >> The current plan of record is to make >> cut a branch next month, stabilize it, and release it. > > I'd like to revisit the mailing list thread where this decision was made. Can > you point me

Re: bringing the codebases back in line

2010-10-21 Thread Allen Wittenauer
On Oct 21, 2010, at 5:51 PM, Eli Collins wrote: > > - The packaging is Linux specific, we've gotten push back when trying > to contribute modifications upstream with Linuxisms since Apache > supports non-Linux platforms (namely Solaris). Oh come now Eli. Just say it: I push everyone rea

Re: bringing the codebases back in line

2010-10-21 Thread Allen Wittenauer
On Oct 21, 2010, at 2:53 PM, Ian Holsman wrote: > yep.. I've heard it's a source of contention... Sure. Maybe like 8 months ago to anyone who was paying attention. > In discussing it with people, I've heard that a major issue (not the only > one i'm sure) is lack of resources to actuall

Re: bringing the codebases back in line

2010-10-21 Thread Allen Wittenauer
On Oct 21, 2010, at 12:13 PM, Ian Holsman wrote: > Hi guys. > > I wanted to start a conversation about how we could merge the the cloudera + > yahoo distribtutions of hadoop into our codebase, > and what would be required. *grabs popcorn*

Re: Checking current configuration values?

2010-10-18 Thread Allen Wittenauer
If you just want a quick visual inspection, you can use the jobconf.jsp by clicking on the xml at the top of a job page in the jobtracker. On Oct 18, 2010, at 3:21 PM, Philip Zeyliger wrote: > Trunk has a configuration servlet (/conf) that exposes configuration. I > think it's in CDH3b3 (Clou

Re: name node and secondary name node configuration query

2010-10-16 Thread Allen Wittenauer
On Oct 16, 2010, at 5:54 AM, Mag Gam wrote: > Hello, > I am trying to trim down my configuration files to avoid confusion. > > If I were to setup something like this: 1 name node (192.168.0.1) , 1 > secondary name node (192.168.0.2), and 10 clients. The name node and > secondary namenode will no

Re: apache commons configuration

2010-09-02 Thread Allen Wittenauer
On Sep 2, 2010, at 11:53 AM, Bob Li wrote: > Hi experts, > > For hadoop 0.20.2, we have following task tracker configuration > >mapred.local.dir >/data/mapred/local > > > It seems not what I expected -- the directory /tmp/hadoop-user/mapred/local > was used. > > Any suggestion? Co

Re: [DISCUSS] Hadoop Security Release off Yahoo! patchset

2010-08-25 Thread Allen Wittenauer
On Aug 25, 2010, at 10:46 AM, Hemanth Yamijala wrote: > I do agree that this would be very useful for folks who want security > sooner. And the fact that Yahoo! have been running it at scale for a > good while now is also assuring. As has been mentioned a few times, part of the security features

Re: Branching and testing strategy for 0.22

2010-08-23 Thread Allen Wittenauer
On Aug 23, 2010, at 3:19 PM, Owen O'Malley wrote: > Are there any concerns? Just that 0.21 isn't even out of RC yet and that patches to fix it may get missed.

Re: Data Block Size ?

2010-07-15 Thread Allen Wittenauer
On Jul 15, 2010, at 11:40 AM, Syed Wasti wrote: > Will it matter what the data block size is ? Yes. > It is recommended to have a block size of 64 MB, but if we want to have the > data block size to 128 MB, should this effect the performance ? Yes. FWIW, we run with 128MB. > Does the size

Re: jni files

2010-07-09 Thread Allen Wittenauer
On Jul 9, 2010, at 2:09 AM, amit kumar verma wrote: > I think I got a solution. As I read more about hadoop and JNI, I learned that > I need to copy jni files to HADOOP_INSTALLATION_DIR//lib/native/Linux-xxx-xxx. lib/native/xxx are for the native compression libraries. They are not for user-le

Re: jni files

2010-07-08 Thread Allen Wittenauer
On Jul 8, 2010, at 1:08 AM, amit kumar verma wrote: > DistributedCache.addCacheFile("hdfs://* > /192.168.0.153:50075*/libraries/mylib.so.1#mylib.so", conf); Do you actually have asterisks in this? If so, that's the problem.

Re: Why single thread for HDFS?

2010-07-05 Thread Allen Wittenauer
On Jul 5, 2010, at 5:01 PM, elton sky wrote: > Well, this sounds good when you have many small files, you concat() them > into a big one. I am talking about split a big file into blocks and copy all > a few blocks in parallel. Basically, your point is that hadoop dfs -cp is relatively slow and co

Re: What are uses of taskTracker and JobTracker services?

2010-06-29 Thread Allen Wittenauer
On Jun 29, 2010, at 8:07 AM, Sarah kho wrote: > Hi, > > Can you please let me know what are tasks that the taskTracker and > JobTracker performs? Pretty much the entirety of the MapReduce framework. You can think of it this way: HDFS <--> MR NameNode <--> JobTracker DataNode <--> TaskTracke

  1   2   >