Re: [jira] Updated: (HADOOP-1180) NNbench test should be able to test the checksumfilesystem as well as the raw filesystem

Doug Cutting Thu, 29 Mar 2007 11:07:24 -0800

Nigel Daley wrote:

So shouldn't fixing this test to conform to the new model in HADOOP-1134be the concern of the patch for HADOOP-1134?

Yes, but, as it stands, this patch would silently stop working correctlyonce HADOOP-1134 is committed. It should instead be written in a morerobust way, that can survive expected changes. Relying on HDFS usingChecksumFileSystem isn't as reliable as an explicit constructor thatsays "I want an unchecksummed FileSystem."

As it stand, I can't runNNBench at scale without using a raw file system, which is what thispatch is intended to allow.

It seems strange to disable things in an undocumented and unsupportedway in order to get a benchmark to complete. How does that provescalability? Rather, leaving NNBench alone seems like a strong argumentfor implementing HADOOP-1134 sooner.

Still, if you want to be able to disable checksums, for benchmarks orwhatever, we can permit that, but should do so explicitly.

HADOOP-928 caused this test to use aChecksumFileSystem and subsequently we saw our "read" TPS metric plummetfrom 20,000 to a couple hundred.

Ah, NNBench used the 'raw' methods before, which was kind of sneaky onits part, since it didn't benchmark the typical user experience.Although the namenode performance should only halve at worst withchecksums as currently implemented, no?

Let's get our current benchmark back on track before we commitHADOOP-1134 (which will likely take a while before it is "PatchAvailable").

I'd argue that we should fix the benchmark to accurately reflect whatusers see, so that we see real improvement when HADOOP-1134 iscommitted. That would make it a more useful and realistic benchmark.However if you believe that a checksum-free benchmark is still useful, Ithink it should be more future-proof.


Doug

Re: [jira] Updated: (HADOOP-1180) NNbench test should be able to test the checksumfilesystem as well as the raw filesystem

Reply via email to