Hi Hao,

You may want to setup a full Hadoop 'one box' install and start running the 
TestDFSIO, Teragen and Terasort (with a small amount of data) scenarios using 
the injected jar to run hdfs.  That may be the fastest and best way to get up 
and running.

When it comes to fault injection the valuable issues that you find may be found 
using the end to end tests versus targeted api calls.  It's much more 
realistic. It's a much stronger case to get something fixed / fix something 
when you can point to the end to end scenario, showing how it can fail.   
That's where you find the good ones.

You may also want to consider what a data validation oracle would look like 
that would allow you to do data validation to catch data corruption issues. If 
you do go further and create a scenario that, say generates a non-random / 
repeatable sequence that reads/writes to HDFS, you could use the fault 
injection and the oracle would validate data correctness. 

You'd get extra bonus points if you packaged things up and contributed your 
work to Bigtop so others could benefit long term.

~Brad

-----Original Message-----
From: Hao Yang [mailto:hao.yang0...@gmail.com] 
Sent: Monday, November 28, 2011 9:44 PM
To: common-dev@hadoop.apache.org
Subject: how to write hadoop client to test HDFS with fault injection

Hi, all:

I am a graduate student, now working on HDFS fault injection course project. I 
used FI framework to create aspects, and hdfs-fi.jar was generated. How can I 
write Hadoop Client code to test the injection? Thank you.


Thank you very much for your time.


Best regards
Hao Yang

Reply via email to