Hi Joshua, In theory there's nothing special for you to do. Just issue your query to HBase with tracing enabled. The active span will go through HBase, down into HDFS, and back again. You'll need both systems collecting spans into the same place so that you can report on the complete trace tree.
I've not recently tested the end-to-end, but I believe it's all there. If not, it's a bug -- this is an intended use case. Can you give it a try and let us know how it goes? FYI, 0.99.x are preview releases of HBase and not for production use. Just so you know :) -n On Wednesday, February 11, 2015, Chunxu Tang <chunxut...@gmail.com> wrote: > Hi all, > > Now I’m exploiting HTrace to trace request level data flows in HBase and > HDFS. I have successfully traced HBase and HDFS by using HTrace, > respectively. > > After that, I combine HBase and HDFS together and I want to just send a > PUT/GET request to HBase, but to trace the whole data flow in both HBase > and HDFS. In my opinion, when I send a request such as Get to HBase, it > will at last try to read the blocks on HDFS, so I can construct a whole > data flow tracing through HBase and HDFS. While, the fact is that I can > only get tracing data of HBase, with no data of HDFS. > > Could you give me any suggestions on how to trace the data flow in both > HBase and HDFS? Does anyone have similar experience? Do I need to modify > the source code? And maybe which part(s) should I touch? If I need to > modify the code, I will try to create a patch for that. > > Thank you. > > My Configurations: > Hadoop version: 2.6.0 > HBase version: 0.99.2 > HTrace version: htrace-master > OS: Ubuntu 12.04 > > > Joshua >