Hi folks 

As part of the Tez APIs, every object ( Input / Output / Processor / 
EdgeManagerPlugin / VertexManagerPlugin ) can be associated with its own user 
provided payload to set itself up. The format of this payload is not known to 
Tez as it could be a java serialized object/protobuf/xml, etc. In a lot of 
cases, this payload is usually a serialized configuration object. To make it 
easy to expose this configuration aspects in the Tez UI, a setHistoryText API 
was added to the UserPayload class. 

This history text was meant to be something which could be displayed in the UI. 
To make this more meaningful, I am proposing that we define a common standard 
that everyone using this API can conform to. This will allow the UI to make 
some assumptions about the structure and display the data in a more consumable 
manner. 

A possible standard could be:
   - history text should be a json blob mapping to a json object containing 
key-val pairs.
   - the keys that should be used: 
      - description: value should be a simple string
      - config: value should be an object in the format: { “key1”: “value1”, 
“key2” : “value2”, … }
     
The above is not a final version or the recommended one. Folks like Jonathan 
and Prakash who are working on TEZ-8 may have stronger recommendations on the 
json structure. 

@Gunther, @Rohini - any comments from the user point of view for exposing 
Hive/Pig information in the Tez UI?

Comments/Questions/Suggestions? 

thanks
— HItesh

Reply via email to