[ https://issues.apache.org/jira/browse/TEZ-1379?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14086952#comment-14086952 ]
Bikas Saha commented on TEZ-1379: --------------------------------- We might be trying to do 2 things that are potentially at odds here. 1) Making the API easy to use 2) Reducing the amount of junk that comes in via configuration files. If we simply accept Configuration then we make the API short and easy to use. We can figure out how remove the junk from it later on. Accepting a conf per comparator/partitioner/etc is making the API more cumbersome to use without preventing the users from dumping that same junk in each one of those configuration. If the user is motivated to write extra code to filter out the params per entity then they would be motivated to do that for the global conf too. For someone who is not motivated, there is nothing we can do to avoid the junk being passed in every parameter. So might as well opt for simplicity given the worst cast scenarios for getting junk are the same in both cases. I would prefer having "setComparator(String comparator), setComparator(String comparator, Configuration conf)" etc. and recommend the non-configuration method in javadoc. So that users dont dump configuration on us on every API unless they hit an issue which makes them have to pass the configuration. At a later point, if we decide to allow a filtered global config via setConfiguration() then the individual set configurations can be easily deprecated without affecting everyones code. Exposing the DAG configuration via TezUserPayload may be an efficient way to ease the configuration pain at a later point. > EdgeConfigurers should accept a Partitioner configuration, accept parameters > for compression and secure shuffle > --------------------------------------------------------------------------------------------------------------- > > Key: TEZ-1379 > URL: https://issues.apache.org/jira/browse/TEZ-1379 > Project: Apache Tez > Issue Type: Sub-task > Reporter: Siddharth Seth > Assignee: Siddharth Seth > Priority: Blocker > Attachments: TEZ-1379.1.txt, TEZ-1379.2.txt > > -- This message was sent by Atlassian JIRA (v6.2#6252)