alamb commented on issue #16886:
URL: https://github.com/apache/datafusion/issues/16886#issuecomment-3152096527

   > Those are noble goals, but i would also add the following
   > 
   > * it's need to be "executable" (each step of the process needs to be 
feasible)
   > * it's need to be decisive (reaching a decision)
   > * it's need to be finite (running to an end in bounded time)
   
   When I ran an engineering team at a company, I would 100% agree with all 
these items and insisted on them myself. 
   
   However, after several years in an open source project like Arrow and 
DataFusion I am not sure they are a great match for this community. 
   
   I have seen several projects that start as vague statements of goal and then 
various people provide parts of the implementation / writeups / tickets, etc 
and it ends up being completed (utf8view being one, porting all functions to be 
UDFs is another, the most recent Parquet metadata cache from @nuno-faria is 
shaping up to be similar as well)
   
   The other question is who decides if a particular proposal meets those 
criteria 🤔  Maybe if we say they are best practice that would be enough
   
   > * it needs to involve relevant stakeholders (subject matter experts and 
heavy users of given functional area)
   >   
   >   * of course everyone on the Internet should be able to provide input, 
but some people are simply more likely to be interested in a topic
   >
   > Finiteness is expectedly hard to achieve in a distributed community where 
everyone follows their own timelines, but this should not come at the cost of 
missing involving relevant people, i.e. people which input can be reasonably 
expected to be useful.
   
   In general I agree -- I think the key challenges are:
   1. Identifying *who* the key people are (some of the core contributors 
probably know, but figuring this out is a high barrier)
   2. Ensuring stakeholders have enough time to provide input if they want, but 
do not hold up the process indefinitely due to lack of bandwidth
   
   Since all communications are public, I think any stakeholder who is 
interested in a feature already has the opportunity to discover and 
participate. However, I think we could do a better job making it easier for 
relevant stakeholders to find proposals
   
   
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to