alamb commented on issue #16886: URL: https://github.com/apache/datafusion/issues/16886#issuecomment-3152096527
> Those are noble goals, but i would also add the following > > * it's need to be "executable" (each step of the process needs to be feasible) > * it's need to be decisive (reaching a decision) > * it's need to be finite (running to an end in bounded time) When I ran an engineering team at a company, I would 100% agree with all these items and insisted on them myself. However, after several years in an open source project like Arrow and DataFusion I am not sure they are a great match for this community. I have seen several projects that start as vague statements of goal and then various people provide parts of the implementation / writeups / tickets, etc and it ends up being completed (utf8view being one, porting all functions to be UDFs is another, the most recent Parquet metadata cache from @nuno-faria is shaping up to be similar as well) The other question is who decides if a particular proposal meets those criteria 🤔 Maybe if we say they are best practice that would be enough > * it needs to involve relevant stakeholders (subject matter experts and heavy users of given functional area) > > * of course everyone on the Internet should be able to provide input, but some people are simply more likely to be interested in a topic > > Finiteness is expectedly hard to achieve in a distributed community where everyone follows their own timelines, but this should not come at the cost of missing involving relevant people, i.e. people which input can be reasonably expected to be useful. In general I agree -- I think the key challenges are: 1. Identifying *who* the key people are (some of the core contributors probably know, but figuring this out is a high barrier) 2. Ensuring stakeholders have enough time to provide input if they want, but do not hold up the process indefinitely due to lack of bandwidth Since all communications are public, I think any stakeholder who is interested in a feature already has the opportunity to discover and participate. However, I think we could do a better job making it easier for relevant stakeholders to find proposals -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For additional commands, e-mail: github-h...@datafusion.apache.org