timsaucer opened a new issue, #1278: URL: https://github.com/apache/datafusion-python/issues/1278
**Is your feature request related to a problem or challenge? Please describe what you are trying to do.** This is a follow on to https://github.com/apache/datafusion-python/issues/1273 There are a number of places where it would be convenient to pass SQL strings as expressions. For example, it would be nice to do ``` df.select( "a", "a - b", col("c"), ) ``` This should intuitively know that we are getting column `a`, followed by `col("a") - col("b")` followed by column 'c'. **Describe the solution you'd like** Using the sql parsing on the DataFrame make the following functions handle SQL strings. We must be *very* careful that we do not break things like cases where users have a column name that is not SQL parseable. **Describe alternatives you've considered** Status quo **Additional context** DataFrame functions to update: - [ ] select - [ ] remove `select_exprs` - [ ] with_column - [ ] with_columns - [ ] aggregate - [ ] repartition_by_hash We do **not** want to apply this treatment to joins because there is no easy way to know *which* DataFrame to perform the SQL parsing against. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
