[GitHub] [spark] EnricoMi commented on pull request #35965: [SPARK-38647][SQL] Add SupportsReportOrdering mix in interface for Scan (DataSourceV2)

2022-06-21 Thread GitBox
EnricoMi commented on PR #35965: URL: https://github.com/apache/spark/pull/35965#issuecomment-1162082945 Thanks @sunchao -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

[GitHub] [spark] EnricoMi commented on pull request #35965: [SPARK-38647][SQL] Add SupportsReportOrdering mix in interface for Scan (DataSourceV2)

2022-06-09 Thread GitBox
EnricoMi commented on PR #35965: URL: https://github.com/apache/spark/pull/35965#issuecomment-1151468995 @sunchao @cloud-fan all comments addressed, all tests green -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the U

[GitHub] [spark] EnricoMi commented on pull request #35965: [SPARK-38647][SQL] Add SupportsReportOrdering mix in interface for Scan (DataSourceV2)

2022-05-16 Thread GitBox
EnricoMi commented on PR #35965: URL: https://github.com/apache/spark/pull/35965#issuecomment-1127488590 @sunchao @HyukjinKwon @aokolnychyi @cloud-fan I have addressed comments and rebased. -- This is an automated message from the Apache Git Service. To respond to the message, please log

[GitHub] [spark] EnricoMi commented on pull request #35965: [SPARK-38647][SQL] Add SupportsReportOrdering mix in interface for Scan (DataSourceV2)

2022-05-09 Thread GitBox
EnricoMi commented on PR #35965: URL: https://github.com/apache/spark/pull/35965#issuecomment-1120977044 @sunchao I have pushed all my changes -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [spark] EnricoMi commented on pull request #35965: [SPARK-38647][SQL] Add SupportsReportOrdering mix in interface for Scan (DataSourceV2)

2022-04-26 Thread GitBox
EnricoMi commented on PR #35965: URL: https://github.com/apache/spark/pull/35965#issuecomment-1110217254 @cloud-fan who would be best to review this? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [spark] EnricoMi commented on pull request #35965: [SPARK-38647][SQL] Add SupportsReportOrdering mix in interface for Scan (DataSourceV2)

2022-04-19 Thread GitBox
EnricoMi commented on PR #35965: URL: https://github.com/apache/spark/pull/35965#issuecomment-1102712930 @cloud-fan thanks for clarification, I have incorporated this distinction into the documentation of `SupportsReportOrdering`. -- This is an automated message from the Apache Git Servic

[GitHub] [spark] EnricoMi commented on pull request #35965: [SPARK-38647][SQL] Add SupportsReportOrdering mix in interface for Scan (DataSourceV2)

2022-04-12 Thread GitBox
EnricoMi commented on PR #35965: URL: https://github.com/apache/spark/pull/35965#issuecomment-1097249916 So you are saying ordering is seen as being part of partitioning. This kind of makes sense. Are you foreseeing that partitioning can transport in-partition order, rather than glob

[GitHub] [spark] EnricoMi commented on pull request #35965: [SPARK-38647][SQL] Add SupportsReportOrdering mix in interface for Scan (DataSourceV2)

2022-04-12 Thread GitBox
EnricoMi commented on PR #35965: URL: https://github.com/apache/spark/pull/35965#issuecomment-1096834886 @sunchao #35657 says `"have V2 data sources to report distribution and ordering to Spark on read path"`: How do V2 sources report their ordering? -- This is an automated message from t

[GitHub] [spark] EnricoMi commented on pull request #35965: [SPARK-38647][SQL] Add SupportsReportOrdering mix in interface for Scan (DataSourceV2)

2022-04-12 Thread GitBox
EnricoMi commented on PR #35965: URL: https://github.com/apache/spark/pull/35965#issuecomment-1096810002 I have found the issue, option `spark.sql.sources.v2.bucketing.enabled` moved to default `false`, so I have to enable that to have order and partitioning being reused. -- This is an a

[GitHub] [spark] EnricoMi commented on pull request #35965: [SPARK-38647][SQL] Add SupportsReportOrdering mix in interface for Scan (DataSourceV2)

2022-04-12 Thread GitBox
EnricoMi commented on PR #35965: URL: https://github.com/apache/spark/pull/35965#issuecomment-1096802477 @sunchao @dongjoon-hyun looks like #35657 (20ffbf7b) broke reusing **partitioning and order** provided by DS V2 in my tests, which worked before rebase. The query planner now introduces