parthchandra commented on issue #2035: URL: https://github.com/apache/datafusion-comet/issues/2035#issuecomment-3090665542
I can reproduce the above problem only after removing this part`EventDate >= '2013-07-01' AND EventDate <= '2013-07-31'` from the query. (With the clause all records are getting filtered out). To reproduce: Download data from https://datasets.clickhouse.com/hits_compatible/hits.parquet Then in spark-shell - ``` scala> spark.read.parquet("file:///Users/parth/data/clickbench/hits.parquet").createOrReplaceTempView("hits"); scala> spark.sql("SELECT TraficSourceID, SearchEngineID, AdvEngineID, CASE WHEN (SearchEngineID = 0 AND AdvEngineID = 0) THEN Referer ELSE '' END AS Src, URL AS Dst, COUNT(*) AS PageViews FROM hits WHERE CounterID = 62 AND IsRefresh = 0 GROUP BY TraficSourceID, SearchEngineID, AdvEngineID, Src, Dst ORDER BY PageViews DESC LIMIT 10 OFFSET 1000").show() ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For additional commands, e-mail: github-h...@datafusion.apache.org