Romi Kuntsman created SPARK-10135:
-------------------------------------

             Summary: Percent of pruned partitions is shown wrong
                 Key: SPARK-10135
                 URL: https://issues.apache.org/jira/browse/SPARK-10135
             Project: Spark
          Issue Type: Bug
          Components: SQL
    Affects Versions: 1.4.0
            Reporter: Romi Kuntsman
            Priority: Trivial


When reading partitioned Parquet in SparkSQL, an info message about the number 
of pruned partitions is displayed.

Actual:
"Selected 15 partitions out of 181, pruned -1106.6666666666667% partitions."

Expected:
"Selected 15 partitions out of 181, pruned 91.71270718232044% partitions."

Fix: (i'm newbie here so please help make patch, thanks!)
in DataSourceStrategy.scala in method apply()

insted of:
val percentPruned = (1 - total.toDouble / selected.toDouble) * 100
should be:
val percentPruned = (1 - selected.toDouble / total.toDouble) * 100




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to