[jira] [Commented] (SPARK-17310) Disable Parquet's record-by-record filter in normal parquet reader and do it in Spark-side

2016-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15481044#comment-15481044 ] Apache Spark commented on SPARK-17310: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-17310) Disable Parquet's record-by-record filter in normal parquet reader and do it in Spark-side

2016-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17310: Assignee: Apache Spark > Disable Parquet's record-by-record filter in normal parquet

[jira] [Assigned] (SPARK-17310) Disable Parquet's record-by-record filter in normal parquet reader and do it in Spark-side

2016-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17310: Assignee: (was: Apache Spark) > Disable Parquet's record-by-record filter in normal

[jira] [Assigned] (SPARK-17409) Query in CTAS is Optimized Twice

2016-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17409: Assignee: Apache Spark (was: Xiao Li) > Query in CTAS is Optimized Twice >

[jira] [Commented] (SPARK-17409) Query in CTAS is Optimized Twice

2016-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17409?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15481029#comment-15481029 ] Apache Spark commented on SPARK-17409: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Assigned] (SPARK-17409) Query in CTAS is Optimized Twice

2016-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17409: Assignee: Xiao Li (was: Apache Spark) > Query in CTAS is Optimized Twice >

[jira] [Commented] (SPARK-4563) Allow spark driver to bind to different ip then advertise ip

2016-09-10 Thread Liam Fisk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15480816#comment-15480816 ] Liam Fisk commented on SPARK-4563: -- There was a proposed patch in SPARK-11638, unfortunately the PR (and

[jira] [Updated] (SPARK-14469) Remove mllib-local from mima project exclusion

2016-09-10 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14469?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-14469: --- Assignee: (was: DB Tsai) > Remove mllib-local from mima project exclusion >

[jira] [Commented] (SPARK-14469) Remove mllib-local from mima project exclusion

2016-09-10 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14469?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15480803#comment-15480803 ] Josh Rosen commented on SPARK-14469: Resolving as a duplicate of SPARK-14818, which I'm submitting a

[jira] [Resolved] (SPARK-14469) Remove mllib-local from mima project exclusion

2016-09-10 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14469?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-14469. Resolution: Duplicate > Remove mllib-local from mima project exclusion >

[jira] [Updated] (SPARK-17483) Minor refactoring and cleanup in BlockManager block status reporting and block removal

2016-09-10 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-17483: --- Component/s: (was: Spark Core) Block Manager > Minor refactoring and cleanup in

[jira] [Commented] (SPARK-17445) Reference an ASF page as the main place to find third-party packages

2016-09-10 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15480419#comment-15480419 ] Matei Zaharia commented on SPARK-17445: --- Sounds good, but IMO just keep the current supplemental

[jira] [Commented] (SPARK-17445) Reference an ASF page as the main place to find third-party packages

2016-09-10 Thread Josh Elser (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15480342#comment-15480342 ] Josh Elser commented on SPARK-17445: +1 to this plan. > Reference an ASF page as the main place to

[jira] [Assigned] (SPARK-17495) Hive hash implementation

2016-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17495: Assignee: Apache Spark > Hive hash implementation > > >

[jira] [Assigned] (SPARK-17495) Hive hash implementation

2016-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17495: Assignee: (was: Apache Spark) > Hive hash implementation > >

[jira] [Commented] (SPARK-17495) Hive hash implementation

2016-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17495?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15480323#comment-15480323 ] Apache Spark commented on SPARK-17495: -- User 'tejasapatil' has created a pull request for this

[jira] [Created] (SPARK-17495) Hive hash implementation

2016-09-10 Thread Tejas Patil (JIRA)
Tejas Patil created SPARK-17495: --- Summary: Hive hash implementation Key: SPARK-17495 URL: https://issues.apache.org/jira/browse/SPARK-17495 Project: Spark Issue Type: Bug Components:

[jira] [Created] (SPARK-17494) Floor function rounds up during join

2016-09-10 Thread Gokhan Civan (JIRA)
Gokhan Civan created SPARK-17494: Summary: Floor function rounds up during join Key: SPARK-17494 URL: https://issues.apache.org/jira/browse/SPARK-17494 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-17493) Spark Job hangs while DataFrame writing to HDFS path with parquet mode

2016-09-10 Thread Gautam Solanki (JIRA)
Gautam Solanki created SPARK-17493: -- Summary: Spark Job hangs while DataFrame writing to HDFS path with parquet mode Key: SPARK-17493 URL: https://issues.apache.org/jira/browse/SPARK-17493 Project:

[jira] [Commented] (SPARK-17492) Reading Cataloged Data Sources without Extending SchemaRelationProvider

2016-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15480056#comment-15480056 ] Apache Spark commented on SPARK-17492: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Assigned] (SPARK-17492) Reading Cataloged Data Sources without Extending SchemaRelationProvider

2016-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17492: Assignee: (was: Apache Spark) > Reading Cataloged Data Sources without Extending

[jira] [Assigned] (SPARK-17492) Reading Cataloged Data Sources without Extending SchemaRelationProvider

2016-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17492: Assignee: Apache Spark > Reading Cataloged Data Sources without Extending

[jira] [Created] (SPARK-17492) Reading Cataloged Data Sources without Extending SchemaRelationProvider

2016-09-10 Thread Xiao Li (JIRA)
Xiao Li created SPARK-17492: --- Summary: Reading Cataloged Data Sources without Extending SchemaRelationProvider Key: SPARK-17492 URL: https://issues.apache.org/jira/browse/SPARK-17492 Project: Spark

[jira] [Commented] (SPARK-17477) SparkSQL cannot handle schema evolution from Int -> Long when parquet files have Int as its type while hive metastore has Long as its type

2016-09-10 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17477?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15479917#comment-15479917 ] Hyukjin Kwon commented on SPARK-17477: -- Is this subset of SPARK-16544? Also, I remember I was told

[jira] [Commented] (SPARK-16460) Spark 2.0 CSV ignores NULL value in Date format

2016-09-10 Thread Marcel Boldt (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16460?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15479721#comment-15479721 ] Marcel Boldt commented on SPARK-16460: -- Hi [~proflin]:Some time has passed, but I see that your pull

[jira] [Commented] (SPARK-16599) java.util.NoSuchElementException: None.get at at org.apache.spark.storage.BlockInfoManager.releaseAllLocksForTask(BlockInfoManager.scala:343)

2016-09-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15479567#comment-15479567 ] Sean Owen commented on SPARK-16599: --- The only thing I can see is that somehow a task starts and is

[jira] [Commented] (SPARK-17445) Reference an ASF page as the main place to find third-party packages

2016-09-10 Thread Jagadeesan A S (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15479544#comment-15479544 ] Jagadeesan A S commented on SPARK-17445: If the above changes is feasible means, i would like to

[jira] [Commented] (SPARK-17310) Disable Parquet's record-by-record filter in normal parquet reader and do it in Spark-side

2016-09-10 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15479530#comment-15479530 ] Hyukjin Kwon commented on SPARK-17310: -- [~andrew_duffy] Thanks Andrew. I will work on this. >

[jira] [Updated] (SPARK-17437) uiWebUrl is not accessible to JavaSparkContext or pyspark.SparkContext

2016-09-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-17437: -- Assignee: Adrian Petrescu > uiWebUrl is not accessible to JavaSparkContext or pyspark.SparkContext >

[jira] [Updated] (SPARK-17437) uiWebUrl is not accessible to JavaSparkContext or pyspark.SparkContext

2016-09-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-17437: -- Priority: Minor (was: Major) > uiWebUrl is not accessible to JavaSparkContext or pyspark.SparkContext

[jira] [Commented] (SPARK-17437) uiWebUrl is not accessible to JavaSparkContext or pyspark.SparkContext

2016-09-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15479526#comment-15479526 ] Sean Owen commented on SPARK-17437: --- No, but unless there's some hurry I'd not generally merge

[jira] [Commented] (SPARK-17445) Reference an ASF page as the main place to find third-party packages

2016-09-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15479523#comment-15479523 ] Sean Owen commented on SPARK-17445: --- How about this net set of changes: - Rename "Supplemental Spark

[jira] [Resolved] (SPARK-17340) .sparkStaging not cleaned if application exited incorrectly

2016-09-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17340?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-17340. --- Resolution: Won't Fix > .sparkStaging not cleaned if application exited incorrectly >

[jira] [Updated] (SPARK-17396) Threads number keep increasing when query on external CSV partitioned table

2016-09-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-17396: -- Assignee: Ryan Blue > Threads number keep increasing when query on external CSV partitioned table >

[jira] [Resolved] (SPARK-17396) Threads number keep increasing when query on external CSV partitioned table

2016-09-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-17396. --- Resolution: Fixed Fix Version/s: 2.1.0 2.0.1 Issue resolved by pull

[jira] [Assigned] (SPARK-17447) performance improvement in Partitioner.DefaultPartitioner

2016-09-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-17447: - Assignee: Sean Owen > performance improvement in Partitioner.DefaultPartitioner >

[jira] [Assigned] (SPARK-17490) Optimize SerializeFromObject for primitive array

2016-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17490?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17490: Assignee: (was: Apache Spark) > Optimize SerializeFromObject for primitive array >

[jira] [Assigned] (SPARK-17490) Optimize SerializeFromObject for primitive array

2016-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17490?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17490: Assignee: Apache Spark > Optimize SerializeFromObject for primitive array >

[jira] [Commented] (SPARK-17490) Optimize SerializeFromObject for primitive array

2016-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17490?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15479418#comment-15479418 ] Apache Spark commented on SPARK-17490: -- User 'kiszk' has created a pull request for this issue:

[jira] [Resolved] (SPARK-11496) Parallel implementation of personalized pagerank

2016-09-10 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai resolved SPARK-11496. - Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 14998

[jira] [Updated] (SPARK-17491) MemoryStore.putIteratorAsBytes() may silently lose values when KryoSerializer is used

2016-09-10 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17491?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-17491: --- Labels: correctness (was: ) > MemoryStore.putIteratorAsBytes() may silently lose values when

[jira] [Assigned] (SPARK-17491) MemoryStore.putIteratorAsBytes() may silently lose values when KryoSerializer is used

2016-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17491?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17491: Assignee: Josh Rosen (was: Apache Spark) > MemoryStore.putIteratorAsBytes() may silently

[jira] [Commented] (SPARK-17491) MemoryStore.putIteratorAsBytes() may silently lose values when KryoSerializer is used

2016-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17491?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15479312#comment-15479312 ] Apache Spark commented on SPARK-17491: -- User 'JoshRosen' has created a pull request for this issue:

[jira] [Assigned] (SPARK-17491) MemoryStore.putIteratorAsBytes() may silently lose values when KryoSerializer is used

2016-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17491?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17491: Assignee: Apache Spark (was: Josh Rosen) > MemoryStore.putIteratorAsBytes() may silently

[jira] [Created] (SPARK-17491) MemoryStore.putIteratorAsBytes() may silently lose values when KryoSerializer is used

2016-09-10 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-17491: -- Summary: MemoryStore.putIteratorAsBytes() may silently lose values when KryoSerializer is used Key: SPARK-17491 URL: https://issues.apache.org/jira/browse/SPARK-17491

[jira] [Created] (SPARK-17490) Optimize SerializeFromObject for primitive array

2016-09-10 Thread Kazuaki Ishizaki (JIRA)
Kazuaki Ishizaki created SPARK-17490: Summary: Optimize SerializeFromObject for primitive array Key: SPARK-17490 URL: https://issues.apache.org/jira/browse/SPARK-17490 Project: Spark

[jira] [Commented] (SPARK-17479) Fix LDA example in docs

2016-09-10 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15479243#comment-15479243 ] zhengruifeng commented on SPARK-17479: -- Because the paths in examples are relative path, make sure