[jira] [Updated] (SPARK-18949) Add recoverPartitions API to Catalog

2016-12-20 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18949: Fix Version/s: (was: 2.11) 2.1.1 > Add recoverPartitions API to Catalog >

[jira] [Updated] (SPARK-18949) Add recoverPartitions API to Catalog

2016-12-20 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18949: Fix Version/s: (was: 2.2.0) > Add recoverPartitions API to Catalog >

[jira] [Resolved] (SPARK-18949) Add recoverPartitions API to Catalog

2016-12-20 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18949. - Resolution: Fixed Fix Version/s: 2.2.0 2.11 > Add recoverPartitions

[jira] [Commented] (SPARK-18932) Partial aggregation for collect_set / collect_list

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18932?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15766342#comment-15766342 ] Apache Spark commented on SPARK-18932: -- User 'viirya' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18932) Partial aggregation for collect_set / collect_list

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18932?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18932: Assignee: Apache Spark > Partial aggregation for collect_set / collect_list >

[jira] [Assigned] (SPARK-18932) Partial aggregation for collect_set / collect_list

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18932?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18932: Assignee: (was: Apache Spark) > Partial aggregation for collect_set / collect_list >

[jira] [Closed] (SPARK-18946) treeAggregate will be low effficiency when aggregate high dimension vectors in ML algorithm

2016-12-20 Thread zunwen you (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zunwen you closed SPARK-18946. -- Resolution: Duplicate > treeAggregate will be low effficiency when aggregate high dimension vectors >

[jira] [Commented] (SPARK-18821) Bisecting k-means wrapper in SparkR

2016-12-20 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15766299#comment-15766299 ] Miao Wang commented on SPARK-18821: --- I can work on this one, if it is not urgent. Thanks! > Bisecting

[jira] [Assigned] (SPARK-18960) Avoid double reading file which is being copied.

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18960: Assignee: Apache Spark > Avoid double reading file which is being copied. >

[jira] [Commented] (SPARK-18960) Avoid double reading file which is being copied.

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15766274#comment-15766274 ] Apache Spark commented on SPARK-18960: -- User 'uncleGen' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18960) Avoid double reading file which is being copied.

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18960: Assignee: (was: Apache Spark) > Avoid double reading file which is being copied. >

[jira] [Created] (SPARK-18960) Avoid double reading file which is being copied.

2016-12-20 Thread Genmao Yu (JIRA)
Genmao Yu created SPARK-18960: - Summary: Avoid double reading file which is being copied. Key: SPARK-18960 URL: https://issues.apache.org/jira/browse/SPARK-18960 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-18036) Decision Trees do not handle edge cases

2016-12-20 Thread Weichen Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18036?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15766243#comment-15766243 ] Weichen Xu commented on SPARK-18036: Oh, I'm too busy recently to work on it, it would be great if

[jira] [Issue Comment Deleted] (SPARK-18036) Decision Trees do not handle edge cases

2016-12-20 Thread Weichen Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weichen Xu updated SPARK-18036: --- Comment: was deleted (was: i am working on this... ) > Decision Trees do not handle edge cases >

[jira] [Assigned] (SPARK-18956) Python API should reuse existing SparkSession while creating new SQLContext instances

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18956?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18956: Assignee: (was: Apache Spark) > Python API should reuse existing SparkSession while

[jira] [Commented] (SPARK-18956) Python API should reuse existing SparkSession while creating new SQLContext instances

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18956?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15766118#comment-15766118 ] Apache Spark commented on SPARK-18956: -- User 'viirya' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18956) Python API should reuse existing SparkSession while creating new SQLContext instances

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18956?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18956: Assignee: Apache Spark > Python API should reuse existing SparkSession while creating new

[jira] [Commented] (SPARK-18800) Correct the assert in UnsafeKVExternalSorter which ensures array size

2016-12-20 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15766039#comment-15766039 ] Liang-Chi Hsieh commented on SPARK-18800: - Note: this jia is motivated by the issue reported on

[jira] [Updated] (SPARK-18959) invalid resource statistics for standalone cluster

2016-12-20 Thread hustfxj (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] hustfxj updated SPARK-18959: Attachment: 屏幕快照 2016-12-21 11.49.12.png The attachment is the master page > invalid resource statistics

[jira] [Commented] (SPARK-18959) invalid resource statistics for standalone cluster

2016-12-20 Thread hustfxj (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15766020#comment-15766020 ] hustfxj commented on SPARK-18959: - the attachment is the master page > invalid resource statistics for

[jira] [Created] (SPARK-18959) invalid resource statistics for standalone cluster

2016-12-20 Thread hustfxj (JIRA)
hustfxj created SPARK-18959: --- Summary: invalid resource statistics for standalone cluster Key: SPARK-18959 URL: https://issues.apache.org/jira/browse/SPARK-18959 Project: Spark Issue Type: Bug

[jira] [Resolved] (SPARK-18900) Flaky Test: StateStoreSuite.maintenance

2016-12-20 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-18900. --- Resolution: Fixed Assignee: Burak Yavuz Fix Version/s: 2.1.1 > Flaky Test:

[jira] [Updated] (SPARK-18900) Flaky Test: StateStoreSuite.maintenance

2016-12-20 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-18900: -- Fix Version/s: 2.2.0 > Flaky Test: StateStoreSuite.maintenance >

[jira] [Commented] (SPARK-18958) SparkR should support toJSON on DataFrame

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18958?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15765965#comment-15765965 ] Apache Spark commented on SPARK-18958: -- User 'felixcheung' has created a pull request for this

[jira] [Assigned] (SPARK-18958) SparkR should support toJSON on DataFrame

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18958: Assignee: Felix Cheung (was: Apache Spark) > SparkR should support toJSON on DataFrame >

[jira] [Assigned] (SPARK-18958) SparkR should support toJSON on DataFrame

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18958: Assignee: Apache Spark (was: Felix Cheung) > SparkR should support toJSON on DataFrame >

[jira] [Created] (SPARK-18958) SparkR should support toJSON on DataFrame

2016-12-20 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-18958: Summary: SparkR should support toJSON on DataFrame Key: SPARK-18958 URL: https://issues.apache.org/jira/browse/SPARK-18958 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-18941) Spark thrift server, Spark 2.0.2, The "drop table" command doesn't delete the directory associated with the Hive table (not EXTERNAL table) from the HDFS file system

2016-12-20 Thread luat (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18941?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] luat updated SPARK-18941: - Description: Spark thrift server, Spark 2.0.2, The "drop table" command doesn't delete the directory associated

[jira] [Commented] (SPARK-18903) uiWebUrl is not accessible to SparkR

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18903?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15765900#comment-15765900 ] Apache Spark commented on SPARK-18903: -- User 'felixcheung' has created a pull request for this

[jira] [Assigned] (SPARK-18903) uiWebUrl is not accessible to SparkR

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18903: Assignee: (was: Apache Spark) > uiWebUrl is not accessible to SparkR >

[jira] [Assigned] (SPARK-18903) uiWebUrl is not accessible to SparkR

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18903: Assignee: Apache Spark > uiWebUrl is not accessible to SparkR >

[jira] [Created] (SPARK-18957) when WAL time out, loss data

2016-12-20 Thread xy7 (JIRA)
xy7 created SPARK-18957: --- Summary: when WAL time out, loss data Key: SPARK-18957 URL: https://issues.apache.org/jira/browse/SPARK-18957 Project: Spark Issue Type: Bug Components: Block

[jira] [Created] (SPARK-18956) Python API should reuse existing SparkSession while creating new SQLContext instances

2016-12-20 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-18956: -- Summary: Python API should reuse existing SparkSession while creating new SQLContext instances Key: SPARK-18956 URL: https://issues.apache.org/jira/browse/SPARK-18956

[jira] [Commented] (SPARK-18931) Create empty staging directory in partitioned table on insert

2016-12-20 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15765755#comment-15765755 ] Dongjoon Hyun commented on SPARK-18931: --- Hi, [~smilegator]. This seems to be related to SPARK-18931

[jira] [Created] (SPARK-18955) Add ability to emit kafka events to DStream or KafkaDStream

2016-12-20 Thread Russell Jurney (JIRA)
Russell Jurney created SPARK-18955: -- Summary: Add ability to emit kafka events to DStream or KafkaDStream Key: SPARK-18955 URL: https://issues.apache.org/jira/browse/SPARK-18955 Project: Spark

[jira] [Commented] (SPARK-18710) Add offset to GeneralizedLinearRegression models

2016-12-20 Thread Wayne Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18710?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15765733#comment-15765733 ] Wayne Zhang commented on SPARK-18710: - [~yanboliang] Thanks for the suggestion. I think the issue is

[jira] [Assigned] (SPARK-18953) Do not show the link to a dead worker on the master page

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18953?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18953: Assignee: (was: Apache Spark) > Do not show the link to a dead worker on the master

[jira] [Assigned] (SPARK-18953) Do not show the link to a dead worker on the master page

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18953?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18953: Assignee: Apache Spark > Do not show the link to a dead worker on the master page >

[jira] [Commented] (SPARK-18953) Do not show the link to a dead worker on the master page

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18953?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15765726#comment-15765726 ] Apache Spark commented on SPARK-18953: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Commented] (SPARK-18953) Do not show the link to a dead worker on the master page

2016-12-20 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18953?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15765690#comment-15765690 ] Dongjoon Hyun commented on SPARK-18953: --- Hi, [~yhuai]. If you didn't start this yet, may I make a

[jira] [Updated] (SPARK-18928) FileScanRDD, JDBCRDD, and UnsafeSorter should support task cancellation

2016-12-20 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-18928: - Fix Version/s: 2.0.3 > FileScanRDD, JDBCRDD, and UnsafeSorter should support task cancellation >

[jira] [Assigned] (SPARK-18950) Report conflicting fields when merging two StructTypes.

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18950: Assignee: (was: Apache Spark) > Report conflicting fields when merging two

[jira] [Commented] (SPARK-18950) Report conflicting fields when merging two StructTypes.

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15765624#comment-15765624 ] Apache Spark commented on SPARK-18950: -- User 'bravo-zhang' has created a pull request for this

[jira] [Assigned] (SPARK-18950) Report conflicting fields when merging two StructTypes.

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18950: Assignee: Apache Spark > Report conflicting fields when merging two StructTypes. >

[jira] [Updated] (SPARK-18761) Uncancellable / unkillable tasks may starve jobs of resoures

2016-12-20 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-18761: - Fix Version/s: 2.1.1 2.0.3 > Uncancellable / unkillable tasks may starve jobs of

[jira] [Commented] (SPARK-18301) VectorAssembler does not support StructTypes

2016-12-20 Thread Ilya Matiach (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15765611#comment-15765611 ] Ilya Matiach commented on SPARK-18301: -- For example, when I use HashingTF I get the same sort of

[jira] [Resolved] (SPARK-18576) Expose basic TaskContext info in PySpark

2016-12-20 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18576. - Resolution: Fixed Assignee: holdenk Fix Version/s: 2.2.0 > Expose basic

[jira] [Commented] (SPARK-18301) VectorAssembler does not support StructTypes

2016-12-20 Thread Ilya Matiach (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15765596#comment-15765596 ] Ilya Matiach commented on SPARK-18301: -- I am able to reproduce this, but I'm not sure if this is

[jira] [Assigned] (SPARK-18954) Fix flaky test: o.a.s.streaming.BasicOperationsSuite rdd cleanup - map and window

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18954: Assignee: Shixiong Zhu (was: Apache Spark) > Fix flaky test:

[jira] [Commented] (SPARK-18954) Fix flaky test: o.a.s.streaming.BasicOperationsSuite rdd cleanup - map and window

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15765581#comment-15765581 ] Apache Spark commented on SPARK-18954: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18954) Fix flaky test: o.a.s.streaming.BasicOperationsSuite rdd cleanup - map and window

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18954: Assignee: Apache Spark (was: Shixiong Zhu) > Fix flaky test:

[jira] [Created] (SPARK-18954) Fix flaky test: o.a.s.streaming.BasicOperationsSuite rdd cleanup - map and window

2016-12-20 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-18954: Summary: Fix flaky test: o.a.s.streaming.BasicOperationsSuite rdd cleanup - map and window Key: SPARK-18954 URL: https://issues.apache.org/jira/browse/SPARK-18954

[jira] [Commented] (SPARK-5632) not able to resolve dot('.') in field name

2016-12-20 Thread William Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15765558#comment-15765558 ] William Shen commented on SPARK-5632: - [~marmbrus], Found another weirdness with dot in field name,

[jira] [Assigned] (SPARK-18952) regex strings not properly escaped in codegen for aggregations

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18952?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18952: Assignee: Apache Spark > regex strings not properly escaped in codegen for aggregations >

[jira] [Commented] (SPARK-18952) regex strings not properly escaped in codegen for aggregations

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18952?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15765539#comment-15765539 ] Apache Spark commented on SPARK-18952: -- User 'brkyvz' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18952) regex strings not properly escaped in codegen for aggregations

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18952?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18952: Assignee: (was: Apache Spark) > regex strings not properly escaped in codegen for

[jira] [Updated] (SPARK-18952) regex strings not properly escaped in codegen for aggregations

2016-12-20 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18952?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Burak Yavuz updated SPARK-18952: Summary: regex strings not properly escaped in codegen for aggregations (was: regex strings not

[jira] [Commented] (SPARK-18832) Spark SQL: Incorrect error message on calling registered UDF.

2016-12-20 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15765511#comment-15765511 ] Dongjoon Hyun commented on SPARK-18832: --- Sorry, it turns out that the case I met yesterday only

[jira] [Created] (SPARK-18953) Do not show the link to a dead worker on the master page

2016-12-20 Thread Yin Huai (JIRA)
Yin Huai created SPARK-18953: Summary: Do not show the link to a dead worker on the master page Key: SPARK-18953 URL: https://issues.apache.org/jira/browse/SPARK-18953 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-18234) Update mode in structured streaming

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18234: Assignee: (was: Apache Spark) > Update mode in structured streaming >

[jira] [Assigned] (SPARK-18234) Update mode in structured streaming

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18234: Assignee: Apache Spark > Update mode in structured streaming >

[jira] [Commented] (SPARK-18234) Update mode in structured streaming

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15765449#comment-15765449 ] Apache Spark commented on SPARK-18234: -- User 'tdas' has created a pull request for this issue:

[jira] [Resolved] (SPARK-18927) MemorySink for StructuredStreaming can't recover from checkpoint if location is provided in conf

2016-12-20 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-18927. -- Resolution: Fixed Assignee: Burak Yavuz Fix Version/s: 2.2.0

[jira] [Updated] (SPARK-18951) Upgrade com.thoughtworks.paranamer/paranamer to 2.6

2016-12-20 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18951?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-18951: - Description: I recently hit a bug of com.thoughtworks.paranamer/paranamer, which causes jackson fail to

[jira] [Commented] (SPARK-18951) Upgrade com.thoughtworks.paranamer/paranamer to 2.6

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15765366#comment-15765366 ] Apache Spark commented on SPARK-18951: -- User 'yhuai' has created a pull request for this issue:

[jira] [Created] (SPARK-18952) regex strings not properly escaped in codegen

2016-12-20 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-18952: --- Summary: regex strings not properly escaped in codegen Key: SPARK-18952 URL: https://issues.apache.org/jira/browse/SPARK-18952 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-18950) Report conflicting fields when merging two StructTypes.

2016-12-20 Thread Bravo Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15765361#comment-15765361 ] Bravo Zhang commented on SPARK-18950: - I'll work on this, thanks > Report conflicting fields when

[jira] [Updated] (SPARK-18950) Report conflicting fields when merging two StructTypes.

2016-12-20 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-18950: --- Labels: starter (was: ) > Report conflicting fields when merging two StructTypes. >

[jira] [Commented] (SPARK-18886) Delay scheduling should not delay some executors indefinitely if one task is scheduled before delay timeout

2016-12-20 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15765322#comment-15765322 ] Imran Rashid commented on SPARK-18886: -- You understand correctly -- that is precisely what I'm

[jira] [Assigned] (SPARK-18951) Upgrade com.thoughtworks.paranamer/paranamer to 2.6

2016-12-20 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18951?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai reassigned SPARK-18951: Assignee: Yin Huai > Upgrade com.thoughtworks.paranamer/paranamer to 2.6 >

[jira] [Created] (SPARK-18951) Upgrade com.thoughtworks.paranamer/paranamer

2016-12-20 Thread Yin Huai (JIRA)
Yin Huai created SPARK-18951: Summary: Upgrade com.thoughtworks.paranamer/paranamer Key: SPARK-18951 URL: https://issues.apache.org/jira/browse/SPARK-18951 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-18951) Upgrade com.thoughtworks.paranamer/paranamer to 2.6

2016-12-20 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18951?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-18951: - Summary: Upgrade com.thoughtworks.paranamer/paranamer to 2.6 (was: Upgrade

[jira] [Created] (SPARK-18950) Report conflicting fields when merging two StructTypes.

2016-12-20 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-18950: -- Summary: Report conflicting fields when merging two StructTypes. Key: SPARK-18950 URL: https://issues.apache.org/jira/browse/SPARK-18950 Project: Spark Issue

[jira] [Comment Edited] (SPARK-16845) org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificOrdering" grows beyond 64 KB

2016-12-20 Thread Barry Becker (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15762805#comment-15762805 ] Barry Becker edited comment on SPARK-16845 at 12/20/16 9:24 PM: I found a

[jira] [Resolved] (SPARK-18281) toLocalIterator yields time out error on pyspark2

2016-12-20 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-18281. Resolution: Fixed Fix Version/s: 2.0.3 2.1.1 Issue resolved by pull

[jira] [Commented] (SPARK-18761) Uncancellable / unkillable tasks may starve jobs of resoures

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15765200#comment-15765200 ] Apache Spark commented on SPARK-18761: -- User 'JoshRosen' has created a pull request for this issue:

[jira] [Commented] (SPARK-12965) Indexer setInputCol() doesn't resolve column names like DataFrame.col()

2016-12-20 Thread Ilya Matiach (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15765124#comment-15765124 ] Ilya Matiach commented on SPARK-12965: -- Can the ML component be removed from this Jira? It looks

[jira] [Updated] (SPARK-18949) Add recoverPartitions API to Catalog

2016-12-20 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-18949: Description: Currently, we only have a SQL interface for recovering all the partitions in the directory

[jira] [Updated] (SPARK-18949) Add recoverPartitions API to Catalog

2016-12-20 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-18949: Summary: Add recoverPartitions API to Catalog (was: Add repairTable API to Catalog) > Add

[jira] [Commented] (SPARK-11293) ExternalSorter and ExternalAppendOnlyMap should free shuffle memory in their stop() methods

2016-12-20 Thread Barry Becker (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15765096#comment-15765096 ] Barry Becker commented on SPARK-11293: -- Not sure if this is related, but I am running on spark 2.0.2

[jira] [Commented] (SPARK-18928) FileScanRDD, JDBCRDD, and UnsafeSorter should support task cancellation

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15765087#comment-15765087 ] Apache Spark commented on SPARK-18928: -- User 'JoshRosen' has created a pull request for this issue:

[jira] [Commented] (SPARK-18949) Add repairTable API to Catalog

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18949?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15765054#comment-15765054 ] Apache Spark commented on SPARK-18949: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18949) Add repairTable API to Catalog

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18949: Assignee: Xiao Li (was: Apache Spark) > Add repairTable API to Catalog >

[jira] [Assigned] (SPARK-18949) Add repairTable API to Catalog

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18949: Assignee: Apache Spark (was: Xiao Li) > Add repairTable API to Catalog >

[jira] [Created] (SPARK-18949) Add repairTable API to Catalog

2016-12-20 Thread Xiao Li (JIRA)
Xiao Li created SPARK-18949: --- Summary: Add repairTable API to Catalog Key: SPARK-18949 URL: https://issues.apache.org/jira/browse/SPARK-18949 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-18036) Decision Trees do not handle edge cases

2016-12-20 Thread Ilya Matiach (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18036?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15765032#comment-15765032 ] Ilya Matiach commented on SPARK-18036: -- Weichen Xu, are you working on this issue or have you

[jira] [Commented] (SPARK-16473) BisectingKMeans Algorithm failing with java.util.NoSuchElementException: key not found

2016-12-20 Thread Ilya Matiach (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15765027#comment-15765027 ] Ilya Matiach commented on SPARK-16473: -- Do you have a smaller dataset than the one in the

[jira] [Commented] (SPARK-18941) Spark thrift server, Spark 2.0.2, The "drop table" command doesn't delete the directory associated with the table (not EXTERNAL table) from the file system

2016-12-20 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15765024#comment-15765024 ] Dongjoon Hyun commented on SPARK-18941: --- Hi, [~luatnc]. For me, 2.0.2 and the current master

[jira] [Commented] (SPARK-18820) Driver may send "LaunchTask" before executor receive "RegisteredExecutor"

2016-12-20 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15764995#comment-15764995 ] Shixiong Zhu commented on SPARK-18820: -- This was fixed in SPARK-13112. It's for 2.0.0 not backported

[jira] [Resolved] (SPARK-18820) Driver may send "LaunchTask" before executor receive "RegisteredExecutor"

2016-12-20 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-18820. -- Resolution: Duplicate > Driver may send "LaunchTask" before executor receive

[jira] [Commented] (SPARK-16473) BisectingKMeans Algorithm failing with java.util.NoSuchElementException: key not found

2016-12-20 Thread Ilya Matiach (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15764964#comment-15764964 ] Ilya Matiach commented on SPARK-16473: -- If you could put the sample dataset on google drive or one

[jira] [Commented] (SPARK-16473) BisectingKMeans Algorithm failing with java.util.NoSuchElementException: key not found

2016-12-20 Thread Ilya Matiach (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15764909#comment-15764909 ] Ilya Matiach commented on SPARK-16473: -- I've added a pull request here:

[jira] [Assigned] (SPARK-16473) BisectingKMeans Algorithm failing with java.util.NoSuchElementException: key not found

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16473: Assignee: (was: Apache Spark) > BisectingKMeans Algorithm failing with

[jira] [Commented] (SPARK-16473) BisectingKMeans Algorithm failing with java.util.NoSuchElementException: key not found

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15764905#comment-15764905 ] Apache Spark commented on SPARK-16473: -- User 'imatiach-msft' has created a pull request for this

[jira] [Assigned] (SPARK-16473) BisectingKMeans Algorithm failing with java.util.NoSuchElementException: key not found

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16473: Assignee: Apache Spark > BisectingKMeans Algorithm failing with

[jira] [Commented] (SPARK-16473) BisectingKMeans Algorithm failing with java.util.NoSuchElementException: key not found

2016-12-20 Thread Ilya Matiach (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15764892#comment-15764892 ] Ilya Matiach commented on SPARK-16473: -- I will start a pull request for the change. I would like to

[jira] [Commented] (SPARK-18886) Delay scheduling should not delay some executors indefinitely if one task is scheduled before delay timeout

2016-12-20 Thread Mridul Muralidharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15764876#comment-15764876 ] Mridul Muralidharan commented on SPARK-18886: - I am not sure what is described will work as

[jira] [Updated] (SPARK-18836) Serialize Task Metrics once per stage

2016-12-20 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18836?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-18836: -- Fix Version/s: (was: 1.3.0) 2.2.0 > Serialize Task

[jira] [Commented] (SPARK-18886) Delay scheduling should not delay some executors indefinitely if one task is scheduled before delay timeout

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15764795#comment-15764795 ] Apache Spark commented on SPARK-18886: -- User 'squito' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18886) Delay scheduling should not delay some executors indefinitely if one task is scheduled before delay timeout

2016-12-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18886: Assignee: Apache Spark > Delay scheduling should not delay some executors indefinitely if

  1   2   >