[jira] [Commented] (SPARK-23098) Migrate Kafka batch source to v2

2018-06-27 Thread Richard Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16525919#comment-16525919 ] Richard Yu commented on SPARK-23098: Wasn't this issue resolved by SPARK-23362? > Migrate Kafka

[jira] [Commented] (SPARK-13343) speculative tasks that didn't commit shouldn't be marked as success

2018-06-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16525898#comment-16525898 ] Apache Spark commented on SPARK-13343: -- User 'hthuynh2' has created a pull request for this issue:

[jira] [Comment Edited] (SPARK-18258) Sinks need access to offset representation

2018-06-27 Thread Richard Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16525883#comment-16525883 ] Richard Yu edited comment on SPARK-18258 at 6/28/18 3:56 AM: -

[jira] [Assigned] (SPARK-24551) Add Integration tests for Secrets

2018-06-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24551: Assignee: Apache Spark > Add Integration tests for Secrets >

[jira] [Assigned] (SPARK-24551) Add Integration tests for Secrets

2018-06-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24551: Assignee: (was: Apache Spark) > Add Integration tests for Secrets >

[jira] [Commented] (SPARK-24551) Add Integration tests for Secrets

2018-06-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24551?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16525884#comment-16525884 ] Apache Spark commented on SPARK-24551: -- User 'skonto' has created a pull request for this issue:

[jira] [Commented] (SPARK-18258) Sinks need access to offset representation

2018-06-27 Thread Richard Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16525883#comment-16525883 ] Richard Yu commented on SPARK-18258: [~c...@koeninger.org] I have a bit of a concern regarding where

[jira] [Created] (SPARK-24670) How to stream only newer files from a folder in Apache Spark?

2018-06-27 Thread Mahbub Murshed (JIRA)
Mahbub Murshed created SPARK-24670: -- Summary: How to stream only newer files from a folder in Apache Spark? Key: SPARK-24670 URL: https://issues.apache.org/jira/browse/SPARK-24670 Project: Spark

[jira] [Assigned] (SPARK-24603) Typo in comments

2018-06-27 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-24603: Assignee: Fokko Driesprong > Typo in comments > > >

[jira] [Resolved] (SPARK-24603) Typo in comments

2018-06-27 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24603. -- Resolution: Fixed Fix Version/s: 2.3.2 2.4.0

[jira] [Commented] (SPARK-24667) If folders managed by DiskBlockManager are deleted manually, shell throws FileNotFoundException

2018-06-27 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16525818#comment-16525818 ] Hyukjin Kwon commented on SPARK-24667: -- Why would you delete the folder intentionally? > If

[jira] [Assigned] (SPARK-24645) Skip parsing when csvColumnPruning enabled and partitions scanned only

2018-06-27 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-24645: Assignee: Takeshi Yamamuro > Skip parsing when csvColumnPruning enabled and partitions

[jira] [Resolved] (SPARK-24645) Skip parsing when csvColumnPruning enabled and partitions scanned only

2018-06-27 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24645. -- Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21631

[jira] [Commented] (SPARK-18258) Sinks need access to offset representation

2018-06-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16525779#comment-16525779 ] Apache Spark commented on SPARK-18258: -- User 'ConcurrencyPractitioner' has created a pull request

[jira] [Assigned] (SPARK-18258) Sinks need access to offset representation

2018-06-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18258: Assignee: (was: Apache Spark) > Sinks need access to offset representation >

[jira] [Assigned] (SPARK-18258) Sinks need access to offset representation

2018-06-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18258: Assignee: Apache Spark > Sinks need access to offset representation >

[jira] [Assigned] (SPARK-24624) Can not mix vectorized and non-vectorized UDFs

2018-06-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24624: Assignee: (was: Apache Spark) > Can not mix vectorized and non-vectorized UDFs >

[jira] [Commented] (SPARK-24624) Can not mix vectorized and non-vectorized UDFs

2018-06-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16525695#comment-16525695 ] Apache Spark commented on SPARK-24624: -- User 'icexelloss' has created a pull request for this

[jira] [Assigned] (SPARK-24624) Can not mix vectorized and non-vectorized UDFs

2018-06-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24624: Assignee: Apache Spark > Can not mix vectorized and non-vectorized UDFs >

[jira] [Resolved] (SPARK-24553) Job UI redirect causing http 302 error

2018-06-27 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-24553. - Resolution: Fixed Assignee: Steven Kallman Fix Version/s: 2.4.0 > Job UI redirect

[jira] [Resolved] (SPARK-24451) Spark-Streaming-Kafka-1.6.3- KafkaUtils.createStream function uses the old Logging class

2018-06-27 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24451?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-24451. -- Resolution: Not A Problem > Spark-Streaming-Kafka-1.6.3- KafkaUtils.createStream function

[jira] [Resolved] (SPARK-24204) Verify a write schema in Json/Orc/ParquetFileFormat

2018-06-27 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-24204. - Resolution: Fixed Assignee: Takeshi Yamamuro Fix Version/s: 2.4.0 > Verify a write

[jira] [Resolved] (SPARK-24533) typesafe has rebranded to lightbend. change the build/mvn endpoint from downloads.typesafe.com to downloads.lightbend.com

2018-06-27 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-24533. Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21636

[jira] [Assigned] (SPARK-24533) typesafe has rebranded to lightbend. change the build/mvn endpoint from downloads.typesafe.com to downloads.lightbend.com

2018-06-27 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reassigned SPARK-24533: -- Assignee: Sanket Reddy > typesafe has rebranded to lightbend. change the build/mvn

[jira] [Resolved] (SPARK-24660) SHS is not showing properly errors when downloading logs

2018-06-27 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24660?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-24660. Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21644

[jira] [Assigned] (SPARK-24660) SHS is not showing properly errors when downloading logs

2018-06-27 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24660?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reassigned SPARK-24660: -- Assignee: Marco Gaido > SHS is not showing properly errors when downloading logs >

[jira] [Updated] (SPARK-20168) Enable kinesis to start stream from Initial position specified by a timestamp

2018-06-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20168?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-20168: -- Target Version/s: (was: 2.3.0) > Enable kinesis to start stream from Initial position specified by

[jira] [Updated] (SPARK-23197) Flaky test: spark.streaming.ReceiverSuite."receiver_life_cycle"

2018-06-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-23197: -- Target Version/s: 2.4.0 (was: 2.3.0) > Flaky test:

[jira] [Updated] (SPARK-21743) top-most limit should not cause memory leak

2018-06-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-21743: -- Fix Version/s: (was: 2.3.0) > top-most limit should not cause memory leak >

[jira] [Updated] (SPARK-20168) Enable kinesis to start stream from Initial position specified by a timestamp

2018-06-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20168?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-20168: -- Fix Version/s: (was: 2.3.0) > Enable kinesis to start stream from Initial position specified by a

[jira] [Updated] (SPARK-23899) Built-in SQL Function Improvement

2018-06-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-23899: -- Fix Version/s: (was: 2.4.0) > Built-in SQL Function Improvement >

[jira] [Updated] (SPARK-24238) HadoopFsRelation can't append the same table with multi job at the same time.

2018-06-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-24238: -- Target Version/s: (was: 2.4.0) Priority: Minor (was: Major) Fix Version/s:

[jira] [Updated] (SPARK-23197) Flaky test: spark.streaming.ReceiverSuite."receiver_life_cycle"

2018-06-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-23197: -- Fix Version/s: (was: 2.3.0) (was: 3.0.0) > Flaky test:

[jira] [Updated] (SPARK-24194) HadoopFsRelation cannot overwrite a path that is also being read from

2018-06-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-24194: -- Flags: (was: Patch) Target Version/s: (was: 2.4.0) Labels: (was:

[jira] [Updated] (SPARK-24252) DataSourceV2: Add catalog support

2018-06-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-24252: -- Target Version/s: 2.4.0 Fix Version/s: (was: 2.4.0) > DataSourceV2: Add catalog support >

[jira] [Updated] (SPARK-24251) DataSourceV2: Add AppendData logical operation

2018-06-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-24251: -- Target Version/s: 2.4.0 Fix Version/s: (was: 2.4.0) > DataSourceV2: Add AppendData logical

[jira] [Updated] (SPARK-24402) Optimize `In` expression when only one element in the collection or collection is empty

2018-06-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-24402: -- Target Version/s: 2.4.0 Fix Version/s: (was: 2.4.0) > Optimize `In` expression when only

[jira] [Updated] (SPARK-24411) Adding native Java tests for `isInCollection`

2018-06-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-24411: -- Priority: Minor (was: Major) Fix Version/s: (was: 2.4.0) > Adding native Java tests for

[jira] [Updated] (SPARK-24489) No check for invalid input type of weight data in ml.PowerIterationClustering

2018-06-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-24489: -- Target Version/s: (was: 2.4.0) Priority: Minor (was: Major) Fix Version/s:

[jira] [Updated] (SPARK-24613) Cache with UDF could not be matched with subsequent dependent caches

2018-06-27 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-24613: Fix Version/s: 2.3.2 > Cache with UDF could not be matched with subsequent dependent caches >

[jira] [Commented] (SPARK-24458) Invalid PythonUDF check_1(), requires attributes from more than one child

2018-06-27 Thread Ruben Berenguel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16525556#comment-16525556 ] Ruben Berenguel commented on SPARK-24458: - Can't reproduce with Spark 2.2 either, local mode. >

[jira] [Commented] (SPARK-24642) Add a function which infers schema from a JSON column

2018-06-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16525548#comment-16525548 ] Reynold Xin commented on SPARK-24642: - [~maxgekk] I think this is too complicated and unpredictable.

[jira] [Updated] (SPARK-24669) Managed table was not cleared of path after drop database cascade

2018-06-27 Thread Dong Jiang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dong Jiang updated SPARK-24669: --- Description: I can do the following in sequence # Create a managed table using path options # Drop

[jira] [Assigned] (SPARK-23648) extend hint syntax to support any expression for R

2018-06-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23648?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23648: Assignee: (was: Apache Spark) > extend hint syntax to support any expression for R >

[jira] [Commented] (SPARK-23648) extend hint syntax to support any expression for R

2018-06-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23648?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16525531#comment-16525531 ] Apache Spark commented on SPARK-23648: -- User 'huaxingao' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23648) extend hint syntax to support any expression for R

2018-06-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23648?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23648: Assignee: Apache Spark > extend hint syntax to support any expression for R >

[jira] [Updated] (SPARK-24669) Managed table was not cleared of path after drop database cascade

2018-06-27 Thread Dong Jiang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dong Jiang updated SPARK-24669: --- Affects Version/s: 2.3.1 > Managed table was not cleared of path after drop database cascade >

[jira] [Updated] (SPARK-24669) Managed table was not cleared of path after drop database cascade

2018-06-27 Thread Dong Jiang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dong Jiang updated SPARK-24669: --- Description: I can do the following in sequence # Create a managed table using path options # Drop

[jira] [Updated] (SPARK-24669) Managed table was not cleared of path after drop database cascade

2018-06-27 Thread Dong Jiang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dong Jiang updated SPARK-24669: --- Description: I can do the following in sequence # Create a managed table using path options # Drop

[jira] [Updated] (SPARK-24669) Managed table was not cleared of path after drop database cascade

2018-06-27 Thread Dong Jiang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dong Jiang updated SPARK-24669: --- Description: I can do the following in sequence # Create a managed table using path options # Drop

[jira] [Created] (SPARK-24669) Managed table was not cleared of path after drop database cascade

2018-06-27 Thread Dong Jiang (JIRA)
Dong Jiang created SPARK-24669: -- Summary: Managed table was not cleared of path after drop database cascade Key: SPARK-24669 URL: https://issues.apache.org/jira/browse/SPARK-24669 Project: Spark

[jira] [Commented] (SPARK-22666) Spark datasource for image format

2018-06-27 Thread Jayesh lalwani (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16525479#comment-16525479 ] Jayesh lalwani commented on SPARK-22666: Should this datasource reside in the spark-mllib module

[jira] [Resolved] (SPARK-21687) Spark SQL should set createTime for Hive partition

2018-06-27 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-21687. - Resolution: Fixed Fix Version/s: 2.4.0 > Spark SQL should set createTime for Hive partition >

[jira] [Assigned] (SPARK-21687) Spark SQL should set createTime for Hive partition

2018-06-27 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li reassigned SPARK-21687: --- Assignee: Chaozhong Yang > Spark SQL should set createTime for Hive partition >

[jira] [Commented] (SPARK-23648) extend hint syntax to support any expression for R

2018-06-27 Thread Huaxin Gao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23648?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16525373#comment-16525373 ] Huaxin Gao commented on SPARK-23648: I will work on this and submit a PR soon. Thanks! > extend

[jira] [Commented] (SPARK-24208) Cannot resolve column in self join after applying Pandas UDF

2018-06-27 Thread Stu (Michael Stewart) (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16525363#comment-16525363 ] Stu (Michael Stewart) commented on SPARK-24208: --- I can confirm this does not work on

[jira] [Resolved] (SPARK-24446) Library path with special characters breaks Spark on YARN

2018-06-27 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24446?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-24446. Resolution: Fixed Assignee: Marcelo Vanzin Fix Version/s: 2.4.0 > Library

[jira] [Updated] (SPARK-24668) PySpark crashes when getting the webui url if the webui is disabled

2018-06-27 Thread Karthik Palaniappan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karthik Palaniappan updated SPARK-24668: Environment: * Spark 2.3.0 * Spark-on-YARN * Java 8 * Python 3.6.5 * Jupyter

[jira] [Created] (SPARK-24668) PySpark crashes when getting the webui url if the webui is disabled

2018-06-27 Thread Karthik Palaniappan (JIRA)
Karthik Palaniappan created SPARK-24668: --- Summary: PySpark crashes when getting the webui url if the webui is disabled Key: SPARK-24668 URL: https://issues.apache.org/jira/browse/SPARK-24668

[jira] [Resolved] (SPARK-23556) design doc for write side

2018-06-27 Thread Jose Torres (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jose Torres resolved SPARK-23556. - Resolution: Fixed > design doc for write side > - > >

[jira] [Resolved] (SPARK-23557) design doc for read side

2018-06-27 Thread Jose Torres (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jose Torres resolved SPARK-23557. - Resolution: Done > design doc for read side > > > Key:

[jira] [Commented] (SPARK-23557) design doc for read side

2018-06-27 Thread Jose Torres (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16525308#comment-16525308 ] Jose Torres commented on SPARK-23557: -

[jira] [Commented] (SPARK-23102) Migrate kafka sink

2018-06-27 Thread Jose Torres (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16525307#comment-16525307 ] Jose Torres commented on SPARK-23102: - Yeah, KafkaStreamWriter already took care of this issue. I

[jira] [Resolved] (SPARK-23102) Migrate kafka sink

2018-06-27 Thread Jose Torres (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jose Torres resolved SPARK-23102. - Resolution: Duplicate > Migrate kafka sink > -- > > Key:

[jira] [Commented] (SPARK-23014) Migrate MemorySink fully to v2

2018-06-27 Thread Jose Torres (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23014?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16525305#comment-16525305 ] Jose Torres commented on SPARK-23014: - I'm not currently. I ran into a problem trying to make

[jira] [Commented] (SPARK-24630) SPIP: Support SQLStreaming in Spark

2018-06-27 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16525290#comment-16525290 ] Shixiong Zhu commented on SPARK-24630: -- Structured Streaming supports standard SQL as the batch

[jira] [Updated] (SPARK-24630) SPIP: Support SQLStreaming in Spark

2018-06-27 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-24630: - Component/s: (was: SQL) Structured Streaming > SPIP: Support SQLStreaming

[jira] [Issue Comment Deleted] (SPARK-12009) Avoid re-allocate yarn container while driver want to stop all Executors

2018-06-27 Thread Andrei Iatsuk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrei Iatsuk updated SPARK-12009: -- Comment: was deleted (was: Apache Spark 2.1.2: {code:java} SparkSession sparkSession =

[jira] [Commented] (SPARK-24208) Cannot resolve column in self join after applying Pandas UDF

2018-06-27 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16525210#comment-16525210 ] Marco Gaido commented on SPARK-24208: - I think this may be a duplicate of SPARK-24373. Can you try

[jira] [Commented] (SPARK-12009) Avoid re-allocate yarn container while driver want to stop all Executors

2018-06-27 Thread Andrey Yatsuk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16525190#comment-16525190 ] Andrey Yatsuk commented on SPARK-12009: --- Apache Spark 2.1.2: {code:java} SparkSession

[jira] [Commented] (SPARK-18258) Sinks need access to offset representation

2018-06-27 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16525137#comment-16525137 ] Cody Koeninger commented on SPARK-18258: [~Yohan123] This ticket is about giving implementors of

[jira] [Commented] (SPARK-18258) Sinks need access to offset representation

2018-06-27 Thread Richard Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16524921#comment-16524921 ] Richard Yu commented on SPARK-18258: Just a question, I noticed that in {{KafkaSink}}'s particular

[jira] [Comment Edited] (SPARK-18258) Sinks need access to offset representation

2018-06-27 Thread Richard Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16524921#comment-16524921 ] Richard Yu edited comment on SPARK-18258 at 6/27/18 11:12 AM: -- Just a

[jira] [Commented] (SPARK-24642) Add a function which infers schema from a JSON column

2018-06-27 Thread Maxim Gekk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16524802#comment-16524802 ] Maxim Gekk commented on SPARK-24642: > Do we want this as an aggregate function? I thought of

[jira] [Created] (SPARK-24667) If folders managed by DiskBlockManager are deleted manually, shell throws FileNotFoundException

2018-06-27 Thread wuyonghua (JIRA)
wuyonghua created SPARK-24667: - Summary: If folders managed by DiskBlockManager are deleted manually, shell throws FileNotFoundException Key: SPARK-24667 URL: https://issues.apache.org/jira/browse/SPARK-24667

[jira] [Commented] (SPARK-23904) Big execution plan cause OOM

2018-06-27 Thread Izek Greenfield (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16524720#comment-16524720 ] Izek Greenfield commented on SPARK-23904: - [~RBerenguel] Have you find something? BTW your

[jira] [Created] (SPARK-24666) Word2Vec generate infinity vectors when numIterations are large

2018-06-27 Thread ZhongYu (JIRA)
ZhongYu created SPARK-24666: --- Summary: Word2Vec generate infinity vectors when numIterations are large Key: SPARK-24666 URL: https://issues.apache.org/jira/browse/SPARK-24666 Project: Spark Issue

[jira] [Commented] (SPARK-23102) Migrate kafka sink

2018-06-27 Thread Richard Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16524675#comment-16524675 ] Richard Yu commented on SPARK-23102: I took a look, and it looks like {{KafkaStreamWriter}} already

[jira] [Commented] (SPARK-23102) Migrate kafka sink

2018-06-27 Thread Richard Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16524628#comment-16524628 ] Richard Yu commented on SPARK-23102: Hi [~joseph.torres] Mind if I take this JIRA? > Migrate kafka

[jira] [Issue Comment Deleted] (SPARK-23102) Migrate kafka sink

2018-06-27 Thread Richard Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Yu updated SPARK-23102: --- Comment: was deleted (was: [~joseph.torres] Just a question: I have noted that 

[jira] [Comment Edited] (SPARK-23102) Migrate kafka sink

2018-06-27 Thread Richard Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16524615#comment-16524615 ] Richard Yu edited comment on SPARK-23102 at 6/27/18 6:04 AM: -

[jira] [Comment Edited] (SPARK-23102) Migrate kafka sink

2018-06-27 Thread Richard Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16524615#comment-16524615 ] Richard Yu edited comment on SPARK-23102 at 6/27/18 6:03 AM: -

[jira] [Commented] (SPARK-23102) Migrate kafka sink

2018-06-27 Thread Richard Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16524615#comment-16524615 ] Richard Yu commented on SPARK-23102: Just a question: I have noted that