[jira] [Commented] (SPARK-20408) Get glob path in parallel to reduce resolve relation time

2018-06-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20408?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520976#comment-16520976 ] Apache Spark commented on SPARK-20408: -- User 'xuanyuanking' has created a pull request for this

[jira] [Commented] (SPARK-23710) Upgrade Hive to 2.3.2

2018-06-22 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23710?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520967#comment-16520967 ] Hyukjin Kwon commented on SPARK-23710: -- Yup, agree with that should better be done first and this

[jira] [Assigned] (SPARK-24634) Add a new metric regarding number of rows later than watermark

2018-06-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24634: Assignee: Apache Spark > Add a new metric regarding number of rows later than watermark

[jira] [Commented] (SPARK-24634) Add a new metric regarding number of rows later than watermark

2018-06-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520950#comment-16520950 ] Apache Spark commented on SPARK-24634: -- User 'HeartSaVioR' has created a pull request for this

[jira] [Assigned] (SPARK-24634) Add a new metric regarding number of rows later than watermark

2018-06-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24634: Assignee: (was: Apache Spark) > Add a new metric regarding number of rows later than

[jira] [Commented] (SPARK-24634) Add a new metric regarding number of rows later than watermark

2018-06-22 Thread Jungtaek Lim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520948#comment-16520948 ] Jungtaek Lim commented on SPARK-24634: -- Working on this. Will submit a patch soon. > Add a new

[jira] [Created] (SPARK-24634) Add a new metric regarding number of rows later than watermark

2018-06-22 Thread Jungtaek Lim (JIRA)
Jungtaek Lim created SPARK-24634: Summary: Add a new metric regarding number of rows later than watermark Key: SPARK-24634 URL: https://issues.apache.org/jira/browse/SPARK-24634 Project: Spark

[jira] [Created] (SPARK-24633) arrays_zip function's code generator splits input processing incorrectly

2018-06-22 Thread Bruce Robbins (JIRA)
Bruce Robbins created SPARK-24633: - Summary: arrays_zip function's code generator splits input processing incorrectly Key: SPARK-24633 URL: https://issues.apache.org/jira/browse/SPARK-24633 Project:

[jira] [Commented] (SPARK-24631) Cannot up cast column from bigint to smallint as it may truncate

2018-06-22 Thread Sivakumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520879#comment-16520879 ] Sivakumar commented on SPARK-24631: --- Its resolved, Actually I was trying to query a view. Recreate d

[jira] [Commented] (SPARK-23704) PySpark access of individual trees in random forest is slow

2018-06-22 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520832#comment-16520832 ] Seth Hendrickson commented on SPARK-23704: -- Instead of {code:java}

[jira] [Resolved] (SPARK-24532) HiveExternalCatalogVersionSuite should be resilient to missing versions

2018-06-22 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24532?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-24532. Resolution: Won't Fix I've added documentation on the release docs about this test, so

[jira] [Commented] (SPARK-24631) Cannot up cast column from bigint to smallint as it may truncate

2018-06-22 Thread vaquar khan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520785#comment-16520785 ] vaquar khan commented on SPARK-24631: - You just need to cast column , issue will be resolved {{}} >

[jira] [Updated] (SPARK-22897) Expose stageAttemptId in TaskContext

2018-06-22 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22897?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin updated SPARK-22897: --- Fix Version/s: 2.1.3 > Expose stageAttemptId in TaskContext >

[jira] [Updated] (SPARK-24589) OutputCommitCoordinator may allow duplicate commits

2018-06-22 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin updated SPARK-24589: --- Fix Version/s: 2.1.3 > OutputCommitCoordinator may allow duplicate commits >

[jira] [Commented] (SPARK-24552) Task attempt numbers are reused when stages are retried

2018-06-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24552?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520752#comment-16520752 ] Apache Spark commented on SPARK-24552: -- User 'vanzin' has created a pull request for this issue:

[jira] [Commented] (SPARK-24552) Task attempt numbers are reused when stages are retried

2018-06-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24552?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520742#comment-16520742 ] Apache Spark commented on SPARK-24552: -- User 'vanzin' has created a pull request for this issue:

[jira] [Commented] (SPARK-14922) Alter Table Drop Partition Using Predicate-based Partition Spec

2018-06-22 Thread nirav patel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14922?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520715#comment-16520715 ] nirav patel commented on SPARK-14922: - Hi Any updates on this? Is there any workaround meanwhile? Is

[jira] [Assigned] (SPARK-24632) Allow 3rd-party libraries to use pyspark.ml abstractions for Java wrappers for persistence

2018-06-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-24632: - Assignee: Joseph K. Bradley > Allow 3rd-party libraries to use pyspark.ml

[jira] [Updated] (SPARK-24632) Allow 3rd-party libraries to use pyspark.ml abstractions for Java wrappers for persistence

2018-06-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-24632: -- Description: This is a follow-up for [SPARK-17025], which allowed users to implement

[jira] [Resolved] (SPARK-21926) Compatibility between ML Transformers and Structured Streaming

2018-06-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21926?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-21926. --- Resolution: Fixed Fix Version/s: 2.3.0 Marking fix version as 2.3.0 since

[jira] [Updated] (SPARK-24465) LSHModel should support Structured Streaming for transform

2018-06-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-24465: -- Description: Locality Sensitive Hashing (LSH) Models

[jira] [Comment Edited] (SPARK-24465) LSHModel should support Structured Streaming for transform

2018-06-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520677#comment-16520677 ] Joseph K. Bradley edited comment on SPARK-24465 at 6/22/18 6:39 PM:

[jira] [Commented] (SPARK-24465) LSHModel should support Structured Streaming for transform

2018-06-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520677#comment-16520677 ] Joseph K. Bradley commented on SPARK-24465: --- Oh actually I think I made this by mistake? I

[jira] [Resolved] (SPARK-24465) LSHModel should support Structured Streaming for transform

2018-06-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-24465. --- Resolution: Fixed Assignee: Joseph K. Bradley Fix Version/s: 2.3.1

[jira] [Updated] (SPARK-24465) LSHModel should support Structured Streaming for transform

2018-06-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-24465: -- Description: Locality Sensitive Hashing (LSH) Models

[jira] [Commented] (SPARK-24465) LSHModel should support Structured Streaming for transform

2018-06-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520671#comment-16520671 ] Joseph K. Bradley commented on SPARK-24465: --- You're right; I did not read [SPARK-12878]

[jira] [Updated] (SPARK-12878) Dataframe fails with nested User Defined Types

2018-06-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12878?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-12878: -- Description: Spark 1.6.0 crashes when using nested User Defined Types in a Dataframe.

[jira] [Commented] (SPARK-19498) Discussion: Making MLlib APIs extensible for 3rd party libraries

2018-06-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520666#comment-16520666 ] Joseph K. Bradley commented on SPARK-19498: --- Sure, comments are welcome! Or links to JIRAs,

[jira] [Commented] (SPARK-22666) Spark datasource for image format

2018-06-22 Thread Jayesh lalwani (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520657#comment-16520657 ] Jayesh lalwani commented on SPARK-22666: I'll try to take this on > Spark datasource for image

[jira] [Issue Comment Deleted] (SPARK-17025) Cannot persist PySpark ML Pipeline model that includes custom Transformer

2018-06-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-17025: -- Comment: was deleted (was: Thank you for your e-mail. I am on businees travel until

[jira] [Assigned] (SPARK-17025) Cannot persist PySpark ML Pipeline model that includes custom Transformer

2018-06-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-17025: - Assignee: Ajay Saini > Cannot persist PySpark ML Pipeline model that includes

[jira] [Resolved] (SPARK-17025) Cannot persist PySpark ML Pipeline model that includes custom Transformer

2018-06-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-17025. --- Resolution: Fixed Fix Version/s: 2.3.0 Fixed by linked JIRAs > Cannot

[jira] [Created] (SPARK-24632) Allow 3rd-party libraries to use pyspark.ml abstractions for Java wrappers for persistence

2018-06-22 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-24632: - Summary: Allow 3rd-party libraries to use pyspark.ml abstractions for Java wrappers for persistence Key: SPARK-24632 URL:

[jira] [Commented] (SPARK-17025) Cannot persist PySpark ML Pipeline model that includes custom Transformer

2018-06-22 Thread Peter Knight (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520648#comment-16520648 ] Peter Knight commented on SPARK-17025: -- Thank you for your e-mail. I am on businees travel until

[jira] [Commented] (SPARK-17025) Cannot persist PySpark ML Pipeline model that includes custom Transformer

2018-06-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520646#comment-16520646 ] Joseph K. Bradley commented on SPARK-17025: --- We've tested it with Python-only implementations,

[jira] [Commented] (SPARK-4591) Algorithm/model parity for spark.ml (Scala)

2018-06-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4591?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520635#comment-16520635 ] Joseph K. Bradley commented on SPARK-4591: -- There are still a few contained tasks which are

[jira] [Commented] (SPARK-11107) spark.ml should support more input column types: umbrella

2018-06-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520634#comment-16520634 ] Joseph K. Bradley commented on SPARK-11107: --- There are still lots of Transformers and

[jira] [Resolved] (SPARK-11107) spark.ml should support more input column types: umbrella

2018-06-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-11107. --- Resolution: Done > spark.ml should support more input column types: umbrella >

[jira] [Resolved] (SPARK-24372) Create script for preparing RCs

2018-06-22 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves resolved SPARK-24372. --- Resolution: Fixed Assignee: Marcelo Vanzin Fix Version/s: 2.4.0 > Create

[jira] [Resolved] (SPARK-24518) Using Hadoop credential provider API to store password

2018-06-22 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-24518. Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21548

[jira] [Assigned] (SPARK-24518) Using Hadoop credential provider API to store password

2018-06-22 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reassigned SPARK-24518: -- Assignee: Saisai Shao > Using Hadoop credential provider API to store password >

[jira] [Commented] (SPARK-24130) Data Source V2: Join Push Down

2018-06-22 Thread Dilip Biswal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520587#comment-16520587 ] Dilip Biswal commented on SPARK-24130: -- [~Shurap1] We are currently waiting for feedback from the

[jira] [Issue Comment Deleted] (SPARK-24130) Data Source V2: Join Push Down

2018-06-22 Thread Dilip Biswal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dilip Biswal updated SPARK-24130: - Comment: was deleted (was: [~Shurap1] We are currently waiting for feedback from the community

[jira] [Commented] (SPARK-24130) Data Source V2: Join Push Down

2018-06-22 Thread Dilip Biswal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520581#comment-16520581 ] Dilip Biswal commented on SPARK-24130: -- [~Shurap1] We are currently waiting for feedback from the

[jira] [Commented] (SPARK-24611) Clean up OutputCommitCoordinator

2018-06-22 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520578#comment-16520578 ] Marcelo Vanzin commented on SPARK-24611: One more: adjust the test so that it ensures that state

[jira] [Comment Edited] (SPARK-24631) Cannot up cast column from bigint to smallint as it may truncate

2018-06-22 Thread vaquar khan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520576#comment-16520576 ] vaquar khan edited comment on SPARK-24631 at 6/22/18 4:43 PM: -- Database

[jira] [Commented] (SPARK-24631) Cannot up cast column from bigint to smallint as it may truncate

2018-06-22 Thread vaquar khan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520576#comment-16520576 ] vaquar khan commented on SPARK-24631: - Database type (MYSQL,Hbase etc ), column descriptions

[jira] [Updated] (SPARK-24130) Data Source V2: Join Push Down

2018-06-22 Thread Jia Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jia Li updated SPARK-24130: --- Attachment: Data Source V2 Join Push Down.pdf > Data Source V2: Join Push Down >

[jira] [Commented] (SPARK-24631) Cannot up cast column from bigint to smallint as it may truncate

2018-06-22 Thread Sivakumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520551#comment-16520551 ] Sivakumar commented on SPARK-24631: --- Updated with some additional data. I have tried the same in

[jira] [Updated] (SPARK-24631) Cannot up cast column from bigint to smallint as it may truncate

2018-06-22 Thread Sivakumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sivakumar updated SPARK-24631: -- Description: Getting the below error when executing the simple select query, Sample: Table

[jira] [Comment Edited] (SPARK-24631) Cannot up cast column from bigint to smallint as it may truncate

2018-06-22 Thread vaquar khan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520533#comment-16520533 ] vaquar khan edited comment on SPARK-24631 at 6/22/18 3:57 PM: -- Can you add

[jira] [Commented] (SPARK-24631) Cannot up cast column from bigint to smallint as it may truncate

2018-06-22 Thread vaquar khan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520533#comment-16520533 ] vaquar khan commented on SPARK-24631: - Can you add complete error logs and if possible smalll code

[jira] [Commented] (SPARK-23710) Upgrade Hive to 2.3.2

2018-06-22 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23710?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520522#comment-16520522 ] Marcelo Vanzin commented on SPARK-23710: There are a few places in Spark that are affected by a

[jira] [Created] (SPARK-24631) Cannot up cast column from bigint to smallint as it may truncate

2018-06-22 Thread Sivakumar (JIRA)
Sivakumar created SPARK-24631: - Summary: Cannot up cast column from bigint to smallint as it may truncate Key: SPARK-24631 URL: https://issues.apache.org/jira/browse/SPARK-24631 Project: Spark

[jira] [Comment Edited] (SPARK-24130) Data Source V2: Join Push Down

2018-06-22 Thread vaquar khan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520461#comment-16520461 ] vaquar khan edited comment on SPARK-24130 at 6/22/18 3:02 PM: -- Could you

[jira] [Commented] (SPARK-24130) Data Source V2: Join Push Down

2018-06-22 Thread vaquar khan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520461#comment-16520461 ] vaquar khan commented on SPARK-24130: - Could you please update doc in Jira insted of google doc  >

[jira] [Resolved] (SPARK-24519) MapStatus has 2000 hardcoded

2018-06-22 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves resolved SPARK-24519. --- Resolution: Fixed Assignee: Hieu Tri Huynh Fix Version/s: 2.4.0 > MapStatus

[jira] [Commented] (SPARK-24130) Data Source V2: Join Push Down

2018-06-22 Thread Parshuram V Patki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520386#comment-16520386 ] Parshuram V Patki commented on SPARK-24130: --- [~jliwork] do you think this improvement will

[jira] [Updated] (SPARK-22897) Expose stageAttemptId in TaskContext

2018-06-22 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22897?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves updated SPARK-22897: -- Fix Version/s: 2.2.2 > Expose stageAttemptId in TaskContext >

[jira] [Updated] (SPARK-24630) SPIP: Support SQLStreaming in Spark

2018-06-22 Thread Jackey Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jackey Lee updated SPARK-24630: --- Attachment: SQLStreaming SPIP.pdf > SPIP: Support SQLStreaming in Spark >

[jira] [Updated] (SPARK-24630) SPIP: Support SQLStreaming in Spark

2018-06-22 Thread Jackey Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jackey Lee updated SPARK-24630: --- Summary: SPIP: Support SQLStreaming in Spark (was: Support SQLStreaming in Spark) > SPIP: Support

[jira] [Created] (SPARK-24630) Support SQLStreaming in Spark

2018-06-22 Thread Jackey Lee (JIRA)
Jackey Lee created SPARK-24630: -- Summary: Support SQLStreaming in Spark Key: SPARK-24630 URL: https://issues.apache.org/jira/browse/SPARK-24630 Project: Spark Issue Type: Improvement

[jira] [Assigned] (SPARK-24629) thrift server memory leak when beeline connection quits

2018-06-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24629?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24629: Assignee: Apache Spark > thrift server memory leak when beeline connection quits >

[jira] [Assigned] (SPARK-24629) thrift server memory leak when beeline connection quits

2018-06-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24629?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24629: Assignee: (was: Apache Spark) > thrift server memory leak when beeline connection

[jira] [Commented] (SPARK-24629) thrift server memory leak when beeline connection quits

2018-06-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520219#comment-16520219 ] Apache Spark commented on SPARK-24629: -- User 'ChenjunZou' has created a pull request for this

[jira] [Updated] (SPARK-24629) thrift server memory leak when beeline connection quits

2018-06-22 Thread StephenZou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24629?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] StephenZou updated SPARK-24629: --- Description: When Beeline connection closes, spark thrift server (STS) will send a session close

[jira] [Updated] (SPARK-24629) thrift server memory leak when beeline connection quits

2018-06-22 Thread StephenZou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24629?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] StephenZou updated SPARK-24629: --- Attachment: .png > thrift server memory leak when beeline connection quits >

[jira] [Updated] (SPARK-24629) thrift server memory leak when beeline connection quits

2018-06-22 Thread StephenZou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24629?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] StephenZou updated SPARK-24629: --- Attachment: .png > thrift server memory leak when beeline connection quits >

[jira] [Created] (SPARK-24629) thrift server memory leak when beeline connection quits

2018-06-22 Thread StephenZou (JIRA)
StephenZou created SPARK-24629: -- Summary: thrift server memory leak when beeline connection quits Key: SPARK-24629 URL: https://issues.apache.org/jira/browse/SPARK-24629 Project: Spark Issue

[jira] [Commented] (SPARK-24458) Invalid PythonUDF check_1(), requires attributes from more than one child

2018-06-22 Thread Ruben Berenguel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520165#comment-16520165 ] Ruben Berenguel commented on SPARK-24458: - [~hyukjin.kwon] I just built 2.3.0 from the tagged

[jira] [Commented] (SPARK-24498) Add JDK compiler for runtime codegen

2018-06-22 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520161#comment-16520161 ] Takeshi Yamamuro commented on SPARK-24498: -- yea, that might be true now. But, I think we need

[jira] [Commented] (SPARK-20295) when spark.sql.adaptive.enabled is enabled, have conflict with Exchange Resue

2018-06-22 Thread Yuming Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520148#comment-16520148 ] Yuming Wang commented on SPARK-20295: - [~KevinZwx] Can you try

[jira] [Commented] (SPARK-24628) The example given to create a dense matrix using python has a mistake

2018-06-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520141#comment-16520141 ] Apache Spark commented on SPARK-24628: -- User 'huangweizhe123' has created a pull request for this

[jira] [Assigned] (SPARK-24628) The example given to create a dense matrix using python has a mistake

2018-06-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24628: Assignee: (was: Apache Spark) > The example given to create a dense matrix using

[jira] [Assigned] (SPARK-24628) The example given to create a dense matrix using python has a mistake

2018-06-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24628: Assignee: Apache Spark > The example given to create a dense matrix using python has a

[jira] [Created] (SPARK-24628) The example given to create a dense matrix using python has a mistake

2018-06-22 Thread Weizhe Huang (JIRA)
Weizhe Huang created SPARK-24628: Summary: The example given to create a dense matrix using python has a mistake Key: SPARK-24628 URL: https://issues.apache.org/jira/browse/SPARK-24628 Project: Spark

[jira] [Commented] (SPARK-24498) Add JDK compiler for runtime codegen

2018-06-22 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520125#comment-16520125 ] Marco Gaido commented on SPARK-24498: - Thanks for your great analysis [~maropu]! Very interesting.

[jira] [Resolved] (SPARK-23603) When the length of the json is in a range,get_json_object will result in missing tail data

2018-06-22 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23603. -- Resolution: Duplicate 2.7.x has a regression we had to revert it back. See also

[jira] [Commented] (SPARK-24498) Add JDK compiler for runtime codegen

2018-06-22 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520090#comment-16520090 ] Takeshi Yamamuro commented on SPARK-24498: -- If there are javac options related to performance,

[jira] [Resolved] (SPARK-23934) High-order function: map_from_entries(array>) → map

2018-06-22 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin resolved SPARK-23934. --- Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21282

[jira] [Assigned] (SPARK-23934) High-order function: map_from_entries(array>) → map

2018-06-22 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin reassigned SPARK-23934: - Assignee: Marek Novotny > High-order function: map_from_entries(array>) → map >

[jira] [Created] (SPARK-24627) [Spark2.3.0] After HDFS Token expire kinit not able to submit job using beeline

2018-06-22 Thread ABHISHEK KUMAR GUPTA (JIRA)
ABHISHEK KUMAR GUPTA created SPARK-24627: Summary: [Spark2.3.0] After HDFS Token expire kinit not able to submit job using beeline Key: SPARK-24627 URL: https://issues.apache.org/jira/browse/SPARK-24627

[jira] [Comment Edited] (SPARK-24498) Add JDK compiler for runtime codegen

2018-06-22 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520056#comment-16520056 ] Takeshi Yamamuro edited comment on SPARK-24498 at 6/22/18 6:59 AM: ---

[jira] [Commented] (SPARK-24498) Add JDK compiler for runtime codegen

2018-06-22 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520056#comment-16520056 ] Takeshi Yamamuro commented on SPARK-24498: -- The results of my investigation (sf=1 performance

[jira] [Commented] (SPARK-24569) Spark Aggregator with output type Option[Boolean] creates column of type Row

2018-06-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16520042#comment-16520042 ] Apache Spark commented on SPARK-24569: -- User 'viirya' has created a pull request for this issue:

[jira] [Assigned] (SPARK-24569) Spark Aggregator with output type Option[Boolean] creates column of type Row

2018-06-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24569: Assignee: Apache Spark > Spark Aggregator with output type Option[Boolean] creates

[jira] [Assigned] (SPARK-24569) Spark Aggregator with output type Option[Boolean] creates column of type Row

2018-06-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24569: Assignee: (was: Apache Spark) > Spark Aggregator with output type Option[Boolean]