[jira] [Commented] (SPARK-9135) Filter fails when filtering with a method reference to overloaded method

2017-01-13 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822742#comment-15822742 ] Hyukjin Kwon commented on SPARK-9135: - It still happens in the master branch. > Filter fails when

[jira] [Resolved] (SPARK-6645) StructField/StructType and related classes are not in the Scaladoc

2017-01-13 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-6645. - Resolution: Not A Problem It seems already documented in Scaladoc/Javadoc. >

[jira] [Updated] (SPARK-19222) Limit Query Performance issue

2017-01-13 Thread Sujith (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sujith updated SPARK-19222: --- Description: Performance/memory bottle neck occurs in the below mentioned query case 1: create table t1 as

[jira] [Commented] (SPARK-19223) InputFileBlockHolder doesn't work with Python UDF for datasource other than FileFormat

2017-01-13 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19223?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822733#comment-15822733 ] Liang-Chi Hsieh commented on SPARK-19223: - Hi [~someonehere15], For the issue on spark-xml

[jira] [Commented] (SPARK-4862) Streaming | Setting checkpoint as a local directory results in Checkpoint RDD has different partitions error

2017-01-13 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822731#comment-15822731 ] Hyukjin Kwon commented on SPARK-4862: - [~aniket] Would you be able to try this in 2.x or the current

[jira] [Assigned] (SPARK-19223) InputFileBlockHolder doesn't work with Python UDF for datasource other than FileFormat

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19223: Assignee: (was: Apache Spark) > InputFileBlockHolder doesn't work with Python UDF for

[jira] [Commented] (SPARK-19223) InputFileBlockHolder doesn't work with Python UDF for datasource other than FileFormat

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19223?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822729#comment-15822729 ] Apache Spark commented on SPARK-19223: -- User 'viirya' has created a pull request for this issue:

[jira] [Assigned] (SPARK-19223) InputFileBlockHolder doesn't work with Python UDF for datasource other than FileFormat

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19223: Assignee: Apache Spark > InputFileBlockHolder doesn't work with Python UDF for datasource

[jira] [Commented] (SPARK-2620) case class cannot be used as key for reduce

2017-01-13 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822726#comment-15822726 ] Hyukjin Kwon commented on SPARK-2620: - ^ I can still reproduce this. > case class cannot be used as

[jira] [Commented] (SPARK-3249) Fix links in ScalaDoc that cause warning messages in `sbt/sbt unidoc`

2017-01-13 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822722#comment-15822722 ] Hyukjin Kwon commented on SPARK-3249: - It prints as below after building this via {{jekyll build}} :

[jira] [Created] (SPARK-19223) InputFileBlockHolder doesn't work with Python UDF for datasource other than FileFormat

2017-01-13 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-19223: --- Summary: InputFileBlockHolder doesn't work with Python UDF for datasource other than FileFormat Key: SPARK-19223 URL: https://issues.apache.org/jira/browse/SPARK-19223

[jira] [Created] (SPARK-19222) Limit Query Performance issue

2017-01-13 Thread Sujith (JIRA)
Sujith created SPARK-19222: -- Summary: Limit Query Performance issue Key: SPARK-19222 URL: https://issues.apache.org/jira/browse/SPARK-19222 Project: Spark Issue Type: Bug Components: SQL

[jira] [Commented] (SPARK-2356) Exception: Could not locate executable null\bin\winutils.exe in the Hadoop

2017-01-13 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822717#comment-15822717 ] Hyukjin Kwon commented on SPARK-2356: - Is this really Spark-related issue? > Exception: Could not

[jira] [Resolved] (SPARK-2153) CassandraTest fails for newer Cassandra due to case insensitive key space

2017-01-13 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-2153. - Resolution: Not A Problem It seems we don't have the test in the master anymore. It seems related

[jira] [Commented] (SPARK-19221) Add winutils binaries to Path in AppVeyor for Hadoop libraries to call native libraries properly

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19221?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822693#comment-15822693 ] Apache Spark commented on SPARK-19221: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-19221) Add winutils binaries to Path in AppVeyor for Hadoop libraries to call native libraries properly

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19221: Assignee: (was: Apache Spark) > Add winutils binaries to Path in AppVeyor for Hadoop

[jira] [Assigned] (SPARK-19221) Add winutils binaries to Path in AppVeyor for Hadoop libraries to call native libraries properly

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19221: Assignee: Apache Spark > Add winutils binaries to Path in AppVeyor for Hadoop libraries

[jira] [Created] (SPARK-19221) Add winutils binaries to Path in AppVeyor for Hadoop libraries to call native libraries properly

2017-01-13 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-19221: Summary: Add winutils binaries to Path in AppVeyor for Hadoop libraries to call native libraries properly Key: SPARK-19221 URL: https://issues.apache.org/jira/browse/SPARK-19221

[jira] [Updated] (SPARK-19178) convert string of large numbers to int should return null

2017-01-13 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-19178: Fix Version/s: 2.1.1 2.0.3 > convert string of large numbers to int should

[jira] [Commented] (SPARK-19153) DataFrameWriter.saveAsTable should work with hive format to create partitioned table

2017-01-13 Thread Shuai Lin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822661#comment-15822661 ] Shuai Lin commented on SPARK-19153: --- I'm working on this ticket, thanks. > DataFrameWriter.saveAsTable

[jira] [Comment Edited] (SPARK-18667) input_file_name function does not work with UDF

2017-01-13 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822631#comment-15822631 ] Liang-Chi Hsieh edited comment on SPARK-18667 at 1/14/17 2:00 AM: --

[jira] [Commented] (SPARK-18667) input_file_name function does not work with UDF

2017-01-13 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822631#comment-15822631 ] Liang-Chi Hsieh commented on SPARK-18667: - [~someonehere15], Yeah, I can reproduce that the last

[jira] [Assigned] (SPARK-19129) alter table table_name drop partition with a empty string will drop the whole table

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19129: Assignee: Xiao Li (was: Apache Spark) > alter table table_name drop partition with a

[jira] [Commented] (SPARK-19129) alter table table_name drop partition with a empty string will drop the whole table

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822575#comment-15822575 ] Apache Spark commented on SPARK-19129: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Assigned] (SPARK-19129) alter table table_name drop partition with a empty string will drop the whole table

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19129: Assignee: Apache Spark (was: Xiao Li) > alter table table_name drop partition with a

[jira] [Commented] (SPARK-11520) RegressionMetrics should support instance weights

2017-01-13 Thread Ilya Matiach (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822556#comment-15822556 ] Ilya Matiach commented on SPARK-11520: -- I've sent a pull request that includes this JIRA and

[jira] [Commented] (SPARK-19208) MaxAbsScaler and MinMaxScaler are very inefficient

2017-01-13 Thread Ilya Matiach (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822549#comment-15822549 ] Ilya Matiach commented on SPARK-19208: -- [~srowen] isn't feature hashing (eg HashingTF) to large bit

[jira] [Commented] (SPARK-18821) Bisecting k-means wrapper in SparkR

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822530#comment-15822530 ] Apache Spark commented on SPARK-18821: -- User 'wangmiao1981' has created a pull request for this

[jira] [Assigned] (SPARK-18821) Bisecting k-means wrapper in SparkR

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18821: Assignee: (was: Apache Spark) > Bisecting k-means wrapper in SparkR >

[jira] [Assigned] (SPARK-18821) Bisecting k-means wrapper in SparkR

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18821: Assignee: Apache Spark > Bisecting k-means wrapper in SparkR >

[jira] [Commented] (SPARK-18739) Models in pyspark.classification and regression support setXXXCol methods

2017-01-13 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822495#comment-15822495 ] Bryan Cutler commented on SPARK-18739: -- What about other missing methods from models, like param

[jira] [Assigned] (SPARK-19220) SSL redirect handler only redirects the server's root

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19220: Assignee: Apache Spark > SSL redirect handler only redirects the server's root >

[jira] [Assigned] (SPARK-19220) SSL redirect handler only redirects the server's root

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19220: Assignee: (was: Apache Spark) > SSL redirect handler only redirects the server's root

[jira] [Commented] (SPARK-19220) SSL redirect handler only redirects the server's root

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822478#comment-15822478 ] Apache Spark commented on SPARK-19220: -- User 'vanzin' has created a pull request for this issue:

[jira] [Commented] (SPARK-19129) alter table table_name drop partition with a empty string will drop the whole table

2017-01-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822445#comment-15822445 ] Xiao Li commented on SPARK-19129: - This is actually a bug in Hive. Anyway, Spark can detect it and block

[jira] [Updated] (SPARK-19213) FileSourceScanExec uses SparkSession from HadoopFsRelation creation time instead of the active session at execution time

2017-01-13 Thread Andrew Ash (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Ash updated SPARK-19213: --- Summary: FileSourceScanExec uses SparkSession from HadoopFsRelation creation time instead of the

[jira] [Resolved] (SPARK-18568) vertex attributes in the edge triplet not getting updated in super steps for Pregel API

2017-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-18568. --- Resolution: Not A Problem Yes, the result of updating a mutable object in an RDD is undefined. If

[jira] [Commented] (SPARK-18568) vertex attributes in the edge triplet not getting updated in super steps for Pregel API

2017-01-13 Thread Andrew Ray (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822397#comment-15822397 ] Andrew Ray commented on SPARK-18568: RDD's have the same problem for cached collections of mutable

[jira] [Resolved] (SPARK-19180) the offset of short is 4 in OffHeapColumnVector's putShorts

2017-01-13 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-19180. Resolution: Fixed Fix Version/s: 2.0.3 2.1.1 Issue resolved by pull

[jira] [Updated] (SPARK-18682) Batch Source for Kafka

2017-01-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-18682: - Assignee: Tyson Condie > Batch Source for Kafka > -- > >

[jira] [Assigned] (SPARK-19113) Fix flaky test: o.a.s.sql.streaming.StreamSuite fatal errors from a source should be sent to the user

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19113: Assignee: Shixiong Zhu (was: Apache Spark) > Fix flaky test:

[jira] [Created] (SPARK-19220) SSL redirect handler only redirects the server's root

2017-01-13 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-19220: -- Summary: SSL redirect handler only redirects the server's root Key: SPARK-19220 URL: https://issues.apache.org/jira/browse/SPARK-19220 Project: Spark

[jira] [Assigned] (SPARK-19129) alter table table_name drop partition with a empty string will drop the whole table

2017-01-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li reassigned SPARK-19129: --- Assignee: Xiao Li > alter table table_name drop partition with a empty string will drop the whole

[jira] [Commented] (SPARK-19129) alter table table_name drop partition with a empty string will drop the whole table

2017-01-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822330#comment-15822330 ] Xiao Li commented on SPARK-19129: - This is a bug we need to fix. Let me try it. Thanks! > alter table

[jira] [Updated] (SPARK-19129) alter table table_name drop partition with a empty string will drop the whole table

2017-01-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-19129: Priority: Critical (was: Major) > alter table table_name drop partition with a empty string will drop the

[jira] [Updated] (SPARK-19129) alter table table_name drop partition with a empty string will drop the whole table

2017-01-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-19129: Labels: correctness (was: ) > alter table table_name drop partition with a empty string will drop the

[jira] [Assigned] (SPARK-18589) persist() resolves "java.lang.RuntimeException: Invalid PythonUDF (...), requires attributes from more than one child"

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18589: Assignee: Davies Liu (was: Apache Spark) > persist() resolves

[jira] [Assigned] (SPARK-18589) persist() resolves "java.lang.RuntimeException: Invalid PythonUDF (...), requires attributes from more than one child"

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18589: Assignee: Apache Spark (was: Davies Liu) > persist() resolves

[jira] [Commented] (SPARK-18589) persist() resolves "java.lang.RuntimeException: Invalid PythonUDF (...), requires attributes from more than one child"

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822278#comment-15822278 ] Apache Spark commented on SPARK-18589: -- User 'davies' has created a pull request for this issue:

[jira] [Closed] (SPARK-18475) Be able to provide higher parallelization for StructuredStreaming Kafka Source

2017-01-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18475?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust closed SPARK-18475. Resolution: Won't Fix > Be able to provide higher parallelization for StructuredStreaming

[jira] [Updated] (SPARK-18970) FileSource failure during file list refresh doesn't cause an application to fail, but stops further processing

2017-01-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18970?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-18970: - Description: Spark streaming application uses S3 files as streaming sources. After

[jira] [Updated] (SPARK-18589) persist() resolves "java.lang.RuntimeException: Invalid PythonUDF (...), requires attributes from more than one child"

2017-01-13 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-18589: --- Priority: Critical (was: Minor) > persist() resolves "java.lang.RuntimeException: Invalid PythonUDF

[jira] [Assigned] (SPARK-18589) persist() resolves "java.lang.RuntimeException: Invalid PythonUDF (...), requires attributes from more than one child"

2017-01-13 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu reassigned SPARK-18589: -- Assignee: Davies Liu > persist() resolves "java.lang.RuntimeException: Invalid PythonUDF >

[jira] [Commented] (SPARK-17993) Spark prints an avalanche of warning messages from Parquet when reading parquet files written by older versions of Parquet-mr

2017-01-13 Thread Michael Allman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822248#comment-15822248 ] Michael Allman commented on SPARK-17993: [~emre.colak] FYI

[jira] [Updated] (SPARK-19213) FileSourceScanExec usese sparksession from hadoopfsrelation creation time instead of the one active at time of execution

2017-01-13 Thread Robert Kruszewski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kruszewski updated SPARK-19213: -- Description: If you look at

[jira] [Updated] (SPARK-19213) FileSourceScanExec usese sparksession from hadoopfsrelation creation time instead of the one active at time of execution

2017-01-13 Thread Robert Kruszewski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kruszewski updated SPARK-19213: -- Description: If you look at

[jira] [Updated] (SPARK-19213) FileSourceScanExec usese sparksession from hadoopfsrelation creation time instead of the one active at time of execution

2017-01-13 Thread Robert Kruszewski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kruszewski updated SPARK-19213: -- Description: If you look at

[jira] [Commented] (SPARK-19219) Parquet log output overly verbose by default

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822238#comment-15822238 ] Apache Spark commented on SPARK-19219: -- User 'nicklavers' has created a pull request for this issue:

[jira] [Assigned] (SPARK-19219) Parquet log output overly verbose by default

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19219: Assignee: Apache Spark > Parquet log output overly verbose by default >

[jira] [Assigned] (SPARK-19219) Parquet log output overly verbose by default

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19219: Assignee: (was: Apache Spark) > Parquet log output overly verbose by default >

[jira] [Commented] (SPARK-19217) Offer easy cast from vector to array

2017-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1585#comment-1585 ] Sean Owen commented on SPARK-19217: --- It makes some sense to me, as I also find I write a UDF to do this

[jira] [Commented] (SPARK-19209) "No suitable driver" on first try

2017-01-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1584#comment-1584 ] Xiao Li commented on SPARK-19209: - This could be caused by the classLoader issue. Anyway, let me first

[jira] [Updated] (SPARK-19213) FileSourceScanExec usese sparksession from hadoopfsrelation creation time instead of the one active at time of execution

2017-01-13 Thread Robert Kruszewski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kruszewski updated SPARK-19213: -- Description: If you look at

[jira] [Created] (SPARK-19219) Parquet log output overly verbose by default

2017-01-13 Thread Nicholas (JIRA)
Nicholas created SPARK-19219: Summary: Parquet log output overly verbose by default Key: SPARK-19219 URL: https://issues.apache.org/jira/browse/SPARK-19219 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-19209) "No suitable driver" on first try

2017-01-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822201#comment-15822201 ] Xiao Li commented on SPARK-19209: - I am trying to find a workaround for your case. Could you add an extra

[jira] [Comment Edited] (SPARK-19209) "No suitable driver" on first try

2017-01-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822186#comment-15822186 ] Xiao Li edited comment on SPARK-19209 at 1/13/17 7:10 PM: -- Do you also hit the

[jira] [Commented] (SPARK-19209) "No suitable driver" on first try

2017-01-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822186#comment-15822186 ] Xiao Li commented on SPARK-19209: - Did you also hit the same exception `java.sql.SQLException: No

[jira] [Commented] (SPARK-19218) SET command should show a sorted result

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822181#comment-15822181 ] Apache Spark commented on SPARK-19218: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Assigned] (SPARK-19218) SET command should show a sorted result

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19218: Assignee: (was: Apache Spark) > SET command should show a sorted result >

[jira] [Assigned] (SPARK-19218) SET command should show a sorted result

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19218: Assignee: Apache Spark > SET command should show a sorted result >

[jira] [Created] (SPARK-19218) SET command should show a sorted result

2017-01-13 Thread Dongjoon Hyun (JIRA)
Dongjoon Hyun created SPARK-19218: - Summary: SET command should show a sorted result Key: SPARK-19218 URL: https://issues.apache.org/jira/browse/SPARK-19218 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-19209) "No suitable driver" on first try

2017-01-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822154#comment-15822154 ] Xiao Li commented on SPARK-19209: - Thanks for reporting the regression. Let me take a look at this. >

[jira] [Deleted] (SPARK-19205) "No suitable driver" on first try

2017-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen deleted SPARK-19205: -- > "No suitable driver" on first try > - > > Key:

[jira] [Deleted] (SPARK-19204) "No suitable driver" on first try

2017-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen deleted SPARK-19204: -- > "No suitable driver" on first try > - > > Key:

[jira] [Comment Edited] (SPARK-19209) "No suitable driver" on first try

2017-01-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822145#comment-15822145 ] Xiao Li edited comment on SPARK-19209 at 1/13/17 6:50 PM: -- It sounds like you

[jira] [Reopened] (SPARK-19209) "No suitable driver" on first try

2017-01-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19209?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li reopened SPARK-19209: - > "No suitable driver" on first try > - > > Key: SPARK-19209

[jira] [Commented] (SPARK-19209) "No suitable driver" on first try

2017-01-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822145#comment-15822145 ] Xiao Li commented on SPARK-19209: - It sounds like you create multiple duplicate JIRAs: SPARK-19204,

[jira] [Updated] (SPARK-19113) Fix flaky test: o.a.s.sql.streaming.StreamSuite fatal errors from a source should be sent to the user

2017-01-13 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-19113: - Fix Version/s: (was: 2.1.1) (was: 2.2.0) > Fix flaky test:

[jira] [Reopened] (SPARK-19113) Fix flaky test: o.a.s.sql.streaming.StreamSuite fatal errors from a source should be sent to the user

2017-01-13 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu reopened SPARK-19113: -- Reopened it as it's still flaky > Fix flaky test: o.a.s.sql.streaming.StreamSuite fatal errors

[jira] [Closed] (SPARK-19209) "No suitable driver" on first try

2017-01-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19209?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li closed SPARK-19209. --- Resolution: Duplicate > "No suitable driver" on first try > - > >

[jira] [Created] (SPARK-19217) Offer easy cast from vector to array

2017-01-13 Thread Nicholas Chammas (JIRA)
Nicholas Chammas created SPARK-19217: Summary: Offer easy cast from vector to array Key: SPARK-19217 URL: https://issues.apache.org/jira/browse/SPARK-19217 Project: Spark Issue Type:

[jira] [Closed] (SPARK-19131) Support "alter table drop partition [if exists]"

2017-01-13 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun closed SPARK-19131. - Resolution: Invalid Hi, [~licl]. I'm closing this issue because it's already supported feature.

[jira] [Commented] (SPARK-4502) Spark SQL reads unneccesary nested fields from Parquet

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822120#comment-15822120 ] Apache Spark commented on SPARK-4502: - User 'mallman' has created a pull request for this issue:

[jira] [Commented] (SPARK-4502) Spark SQL reads unneccesary nested fields from Parquet

2017-01-13 Thread Michael Allman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822111#comment-15822111 ] Michael Allman commented on SPARK-4502: --- Hi Guys, I'm going to submit a PR for this shortly. We've

[jira] [Commented] (SPARK-19216) LogisticRegressionModel is missing getThreshold()

2017-01-13 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822094#comment-15822094 ] Nicholas Chammas commented on SPARK-19216: -- cc [~josephkb] - Is this a valid gap in Python's

[jira] [Created] (SPARK-19216) LogisticRegressionModel is missing getThreshold()

2017-01-13 Thread Nicholas Chammas (JIRA)
Nicholas Chammas created SPARK-19216: Summary: LogisticRegressionModel is missing getThreshold() Key: SPARK-19216 URL: https://issues.apache.org/jira/browse/SPARK-19216 Project: Spark

[jira] [Resolved] (SPARK-18335) Add a numSlices parameter to SparkR's createDataFrame

2017-01-13 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18335?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman resolved SPARK-18335. --- Resolution: Fixed Fix Version/s: 2.2.0 2.1.1 Issue

[jira] [Commented] (SPARK-19186) Hash symbol in middle of Sybase database table name causes Spark Exception

2017-01-13 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822053#comment-15822053 ] Dongjoon Hyun commented on SPARK-19186: --- Hi, [~schulewa]. It looks like

[jira] [Resolved] (SPARK-19092) Save() API of DataFrameWriter should not scan all the saved files

2017-01-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-19092. - Resolution: Fixed Fix Version/s: 2.2.0 > Save() API of DataFrameWriter should not scan all the

[jira] [Assigned] (SPARK-19092) Save() API of DataFrameWriter should not scan all the saved files

2017-01-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li reassigned SPARK-19092: --- Assignee: Xiao Li > Save() API of DataFrameWriter should not scan all the saved files >

[jira] [Resolved] (SPARK-17237) DataFrame fill after pivot causing org.apache.spark.sql.AnalysisException

2017-01-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-17237. - Resolution: Fixed Assignee: Takeshi Yamamuro Fix Version/s: 2.2.0

[jira] [Resolved] (SPARK-19142) spark.kmeans should take seed, initSteps, and tol as parameters

2017-01-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-19142. - Resolution: Fixed Assignee: Miao Wang Fix Version/s: 2.2.0 > spark.kmeans should take

[jira] [Resolved] (SPARK-19178) convert string of large numbers to int should return null

2017-01-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-19178. - Resolution: Fixed Fix Version/s: 2.2.0 > convert string of large numbers to int should return

[jira] [Commented] (SPARK-13857) Feature parity for ALS ML with MLLIB

2017-01-13 Thread Danilo Ascione (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822021#comment-15822021 ] Danilo Ascione commented on SPARK-13857: I have a pipeline similar to [~abudd2014]'s one. I have

[jira] [Resolved] (SPARK-18687) Backward compatibility - creating a Dataframe on a new SQLContext object fails with a Derby error

2017-01-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-18687. - Resolution: Fixed Assignee: Vinayak Joshi Fix Version/s: 2.2.0 2.1.1

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: (was: Apache Spark) > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: (was: Apache Spark) > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: Apache Spark > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: Apache Spark > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: Apache Spark > Optimize CartesianRDD to avoid parent RDD partition

  1   2   3   >