[jira] [Commented] (SPARK-14734) Add conversions between mllib and ml Vector, Matrix types

2016-04-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14734?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15248462#comment-15248462 ] Apache Spark commented on SPARK-14734: -- User 'jkbradley' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14734) Add conversions between mllib and ml Vector, Matrix types

2016-04-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14734: Assignee: Apache Spark (was: Joseph K. Bradley) > Add conversions between mllib and ml

[jira] [Assigned] (SPARK-14734) Add conversions between mllib and ml Vector, Matrix types

2016-04-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14734: Assignee: Joseph K. Bradley (was: Apache Spark) > Add conversions between mllib and ml

[jira] [Commented] (SPARK-14037) count(df) is very slow for dataframe constrcuted using SparkR::createDataFrame

2016-04-19 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15248416#comment-15248416 ] Shivaram Venkataraman commented on SPARK-14037: --- Thanks [~samalexg] and [~sunrui] for

[jira] [Resolved] (SPARK-13252) Bump up Kafka to 0.9.0.0

2016-04-19 Thread Mark Grover (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Grover resolved SPARK-13252. - Resolution: Won't Fix Marking this as won't fix, and taking the focus back on the discussion in

[jira] [Commented] (SPARK-14594) Improve error messages for RDD API

2016-04-19 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15248392#comment-15248392 ] Shivaram Venkataraman commented on SPARK-14594: --- I think this would be very useful to have

[jira] [Commented] (SPARK-14325) some strange name conflicts in `group_by`

2016-04-19 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15248374#comment-15248374 ] Shivaram Venkataraman commented on SPARK-14325: --- I think the problem might be related to

[jira] [Commented] (SPARK-14692) Error While Setting the path for R front end

2016-04-19 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15248363#comment-15248363 ] Shivaram Venkataraman commented on SPARK-14692: --- I think SparkR might not have been built ?

[jira] [Assigned] (SPARK-14733) Allow custom timing control in microbenchmarks

2016-04-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14733: Assignee: Apache Spark > Allow custom timing control in microbenchmarks >

[jira] [Commented] (SPARK-14733) Allow custom timing control in microbenchmarks

2016-04-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15248359#comment-15248359 ] Apache Spark commented on SPARK-14733: -- User 'ericl' has created a pull request for this issue:

[jira] [Created] (SPARK-14734) Add conversions between mllib and ml Vector, Matrix types

2016-04-19 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-14734: - Summary: Add conversions between mllib and ml Vector, Matrix types Key: SPARK-14734 URL: https://issues.apache.org/jira/browse/SPARK-14734 Project: Spark

[jira] [Assigned] (SPARK-14733) Allow custom timing control in microbenchmarks

2016-04-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14733: Assignee: (was: Apache Spark) > Allow custom timing control in microbenchmarks >

[jira] [Created] (SPARK-14733) Allow custom timing control in microbenchmarks

2016-04-19 Thread Eric Liang (JIRA)
Eric Liang created SPARK-14733: -- Summary: Allow custom timing control in microbenchmarks Key: SPARK-14733 URL: https://issues.apache.org/jira/browse/SPARK-14733 Project: Spark Issue Type:

[jira] [Commented] (SPARK-14326) Can't specify "long" type in structField

2016-04-19 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15248335#comment-15248335 ] Shivaram Venkataraman commented on SPARK-14326: --- I think its fine to support this in

[jira] [Resolved] (SPARK-14675) ClassFormatError in codegen when using Aggregator

2016-04-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14675?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-14675. -- Resolution: Fixed Assignee: Wenchen Fan Fix Version/s: 2.0.0 This issue has been

[jira] [Commented] (SPARK-14051) Implement `Double.NaN==Float.NaN` for consistency

2016-04-19 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15248284#comment-15248284 ] Dongjoon Hyun commented on SPARK-14051: --- Hi, [~joshrosen]. Could you take a look at this PR? Sorry

[jira] [Updated] (SPARK-14730) Expose ColumnPruner as feature transformer

2016-04-19 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14730?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14730: -- Issue Type: New Feature (was: Improvement) > Expose ColumnPruner as feature

[jira] [Updated] (SPARK-14730) Expose ColumnPruner as feature transformer

2016-04-19 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14730?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14730: -- Affects Version/s: (was: 2.0.0) > Expose ColumnPruner as feature transformer >

[jira] [Commented] (SPARK-12148) SparkR: rename DataFrame to SparkDataFrame

2016-04-19 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15248272#comment-15248272 ] Felix Cheung commented on SPARK-12148: -- I'm up for this if [~sunrui] you haven't started > SparkR:

[jira] [Commented] (SPARK-14414) Make error messages consistent across DDLs

2016-04-19 Thread Bo Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15248265#comment-15248265 ] Bo Meng commented on SPARK-14414: - Can anyone update the 'Assignee' for this one, since my code was

[jira] [Resolved] (SPARK-14458) Wrong data schema is passed to FileFormat data sources that can't infer schema

2016-04-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-14458. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12179

[jira] [Resolved] (SPARK-14566) When appending to partitioned persisted table, we should apply a projection over input query plan using existing metastore schema

2016-04-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-14566. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12179

[jira] [Commented] (SPARK-14676) Catch, wrap, and re-throw exceptions from Await.result in order to capture full stacktrace

2016-04-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15248257#comment-15248257 ] Reynold Xin commented on SPARK-14676: - [~joshrosen] I didn't merge this in 1.6.2 because it is a bit

[jira] [Resolved] (SPARK-14676) Catch, wrap, and re-throw exceptions from Await.result in order to capture full stacktrace

2016-04-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-14676. - Resolution: Fixed Fix Version/s: 2.0.0 > Catch, wrap, and re-throw exceptions from

[jira] [Resolved] (SPARK-12457) Add ExpressionDescription to collection functions

2016-04-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-12457. - Resolution: Fixed Assignee: Xiao Li Fix Version/s: 2.0.0 > Add

[jira] [Comment Edited] (SPARK-13662) [SQL][Hive] Have SHOW TABLES return additional fields from Hive MetaStore

2016-04-19 Thread Vijay Parmar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13662?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15248241#comment-15248241 ] Vijay Parmar edited comment on SPARK-13662 at 4/19/16 5:27 PM: --- Thank you

[jira] [Updated] (SPARK-14732) spark.ml GaussianMixture should not use spark.mllib MultivariateGaussian

2016-04-19 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14732: -- Description: {{org.apache.spark.ml.clustering.GaussianMixtureModel.gaussians}}

[jira] [Commented] (SPARK-13662) [SQL][Hive] Have SHOW TABLES return additional fields from Hive MetaStore

2016-04-19 Thread Vijay Parmar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13662?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15248241#comment-15248241 ] Vijay Parmar commented on SPARK-13662: -- Thank you Evan! If possible then you please guide me to

[jira] [Commented] (SPARK-14564) Python Word2Vec missing setWindowSize method

2016-04-19 Thread Brad Willard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15248230#comment-15248230 ] Brad Willard commented on SPARK-14564: -- Do you guys think it's possible to get this in 1.6.2 release

[jira] [Commented] (SPARK-13745) Support columnar in memory representation on Big Endian platforms

2016-04-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15248221#comment-15248221 ] Apache Spark commented on SPARK-13745: -- User 'robbinspg' has created a pull request for this issue:

[jira] [Created] (SPARK-14732) spark.ml GaussianMixture should not use spark.mllib MultivariateGaussian

2016-04-19 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-14732: - Summary: spark.ml GaussianMixture should not use spark.mllib MultivariateGaussian Key: SPARK-14732 URL: https://issues.apache.org/jira/browse/SPARK-14732

[jira] [Closed] (SPARK-13179) pyspark row name collision 'count'

2016-04-19 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu closed SPARK-13179. -- Resolution: Won't Fix > pyspark row name collision 'count' > -- > >

[jira] [Commented] (SPARK-14709) spark.ml API for linear SVM

2016-04-19 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15248186#comment-15248186 ] yuhao yang commented on SPARK-14709: I put the prototype in

[jira] [Resolved] (SPARK-14491) refactor object operator framework to make it easy to eliminate serializations

2016-04-19 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14491?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-14491. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12260

[jira] [Created] (SPARK-14731) Revert SPARK-12130 to make 2.0 shuffle service compatible with 1.x

2016-04-19 Thread Mark Grover (JIRA)
Mark Grover created SPARK-14731: --- Summary: Revert SPARK-12130 to make 2.0 shuffle service compatible with 1.x Key: SPARK-14731 URL: https://issues.apache.org/jira/browse/SPARK-14731 Project: Spark

[jira] [Commented] (SPARK-14725) Remove HttpServer

2016-04-19 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15248167#comment-15248167 ] Marcelo Vanzin commented on SPARK-14725: I don't remember if there's an option to use the HTTP

[jira] [Resolved] (SPARK-13681) Reimplement CommitFailureTestRelationSuite

2016-04-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-13681. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12179

[jira] [Updated] (SPARK-13681) Reimplement CommitFailureTestRelationSuite

2016-04-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-13681: - Assignee: Cheng Lian > Reimplement CommitFailureTestRelationSuite >

[jira] [Commented] (SPARK-13962) spark.ml Evaluators should support other numeric types for label

2016-04-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15248120#comment-15248120 ] Apache Spark commented on SPARK-13962: -- User 'BenFradet' has created a pull request for this issue:

[jira] [Assigned] (SPARK-13962) spark.ml Evaluators should support other numeric types for label

2016-04-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13962: Assignee: Apache Spark (was: Benjamin Fradet) > spark.ml Evaluators should support other

[jira] [Assigned] (SPARK-13962) spark.ml Evaluators should support other numeric types for label

2016-04-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13962: Assignee: Benjamin Fradet (was: Apache Spark) > spark.ml Evaluators should support other

[jira] [Comment Edited] (SPARK-14034) Converting to Dataset causes wrong order and values in nested array of documents

2016-04-19 Thread Barry Jones (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247988#comment-15247988 ] Barry Jones edited comment on SPARK-14034 at 4/19/16 3:42 PM: -- I have the

[jira] [Commented] (SPARK-14034) Converting to Dataset causes wrong order and values in nested array of documents

2016-04-19 Thread Barry Jones (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247988#comment-15247988 ] Barry Jones commented on SPARK-14034: - I have the same issue. Nested data values are associated with

[jira] [Commented] (SPARK-10574) HashingTF should use MurmurHash3

2016-04-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247794#comment-15247794 ] Apache Spark commented on SPARK-10574: -- User 'yanboliang' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10574) HashingTF should use MurmurHash3

2016-04-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10574: Assignee: Apache Spark (was: Yanbo Liang) > HashingTF should use MurmurHash3 >

[jira] [Assigned] (SPARK-10574) HashingTF should use MurmurHash3

2016-04-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10574: Assignee: Yanbo Liang (was: Apache Spark) > HashingTF should use MurmurHash3 >

[jira] [Updated] (SPARK-14577) spark.sql.codegen.maxCaseBranches config option

2016-04-19 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-14577: Assignee: Dongjoon Hyun > spark.sql.codegen.maxCaseBranches config option >

[jira] [Resolved] (SPARK-14577) spark.sql.codegen.maxCaseBranches config option

2016-04-19 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-14577. - Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12353

[jira] [Commented] (SPARK-14600) Push predicates through Expand

2016-04-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247750#comment-15247750 ] Apache Spark commented on SPARK-14600: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14600) Push predicates through Expand

2016-04-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14600: Assignee: (was: Apache Spark) > Push predicates through Expand >

[jira] [Assigned] (SPARK-14600) Push predicates through Expand

2016-04-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14600: Assignee: Apache Spark > Push predicates through Expand > --

[jira] [Commented] (SPARK-14489) RegressionEvaluator returns NaN for ALS in Spark ml

2016-04-19 Thread Abou Haydar Elias (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247710#comment-15247710 ] Abou Haydar Elias commented on SPARK-14489: --- I totally agree with [~sethah]. I have stumbled on

[jira] [Commented] (SPARK-928) Add support for Unsafe-based serializer in Kryo 2.22

2016-04-19 Thread Kai Jiang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247672#comment-15247672 ] Kai Jiang commented on SPARK-928: - Since it is labeled as Starter, I would like to take a try on this one.

[jira] [Created] (SPARK-14730) Expose ColumnPruner as feature transformer

2016-04-19 Thread Jacek Laskowski (JIRA)
Jacek Laskowski created SPARK-14730: --- Summary: Expose ColumnPruner as feature transformer Key: SPARK-14730 URL: https://issues.apache.org/jira/browse/SPARK-14730 Project: Spark Issue Type:

[jira] [Commented] (SPARK-14326) Can't specify "long" type in structField

2016-04-19 Thread Dmitriy Selivanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247602#comment-15247602 ] Dmitriy Selivanov commented on SPARK-14326: --- This particular case was related to reading csv

[jira] [Commented] (SPARK-14727) NullPointerException while trying to launch local spark job

2016-04-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247600#comment-15247600 ] Sean Owen commented on SPARK-14727: --- (Was that the problem? since you re-resolved it) >

[jira] [Commented] (SPARK-14703) Spark uses SLF4J, but actually relies quite heavily on Log4J

2016-04-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247599#comment-15247599 ] Sean Owen commented on SPARK-14703: --- Oh hey [~ceki]. In Spark, it's almost entirely about settings log

[jira] [Commented] (SPARK-14703) Spark uses SLF4J, but actually relies quite heavily on Log4J

2016-04-19 Thread Ceki Gulcu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247592#comment-15247592 ] Ceki Gulcu commented on SPARK-14703: @srowen Being able to configure loggers has been a oft-requested

[jira] [Commented] (SPARK-14525) DataFrameWriter's save method should delegate to jdbc for jdbc datasource

2016-04-19 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247576#comment-15247576 ] Takeshi Yamamuro commented on SPARK-14525: -- Yeah, +1. I'm not sure we have a special handling

[jira] [Closed] (SPARK-14727) NullPointerException while trying to launch local spark job

2016-04-19 Thread Darshan Mehta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darshan Mehta closed SPARK-14727. - Resolution: Fixed > NullPointerException while trying to launch local spark job >

[jira] [Created] (SPARK-14729) Implement an existing cluster manager with New ExternalClusterManager interface

2016-04-19 Thread Hemant Bhanawat (JIRA)
Hemant Bhanawat created SPARK-14729: --- Summary: Implement an existing cluster manager with New ExternalClusterManager interface Key: SPARK-14729 URL: https://issues.apache.org/jira/browse/SPARK-14729

[jira] [Commented] (SPARK-14729) Implement an existing cluster manager with New ExternalClusterManager interface

2016-04-19 Thread Hemant Bhanawat (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14729?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247543#comment-15247543 ] Hemant Bhanawat commented on SPARK-14729: - I am looking into this. > Implement an existing

[jira] [Commented] (SPARK-14727) NullPointerException while trying to launch local spark job

2016-04-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247533#comment-15247533 ] Sean Owen commented on SPARK-14727: --- It's almost certainly the same class of problem. I don't know that

[jira] [Reopened] (SPARK-14727) NullPointerException while trying to launch local spark job

2016-04-19 Thread Darshan Mehta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darshan Mehta reopened SPARK-14727: --- Not a duplicate > NullPointerException while trying to launch local spark job >

[jira] [Commented] (SPARK-14727) NullPointerException while trying to launch local spark job

2016-04-19 Thread Darshan Mehta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247523#comment-15247523 ] Darshan Mehta commented on SPARK-14727: --- Stacktrace in SPARK-2356 says the following: "Could not

[jira] [Commented] (SPARK-13904) Add support for pluggable cluster manager

2016-04-19 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247498#comment-15247498 ] Kazuaki Ishizaki commented on SPARK-13904: -- I agree with you since SPARK-14689 addresses. > Add

[jira] [Closed] (SPARK-14728) Add a rule to block the use of getOrElse(null) which can simply be orNull.

2016-04-19 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon closed SPARK-14728. Resolution: Invalid > Add a rule to block the use of getOrElse(null) which can simply be orNull. >

[jira] [Commented] (SPARK-14723) A new way to support dynamic allocation in Spark Streaming

2016-04-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247470#comment-15247470 ] Sean Owen commented on SPARK-14723: --- Generally, your solution resembles the "no dynamic allocation"

[jira] [Commented] (SPARK-14728) Add a rule to block the use of getOrElse(null) which can simply be orNull.

2016-04-19 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247459#comment-15247459 ] Hyukjin Kwon commented on SPARK-14728: -- Oh, sorry, I noticed not all classes having {{getOrElse}}

[jira] [Commented] (SPARK-14728) Add a rule to block the use of getOrElse(null) which can simply be orNull.

2016-04-19 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247451#comment-15247451 ] Hyukjin Kwon commented on SPARK-14728: -- [~rxin] Do you think it is okay to add a rule? If you are

[jira] [Created] (SPARK-14728) Add a rule to block the use of getOrElse(null) which can simply be orNull.

2016-04-19 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-14728: Summary: Add a rule to block the use of getOrElse(null) which can simply be orNull. Key: SPARK-14728 URL: https://issues.apache.org/jira/browse/SPARK-14728 Project:

[jira] [Commented] (SPARK-14525) DataFrameWriter's save method should delegate to jdbc for jdbc datasource

2016-04-19 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247448#comment-15247448 ] Hyukjin Kwon commented on SPARK-14525: -- Shouldn't we then deprecate the support for

[jira] [Comment Edited] (SPARK-14525) DataFrameWriter's save method should delegate to jdbc for jdbc datasource

2016-04-19 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247448#comment-15247448 ] Hyukjin Kwon edited comment on SPARK-14525 at 4/19/16 9:52 AM: --- Shouldn't

[jira] [Resolved] (SPARK-14727) NullPointerException while trying to launch local spark job

2016-04-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-14727. --- Resolution: Duplicate No, it's trying to execute local winutils binaries. I'm all but certain this

[jira] [Commented] (SPARK-14723) A new way to support dynamic allocation in Spark Streaming

2016-04-19 Thread WilliamZhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247435#comment-15247435 ] WilliamZhu commented on SPARK-14723: I think it would be better to give extra executors the first

[jira] [Commented] (SPARK-14703) Spark uses SLF4J, but actually relies quite heavily on Log4J

2016-04-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247434#comment-15247434 ] Sean Owen commented on SPARK-14703: --- Oh, logback tries to reimplement some log4j API methods? It sounds

[jira] [Commented] (SPARK-13944) Separate out local linear algebra as a standalone module without Spark dependency

2016-04-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247431#comment-15247431 ] Sean Owen commented on SPARK-13944: --- I take the point about some models not being representable in

[jira] [Commented] (SPARK-14600) Push predicates through Expand

2016-04-19 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247427#comment-15247427 ] Wenchen Fan commented on SPARK-14600: - working on it > Push predicates through Expand >

[jira] [Commented] (SPARK-14703) Spark uses SLF4J, but actually relies quite heavily on Log4J

2016-04-19 Thread Matthew Byng-Maddick (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247424#comment-15247424 ] Matthew Byng-Maddick commented on SPARK-14703: -- Leaving log4j in the build and then routing

[jira] [Updated] (SPARK-14727) NullPointerException while trying to launch local spark job

2016-04-19 Thread Darshan Mehta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darshan Mehta updated SPARK-14727: -- Attachment: Logs.log > NullPointerException while trying to launch local spark job >

[jira] [Updated] (SPARK-14727) NullPointerException while trying to launch local spark job

2016-04-19 Thread Darshan Mehta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darshan Mehta updated SPARK-14727: -- Attachment: SparkCrud.java > NullPointerException while trying to launch local spark job >

[jira] [Created] (SPARK-14727) NullPointerException while trying to launch local spark job

2016-04-19 Thread Darshan Mehta (JIRA)
Darshan Mehta created SPARK-14727: - Summary: NullPointerException while trying to launch local spark job Key: SPARK-14727 URL: https://issues.apache.org/jira/browse/SPARK-14727 Project: Spark

[jira] [Updated] (SPARK-14723) A new way to support dynamic allocation in Spark Streaming

2016-04-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-14723: -- Target Version/s: (was: 2.1.0) Labels: (was: features) Fix Version/s:

[jira] [Commented] (SPARK-13266) Python DataFrameReader converts None to "None" instead of null

2016-04-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247399#comment-15247399 ] Apache Spark commented on SPARK-13266: -- User 'viirya' has created a pull request for this issue:

[jira] [Commented] (SPARK-14723) A new way to support dynamic allocation in Spark Streaming

2016-04-19 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247384#comment-15247384 ] Saisai Shao commented on SPARK-14723: - It would be better not to set fix version and target version.

[jira] [Commented] (SPARK-14726) Support for sampling when inferring schema in CSV data source

2016-04-19 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14726?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247378#comment-15247378 ] Hyukjin Kwon commented on SPARK-14726: -- This is currently not supported. I can work on this but I

[jira] [Created] (SPARK-14726) Support for sampling when inferring schema in CSV data source

2016-04-19 Thread Bomi Kim (JIRA)
Bomi Kim created SPARK-14726: Summary: Support for sampling when inferring schema in CSV data source Key: SPARK-14726 URL: https://issues.apache.org/jira/browse/SPARK-14726 Project: Spark Issue

[jira] [Commented] (SPARK-14326) Can't specify "long" type in structField

2016-04-19 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247360#comment-15247360 ] Sun Rui commented on SPARK-14326: - we could add support for "bigint" type in structField. The question is

[jira] [Comment Edited] (SPARK-14725) Remove HttpServer

2016-04-19 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247358#comment-15247358 ] Saisai Shao edited comment on SPARK-14725 at 4/19/16 8:13 AM: -- I just search

[jira] [Commented] (SPARK-14725) Remove HttpServer

2016-04-19 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247358#comment-15247358 ] Saisai Shao commented on SPARK-14725: - I just search the repl code, from my understanding seems

[jira] [Commented] (SPARK-14703) Spark uses SLF4J, but actually relies quite heavily on Log4J

2016-04-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247357#comment-15247357 ] Sean Owen commented on SPARK-14703: --- To be more specific, I mean leaving log4j in the build so it can

[jira] [Assigned] (SPARK-12919) Implement dapply() on DataFrame in SparkR

2016-04-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12919?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12919: Assignee: Apache Spark > Implement dapply() on DataFrame in SparkR >

[jira] [Assigned] (SPARK-12919) Implement dapply() on DataFrame in SparkR

2016-04-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12919?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12919: Assignee: (was: Apache Spark) > Implement dapply() on DataFrame in SparkR >

[jira] [Commented] (SPARK-12919) Implement dapply() on DataFrame in SparkR

2016-04-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12919?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247319#comment-15247319 ] Apache Spark commented on SPARK-12919: -- User 'sun-rui' has created a pull request for this issue:

[jira] [Updated] (SPARK-14725) Remove HttpServer

2016-04-19 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saisai Shao updated SPARK-14725: Priority: Minor (was: Major) > Remove HttpServer > - > > Key:

[jira] [Updated] (SPARK-14725) Remove HttpServer

2016-04-19 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saisai Shao updated SPARK-14725: Issue Type: Bug (was: Sub-task) Parent: (was: SPARK-11806) > Remove HttpServer >

[jira] [Commented] (SPARK-14725) Remove HttpServer

2016-04-19 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247318#comment-15247318 ] Saisai Shao commented on SPARK-14725: - OK, let me check first :). > Remove HttpServer >

[jira] [Commented] (SPARK-14725) Remove HttpServer

2016-04-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247317#comment-15247317 ] Reynold Xin commented on SPARK-14725: - Does getClassFileInputStreamFromHttpServer not use it? If we

[jira] [Commented] (SPARK-14725) Remove HttpServer

2016-04-19 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247316#comment-15247316 ] Saisai Shao commented on SPARK-14725: - I think now it changes to use RPC instead of Http, please see

[jira] [Commented] (SPARK-14725) Remove HttpServer

2016-04-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247298#comment-15247298 ] Reynold Xin commented on SPARK-14725: - Doesn't the REPL use it? > Remove HttpServer >

<    1   2   3   >