[jira] [Created] (SPARK-6627) Clean up of shuffle code and interfaces

2015-03-31 Thread Patrick Wendell (JIRA)
Patrick Wendell created SPARK-6627: -- Summary: Clean up of shuffle code and interfaces Key: SPARK-6627 URL: https://issues.apache.org/jira/browse/SPARK-6627 Project: Spark Issue Type:

[jira] [Commented] (SPARK-4514) SparkContext localProperties does not inherit property updates across thread reuse

2015-03-31 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388095#comment-14388095 ] Josh Rosen commented on SPARK-4514: --- I don't know that there's a good way to fix this

[jira] [Assigned] (SPARK-4550) In sort-based shuffle, store map outputs in serialized form

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-4550: --- Assignee: Apache Spark (was: Sandy Ryza) In sort-based shuffle, store map outputs in

[jira] [Commented] (SPARK-6627) Clean up of shuffle code and interfaces

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6627?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388107#comment-14388107 ] Apache Spark commented on SPARK-6627: - User 'pwendell' has created a pull request for

[jira] [Assigned] (SPARK-6627) Clean up of shuffle code and interfaces

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6627?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6627: --- Assignee: Patrick Wendell (was: Apache Spark) Clean up of shuffle code and interfaces

[jira] [Assigned] (SPARK-6627) Clean up of shuffle code and interfaces

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6627?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6627: --- Assignee: Apache Spark (was: Patrick Wendell) Clean up of shuffle code and interfaces

[jira] [Assigned] (SPARK-4550) In sort-based shuffle, store map outputs in serialized form

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-4550: --- Assignee: Sandy Ryza (was: Apache Spark) In sort-based shuffle, store map outputs in

[jira] [Commented] (SPARK-5564) Support sparse LDA solutions

2015-03-31 Thread Debasish Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388128#comment-14388128 ] Debasish Das commented on SPARK-5564: - [~sparks] we are trying to access the EC2

[jira] [Updated] (SPARK-6575) Add configuration to disable schema merging while converting metastore Parquet tables

2015-03-31 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-6575: Priority: Blocker (was: Major) Add configuration to disable schema merging while converting metastore

[jira] [Resolved] (SPARK-6625) Add common string filters to data sources

2015-03-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-6625. Resolution: Fixed Fix Version/s: 1.4.0 1.3.1 Add common string filters

[jira] [Assigned] (SPARK-5738) Reuse mutable row for each record at jsonStringToRow

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5738: --- Assignee: (was: Apache Spark) Reuse mutable row for each record at jsonStringToRow

[jira] [Assigned] (SPARK-5738) Reuse mutable row for each record at jsonStringToRow

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5738: --- Assignee: Apache Spark Reuse mutable row for each record at jsonStringToRow

[jira] [Created] (SPARK-6628) Exception occurs when executing sql statement insert into on hbase table

2015-03-31 Thread meiyoula (JIRA)
meiyoula created SPARK-6628: --- Summary: Exception occurs when executing sql statement insert into on hbase table Key: SPARK-6628 URL: https://issues.apache.org/jira/browse/SPARK-6628 Project: Spark

[jira] [Created] (SPARK-6629) cancelJobGroup() may not work for jobs whose job groups are inherited from parent threads

2015-03-31 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-6629: - Summary: cancelJobGroup() may not work for jobs whose job groups are inherited from parent threads Key: SPARK-6629 URL: https://issues.apache.org/jira/browse/SPARK-6629

[jira] [Commented] (SPARK-6629) cancelJobGroup() may not work for jobs whose job groups are inherited from parent threads

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388223#comment-14388223 ] Apache Spark commented on SPARK-6629: - User 'JoshRosen' has created a pull request for

[jira] [Assigned] (SPARK-6629) cancelJobGroup() may not work for jobs whose job groups are inherited from parent threads

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6629?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6629: --- Assignee: Josh Rosen (was: Apache Spark) cancelJobGroup() may not work for jobs whose job

[jira] [Commented] (SPARK-4514) SparkContext localProperties does not inherit property updates across thread reuse

2015-03-31 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388231#comment-14388231 ] Josh Rosen commented on SPARK-4514: --- I've filed SPARK-6629 to fix a related issue where

[jira] [Commented] (SPARK-6555) Override equals and hashCode in MetastoreRelation

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388267#comment-14388267 ] Apache Spark commented on SPARK-6555: - User 'liancheng' has created a pull request for

[jira] [Assigned] (SPARK-6555) Override equals and hashCode in MetastoreRelation

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6555: --- Assignee: Cheng Lian (was: Apache Spark) Override equals and hashCode in MetastoreRelation

[jira] [Assigned] (SPARK-6555) Override equals and hashCode in MetastoreRelation

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6555: --- Assignee: Apache Spark (was: Cheng Lian) Override equals and hashCode in MetastoreRelation

[jira] [Commented] (SPARK-6629) cancelJobGroup() may not work for jobs whose job groups are inherited from parent threads

2015-03-31 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388283#comment-14388283 ] Josh Rosen commented on SPARK-6629: --- This may very well be not an issue, pending

[jira] [Commented] (SPARK-6435) spark-shell --jars option does not add all jars to classpath

2015-03-31 Thread Masayoshi TSUZUKI (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388155#comment-14388155 ] Masayoshi TSUZUKI commented on SPARK-6435: -- Sorry to confuse you. My PR is for

[jira] [Commented] (SPARK-6435) spark-shell --jars option does not add all jars to classpath

2015-03-31 Thread Masayoshi TSUZUKI (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388164#comment-14388164 ] Masayoshi TSUZUKI commented on SPARK-6435: -- Ah, the problems occured in

[jira] [Resolved] (SPARK-6623) Alias DataFrame.na.fill/drop in Python

2015-03-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-6623. Resolution: Fixed Fix Version/s: 1.4.0 1.3.1 Assignee: Reynold

[jira] [Updated] (SPARK-6116) Making DataFrame API non-experimental

2015-03-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-6116: --- Labels: DataFrame (was: ) Making DataFrame API non-experimental

[jira] [Commented] (SPARK-6573) expect pandas null values as numpy.nan (not only as None)

2015-03-31 Thread Fabian Boehnlein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388204#comment-14388204 ] Fabian Boehnlein commented on SPARK-6573: - I don't understand. numpy.nan values

[jira] [Assigned] (SPARK-6629) cancelJobGroup() may not work for jobs whose job groups are inherited from parent threads

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6629?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6629: --- Assignee: Apache Spark (was: Josh Rosen) cancelJobGroup() may not work for jobs whose job

[jira] [Updated] (SPARK-6623) Alias DataFrame.na.fill/drop in Python

2015-03-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-6623: --- Labels: DataFrame (was: ) Alias DataFrame.na.fill/drop in Python

[jira] [Commented] (SPARK-765) Test suite should run Spark example programs

2015-03-31 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-765?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388246#comment-14388246 ] Yu Ishikawa commented on SPARK-765: --- Hi [~joshrosen], Should we add a maven dependency in

[jira] [Resolved] (SPARK-6618) HiveMetastoreCatalog.lookupRelation should use fine-grained lock

2015-03-31 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-6618. --- Resolution: Fixed Fix Version/s: 1.4.0 1.3.1

[jira] [Assigned] (SPARK-6583) Support aggregated function in order by

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6583: --- Assignee: Apache Spark Support aggregated function in order by

[jira] [Resolved] (SPARK-6542) Add CreateStruct as an Expression

2015-03-31 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-6542. --- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5195

[jira] [Updated] (SPARK-6628) ClassCastException occurs when executing sql statement insert into on hbase table

2015-03-31 Thread meiyoula (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] meiyoula updated SPARK-6628: Summary: ClassCastException occurs when executing sql statement insert into on hbase table (was: Exception

[jira] [Assigned] (SPARK-6583) Support aggregated function in order by

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6583: --- Assignee: (was: Apache Spark) Support aggregated function in order by

[jira] [Assigned] (SPARK-3596) Support changing the yarn client monitor interval

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3596?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-3596: --- Assignee: (was: Apache Spark) Support changing the yarn client monitor interval

[jira] [Assigned] (SPARK-6582) Support ssl for this AvroSink in Spark Streaming External

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6582: --- Assignee: Apache Spark Support ssl for this AvroSink in Spark Streaming External

[jira] [Assigned] (SPARK-6582) Support ssl for this AvroSink in Spark Streaming External

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6582: --- Assignee: (was: Apache Spark) Support ssl for this AvroSink in Spark Streaming External

[jira] [Commented] (SPARK-6582) Support ssl for this AvroSink in Spark Streaming External

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6582?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388385#comment-14388385 ] Apache Spark commented on SPARK-6582: - User 'SaintBacchus' has created a pull request

[jira] [Commented] (SPARK-1502) Spark on Yarn: add config option to not include yarn/mapred cluster classpath

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388390#comment-14388390 ] Apache Spark commented on SPARK-1502: - User 'Sephiroth-Lin' has created a pull request

[jira] [Assigned] (SPARK-1502) Spark on Yarn: add config option to not include yarn/mapred cluster classpath

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1502?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-1502: --- Assignee: Apache Spark (was: Thomas Graves) Spark on Yarn: add config option to not

[jira] [Updated] (SPARK-6613) Starting stream from checkpoint causes Streaming tab to throw error

2015-03-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6613: - Component/s: Streaming Starting stream from checkpoint causes Streaming tab to throw error

[jira] [Updated] (SPARK-6614) OutputCommitCoordinator should clear authorized committers only after authorized committer fails, not after any failure

2015-03-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6614: - Component/s: Scheduler OutputCommitCoordinator should clear authorized committers only after

[jira] [Updated] (SPARK-6420) Driver's Block Manager does not use spark.driver.host in Yarn-Client mode

2015-03-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6420: - Component/s: (was: Spark Core) Block Manager Driver's Block Manager does not use

[jira] [Updated] (SPARK-6568) spark-shell.cmd --jars option does not accept the jar that has space in its path

2015-03-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6568: - Component/s: (was: Spark Core) Spark Shell spark-shell.cmd --jars option does not

[jira] [Updated] (SPARK-6620) Speed up toDF() and rdd() functions by constructing converters in ScalaReflection

2015-03-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6620: - Component/s: SQL Speed up toDF() and rdd() functions by constructing converters in ScalaReflection

[jira] [Assigned] (SPARK-3596) Support changing the yarn client monitor interval

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3596?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-3596: --- Assignee: Apache Spark Support changing the yarn client monitor interval

[jira] [Commented] (SPARK-3596) Support changing the yarn client monitor interval

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388379#comment-14388379 ] Apache Spark commented on SPARK-3596: - User 'Sephiroth-Lin' has created a pull request

[jira] [Commented] (SPARK-6626) TwitterUtils.createStream documentation error

2015-03-31 Thread Jayson Sunshine (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6626?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388389#comment-14388389 ] Jayson Sunshine commented on SPARK-6626: Okay, that sounds good. I will try to do

[jira] [Assigned] (SPARK-1502) Spark on Yarn: add config option to not include yarn/mapred cluster classpath

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1502?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-1502: --- Assignee: Thomas Graves (was: Apache Spark) Spark on Yarn: add config option to not

[jira] [Commented] (SPARK-6626) TwitterUtils.createStream documentation error

2015-03-31 Thread Jayson Sunshine (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6626?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388401#comment-14388401 ] Jayson Sunshine commented on SPARK-6626: I submitted a pull request on GitHub for

[jira] [Assigned] (SPARK-6322) CTAS should consider the case where no file format or storage handler is given

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6322: --- Assignee: (was: Apache Spark) CTAS should consider the case where no file format or

[jira] [Assigned] (SPARK-6302) GeneratedAggregate uses wrong schema on updateProjection

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6302: --- Assignee: Apache Spark GeneratedAggregate uses wrong schema on updateProjection

[jira] [Assigned] (SPARK-5692) Model import/export for Word2Vec

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5692: --- Assignee: Manoj Kumar (was: Apache Spark) Model import/export for Word2Vec

[jira] [Assigned] (SPARK-5692) Model import/export for Word2Vec

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5692: --- Assignee: Apache Spark (was: Manoj Kumar) Model import/export for Word2Vec

[jira] [Commented] (SPARK-6390) Add MatrixUDT in PySpark

2015-03-31 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388310#comment-14388310 ] Manoj Kumar commented on SPARK-6390: ping [~mengxr] Sorry for spamming, but it would

[jira] [Assigned] (SPARK-6322) CTAS should consider the case where no file format or storage handler is given

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6322: --- Assignee: Apache Spark CTAS should consider the case where no file format or storage

[jira] [Assigned] (SPARK-6303) Average should be in canBeCodeGened list

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6303: --- Assignee: (was: Apache Spark) Average should be in canBeCodeGened list

[jira] [Commented] (SPARK-5692) Model import/export for Word2Vec

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388296#comment-14388296 ] Apache Spark commented on SPARK-5692: - User 'MechCoder' has created a pull request for

[jira] [Commented] (SPARK-6626) TwitterUtils.createStream documentation error

2015-03-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6626?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388298#comment-14388298 ] Sean Owen commented on SPARK-6626: -- Yes, I believe you are correct. Would you like to

[jira] [Assigned] (SPARK-6302) GeneratedAggregate uses wrong schema on updateProjection

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6302: --- Assignee: (was: Apache Spark) GeneratedAggregate uses wrong schema on updateProjection

[jira] [Assigned] (SPARK-6303) Average should be in canBeCodeGened list

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6303: --- Assignee: Apache Spark Average should be in canBeCodeGened list

[jira] [Created] (SPARK-6631) I am unable to get the Maven Build file in Example 2.13 to build anything but an empty file

2015-03-31 Thread Frank Domoney (JIRA)
Frank Domoney created SPARK-6631: Summary: I am unable to get the Maven Build file in Example 2.13 to build anything but an empty file Key: SPARK-6631 URL: https://issues.apache.org/jira/browse/SPARK-6631

[jira] [Resolved] (SPARK-5524) Remove messy dependencies to log4j

2015-03-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5524. -- Resolution: Won't Fix I see only two instances of this in the code base, and I don't think it's worth

[jira] [Updated] (SPARK-6630) SparkConf.setIfMissing should only evaluate the assigned value if indeed missing

2015-03-31 Thread Svend Vanderveken (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Svend Vanderveken updated SPARK-6630: - Description: The method setIfMissing() in SparkConf is currently systematically

[jira] [Updated] (SPARK-6630) SparkConf.setIfMissing should only evaluate the assigned value if indeed missing

2015-03-31 Thread Svend Vanderveken (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Svend Vanderveken updated SPARK-6630: - Description: The method setIfMissing() in SparkConf is currently systematically

[jira] [Commented] (SPARK-6435) spark-shell --jars option does not add all jars to classpath

2015-03-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388455#comment-14388455 ] Sean Owen commented on SPARK-6435: -- [~tsudukim] I think this JIRA is a fine place to

[jira] [Assigned] (SPARK-6615) Python API for Word2Vec

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6615?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6615: --- Assignee: Apache Spark Python API for Word2Vec ---

[jira] [Commented] (SPARK-6615) Python API for Word2Vec

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6615?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388468#comment-14388468 ] Apache Spark commented on SPARK-6615: - User 'Lewuathe' has created a pull request for

[jira] [Assigned] (SPARK-6615) Python API for Word2Vec

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6615?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6615: --- Assignee: (was: Apache Spark) Python API for Word2Vec ---

[jira] [Updated] (SPARK-6613) Starting stream from checkpoint causes Streaming tab to throw error

2015-03-31 Thread Marius Soutier (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marius Soutier updated SPARK-6613: -- Description: When continuing my streaming job from a checkpoint, the job runs, but the

[jira] [Updated] (SPARK-5349) Spark standalone should support dynamic resource scaling

2015-03-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5349: - Summary: Spark standalone should support dynamic resource scaling (was: Multiple spark shells should be

[jira] [Created] (SPARK-6630) SparkConf.setIfMissing should only evaluate the assigned value if indeed missing

2015-03-31 Thread Svend Vanderveken (JIRA)
Svend Vanderveken created SPARK-6630: Summary: SparkConf.setIfMissing should only evaluate the assigned value if indeed missing Key: SPARK-6630 URL: https://issues.apache.org/jira/browse/SPARK-6630

[jira] [Created] (SPARK-6642) Change the lambda weight to number of explicit ratings in implicit ALS

2015-03-31 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-6642: Summary: Change the lambda weight to number of explicit ratings in implicit ALS Key: SPARK-6642 URL: https://issues.apache.org/jira/browse/SPARK-6642 Project: Spark

[jira] [Assigned] (SPARK-5360) For CoGroupedRDD, rdds for narrow dependencies and shuffle handles are included twice in serialized task

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5360: --- Assignee: Kay Ousterhout (was: Apache Spark) For CoGroupedRDD, rdds for narrow

[jira] [Assigned] (SPARK-5360) For CoGroupedRDD, rdds for narrow dependencies and shuffle handles are included twice in serialized task

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5360: --- Assignee: Apache Spark (was: Kay Ousterhout) For CoGroupedRDD, rdds for narrow

[jira] [Created] (SPARK-6638) optimize StringType in SQL

2015-03-31 Thread Davies Liu (JIRA)
Davies Liu created SPARK-6638: - Summary: optimize StringType in SQL Key: SPARK-6638 URL: https://issues.apache.org/jira/browse/SPARK-6638 Project: Spark Issue Type: Improvement

[jira] [Assigned] (SPARK-2808) update kafka to version 0.8.2

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-2808: --- Assignee: (was: Apache Spark) update kafka to version 0.8.2

[jira] [Assigned] (SPARK-2808) update kafka to version 0.8.2

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-2808: --- Assignee: Apache Spark update kafka to version 0.8.2 -

[jira] [Comment Edited] (SPARK-3066) Support recommendAll in matrix factorization model

2015-03-31 Thread Debasish Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14389973#comment-14389973 ] Debasish Das edited comment on SPARK-3066 at 4/1/15 4:28 AM: -

[jira] [Created] (SPARK-6640) Executor may connect to HeartbeartReceiver before it's setup in the driver side

2015-03-31 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-6640: --- Summary: Executor may connect to HeartbeartReceiver before it's setup in the driver side Key: SPARK-6640 URL: https://issues.apache.org/jira/browse/SPARK-6640 Project:

[jira] [Assigned] (SPARK-3872) Rewrite the test for ActorInputStream.

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-3872: --- Assignee: Apache Spark (was: Prashant Sharma) Rewrite the test for ActorInputStream.

[jira] [Assigned] (SPARK-3872) Rewrite the test for ActorInputStream.

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-3872: --- Assignee: Prashant Sharma (was: Apache Spark) Rewrite the test for ActorInputStream.

[jira] [Commented] (SPARK-6435) spark-shell --jars option does not add all jars to classpath

2015-03-31 Thread Masayoshi TSUZUKI (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14389829#comment-14389829 ] Masayoshi TSUZUKI commented on SPARK-6435: -- [~srowen] OK, thank you! Then I'm

[jira] [Assigned] (SPARK-6626) TwitterUtils.createStream documentation error

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6626: --- Assignee: Apache Spark TwitterUtils.createStream documentation error

[jira] [Commented] (SPARK-6626) TwitterUtils.createStream documentation error

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6626?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14389852#comment-14389852 ] Apache Spark commented on SPARK-6626: - User 'JaysonSunshine' has created a pull

[jira] [Created] (SPARK-6641) Add config or control of accumulator on python

2015-03-31 Thread Weizhong (JIRA)
Weizhong created SPARK-6641: --- Summary: Add config or control of accumulator on python Key: SPARK-6641 URL: https://issues.apache.org/jira/browse/SPARK-6641 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-6641) Add config or control of accumulator on python

2015-03-31 Thread Weizhong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weizhong updated SPARK-6641: Description: Now if we init SparkContext of Python, then will create a single Accumulator in Java and

[jira] [Commented] (SPARK-3066) Support recommendAll in matrix factorization model

2015-03-31 Thread Debasish Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14389973#comment-14389973 ] Debasish Das commented on SPARK-3066: - Also unless the raw flow runs there is no way

[jira] [Updated] (SPARK-6573) Convert inbound NaN values as null

2015-03-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-6573: --- Target Version/s: 1.4.0 Convert inbound NaN values as null --

[jira] [Updated] (SPARK-6573) Convert inbound NaN values as null

2015-03-31 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-6573: --- Summary: Convert inbound NaN values as null (was: expect pandas null values as numpy.nan (not only

[jira] [Created] (SPARK-6643) Python API for StandardScalerModel

2015-03-31 Thread Kai Sasaki (JIRA)
Kai Sasaki created SPARK-6643: - Summary: Python API for StandardScalerModel Key: SPARK-6643 URL: https://issues.apache.org/jira/browse/SPARK-6643 Project: Spark Issue Type: Task

[jira] [Assigned] (SPARK-6638) optimize StringType in SQL

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6638: --- Assignee: Davies Liu (was: Apache Spark) optimize StringType in SQL

[jira] [Assigned] (SPARK-6638) optimize StringType in SQL

2015-03-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6638: --- Assignee: Apache Spark (was: Davies Liu) optimize StringType in SQL

[jira] [Created] (SPARK-6639) Create a new script to start multiple masters

2015-03-31 Thread Tao Wang (JIRA)
Tao Wang created SPARK-6639: --- Summary: Create a new script to start multiple masters Key: SPARK-6639 URL: https://issues.apache.org/jira/browse/SPARK-6639 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-799) Windows versions of the deploy scripts

2015-03-31 Thread Masayoshi TSUZUKI (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14389853#comment-14389853 ] Masayoshi TSUZUKI commented on SPARK-799: - I tend not to think it is a good idea

[jira] [Commented] (SPARK-6631) I am unable to get the Maven Build file in Example 2.13 to build anything but an empty file

2015-03-31 Thread Frank Domoney (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388583#comment-14388583 ] Frank Domoney commented on SPARK-6631: -- Thanks Sean I will shift it to that forum.

[jira] [Updated] (SPARK-5782) Python Worker / Pyspark Daemon Memory Issue

2015-03-31 Thread Mark Khaitman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Khaitman updated SPARK-5782: - External issue URL: https://github.com/apache/spark/pull/1977 Python Worker / Pyspark Daemon

[jira] [Commented] (SPARK-5532) Repartitioning DataFrame causes saveAsParquetFile to fail with VectorUDT

2015-03-31 Thread Jao Rabary (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388646#comment-14388646 ] Jao Rabary commented on SPARK-5532: --- I get the same problem with a DataFrame created

[jira] [Commented] (SPARK-2629) Improve performance of DStream.updateStateByKey

2015-03-31 Thread Vinoth Chandar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388681#comment-14388681 ] Vinoth Chandar commented on SPARK-2629: --- [~tdas] are you guys thinking along the

[jira] [Comment Edited] (SPARK-5004) PySpark does not handle SOCKS proxy

2015-03-31 Thread Eric O. LEBIGOT (EOL) (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14388680#comment-14388680 ] Eric O. LEBIGOT (EOL) edited comment on SPARK-5004 at 3/31/15 3:14 PM:

  1   2   >