[jira] [Commented] (SPARK-2313) PySpark should accept port via a command line argument rather than STDIN

2014-11-24 Thread Lv, Qi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14222765#comment-14222765 ] Lv, Qi commented on SPARK-2313: --- I've submitted a patch to fix this issue:

[jira] [Issue Comment Deleted] (SPARK-2313) PySpark should accept port via a command line argument rather than STDIN

2014-11-24 Thread Lv, Qi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2313?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lv, Qi updated SPARK-2313: -- Comment: was deleted (was: I've submitted a patch to fix this issue: https://github.com/apache/spark/pull/3424

[jira] [Commented] (SPARK-4475) PySpark failed to initialize if localhost can not be resolved

2014-11-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4475?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14222779#comment-14222779 ] Apache Spark commented on SPARK-4475: - User 'lvsoft' has created a pull request for

[jira] [Created] (SPARK-4570) Add broadcast join to left semi join

2014-11-24 Thread XiaoJing wang (JIRA)
XiaoJing wang created SPARK-4570: Summary: Add broadcast join to left semi join Key: SPARK-4570 URL: https://issues.apache.org/jira/browse/SPARK-4570 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-4567) Make SparkJobInfo and SparkStageInfo serializable

2014-11-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14222792#comment-14222792 ] Apache Spark commented on SPARK-4567: - User 'sryza' has created a pull request for

[jira] [Resolved] (SPARK-4371) Spark crashes with JBoss Logging 3.6.1

2014-11-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4371. -- Resolution: Not a Problem OK, good to know. I think this is for the moment considered NotAProblem for

[jira] [Created] (SPARK-4571) History server shows negative time

2014-11-24 Thread Andrew Or (JIRA)
Andrew Or created SPARK-4571: Summary: History server shows negative time Key: SPARK-4571 URL: https://issues.apache.org/jira/browse/SPARK-4571 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-4571) History server shows negative time

2014-11-24 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4571?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4571: - Attachment: Screen Shot 2014-11-21 at 2.49.25 PM.png History server shows negative time

[jira] [Created] (SPARK-4572) [SQL] spark-sql exits while encountered an error

2014-11-24 Thread Fuqing Yang (JIRA)
Fuqing Yang created SPARK-4572: -- Summary: [SQL] spark-sql exits while encountered an error Key: SPARK-4572 URL: https://issues.apache.org/jira/browse/SPARK-4572 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-4507) PR merge script should support closing multiple JIRA tickets

2014-11-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4507?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14222903#comment-14222903 ] Apache Spark commented on SPARK-4507: - User 'hase1031' has created a pull request for

[jira] [Commented] (SPARK-4507) PR merge script should support closing multiple JIRA tickets

2014-11-24 Thread Takayuki Hasegawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4507?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14222906#comment-14222906 ] Takayuki Hasegawa commented on SPARK-4507: -- This is my first pull-request for

[jira] [Commented] (SPARK-4567) Make SparkJobInfo and SparkStageInfo serializable

2014-11-24 Thread Xuefu Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14222929#comment-14222929 ] Xuefu Zhang commented on SPARK-4567: {quote} please don't set the FixVersion field.

[jira] [Created] (SPARK-4573) Support SettableStructObjectInspector for function wrap in HiveObjectInspectors

2014-11-24 Thread Cheng Hao (JIRA)
Cheng Hao created SPARK-4573: Summary: Support SettableStructObjectInspector for function wrap in HiveObjectInspectors Key: SPARK-4573 URL: https://issues.apache.org/jira/browse/SPARK-4573 Project: Spark

[jira] [Commented] (SPARK-4573) Support SettableStructObjectInspector for function wrap in HiveObjectInspectors

2014-11-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14222959#comment-14222959 ] Apache Spark commented on SPARK-4573: - User 'chenghao-intel' has created a pull

[jira] [Commented] (SPARK-4573) Support SettableStructObjectInspector for function wrap in HiveObjectInspectors

2014-11-24 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14222961#comment-14222961 ] Cheng Hao commented on SPARK-4573: -- HIVE UDAF needs SettableStructObjectInspector

[jira] [Commented] (SPARK-3628) Don't apply accumulator updates multiple times for tasks in result stages

2014-11-24 Thread Nan Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14222973#comment-14222973 ] Nan Zhu commented on SPARK-3628: hmmmOK but for this case, shall I submit individual

[jira] [Created] (SPARK-4574) Adding support for defining schema in foreign DDL commands.

2014-11-24 Thread wangfei (JIRA)
wangfei created SPARK-4574: -- Summary: Adding support for defining schema in foreign DDL commands. Key: SPARK-4574 URL: https://issues.apache.org/jira/browse/SPARK-4574 Project: Spark Issue Type:

[jira] [Commented] (SPARK-4574) Adding support for defining schema in foreign DDL commands.

2014-11-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14223041#comment-14223041 ] Apache Spark commented on SPARK-4574: - User 'scwf' has created a pull request for this

[jira] [Commented] (SPARK-4352) Incorporate locality preferences in dynamic allocation requests

2014-11-24 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14223066#comment-14223066 ] Thomas Graves commented on SPARK-4352: -- can you add description here? Incorporate

[jira] [Commented] (SPARK-1358) Continuous integrated test should be involved in Spark ecosystem

2014-11-24 Thread shane knapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14223195#comment-14223195 ] shane knapp commented on SPARK-1358: 8x800G SSDs sounds pretty hawt, but we're going

[jira] [Updated] (SPARK-4352) Incorporate locality preferences in dynamic allocation requests

2014-11-24 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza updated SPARK-4352: -- Description: Currently, achieving data locality in Spark is difficult u preferredNodeLocalityData

[jira] [Updated] (SPARK-4352) Incorporate locality preferences in dynamic allocation requests

2014-11-24 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza updated SPARK-4352: -- Description: Currently, achieving data locality in Spark is difficult unless an application takes

[jira] [Commented] (SPARK-1812) Support cross-building with Scala 2.11

2014-11-24 Thread Michael Schmitz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14223331#comment-14223331 ] Michael Schmitz commented on SPARK-1812: Have you created a follow-up JIRA for

[jira] [Resolved] (SPARK-4457) Document how to build for Hadoop versions greater than 2.4

2014-11-24 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves resolved SPARK-4457. -- Resolution: Fixed Fix Version/s: 1.2.0 Assignee: Sandy Ryza Document how to

[jira] [Created] (SPARK-4575) Documentation for the pipeline features

2014-11-24 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-4575: Summary: Documentation for the pipeline features Key: SPARK-4575 URL: https://issues.apache.org/jira/browse/SPARK-4575 Project: Spark Issue Type:

[jira] [Updated] (SPARK-4562) GLM testing time regressions from Spark 1.1

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4562: - Assignee: Davies Liu GLM testing time regressions from Spark 1.1

[jira] [Updated] (SPARK-4121) Master build failures after shading commons-math3

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4121: - Fix Version/s: 1.2.0 Master build failures after shading commons-math3

[jira] [Updated] (SPARK-3633) Fetches failure observed after SPARK-2711

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3633: --- Fix Version/s: 1.2.0 1.1.1 Fetches failure observed after SPARK-2711

[jira] [Updated] (SPARK-4385) DataSource DDL Parser can't handle table names with '_'

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4385: --- Fix Version/s: 1.2.0 DataSource DDL Parser can't handle table names with '_'

[jira] [Updated] (SPARK-4385) DataSource DDL Parser can't handle table names with '_'

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4385: --- Fix Version/s: (was: 1.2.0) DataSource DDL Parser can't handle table names with '_'

[jira] [Updated] (SPARK-3189) Add Robust Regression Algorithm with Turkey bisquare weight function (Biweight Estimates)

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3189: - Fix Version/s: (was: 1.2.0) (was: 1.1.1) Add Robust Regression

[jira] [Closed] (SPARK-3820) Specialize columnSimilarity() without any threshold

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng closed SPARK-3820. Resolution: Won't Fix Specialize columnSimilarity() without any threshold

[jira] [Reopened] (SPARK-3820) Specialize columnSimilarity() without any threshold

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reopened SPARK-3820: -- Specialize columnSimilarity() without any threshold

[jira] [Updated] (SPARK-3396) Change LogistricRegressionWithSGD's default regType to L2

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3396: - Fix Version/s: 1.2.0 Change LogistricRegressionWithSGD's default regType to L2

[jira] [Updated] (SPARK-3615) Kafka test should not hard code Zookeeper port

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3615?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3615: --- Fix Version/s: 1.2.0 Kafka test should not hard code Zookeeper port

[jira] [Updated] (SPARK-3686) flume.SparkSinkSuite.Success is flaky

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3686: --- Fix Version/s: 1.2.0 flume.SparkSinkSuite.Success is flaky

[jira] [Updated] (SPARK-4264) SQL HashJoin induces refCnt = 0 error in ShuffleBlockFetcherIterator

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4264: --- Fix Version/s: 1.2.0 SQL HashJoin induces refCnt = 0 error in ShuffleBlockFetcherIterator

[jira] [Updated] (SPARK-4468) Wrong Parquet filters are created for all inequality predicates with literals on the left hand side

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4468: --- Fix Version/s: 1.2.0 Wrong Parquet filters are created for all inequality predicates with

[jira] [Resolved] (SPARK-4479) Avoid unnecessary defensive copies when Sort based shuffle is on

2014-11-24 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-4479. - Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3422

[jira] [Updated] (SPARK-1860) Standalone Worker cleanup should not clean up running executors

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1860?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-1860: --- Fix Version/s: 1.2.0 Standalone Worker cleanup should not clean up running executors

[jira] [Reopened] (SPARK-4515) OOM/GC errors with sort-based shuffle

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell reopened SPARK-4515: OOM/GC errors with sort-based shuffle -

[jira] [Updated] (SPARK-3452) Maven build should skip publishing artifacts people shouldn't depend on

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3452: --- Fix Version/s: 1.2.0 Maven build should skip publishing artifacts people shouldn't depend on

[jira] [Resolved] (SPARK-4515) OOM/GC errors with sort-based shuffle

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-4515. Resolution: Duplicate OOM/GC errors with sort-based shuffle

[jira] [Resolved] (SPARK-4487) Fix attribute reference resolution error when using ORDER BY.

2014-11-24 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-4487. - Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3363

[jira] [Updated] (SPARK-4293) Make Cast be able to handle complex types.

2014-11-24 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4293?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4293: Target Version/s: 1.3.0 (was: 1.2.0) Make Cast be able to handle complex types.

[jira] [Resolved] (SPARK-4522) Failure to read parquet schema with missing metadata.

2014-11-24 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-4522. - Resolution: Fixed Failure to read parquet schema with missing metadata.

[jira] [Updated] (SPARK-4536) Add sqrt and abs to Spark SQL DSL

2014-11-24 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4536: Target Version/s: 1.3.0 (was: 1.2.0) Add sqrt and abs to Spark SQL DSL

[jira] [Updated] (SPARK-4559) Adding support for ucase and lcase

2014-11-24 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4559: Target Version/s: 1.3.0 (was: 1.2.0) Adding support for ucase and lcase

[jira] [Updated] (SPARK-4574) Adding support for defining schema in foreign DDL commands.

2014-11-24 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4574: Target Version/s: 1.3.0 (was: 1.2.0) Adding support for defining schema in foreign DDL

[jira] [Updated] (SPARK-4266) Avoid expensive JavaScript for StagePages with huge numbers of tasks

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4266: --- Affects Version/s: 1.2.0 Avoid expensive JavaScript for StagePages with huge numbers of

[jira] [Resolved] (SPARK-4145) Create jobs overview and job details pages on the web UI

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-4145. Resolution: Fixed Fix Version/s: 1.2.0 Create jobs overview and job details pages

[jira] [Updated] (SPARK-4266) Avoid expensive JavaScript for StagePages with huge numbers of tasks

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4266: --- Priority: Blocker (was: Critical) Avoid expensive JavaScript for StagePages with huge

[jira] [Commented] (SPARK-2313) PySpark should accept port via a command line argument rather than STDIN

2014-11-24 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14223583#comment-14223583 ] Davies Liu commented on SPARK-2313: --- [~farrellee] Thew new approach could be: 1) bind

[jira] [Resolved] (SPARK-4519) Filestream does not use hadoop configuration set within sparkContext.hadoopConfiguration

2014-11-24 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-4519. -- Resolution: Fixed Fix Version/s: 1.2.0 Filestream does not use hadoop configuration set

[jira] [Updated] (SPARK-4548) Python broadcast perf regression from Spark 1.1

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4548: --- Assignee: Davies Liu Python broadcast perf regression from Spark 1.1

[jira] [Resolved] (SPARK-4518) Filestream sometimes processes files twice

2014-11-24 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-4518. -- Resolution: Fixed Fix Version/s: 1.2.0 Filestream sometimes processes files twice

[jira] [Updated] (SPARK-4576) Add concatenation operator

2014-11-24 Thread Kousuke Saruta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kousuke Saruta updated SPARK-4576: -- Description: The standard SQL defines || as a concatenation operator. The operator makes

[jira] [Created] (SPARK-4576) Add concatenation operator

2014-11-24 Thread Kousuke Saruta (JIRA)
Kousuke Saruta created SPARK-4576: - Summary: Add concatenation operator Key: SPARK-4576 URL: https://issues.apache.org/jira/browse/SPARK-4576 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-4577) Python example of LBFGS for MLlib guide

2014-11-24 Thread Davies Liu (JIRA)
Davies Liu created SPARK-4577: - Summary: Python example of LBFGS for MLlib guide Key: SPARK-4577 URL: https://issues.apache.org/jira/browse/SPARK-4577 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-4576) Add concatenation operator

2014-11-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14223610#comment-14223610 ] Apache Spark commented on SPARK-4576: - User 'sarutak' has created a pull request for

[jira] [Created] (SPARK-4578) Row.asDict() should keep the type of values

2014-11-24 Thread Davies Liu (JIRA)
Davies Liu created SPARK-4578: - Summary: Row.asDict() should keep the type of values Key: SPARK-4578 URL: https://issues.apache.org/jira/browse/SPARK-4578 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-3633) Fetches failure observed after SPARK-2711

2014-11-24 Thread Stephen Haberman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14223645#comment-14223645 ] Stephen Haberman commented on SPARK-3633: - I just tried a job on 1.1.1-rc2 and am

[jira] [Created] (SPARK-4579) Scheduling Delay appears negative

2014-11-24 Thread Arun Ahuja (JIRA)
Arun Ahuja created SPARK-4579: - Summary: Scheduling Delay appears negative Key: SPARK-4579 URL: https://issues.apache.org/jira/browse/SPARK-4579 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-4579) Scheduling Delay appears negative

2014-11-24 Thread Arun Ahuja (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Arun Ahuja updated SPARK-4579: -- Description:

[jira] [Updated] (SPARK-4577) Python example of LBFGS for MLlib guide

2014-11-24 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-4577: -- Priority: Minor (was: Major) Python example of LBFGS for MLlib guide

[jira] [Created] (SPARK-4580) Document random forests and boosting in programming guide

2014-11-24 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-4580: Summary: Document random forests and boosting in programming guide Key: SPARK-4580 URL: https://issues.apache.org/jira/browse/SPARK-4580 Project: Spark

[jira] [Assigned] (SPARK-4196) Streaming + checkpointing + saveAsNewAPIHadoopFiles = NotSerializableException for Hadoop Configuration

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell reassigned SPARK-4196: -- Assignee: Patrick Wendell Streaming + checkpointing + saveAsNewAPIHadoopFiles =

[jira] [Updated] (SPARK-4196) Streaming + checkpointing + saveAsNewAPIHadoopFiles = NotSerializableException for Hadoop Configuration

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4196: --- Assignee: Tathagata Das (was: Patrick Wendell) Streaming + checkpointing +

[jira] [Assigned] (SPARK-4447) Remove layers of abstraction in YARN code no longer needed after dropping yarn-alpha

2014-11-24 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza reassigned SPARK-4447: - Assignee: Patrick Wendell Remove layers of abstraction in YARN code no longer needed after

[jira] [Assigned] (SPARK-4447) Remove layers of abstraction in YARN code no longer needed after dropping yarn-alpha

2014-11-24 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza reassigned SPARK-4447: - Assignee: Sandy Ryza (was: Patrick Wendell) Remove layers of abstraction in YARN code no

[jira] [Commented] (SPARK-4578) Row.asDict() should keep the type of values

2014-11-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14223742#comment-14223742 ] Apache Spark commented on SPARK-4578: - User 'davies' has created a pull request for

[jira] [Created] (SPARK-4581) Refactorize StandardScaler to improve the transformation performance

2014-11-24 Thread DB Tsai (JIRA)
DB Tsai created SPARK-4581: -- Summary: Refactorize StandardScaler to improve the transformation performance Key: SPARK-4581 URL: https://issues.apache.org/jira/browse/SPARK-4581 Project: Spark

[jira] [Commented] (SPARK-4581) Refactorize StandardScaler to improve the transformation performance

2014-11-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14223792#comment-14223792 ] Apache Spark commented on SPARK-4581: - User 'dbtsai' has created a pull request for

[jira] [Updated] (SPARK-4180) SparkContext constructor should throw exception if another SparkContext is already running

2014-11-24 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4180: - Assignee: Josh Rosen SparkContext constructor should throw exception if another SparkContext is

[jira] [Updated] (SPARK-4578) Row.asDict() should keep the type of values

2014-11-24 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4578: - Assignee: Davies Liu Row.asDict() should keep the type of values

[jira] [Commented] (SPARK-4525) MesosSchedulerBackend.resourceOffers cannot decline unused offers from acceptedOffers

2014-11-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14223841#comment-14223841 ] Apache Spark commented on SPARK-4525: - User 'pwendell' has created a pull request for

[jira] [Resolved] (SPARK-4562) GLM testing time regressions from Spark 1.1

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-4562. -- Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3420

[jira] [Updated] (SPARK-4580) Document random forests and boosting in programming guide

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4580?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4580: - Assignee: Joseph K. Bradley Document random forests and boosting in programming guide

[jira] [Resolved] (SPARK-4578) Row.asDict() should keep the type of values

2014-11-24 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-4578. Resolution: Fixed Fix Version/s: 1.2.0 Thanks davies I've resolved this.

[jira] [Commented] (SPARK-4395) Running a Spark SQL SELECT command from PySpark causes a hang for ~ 1 hour

2014-11-24 Thread Sameer Farooqui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4395?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14223851#comment-14223851 ] Sameer Farooqui commented on SPARK-4395: Hi Davies and Michael, I can confirm

[jira] [Created] (SPARK-4582) Add getVectors to Word2VecModel

2014-11-24 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-4582: Summary: Add getVectors to Word2VecModel Key: SPARK-4582 URL: https://issues.apache.org/jira/browse/SPARK-4582 Project: Spark Issue Type: New Feature

[jira] [Commented] (SPARK-4395) Running a Spark SQL SELECT command from PySpark causes a hang for ~ 1 hour

2014-11-24 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4395?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14223853#comment-14223853 ] Davies Liu commented on SPARK-4395: --- [~lian cheng] Could you help to investigate the

[jira] [Commented] (SPARK-4565) Add docs about advanced spark application development

2014-11-24 Thread Joseph E. Gonzalez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14223856#comment-14223856 ] Joseph E. Gonzalez commented on SPARK-4565: --- Yes! However, we might want to

[jira] [Commented] (SPARK-4582) Add getVectors to Word2VecModel

2014-11-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4582?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14223861#comment-14223861 ] Apache Spark commented on SPARK-4582: - User 'mengxr' has created a pull request for

[jira] [Commented] (SPARK-4565) Add docs about advanced spark application development

2014-11-24 Thread Evan Sparks (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14223864#comment-14223864 ] Evan Sparks commented on SPARK-4565: [~pwendell] suggested that we add this to the

[jira] [Updated] (SPARK-4582) Add getVectors to Word2VecModel

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4582: - Fix Version/s: 1.2.0 Add getVectors to Word2VecModel ---

[jira] [Commented] (SPARK-4565) Add docs about advanced spark application development

2014-11-24 Thread Joseph E. Gonzalez (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14223878#comment-14223878 ] Joseph E. Gonzalez commented on SPARK-4565: --- Hmm, I wonder if it would make more

[jira] [Updated] (SPARK-927) PySpark sample() doesn't work if numpy is installed on master but not on workers

2014-11-24 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-927: - Affects Version/s: 1.1.2 1.0.2 PySpark sample() doesn't work if numpy is installed

[jira] [Updated] (SPARK-927) PySpark sample() doesn't work if numpy is installed on master but not on workers

2014-11-24 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-927: - Affects Version/s: 0.9.1 PySpark sample() doesn't work if numpy is installed on master but not on

[jira] [Resolved] (SPARK-4548) Python broadcast perf regression from Spark 1.1

2014-11-24 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-4548. --- Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3417

[jira] [Updated] (SPARK-3080) ArrayIndexOutOfBoundsException in ALS for Large datasets

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3080: - Target Version/s: 1.3.0 (was: 1.2.0) ArrayIndexOutOfBoundsException in ALS for Large datasets

[jira] [Resolved] (SPARK-4517) Improve memory efficiency for python broadcast

2014-11-24 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-4517. --- Resolution: Fixed Fix Version/s: 1.2.0 Assignee: Davies Liu This was fixed in

[jira] [Updated] (SPARK-2206) Automatically infer the number of classification classes in multiclass classification

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-2206: - Target Version/s: 1.3.0 (was: 1.2.0) Automatically infer the number of classification classes

[jira] [Updated] (SPARK-3080) ArrayIndexOutOfBoundsException in ALS for Large datasets

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3080: - Affects Version/s: 1.2.0 ArrayIndexOutOfBoundsException in ALS for Large datasets

[jira] [Updated] (SPARK-4517) Improve memory efficiency for python broadcast

2014-11-24 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4517: -- Component/s: PySpark Improve memory efficiency for python broadcast

[jira] [Updated] (SPARK-4577) Python example of LBFGS for MLlib guide

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4577: - Target Version/s: 1.2.0 Python example of LBFGS for MLlib guide

[jira] [Updated] (SPARK-4577) Python example of LBFGS for MLlib guide

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4577: - Issue Type: Improvement (was: Bug) Python example of LBFGS for MLlib guide

[jira] [Updated] (SPARK-4547) OOM when making bins in BinaryClassificationMetrics

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4547: - Assignee: Sean Owen OOM when making bins in BinaryClassificationMetrics

[jira] [Updated] (SPARK-4581) Refactorize StandardScaler to improve the transformation performance

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4581: - Target Version/s: 1.2.0 Assignee: DB Tsai Refactorize StandardScaler to improve the

[jira] [Updated] (SPARK-4547) OOM when making bins in BinaryClassificationMetrics

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4547: - Target Version/s: 1.3.0 OOM when making bins in BinaryClassificationMetrics

  1   2   >