[jira] [Commented] (SPARK-4655) Split Stage into ShuffleMapStage and ResultStage subclasses

2015-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14328659#comment-14328659 ] Apache Spark commented on SPARK-4655: - User 'ilganeli' has created a pull request for

[jira] [Commented] (SPARK-5629) Add spark-ec2 action to return info about an existing cluster

2015-02-19 Thread Florian Verhein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14328658#comment-14328658 ] Florian Verhein commented on SPARK-5629: Good idea. Can I suggest keeping a forma

[jira] [Commented] (SPARK-5879) spary_ec2.py should expose/return master and slave lists (e.g. write to file)

2015-02-19 Thread Florian Verhein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14328652#comment-14328652 ] Florian Verhein commented on SPARK-5879: Just saw that while reading through your

[jira] [Commented] (SPARK-5879) spary_ec2.py should expose/return master and slave lists (e.g. write to file)

2015-02-19 Thread Florian Verhein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14328651#comment-14328651 ] Florian Verhein commented on SPARK-5879: Yeah that's more flexible - rather than

[jira] [Commented] (SPARK-5879) spary_ec2.py should expose/return master and slave lists (e.g. write to file)

2015-02-19 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14328639#comment-14328639 ] Nicholas Chammas commented on SPARK-5879: - If you just need the master address, yo

[jira] [Created] (SPARK-5919) Enable broadcast joins for Parquet files

2015-02-19 Thread Dima Zhiyanov (JIRA)
Dima Zhiyanov created SPARK-5919: Summary: Enable broadcast joins for Parquet files Key: SPARK-5919 URL: https://issues.apache.org/jira/browse/SPARK-5919 Project: Spark Issue Type: Improvemen

[jira] [Commented] (SPARK-5879) spary_ec2.py should expose/return master and slave lists (e.g. write to file)

2015-02-19 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14328621#comment-14328621 ] Shivaram Venkataraman commented on SPARK-5879: -- I think [~nchammas] proposed

[jira] [Commented] (SPARK-5879) spary_ec2.py should expose/return master and slave lists (e.g. write to file)

2015-02-19 Thread Florian Verhein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14328612#comment-14328612 ] Florian Verhein commented on SPARK-5879: cc [~shivaram], any opinions on how to be

[jira] [Commented] (SPARK-4144) Support incremental model training of Naive Bayes classifier

2015-02-19 Thread Jatinpreet Singh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14328569#comment-14328569 ] Jatinpreet Singh commented on SPARK-4144: - Hi, I have been waiting for this feat

[jira] [Created] (SPARK-5918) Spark Thrift server reports metadata for VARCHAR column as STRING in result set schema

2015-02-19 Thread Holman Lan (JIRA)
Holman Lan created SPARK-5918: - Summary: Spark Thrift server reports metadata for VARCHAR column as STRING in result set schema Key: SPARK-5918 URL: https://issues.apache.org/jira/browse/SPARK-5918 Projec

[jira] [Created] (SPARK-5917) Distinct is broken

2015-02-19 Thread Derrick Burns (JIRA)
Derrick Burns created SPARK-5917: Summary: Distinct is broken Key: SPARK-5917 URL: https://issues.apache.org/jira/browse/SPARK-5917 Project: Spark Issue Type: Bug Components: MLlib

[jira] [Resolved] (SPARK-5900) Wrap the results returned by PIC and FPGrowth in case classes

2015-02-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5900. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4695 [https://githu

[jira] [Commented] (SPARK-5860) JdbcRDD: overflow on large range with high number of partitions

2015-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14328454#comment-14328454 ] Apache Spark commented on SPARK-5860: - User 'hotou' has created a pull request for thi

[jira] [Commented] (SPARK-5753) add basic support to JDBCRDD for postgresql types: uuid, hstore, and array

2015-02-19 Thread Evan Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14328449#comment-14328449 ] Evan Yu commented on SPARK-5753: Ignore this, commit under wrong ticket > add basic suppo

[jira] [Commented] (SPARK-5753) add basic support to JDBCRDD for postgresql types: uuid, hstore, and array

2015-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14328448#comment-14328448 ] Apache Spark commented on SPARK-5753: - User 'hotou' has created a pull request for thi

[jira] [Created] (SPARK-5916) $SPARK_HOME/bin/beeline conflicts with $HIVE_HOME/bin/beeline

2015-02-19 Thread Carl Steinbach (JIRA)
Carl Steinbach created SPARK-5916: - Summary: $SPARK_HOME/bin/beeline conflicts with $HIVE_HOME/bin/beeline Key: SPARK-5916 URL: https://issues.apache.org/jira/browse/SPARK-5916 Project: Spark

[jira] [Updated] (SPARK-5916) $SPARK_HOME/bin/beeline conflicts with $HIVE_HOME/bin/beeline

2015-02-19 Thread Carl Steinbach (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5916?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Carl Steinbach updated SPARK-5916: -- Component/s: SQL > $SPARK_HOME/bin/beeline conflicts with $HIVE_HOME/bin/beeline > -

[jira] [Updated] (SPARK-4808) Spark fails to spill with small number of large objects

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4808: - Target Version/s: 1.3.0, 1.4.0 (was: 1.2.1) > Spark fails to spill with small number of large objects > -

[jira] [Updated] (SPARK-5915) Spillable should check every N bytes rather than every 32 elements

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5915: - Affects Version/s: 1.0.0 > Spillable should check every N bytes rather than every 32 elements > --

[jira] [Updated] (SPARK-5915) Spillable should check every N bytes rather than every 32 elements

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5915: - Target Version/s: 1.4.0 > Spillable should check every N bytes rather than every 32 elements > ---

[jira] [Created] (SPARK-5915) Spillable should check every N bytes rather than every 32 elements

2015-02-19 Thread Mingyu Kim (JIRA)
Mingyu Kim created SPARK-5915: - Summary: Spillable should check every N bytes rather than every 32 elements Key: SPARK-5915 URL: https://issues.apache.org/jira/browse/SPARK-5915 Project: Spark I

[jira] [Commented] (SPARK-5669) Spark assembly includes incompatibly licensed libgfortran, libgcc code via JBLAS

2015-02-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14328364#comment-14328364 ] Sean Owen commented on SPARK-5669: -- It *should* be fine on the grounds that the native li

[jira] [Commented] (SPARK-5669) Spark assembly includes incompatibly licensed libgfortran, libgcc code via JBLAS

2015-02-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14328358#comment-14328358 ] Xiangrui Meng commented on SPARK-5669: -- Agree that users should maintain their depend

[jira] [Updated] (SPARK-5914) Spark-submit cannot execute without machine admin permission on windows

2015-02-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5914: - Component/s: (was: Spark Core) Windows Spark Submit Yes of course yo

[jira] [Created] (SPARK-5914) Spark-submit cannot execute without machine admin permission on windows

2015-02-19 Thread Judy Nash (JIRA)
Judy Nash created SPARK-5914: Summary: Spark-submit cannot execute without machine admin permission on windows Key: SPARK-5914 URL: https://issues.apache.org/jira/browse/SPARK-5914 Project: Spark

[jira] [Updated] (SPARK-4682) Consolidate various 'Clock' classes

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4682: - Affects Version/s: 1.2.0 > Consolidate various 'Clock' classes > --- > >

[jira] [Closed] (SPARK-4682) Consolidate various 'Clock' classes

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-4682. Resolution: Fixed Fix Version/s: 1.3.0 Assignee: Sean Owen Target Version/s: 1.3

[jira] [Resolved] (SPARK-5889) remove pid file in spark-daemon.sh after killing the process.

2015-02-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5889. -- Resolution: Fixed Fix Version/s: 1.2.2 1.3.0 Issue resolved by pull request 46

[jira] [Commented] (SPARK-693) Let deploy scripts set alternate conf, work directories

2015-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14328281#comment-14328281 ] Apache Spark commented on SPARK-693: User 'chu11' has created a pull request for this i

[jira] [Created] (SPARK-5913) Python API for ChiSqSelector

2015-02-19 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-5913: Summary: Python API for ChiSqSelector Key: SPARK-5913 URL: https://issues.apache.org/jira/browse/SPARK-5913 Project: Spark Issue Type: New Feature

[jira] [Commented] (SPARK-5912) Programming guide for feature selection

2015-02-19 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5912?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14328255#comment-14328255 ] Joseph K. Bradley commented on SPARK-5912: -- Sure, can you please follow the examp

[jira] [Commented] (SPARK-1476) 2GB limit in spark for blocks

2015-02-19 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1476?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14328248#comment-14328248 ] Marcelo Vanzin commented on SPARK-1476: --- Hi [~irashid], Approach sounds good. It wo

[jira] [Commented] (SPARK-5912) Programming guide for feature selection

2015-02-19 Thread Alexander Ulanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5912?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14328246#comment-14328246 ] Alexander Ulanov commented on SPARK-5912: - Sure, I can. Could you point me to some

[jira] [Commented] (SPARK-5912) Programming guide for feature selection

2015-02-19 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5912?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14328238#comment-14328238 ] Joseph K. Bradley commented on SPARK-5912: -- [~avulanov] Would you have time to m

[jira] [Commented] (SPARK-3882) JobProgressListener gets permanently out of sync with long running job

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14328237#comment-14328237 ] Andrew Or commented on SPARK-3882: -- Hi [~dgshep] is this still an issue after upgrading t

[jira] [Created] (SPARK-5912) Programming guide for feature selection

2015-02-19 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-5912: Summary: Programming guide for feature selection Key: SPARK-5912 URL: https://issues.apache.org/jira/browse/SPARK-5912 Project: Spark Issue Type: Doc

[jira] [Updated] (SPARK-2033) Automatically cleanup checkpoint

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-2033: - Affects Version/s: 1.0.0 > Automatically cleanup checkpoint > - > >

[jira] [Updated] (SPARK-3051) Support looking-up named accumulators in a registry

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-3051: - Affects Version/s: 1.0.0 > Support looking-up named accumulators in a registry > -

[jira] [Updated] (SPARK-911) Support map pruning on sorted (K, V) RDD's

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-911: Affects Version/s: 1.0.0 > Support map pruning on sorted (K, V) RDD's > -

[jira] [Closed] (SPARK-2188) Support sbt/sbt for Windows

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-2188. Resolution: Won't Fix > Support sbt/sbt for Windows > --- > > Key: S

[jira] [Updated] (SPARK-4669) Allow users to set arbitrary akka configurations via property file

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4669: - Affects Version/s: 1.0.0 > Allow users to set arbitrary akka configurations via property file > --

[jira] [Updated] (SPARK-4721) Improve first thread to put block failed

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4721: - Affects Version/s: 1.0.0 > Improve first thread to put block failed >

[jira] [Updated] (SPARK-5814) Remove JBLAS from runtime dependencies

2015-02-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5814: - Priority: Major (was: Critical) > Remove JBLAS from runtime dependencies > --

[jira] [Updated] (SPARK-4848) On a stand-alone cluster, several worker-specific variables are read only on the master

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4848: - Affects Version/s: 1.0.0 > On a stand-alone cluster, several worker-specific variables are read only on >

[jira] [Updated] (SPARK-1537) Add integration with Yarn's Application Timeline Server

2015-02-19 Thread Zhan Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhan Zhang updated SPARK-1537: -- Attachment: SPARK-1537.txt High level design doc for spark ATS integration. > Add integration with Yarn

[jira] [Updated] (SPARK-1537) Add integration with Yarn's Application Timeline Server

2015-02-19 Thread Zhan Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhan Zhang updated SPARK-1537: -- Attachment: spark-1573.patch Patch against v1.2.1 > Add integration with Yarn's Application Timeline Se

[jira] [Updated] (SPARK-4848) On a stand-alone cluster, several worker-specific variables are read only on the master

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4848: - Component/s: (was: Project Infra) Deploy > On a stand-alone cluster, several worker-s

[jira] [Commented] (SPARK-5744) RDD.isEmpty / take fails for (empty) RDD of Nothing

2015-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5744?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14328173#comment-14328173 ] Apache Spark commented on SPARK-5744: - User 'srowen' has created a pull request for th

[jira] [Created] (SPARK-5911) Make Column.cast(to: String) support fixed precision and scale decimal type

2015-02-19 Thread Yin Huai (JIRA)
Yin Huai created SPARK-5911: --- Summary: Make Column.cast(to: String) support fixed precision and scale decimal type Key: SPARK-5911 URL: https://issues.apache.org/jira/browse/SPARK-5911 Project: Spark

[jira] [Closed] (SPARK-3545) Put HadoopRDD.getPartitions forward and put TaskScheduler.start back in SparkContext to reduce DAGScheduler.JobSubmitted processing time and shorten cluster resources occu

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3545?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-3545. Resolution: Won't Fix > Put HadoopRDD.getPartitions forward and put TaskScheduler.start back in > SparkCont

[jira] [Commented] (SPARK-3545) Put HadoopRDD.getPartitions forward and put TaskScheduler.start back in SparkContext to reduce DAGScheduler.JobSubmitted processing time and shorten cluster resources o

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3545?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14328163#comment-14328163 ] Andrew Or commented on SPARK-3545: -- This should be broken down into two issues. One is al

[jira] [Updated] (SPARK-4921) TaskSetManager mistakenly returns PROCESS_LOCAL for NO_PREF tasks

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4921: - Summary: TaskSetManager mistakenly returns PROCESS_LOCAL for NO_PREF tasks (was: Performance issue caused

[jira] [Updated] (SPARK-5316) DAGScheduler may make shuffleToMapStage leak if getParentStages failes

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5316: - Affects Version/s: 1.0.0 > DAGScheduler may make shuffleToMapStage leak if getParentStages failes > --

[jira] [Updated] (SPARK-4962) Put TaskScheduler.start back in SparkContext to shorten cluster resources occupation period

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4962: - Affects Version/s: 1.0.0 > Put TaskScheduler.start back in SparkContext to shorten cluster resources > oc

[jira] [Updated] (SPARK-5316) DAGScheduler may make shuffleToMapStage leak if getParentStages failes

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5316: - Priority: Major (was: Minor) > DAGScheduler may make shuffleToMapStage leak if getParentStages failes > -

[jira] [Resolved] (SPARK-5902) PipelineStage.transformSchema should be public, not private

2015-02-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5902. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4682 [https://githu

[jira] [Commented] (SPARK-5669) Spark assembly includes incompatibly licensed libgfortran, libgcc code via JBLAS

2015-02-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14328092#comment-14328092 ] Sean Owen commented on SPARK-5669: -- I do find it confusing. I can see an argument that th

[jira] [Updated] (SPARK-5337) respect spark.task.cpus when launch executors

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5337: - Affects Version/s: 1.0.0 > respect spark.task.cpus when launch executors > ---

[jira] [Created] (SPARK-5910) DataFrame.selectExpr("col as newName") does not work

2015-02-19 Thread Yin Huai (JIRA)
Yin Huai created SPARK-5910: --- Summary: DataFrame.selectExpr("col as newName") does not work Key: SPARK-5910 URL: https://issues.apache.org/jira/browse/SPARK-5910 Project: Spark Issue Type: Bug

[jira] [Closed] (SPARK-5825) Failure stopping Services while command line argument is too long

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-5825. Resolution: Fixed Fix Version/s: 1.2.2 1.3.0 > Failure stopping Services while com

[jira] [Updated] (SPARK-5825) Failure stopping Services while command line argument is too long

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5825: - Affects Version/s: 1.0.0 > Failure stopping Services while command line argument is too long > ---

[jira] [Commented] (SPARK-5669) Spark assembly includes incompatibly licensed libgfortran, libgcc code via JBLAS

2015-02-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14328010#comment-14328010 ] Xiangrui Meng commented on SPARK-5669: -- Yes, we are going to remove JBLAS anyway in 1

[jira] [Comment Edited] (SPARK-5669) Spark assembly includes incompatibly licensed libgfortran, libgcc code via JBLAS

2015-02-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14328010#comment-14328010 ] Xiangrui Meng edited comment on SPARK-5669 at 2/19/15 7:43 PM: -

[jira] [Commented] (SPARK-5669) Spark assembly includes incompatibly licensed libgfortran, libgcc code via JBLAS

2015-02-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14327997#comment-14327997 ] Sean Owen commented on SPARK-5669: -- Aha, that's a good point. I'm still not clear if this

[jira] [Commented] (SPARK-2628) Mesos backend throwing unable to find LoginModule

2015-02-19 Thread Timothy Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14327978#comment-14327978 ] Timothy Chen commented on SPARK-2628: - Seems like this is fixed post 1.0.4, somewhere

[jira] [Closed] (SPARK-2628) Mesos backend throwing unable to find LoginModule

2015-02-19 Thread Timothy Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timothy Chen closed SPARK-2628. --- Resolution: Won't Fix > Mesos backend throwing unable to find LoginModule > -

[jira] [Commented] (SPARK-5669) Spark assembly includes incompatibly licensed libgfortran, libgcc code via JBLAS

2015-02-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14327953#comment-14327953 ] Xiangrui Meng commented on SPARK-5669: -- GFortran is part of GCC (https://gcc.gnu.org/

[jira] [Commented] (SPARK-5775) GenericRow cannot be cast to SpecificMutableRow when nested data and partitioned table

2015-02-19 Thread Anselme Vignon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5775?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14327938#comment-14327938 ] Anselme Vignon commented on SPARK-5775: --- This bug is due to a problem in the TableSc

[jira] [Commented] (SPARK-5775) GenericRow cannot be cast to SpecificMutableRow when nested data and partitioned table

2015-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5775?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14327936#comment-14327936 ] Apache Spark commented on SPARK-5775: - User 'anselmevignon' has created a pull request

[jira] [Closed] (SPARK-5423) ExternalAppendOnlyMap won't delete temp spilled file if some exception happens during using it

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-5423. Resolution: Fixed Fix Version/s: 1.2.2 1.1.2 1.3.0

[jira] [Comment Edited] (SPARK-5887) Class not found exception com.datastax.spark.connector.rdd.partitioner.CassandraPartition

2015-02-19 Thread Vijay Pawnarkar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5887?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14327921#comment-14327921 ] Vijay Pawnarkar edited comment on SPARK-5887 at 2/19/15 6:35 PM: ---

[jira] [Commented] (SPARK-5887) Class not found exception com.datastax.spark.connector.rdd.partitioner.CassandraPartition

2015-02-19 Thread Vijay Pawnarkar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5887?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14327921#comment-14327921 ] Vijay Pawnarkar commented on SPARK-5887: Thanks! This could be a class loader issu

[jira] [Commented] (SPARK-2389) globally shared SparkContext / shared Spark "application"

2015-02-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14327903#comment-14327903 ] Patrick Wendell commented on SPARK-2389: I've seen some variants of this question

[jira] [Updated] (SPARK-5423) ExternalAppendOnlyMap won't delete temp spilled file if some exception happens during using it

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5423: - Affects Version/s: 1.0.0 > ExternalAppendOnlyMap won't delete temp spilled file if some exception > happe

[jira] [Updated] (SPARK-5902) PipelineStage.transformSchema should be public, not private

2015-02-19 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-5902: - Summary: PipelineStage.transformSchema should be public, not private (was: PipelineStage.

[jira] [Updated] (SPARK-5902) PipelineStage.transformSchema should be public, not private

2015-02-19 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-5902: - Description: For users to implement their own PipelineStages, we need to make PipelineStag

[jira] [Updated] (SPARK-5423) ExternalAppendOnlyMap won't delete temp spilled file if some exception happens during using it

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5423: - Priority: Major (was: Minor) > ExternalAppendOnlyMap won't delete temp spilled file if some exception >

[jira] [Updated] (SPARK-5719) allow daemons to bind to specified host

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5719?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5719: - Affects Version/s: 1.0.0 > allow daemons to bind to specified host > -

[jira] [Commented] (SPARK-4423) Improve foreach() documentation to avoid confusion between local- and cluster-mode behavior

2015-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14327833#comment-14327833 ] Apache Spark commented on SPARK-4423: - User 'ilganeli' has created a pull request for

[jira] [Resolved] (SPARK-5887) Class not found exception com.datastax.spark.connector.rdd.partitioner.CassandraPartition

2015-02-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5887?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-5887. Resolution: Invalid The Datastax connector is not part of the Apache Spark distribution, it'

[jira] [Updated] (SPARK-5863) Performance regression in Spark SQL/Parquet due to ScalaReflection.convertRowToScala

2015-02-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5863: --- Priority: Critical (was: Major) > Performance regression in Spark SQL/Parquet due to > Scala

[jira] [Closed] (SPARK-5548) Flaky test: o.a.s.util.AkkaUtilsSuite.remote fetch ssl on - untrusted server

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-5548. Resolution: Fixed Fix Version/s: 1.3.0 Target Version/s: 1.3.0 Closing again https://github

[jira] [Commented] (SPARK-5900) Wrap the results returned by PIC and FPGrowth in case classes

2015-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14327787#comment-14327787 ] Apache Spark commented on SPARK-5900: - User 'mengxr' has created a pull request for th

[jira] [Closed] (SPARK-5907) Selected column from DataFrame should not re-analyze logical plan

2015-02-19 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5907?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh closed SPARK-5907. -- Resolution: Duplicate > Selected column from DataFrame should not re-analyze logical plan >

[jira] [Commented] (SPARK-5909) Add a clearCache command to Spark SQL's cache manager

2015-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5909?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14327706#comment-14327706 ] Apache Spark commented on SPARK-5909: - User 'yhuai' has created a pull request for thi

[jira] [Created] (SPARK-5909) Add a clearCache command to Spark SQL's cache manager

2015-02-19 Thread Yin Huai (JIRA)
Yin Huai created SPARK-5909: --- Summary: Add a clearCache command to Spark SQL's cache manager Key: SPARK-5909 URL: https://issues.apache.org/jira/browse/SPARK-5909 Project: Spark Issue Type: Task

[jira] [Updated] (SPARK-5881) RDD remains cached after the table gets overridden by "CACHE TABLE"

2015-02-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5881?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-5881: Priority: Major (was: Blocker) > RDD remains cached after the table gets overridden by "CACHE TABLE" >

[jira] [Commented] (SPARK-5881) RDD remains cached after the table gets overridden by "CACHE TABLE"

2015-02-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14327696#comment-14327696 ] Yin Huai commented on SPARK-5881: - As mentioned by [~lian cheng], we should also track the

[jira] [Commented] (SPARK-5494) SparkSqlSerializer Ignores KryoRegistrators

2015-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5494?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14327657#comment-14327657 ] Apache Spark commented on SPARK-5494: - User 'hkothari' has created a pull request for

[jira] [Commented] (SPARK-5908) Hive udtf with single alias should be resolved correctly

2015-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5908?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14327512#comment-14327512 ] Apache Spark commented on SPARK-5908: - User 'viirya' has created a pull request for th

[jira] [Created] (SPARK-5908) Hive udtf with single alias should be resolved correctly

2015-02-19 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5908: -- Summary: Hive udtf with single alias should be resolved correctly Key: SPARK-5908 URL: https://issues.apache.org/jira/browse/SPARK-5908 Project: Spark Is

[jira] [Commented] (SPARK-5907) Selected column from DataFrame should not re-analyze logical plan

2015-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5907?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14327494#comment-14327494 ] Apache Spark commented on SPARK-5907: - User 'viirya' has created a pull request for th

[jira] [Created] (SPARK-5907) Selected column from DataFrame should not re-analyze logical plan

2015-02-19 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5907: -- Summary: Selected column from DataFrame should not re-analyze logical plan Key: SPARK-5907 URL: https://issues.apache.org/jira/browse/SPARK-5907 Project: Spark

[jira] [Commented] (SPARK-1476) 2GB limit in spark for blocks

2015-02-19 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1476?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14327394#comment-14327394 ] Imran Rashid commented on SPARK-1476: - Based on discussion on the dev list, [~mridulm8

[jira] [Updated] (SPARK-5825) Failure stopping Services while command line argument is too long

2015-02-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5825: - Component/s: (was: Spark Submit) Deploy Target Version/s: 1.3.0, 1.2.2

[jira] [Updated] (SPARK-5889) remove pid file in spark-daemon.sh after killing the process.

2015-02-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5889: - Component/s: Deploy > remove pid file in spark-daemon.sh after killing the process. >

[jira] [Updated] (SPARK-5889) remove pid file in spark-daemon.sh after killing the process.

2015-02-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5889: - Priority: Minor (was: Major) Target Version/s: 1.3.0, 1.2.2 Affects Version/s: 1.2.1

[jira] [Commented] (SPARK-5889) remove pid file in spark-daemon.sh after killing the process.

2015-02-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5889?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14327238#comment-14327238 ] Sean Owen commented on SPARK-5889: -- Yeah I wanted to do this in the original PR, although

[jira] [Resolved] (SPARK-5899) Viewing specific stage information which contains thousands of tasks will freak out the driver and CPU cores from where it runs

2015-02-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5899. -- Resolution: Duplicate > Viewing specific stage information which contains thousands of tasks will > fre

[jira] [Comment Edited] (SPARK-5837) HTTP 500 if try to access Spark UI in yarn-cluster or yarn-client mode

2015-02-19 Thread Rok Roskar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5837?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14327182#comment-14327182 ] Rok Roskar edited comment on SPARK-5837 at 2/19/15 9:59 AM: th

  1   2   >