[jira] [Commented] (SPARK-5669) Spark assembly includes incompatibly licensed libgfortran, libgcc code via JBLAS

2015-02-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14327133#comment-14327133 ] Sean Owen commented on SPARK-5669: -- [~mengxr] That just applies to GCC, right? it still

[jira] [Created] (SPARK-5910) DataFrame.selectExpr(col as newName) does not work

2015-02-19 Thread Yin Huai (JIRA)
Yin Huai created SPARK-5910: --- Summary: DataFrame.selectExpr(col as newName) does not work Key: SPARK-5910 URL: https://issues.apache.org/jira/browse/SPARK-5910 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-5337) respect spark.task.cpus when launch executors

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5337: - Affects Version/s: 1.0.0 respect spark.task.cpus when launch executors

[jira] [Closed] (SPARK-2628) Mesos backend throwing unable to find LoginModule

2015-02-19 Thread Timothy Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timothy Chen closed SPARK-2628. --- Resolution: Won't Fix Mesos backend throwing unable to find LoginModule

[jira] [Commented] (SPARK-2628) Mesos backend throwing unable to find LoginModule

2015-02-19 Thread Timothy Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14327978#comment-14327978 ] Timothy Chen commented on SPARK-2628: - Seems like this is fixed post 1.0.4, somewhere

[jira] [Commented] (SPARK-5669) Spark assembly includes incompatibly licensed libgfortran, libgcc code via JBLAS

2015-02-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14328010#comment-14328010 ] Xiangrui Meng commented on SPARK-5669: -- Yes, we are going to remove JBLAS anyway in

[jira] [Comment Edited] (SPARK-5669) Spark assembly includes incompatibly licensed libgfortran, libgcc code via JBLAS

2015-02-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14328010#comment-14328010 ] Xiangrui Meng edited comment on SPARK-5669 at 2/19/15 7:43 PM:

[jira] [Updated] (SPARK-5825) Failure stopping Services while command line argument is too long

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5825: - Affects Version/s: 1.0.0 Failure stopping Services while command line argument is too long

[jira] [Closed] (SPARK-5825) Failure stopping Services while command line argument is too long

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-5825. Resolution: Fixed Fix Version/s: 1.2.2 1.3.0 Failure stopping Services while

[jira] [Commented] (SPARK-5669) Spark assembly includes incompatibly licensed libgfortran, libgcc code via JBLAS

2015-02-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14328092#comment-14328092 ] Sean Owen commented on SPARK-5669: -- I do find it confusing. I can see an argument that

[jira] [Resolved] (SPARK-5902) PipelineStage.transformSchema should be public, not private

2015-02-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5902. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4682

[jira] [Updated] (SPARK-5825) Failure stopping Services while command line argument is too long

2015-02-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5825: - Component/s: (was: Spark Submit) Deploy Target Version/s: 1.3.0, 1.2.2

[jira] [Comment Edited] (SPARK-5837) HTTP 500 if try to access Spark UI in yarn-cluster or yarn-client mode

2015-02-19 Thread Rok Roskar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5837?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14327182#comment-14327182 ] Rok Roskar edited comment on SPARK-5837 at 2/19/15 9:59 AM:

[jira] [Updated] (SPARK-5889) remove pid file in spark-daemon.sh after killing the process.

2015-02-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5889: - Priority: Minor (was: Major) Target Version/s: 1.3.0, 1.2.2 Affects Version/s: 1.2.1

[jira] [Commented] (SPARK-5889) remove pid file in spark-daemon.sh after killing the process.

2015-02-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5889?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14327238#comment-14327238 ] Sean Owen commented on SPARK-5889: -- Yeah I wanted to do this in the original PR, although

[jira] [Resolved] (SPARK-5899) Viewing specific stage information which contains thousands of tasks will freak out the driver and CPU cores from where it runs

2015-02-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5899. -- Resolution: Duplicate Viewing specific stage information which contains thousands of tasks will

[jira] [Commented] (SPARK-5837) HTTP 500 if try to access Spark UI in yarn-cluster or yarn-client mode

2015-02-19 Thread Rok Roskar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5837?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14327182#comment-14327182 ] Rok Roskar commented on SPARK-5837: --- this looks to be a yarn issue:

[jira] [Updated] (SPARK-5889) remove pid file in spark-daemon.sh after killing the process.

2015-02-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5889: - Component/s: Deploy remove pid file in spark-daemon.sh after killing the process.

[jira] [Commented] (SPARK-1476) 2GB limit in spark for blocks

2015-02-19 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1476?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14327394#comment-14327394 ] Imran Rashid commented on SPARK-1476: - Based on discussion on the dev list,

[jira] [Commented] (SPARK-5494) SparkSqlSerializer Ignores KryoRegistrators

2015-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5494?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14327657#comment-14327657 ] Apache Spark commented on SPARK-5494: - User 'hkothari' has created a pull request for

[jira] [Created] (SPARK-5907) Selected column from DataFrame should not re-analyze logical plan

2015-02-19 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5907: -- Summary: Selected column from DataFrame should not re-analyze logical plan Key: SPARK-5907 URL: https://issues.apache.org/jira/browse/SPARK-5907 Project: Spark

[jira] [Commented] (SPARK-5908) Hive udtf with single alias should be resolved correctly

2015-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5908?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14327512#comment-14327512 ] Apache Spark commented on SPARK-5908: - User 'viirya' has created a pull request for

[jira] [Updated] (SPARK-5719) allow daemons to bind to specified host

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5719?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5719: - Affects Version/s: 1.0.0 allow daemons to bind to specified host

[jira] [Closed] (SPARK-5423) ExternalAppendOnlyMap won't delete temp spilled file if some exception happens during using it

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-5423. Resolution: Fixed Fix Version/s: 1.2.2 1.1.2 1.3.0

[jira] [Commented] (SPARK-5775) GenericRow cannot be cast to SpecificMutableRow when nested data and partitioned table

2015-02-19 Thread Anselme Vignon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5775?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14327938#comment-14327938 ] Anselme Vignon commented on SPARK-5775: --- This bug is due to a problem in the

[jira] [Commented] (SPARK-5669) Spark assembly includes incompatibly licensed libgfortran, libgcc code via JBLAS

2015-02-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14327953#comment-14327953 ] Xiangrui Meng commented on SPARK-5669: -- GFortran is part of GCC

[jira] [Commented] (SPARK-4423) Improve foreach() documentation to avoid confusion between local- and cluster-mode behavior

2015-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14327833#comment-14327833 ] Apache Spark commented on SPARK-4423: - User 'ilganeli' has created a pull request for

[jira] [Updated] (SPARK-5902) PipelineStage.transformSchema should be public, not private

2015-02-19 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-5902: - Description: For users to implement their own PipelineStages, we need to make

[jira] [Updated] (SPARK-5423) ExternalAppendOnlyMap won't delete temp spilled file if some exception happens during using it

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5423: - Affects Version/s: 1.0.0 ExternalAppendOnlyMap won't delete temp spilled file if some exception

[jira] [Commented] (SPARK-5887) Class not found exception com.datastax.spark.connector.rdd.partitioner.CassandraPartition

2015-02-19 Thread Vijay Pawnarkar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5887?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14327921#comment-14327921 ] Vijay Pawnarkar commented on SPARK-5887: Thanks! This could be a class loader

[jira] [Commented] (SPARK-5775) GenericRow cannot be cast to SpecificMutableRow when nested data and partitioned table

2015-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5775?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14327936#comment-14327936 ] Apache Spark commented on SPARK-5775: - User 'anselmevignon' has created a pull request

[jira] [Updated] (SPARK-5423) ExternalAppendOnlyMap won't delete temp spilled file if some exception happens during using it

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5423: - Priority: Major (was: Minor) ExternalAppendOnlyMap won't delete temp spilled file if some exception

[jira] [Resolved] (SPARK-5887) Class not found exception com.datastax.spark.connector.rdd.partitioner.CassandraPartition

2015-02-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5887?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-5887. Resolution: Invalid The Datastax connector is not part of the Apache Spark distribution,

[jira] [Updated] (SPARK-5863) Performance regression in Spark SQL/Parquet due to ScalaReflection.convertRowToScala

2015-02-19 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5863: --- Priority: Critical (was: Major) Performance regression in Spark SQL/Parquet due to

[jira] [Comment Edited] (SPARK-5887) Class not found exception com.datastax.spark.connector.rdd.partitioner.CassandraPartition

2015-02-19 Thread Vijay Pawnarkar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5887?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14327921#comment-14327921 ] Vijay Pawnarkar edited comment on SPARK-5887 at 2/19/15 6:35 PM:

[jira] [Updated] (SPARK-5316) DAGScheduler may make shuffleToMapStage leak if getParentStages failes

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5316: - Priority: Major (was: Minor) DAGScheduler may make shuffleToMapStage leak if getParentStages failes

[jira] [Updated] (SPARK-4962) Put TaskScheduler.start back in SparkContext to shorten cluster resources occupation period

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4962: - Affects Version/s: 1.0.0 Put TaskScheduler.start back in SparkContext to shorten cluster resources

[jira] [Updated] (SPARK-5316) DAGScheduler may make shuffleToMapStage leak if getParentStages failes

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5316: - Affects Version/s: 1.0.0 DAGScheduler may make shuffleToMapStage leak if getParentStages failes

[jira] [Updated] (SPARK-4921) TaskSetManager mistakenly returns PROCESS_LOCAL for NO_PREF tasks

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4921: - Summary: TaskSetManager mistakenly returns PROCESS_LOCAL for NO_PREF tasks (was: Performance issue

[jira] [Closed] (SPARK-3545) Put HadoopRDD.getPartitions forward and put TaskScheduler.start back in SparkContext to reduce DAGScheduler.JobSubmitted processing time and shorten cluster resources occu

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3545?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-3545. Resolution: Won't Fix Put HadoopRDD.getPartitions forward and put TaskScheduler.start back in

[jira] [Updated] (SPARK-1537) Add integration with Yarn's Application Timeline Server

2015-02-19 Thread Zhan Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhan Zhang updated SPARK-1537: -- Attachment: SPARK-1537.txt High level design doc for spark ATS integration. Add integration with

[jira] [Updated] (SPARK-5814) Remove JBLAS from runtime dependencies

2015-02-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5814: - Priority: Major (was: Critical) Remove JBLAS from runtime dependencies

[jira] [Created] (SPARK-5911) Make Column.cast(to: String) support fixed precision and scale decimal type

2015-02-19 Thread Yin Huai (JIRA)
Yin Huai created SPARK-5911: --- Summary: Make Column.cast(to: String) support fixed precision and scale decimal type Key: SPARK-5911 URL: https://issues.apache.org/jira/browse/SPARK-5911 Project: Spark

[jira] [Commented] (SPARK-5744) RDD.isEmpty / take fails for (empty) RDD of Nothing

2015-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5744?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14328173#comment-14328173 ] Apache Spark commented on SPARK-5744: - User 'srowen' has created a pull request for

[jira] [Updated] (SPARK-4848) On a stand-alone cluster, several worker-specific variables are read only on the master

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4848: - Component/s: (was: Project Infra) Deploy On a stand-alone cluster, several

[jira] [Updated] (SPARK-1537) Add integration with Yarn's Application Timeline Server

2015-02-19 Thread Zhan Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhan Zhang updated SPARK-1537: -- Attachment: spark-1573.patch Patch against v1.2.1 Add integration with Yarn's Application Timeline

[jira] [Updated] (SPARK-4848) On a stand-alone cluster, several worker-specific variables are read only on the master

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4848: - Affects Version/s: 1.0.0 On a stand-alone cluster, several worker-specific variables are read only on

[jira] [Updated] (SPARK-4721) Improve first thread to put block failed

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4721: - Affects Version/s: 1.0.0 Improve first thread to put block failed

[jira] [Updated] (SPARK-4669) Allow users to set arbitrary akka configurations via property file

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4669: - Affects Version/s: 1.0.0 Allow users to set arbitrary akka configurations via property file

[jira] [Closed] (SPARK-2188) Support sbt/sbt for Windows

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-2188. Resolution: Won't Fix Support sbt/sbt for Windows --- Key:

[jira] [Updated] (SPARK-911) Support map pruning on sorted (K, V) RDD's

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-911: Affects Version/s: 1.0.0 Support map pruning on sorted (K, V) RDD's

[jira] [Updated] (SPARK-3051) Support looking-up named accumulators in a registry

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-3051: - Affects Version/s: 1.0.0 Support looking-up named accumulators in a registry

[jira] [Updated] (SPARK-2033) Automatically cleanup checkpoint

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-2033: - Affects Version/s: 1.0.0 Automatically cleanup checkpoint -

[jira] [Created] (SPARK-5912) Programming guide for feature selection

2015-02-19 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-5912: Summary: Programming guide for feature selection Key: SPARK-5912 URL: https://issues.apache.org/jira/browse/SPARK-5912 Project: Spark Issue Type:

[jira] [Commented] (SPARK-3882) JobProgressListener gets permanently out of sync with long running job

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14328237#comment-14328237 ] Andrew Or commented on SPARK-3882: -- Hi [~dgshep] is this still an issue after upgrading

[jira] [Commented] (SPARK-5912) Programming guide for feature selection

2015-02-19 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5912?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14328238#comment-14328238 ] Joseph K. Bradley commented on SPARK-5912: -- [~avulanov] Would you have time to

[jira] [Commented] (SPARK-5912) Programming guide for feature selection

2015-02-19 Thread Alexander Ulanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5912?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14328246#comment-14328246 ] Alexander Ulanov commented on SPARK-5912: - Sure, I can. Could you point me to some

[jira] [Commented] (SPARK-1476) 2GB limit in spark for blocks

2015-02-19 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1476?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14328248#comment-14328248 ] Marcelo Vanzin commented on SPARK-1476: --- Hi [~irashid], Approach sounds good. It

[jira] [Created] (SPARK-5918) Spark Thrift server reports metadata for VARCHAR column as STRING in result set schema

2015-02-19 Thread Holman Lan (JIRA)
Holman Lan created SPARK-5918: - Summary: Spark Thrift server reports metadata for VARCHAR column as STRING in result set schema Key: SPARK-5918 URL: https://issues.apache.org/jira/browse/SPARK-5918

[jira] [Commented] (SPARK-5879) spary_ec2.py should expose/return master and slave lists (e.g. write to file)

2015-02-19 Thread Florian Verhein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14328612#comment-14328612 ] Florian Verhein commented on SPARK-5879: cc [~shivaram], any opinions on how to

[jira] [Commented] (SPARK-4144) Support incremental model training of Naive Bayes classifier

2015-02-19 Thread Jatinpreet Singh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14328569#comment-14328569 ] Jatinpreet Singh commented on SPARK-4144: - Hi, I have been waiting for this

[jira] [Commented] (SPARK-4655) Split Stage into ShuffleMapStage and ResultStage subclasses

2015-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14328659#comment-14328659 ] Apache Spark commented on SPARK-4655: - User 'ilganeli' has created a pull request for

[jira] [Commented] (SPARK-5912) Programming guide for feature selection

2015-02-19 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5912?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14328255#comment-14328255 ] Joseph K. Bradley commented on SPARK-5912: -- Sure, can you please follow the

[jira] [Updated] (SPARK-5914) Spark-submit cannot execute without machine admin permission on windows

2015-02-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5914: - Component/s: (was: Spark Core) Windows Spark Submit Yes of course

[jira] [Resolved] (SPARK-5900) Wrap the results returned by PIC and FPGrowth in case classes

2015-02-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5900. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4695

[jira] [Created] (SPARK-5909) Add a clearCache command to Spark SQL's cache manager

2015-02-19 Thread Yin Huai (JIRA)
Yin Huai created SPARK-5909: --- Summary: Add a clearCache command to Spark SQL's cache manager Key: SPARK-5909 URL: https://issues.apache.org/jira/browse/SPARK-5909 Project: Spark Issue Type: Task

[jira] [Commented] (SPARK-5909) Add a clearCache command to Spark SQL's cache manager

2015-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5909?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14327706#comment-14327706 ] Apache Spark commented on SPARK-5909: - User 'yhuai' has created a pull request for

[jira] [Commented] (SPARK-5881) RDD remains cached after the table gets overridden by CACHE TABLE

2015-02-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14327696#comment-14327696 ] Yin Huai commented on SPARK-5881: - As mentioned by [~lian cheng], we should also track the

[jira] [Updated] (SPARK-5881) RDD remains cached after the table gets overridden by CACHE TABLE

2015-02-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5881?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-5881: Priority: Major (was: Blocker) RDD remains cached after the table gets overridden by CACHE TABLE

[jira] [Commented] (SPARK-5907) Selected column from DataFrame should not re-analyze logical plan

2015-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5907?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14327494#comment-14327494 ] Apache Spark commented on SPARK-5907: - User 'viirya' has created a pull request for

[jira] [Created] (SPARK-5908) Hive udtf with single alias should be resolved correctly

2015-02-19 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5908: -- Summary: Hive udtf with single alias should be resolved correctly Key: SPARK-5908 URL: https://issues.apache.org/jira/browse/SPARK-5908 Project: Spark

[jira] [Closed] (SPARK-5907) Selected column from DataFrame should not re-analyze logical plan

2015-02-19 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5907?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh closed SPARK-5907. -- Resolution: Duplicate Selected column from DataFrame should not re-analyze logical plan

[jira] [Commented] (SPARK-5900) Wrap the results returned by PIC and FPGrowth in case classes

2015-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14327787#comment-14327787 ] Apache Spark commented on SPARK-5900: - User 'mengxr' has created a pull request for

[jira] [Closed] (SPARK-5548) Flaky test: o.a.s.util.AkkaUtilsSuite.remote fetch ssl on - untrusted server

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-5548. Resolution: Fixed Fix Version/s: 1.3.0 Target Version/s: 1.3.0 Closing again

[jira] [Resolved] (SPARK-5889) remove pid file in spark-daemon.sh after killing the process.

2015-02-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5889. -- Resolution: Fixed Fix Version/s: 1.2.2 1.3.0 Issue resolved by pull request

[jira] [Created] (SPARK-5914) Spark-submit cannot execute without machine admin permission on windows

2015-02-19 Thread Judy Nash (JIRA)
Judy Nash created SPARK-5914: Summary: Spark-submit cannot execute without machine admin permission on windows Key: SPARK-5914 URL: https://issues.apache.org/jira/browse/SPARK-5914 Project: Spark

[jira] [Created] (SPARK-5915) Spillable should check every N bytes rather than every 32 elements

2015-02-19 Thread Mingyu Kim (JIRA)
Mingyu Kim created SPARK-5915: - Summary: Spillable should check every N bytes rather than every 32 elements Key: SPARK-5915 URL: https://issues.apache.org/jira/browse/SPARK-5915 Project: Spark

[jira] [Updated] (SPARK-4808) Spark fails to spill with small number of large objects

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4808: - Target Version/s: 1.3.0, 1.4.0 (was: 1.2.1) Spark fails to spill with small number of large objects

[jira] [Updated] (SPARK-5915) Spillable should check every N bytes rather than every 32 elements

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5915: - Target Version/s: 1.4.0 Spillable should check every N bytes rather than every 32 elements

[jira] [Updated] (SPARK-5915) Spillable should check every N bytes rather than every 32 elements

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5915: - Affects Version/s: 1.0.0 Spillable should check every N bytes rather than every 32 elements

[jira] [Commented] (SPARK-5753) add basic support to JDBCRDD for postgresql types: uuid, hstore, and array

2015-02-19 Thread Evan Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14328449#comment-14328449 ] Evan Yu commented on SPARK-5753: Ignore this, commit under wrong ticket add basic

[jira] [Created] (SPARK-5917) Distinct is broken

2015-02-19 Thread Derrick Burns (JIRA)
Derrick Burns created SPARK-5917: Summary: Distinct is broken Key: SPARK-5917 URL: https://issues.apache.org/jira/browse/SPARK-5917 Project: Spark Issue Type: Bug Components: MLlib

[jira] [Updated] (SPARK-4682) Consolidate various 'Clock' classes

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4682: - Affects Version/s: 1.2.0 Consolidate various 'Clock' classes ---

[jira] [Closed] (SPARK-4682) Consolidate various 'Clock' classes

2015-02-19 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-4682. Resolution: Fixed Fix Version/s: 1.3.0 Assignee: Sean Owen Target Version/s:

[jira] [Commented] (SPARK-5669) Spark assembly includes incompatibly licensed libgfortran, libgcc code via JBLAS

2015-02-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14328364#comment-14328364 ] Sean Owen commented on SPARK-5669: -- It *should* be fine on the grounds that the native

[jira] [Created] (SPARK-5913) Python API for ChiSqSelector

2015-02-19 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-5913: Summary: Python API for ChiSqSelector Key: SPARK-5913 URL: https://issues.apache.org/jira/browse/SPARK-5913 Project: Spark Issue Type: New Feature

[jira] [Commented] (SPARK-5860) JdbcRDD: overflow on large range with high number of partitions

2015-02-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14328454#comment-14328454 ] Apache Spark commented on SPARK-5860: - User 'hotou' has created a pull request for