[jira] [Updated] (SPARK-1391) BlockManager cannot transfer blocks larger than 2G in size

2014-04-02 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1391?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1391: - Assignee: Min Zhou BlockManager cannot transfer blocks larger than 2G in size

[jira] [Updated] (SPARK-1133) Add a new small files input for MLlib, which will return an RDD[(fileName, content)]

2014-04-02 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1133: - Assignee: Xusen Yin Add a new small files input for MLlib, which will return an RDD[(fileName,

[jira] [Updated] (SPARK-1162) Add top() and takeOrdered() to PySpark

2014-04-03 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1162: - Fix Version/s: 0.9.2 1.0.0 Add top() and takeOrdered() to PySpark

[jira] [Resolved] (SPARK-1162) Add top() and takeOrdered() to PySpark

2014-04-03 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-1162. -- Resolution: Fixed Add top() and takeOrdered() to PySpark

[jira] [Updated] (SPARK-1162) Add top() and takeOrdered() to PySpark

2014-04-03 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1162: - Assignee: Prashant Sharma (was: prashant) Add top() and takeOrdered() to PySpark

[jira] [Resolved] (SPARK-1333) Java API for running SQL queries

2014-04-03 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-1333. -- Resolution: Fixed Java API for running SQL queries

[jira] [Updated] (SPARK-1134) ipython won't run standalone python script

2014-04-03 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1134: - Affects Version/s: 0.9.1 ipython won't run standalone python script

[jira] [Resolved] (SPARK-1134) ipython won't run standalone python script

2014-04-03 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-1134. -- Resolution: Fixed ipython won't run standalone python script

[jira] [Updated] (SPARK-1134) ipython won't run standalone python script

2014-04-03 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1134: - Fix Version/s: 0.9.2 1.0.0 ipython won't run standalone python script

[jira] [Updated] (SPARK-1296) Make RDDs Covariant

2014-04-03 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1296: - Fix Version/s: (was: 1.0.0) Make RDDs Covariant --- Key:

[jira] [Created] (SPARK-1413) Parquet messes up stdout and stdin when used in Spark REPL

2014-04-03 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-1413: Summary: Parquet messes up stdout and stdin when used in Spark REPL Key: SPARK-1413 URL: https://issues.apache.org/jira/browse/SPARK-1413 Project: Spark

[jira] [Resolved] (SPARK-1133) Add a new small files input for MLlib, which will return an RDD[(fileName, content)]

2014-04-04 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-1133. -- Resolution: Fixed Fix Version/s: 1.0.0 Add a new small files input for MLlib, which

[jira] [Assigned] (SPARK-1414) Python API for SparkContext.wholeTextFiles

2014-04-04 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1414?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia reassigned SPARK-1414: Assignee: Matei Zaharia Python API for SparkContext.wholeTextFiles

[jira] [Created] (SPARK-1416) Add support for SequenceFiles in PySpark

2014-04-04 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-1416: Summary: Add support for SequenceFiles in PySpark Key: SPARK-1416 URL: https://issues.apache.org/jira/browse/SPARK-1416 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-1198) Allow pipes tasks to run in different sub-directories

2014-04-04 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1198?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-1198. -- Resolution: Fixed Fix Version/s: 1.0.0 Allow pipes tasks to run in different

[jira] [Created] (SPARK-1423) Add scripts for launching Spark on Windows Azure

2014-04-05 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-1423: Summary: Add scripts for launching Spark on Windows Azure Key: SPARK-1423 URL: https://issues.apache.org/jira/browse/SPARK-1423 Project: Spark Issue Type:

[jira] [Created] (SPARK-1422) Add scripts for launching Spark on Google Compute Engine

2014-04-05 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-1422: Summary: Add scripts for launching Spark on Google Compute Engine Key: SPARK-1422 URL: https://issues.apache.org/jira/browse/SPARK-1422 Project: Spark Issue

[jira] [Commented] (SPARK-1424) InsertInto should work on JavaSchemaRDD as well.

2014-04-05 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1424?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13961283#comment-13961283 ] Matei Zaharia commented on SPARK-1424: -- More generally we should have flags to

[jira] [Updated] (SPARK-1421) Make MLlib work on Python 2.6

2014-04-05 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1421?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1421: - Description: Currently it requires Python 2.7 because it uses some new APIs, but they should not

[jira] [Updated] (SPARK-1421) Make MLlib work on Python 2.6

2014-04-05 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1421?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1421: - Summary: Make MLlib work on Python 2.6 (was: Make MLlib work on Python 2.6 and NumPy 1.7)

[jira] [Created] (SPARK-1426) Make MLlib work with NumPy versions older than 1.7

2014-04-05 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-1426: Summary: Make MLlib work with NumPy versions older than 1.7 Key: SPARK-1426 URL: https://issues.apache.org/jira/browse/SPARK-1426 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-1421) Make MLlib work on Python 2.6

2014-04-05 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1421?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia reassigned SPARK-1421: Assignee: Matei Zaharia Make MLlib work on Python 2.6 -

[jira] [Resolved] (SPARK-1421) Make MLlib work on Python 2.6

2014-04-05 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1421?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-1421. -- Resolution: Fixed Fix Version/s: 0.9.2 1.0.0 Make MLlib work on

[jira] [Commented] (SPARK-1021) sortByKey() launches a cluster job when it shouldn't

2014-04-07 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13962026#comment-13962026 ] Matei Zaharia commented on SPARK-1021: -- Note that if we do this, we'll need a similar

[jira] [Assigned] (SPARK-1428) MLlib should convert non-float64 NumPy arrays to float64 instead of complaining

2014-04-08 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1428?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia reassigned SPARK-1428: Assignee: Matei Zaharia MLlib should convert non-float64 NumPy arrays to float64 instead

[jira] [Updated] (SPARK-1428) MLlib should convert non-float64 NumPy arrays to float64 instead of complaining

2014-04-08 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1428?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1428: - Assignee: Sandeep Singh (was: Matei Zaharia) MLlib should convert non-float64 NumPy arrays to

[jira] [Resolved] (SPARK-1428) MLlib should convert non-float64 NumPy arrays to float64 instead of complaining

2014-04-10 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1428?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-1428. -- Resolution: Fixed Fix Version/s: 1.0.0 MLlib should convert non-float64 NumPy arrays

[jira] [Created] (SPARK-1467) Make StorageLevel.apply() factory methods experimental

2014-04-10 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-1467: Summary: Make StorageLevel.apply() factory methods experimental Key: SPARK-1467 URL: https://issues.apache.org/jira/browse/SPARK-1467 Project: Spark Issue

[jira] [Resolved] (SPARK-1241) Support sliding in RDD

2014-04-11 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-1241. -- Resolution: Fixed Fix Version/s: 1.0.0 Support sliding in RDD --

[jira] [Commented] (SPARK-1225) ROC AUC and Average Precision for Binary classification models

2014-04-11 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13966987#comment-13966987 ] Matei Zaharia commented on SPARK-1225: -- Included in

[jira] [Commented] (SPARK-1355) Switch website to the Apache CMS

2014-04-11 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1355?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13967196#comment-13967196 ] Matei Zaharia commented on SPARK-1355: -- I have to say this was pretty good, I avoided

[jira] [Created] (SPARK-1481) Add Naive Bayes to MLlib documentation

2014-04-12 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-1481: Summary: Add Naive Bayes to MLlib documentation Key: SPARK-1481 URL: https://issues.apache.org/jira/browse/SPARK-1481 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-1484) MLlib should warn if you are using an iterative algorithm on non-cached data

2014-04-13 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-1484: Summary: MLlib should warn if you are using an iterative algorithm on non-cached data Key: SPARK-1484 URL: https://issues.apache.org/jira/browse/SPARK-1484 Project:

[jira] [Resolved] (SPARK-1462) Examples of ML algorithms are using deprecated APIs

2014-04-16 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-1462. -- Resolution: Fixed Fix Version/s: 1.0.0 Examples of ML algorithms are using deprecated

[jira] [Updated] (SPARK-1535) jblas's DoubleMatrix(double[]) ctor creates garbage; avoid

2014-04-19 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1535: - Assignee: Tor Myklebust jblas's DoubleMatrix(double[]) ctor creates garbage; avoid

[jira] [Resolved] (SPARK-1535) jblas's DoubleMatrix(double[]) ctor creates garbage; avoid

2014-04-19 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-1535. -- Resolution: Fixed Fix Version/s: 1.0.0 jblas's DoubleMatrix(double[]) ctor creates

[jira] [Created] (SPARK-1540) Investigate whether we should require keys in PairRDD to be Comparable

2014-04-19 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-1540: Summary: Investigate whether we should require keys in PairRDD to be Comparable Key: SPARK-1540 URL: https://issues.apache.org/jira/browse/SPARK-1540 Project: Spark

[jira] [Commented] (SPARK-1439) Aggregate Scaladocs across projects

2014-04-20 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13975226#comment-13975226 ] Matei Zaharia commented on SPARK-1439: -- Thanks for looking into this, Sean. Instead

[jira] [Updated] (SPARK-1536) Add multiclass classification support to MLlib

2014-04-20 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1536: - Assignee: Manish Amde Add multiclass classification support to MLlib

[jira] [Updated] (SPARK-1546) Add AdaBoost algorithm to Spark MLlib

2014-04-20 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1546?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1546: - Assignee: Manish Amde Add AdaBoost algorithm to Spark MLlib

[jira] [Updated] (SPARK-1547) Add gradient boosting algorithm to MLlib

2014-04-20 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1547: - Assignee: Manish Amde Add gradient boosting algorithm to MLlib

[jira] [Updated] (SPARK-1544) Add support for creating deep decision trees.

2014-04-20 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1544?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1544: - Assignee: Manish Amde Add support for creating deep decision trees.

[jira] [Updated] (SPARK-1545) Add Random Forest algorithm to MLlib

2014-04-20 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1545?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1545: - Assignee: Manish Amde Add Random Forest algorithm to MLlib

[jira] [Assigned] (SPARK-1439) Aggregate Scaladocs across projects

2014-04-21 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia reassigned SPARK-1439: Assignee: Matei Zaharia Aggregate Scaladocs across projects

[jira] [Assigned] (SPARK-1440) Generate JavaDoc instead of ScalaDoc for Java API

2014-04-21 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia reassigned SPARK-1440: Assignee: Matei Zaharia Generate JavaDoc instead of ScalaDoc for Java API

[jira] [Created] (SPARK-1554) Update doc overview page to not mention building if you get a pre-built distro

2014-04-21 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-1554: Summary: Update doc overview page to not mention building if you get a pre-built distro Key: SPARK-1554 URL: https://issues.apache.org/jira/browse/SPARK-1554

[jira] [Created] (SPARK-1563) Add package-info.java files for all packages that appear in Javadoc

2014-04-22 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-1563: Summary: Add package-info.java files for all packages that appear in Javadoc Key: SPARK-1563 URL: https://issues.apache.org/jira/browse/SPARK-1563 Project: Spark

[jira] [Created] (SPARK-1564) Add JavaScript into Javadoc to turn ::Experimental:: and such into badges

2014-04-22 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-1564: Summary: Add JavaScript into Javadoc to turn ::Experimental:: and such into badges Key: SPARK-1564 URL: https://issues.apache.org/jira/browse/SPARK-1564 Project:

[jira] [Created] (SPARK-1567) Add language tabs to quick start guide

2014-04-22 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-1567: Summary: Add language tabs to quick start guide Key: SPARK-1567 URL: https://issues.apache.org/jira/browse/SPARK-1567 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-1566) Consolidate the Spark Programming Guide with tabs for all languages

2014-04-22 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-1566: Summary: Consolidate the Spark Programming Guide with tabs for all languages Key: SPARK-1566 URL: https://issues.apache.org/jira/browse/SPARK-1566 Project: Spark

[jira] [Updated] (SPARK-1563) Add package-info.java and package.scala files for all packages that appear in docs

2014-04-22 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1563: - Summary: Add package-info.java and package.scala files for all packages that appear in docs

[jira] [Resolved] (SPARK-1540) Investigate whether we should require keys in PairRDD to be Comparable

2014-04-24 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1540?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-1540. -- Resolution: Fixed Resolved here: https://github.com/apache/spark/pull/487/files. We were able

[jira] [Commented] (SPARK-1540) Investigate whether we should require keys in PairRDD to be Comparable

2014-04-24 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1540?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13979393#comment-13979393 ] Matei Zaharia commented on SPARK-1540: -- Note that it will remain to add this to the

[jira] [Updated] (SPARK-1548) Add Partial Random Forest algorithm to MLlib

2014-04-24 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1548: - Assignee: Jason Day Add Partial Random Forest algorithm to MLlib

[jira] [Updated] (SPARK-928) Add support for Unsafe-based serializer in Kryo 2.22

2014-04-24 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-928: Priority: Major (was: Minor) Add support for Unsafe-based serializer in Kryo 2.22

[jira] [Updated] (SPARK-928) Add support for Unsafe-based serializer in Kryo 2.22

2014-04-24 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-928: Priority: Minor (was: Major) Add support for Unsafe-based serializer in Kryo 2.22

[jira] [Assigned] (SPARK-1621) Update Chill to 0.3.6

2014-04-24 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia reassigned SPARK-1621: Assignee: Matei Zaharia Update Chill to 0.3.6 -

[jira] [Created] (SPARK-1621) Update Chill to 0.3.6

2014-04-24 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-1621: Summary: Update Chill to 0.3.6 Key: SPARK-1621 URL: https://issues.apache.org/jira/browse/SPARK-1621 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-928) Add support for Unsafe-based serializer in Kryo 2.22

2014-04-24 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13980471#comment-13980471 ] Matei Zaharia commented on SPARK-928: - This probably can't be fixed in 1.0.0 because no

[jira] [Updated] (SPARK-1438) Update RDD.sample() API to make seed parameter optional

2014-04-24 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1438?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1438: - Assignee: Arun Ramakrishnan Update RDD.sample() API to make seed parameter optional

[jira] [Resolved] (SPARK-1438) Update RDD.sample() API to make seed parameter optional

2014-04-24 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1438?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-1438. -- Resolution: Fixed Update RDD.sample() API to make seed parameter optional

[jira] [Resolved] (SPARK-986) Add job cancellation to PySpark

2014-04-24 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-986. - Resolution: Fixed Add job cancellation to PySpark ---

[jira] [Updated] (SPARK-986) Add job cancellation to PySpark

2014-04-24 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-986: Affects Version/s: (was: 0.9.0) Add job cancellation to PySpark

[jira] [Updated] (SPARK-986) Add job cancellation to PySpark

2014-04-24 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-986: Fix Version/s: 1.0.0 Add job cancellation to PySpark ---

[jira] [Updated] (SPARK-986) Add job cancellation to PySpark

2014-04-24 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-986: Assignee: Ahir Reddy Add job cancellation to PySpark ---

[jira] [Resolved] (SPARK-1586) Fix issues with spark development under windows

2014-04-24 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-1586. -- Resolution: Fixed Fix Version/s: 1.0.0 Fix issues with spark development under windows

[jira] [Updated] (SPARK-1242) Add aggregate to python API

2014-04-25 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1242: - Assignee: Holden Karau Add aggregate to python API ---

[jira] [Resolved] (SPARK-1607) Remove use of octal literals, deprecated in Scala 2.10 / removed in 2.11

2014-04-25 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-1607. -- Resolution: Fixed Fix Version/s: 1.0.0 Remove use of octal literals, deprecated in

[jira] [Resolved] (SPARK-1621) Update Chill to 0.3.6

2014-04-25 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-1621. -- Resolution: Fixed Update Chill to 0.3.6 - Key:

[jira] [Created] (SPARK-1637) Clean up examples for 1.0

2014-04-25 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-1637: Summary: Clean up examples for 1.0 Key: SPARK-1637 URL: https://issues.apache.org/jira/browse/SPARK-1637 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-1637) Clean up examples for 1.0

2014-04-25 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1637?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1637: - Description: - Move all of them into subpackages of org.apache.spark.examples (right now some

[jira] [Updated] (SPARK-1235) DAGScheduler ignores exceptions thrown in handleTaskCompletion

2014-04-25 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1235: - Affects Version/s: (was: 1.0.0) DAGScheduler ignores exceptions thrown in

[jira] [Resolved] (SPARK-1235) DAGScheduler ignores exceptions thrown in handleTaskCompletion

2014-04-25 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-1235. -- Resolution: Fixed Fix Version/s: 1.0.0 Resolved in

[jira] [Resolved] (SPARK-615) Add mapPartitionsWithIndex() to the Java API

2014-04-29 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-615?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-615. - Resolution: Fixed Fix Version/s: 1.0.0 Add mapPartitionsWithIndex() to the Java API

[jira] [Resolved] (SPARK-1268) Adding XOR and AND-NOT operations to spark.util.collection.BitSet

2014-04-29 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1268?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-1268. -- Resolution: Fixed Fix Version/s: 1.0.0 Adding XOR and AND-NOT operations to

[jira] [Assigned] (SPARK-544) Provide a Configuration class in addition to system properties

2014-04-29 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-544?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia reassigned SPARK-544: --- Assignee: Matei Zaharia (was: Evan Chan) Provide a Configuration class in addition to

[jira] [Resolved] (SPARK-544) Provide a Configuration class in addition to system properties

2014-04-29 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-544?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-544. - Resolution: Fixed Fix Version/s: 0.9.0 Provide a Configuration class in addition to

[jira] [Assigned] (SPARK-1549) Add python support to spark-submit script

2014-05-03 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia reassigned SPARK-1549: Assignee: Matei Zaharia Add python support to spark-submit script

[jira] [Created] (SPARK-1709) spark-submit should use main class attribute of JAR if no --class is given

2014-05-03 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-1709: Summary: spark-submit should use main class attribute of JAR if no --class is given Key: SPARK-1709 URL: https://issues.apache.org/jira/browse/SPARK-1709 Project:

[jira] [Updated] (SPARK-1710) spark-submit should print better errors than InvocationTargetException

2014-05-03 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1710: - Affects Version/s: 1.0.0 spark-submit should print better errors than InvocationTargetException

[jira] [Assigned] (SPARK-1709) spark-submit should use main class attribute of JAR if no --class is given

2014-05-04 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia reassigned SPARK-1709: Assignee: Matei Zaharia (was: Sandeep Singh) spark-submit should use main class

[jira] [Commented] (SPARK-1709) spark-submit should use main class attribute of JAR if no --class is given

2014-05-04 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13989181#comment-13989181 ] Matei Zaharia commented on SPARK-1709: -- Sorry Sandeep, I actually have a patch done

[jira] [Commented] (SPARK-1709) spark-submit should use main class attribute of JAR if no --class is given

2014-05-04 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13989182#comment-13989182 ] Matei Zaharia commented on SPARK-1709: -- Should've assigned it to myself earlier.

[jira] [Resolved] (SPARK-1732) Support for primitive nulls in SparkSQL

2014-05-06 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia resolved SPARK-1732. -- Resolution: Fixed Support for primitive nulls in SparkSQL

[jira] [Updated] (SPARK-1736) spark-submit on Windows

2014-05-06 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1736: - Component/s: Windows spark-submit on Windows --- Key:

[jira] [Created] (SPARK-1736) Update remaining Windows scripts

2014-05-06 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-1736: Summary: Update remaining Windows scripts Key: SPARK-1736 URL: https://issues.apache.org/jira/browse/SPARK-1736 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-1736) spark-submit on Windows

2014-05-06 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1736: - Priority: Blocker (was: Critical) spark-submit on Windows ---

[jira] [Updated] (SPARK-1620) Uncaught exception from Akka scheduler

2014-05-06 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1620: - Assignee: Mark Hamstra Uncaught exception from Akka scheduler

[jira] [Created] (SPARK-1775) Unneeded lock in ShuffleMapTask.deserializeInfo

2014-05-10 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-1775: Summary: Unneeded lock in ShuffleMapTask.deserializeInfo Key: SPARK-1775 URL: https://issues.apache.org/jira/browse/SPARK-1775 Project: Spark Issue Type:

[jira] [Updated] (SPARK-1775) Unneeded lock in ShuffleMapTask.deserializeInfo

2014-05-10 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1775: - Labels: Starter (was: ) Unneeded lock in ShuffleMapTask.deserializeInfo

[jira] [Updated] (SPARK-1775) Unneeded lock in ShuffleMapTask.deserializeInfo

2014-05-10 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1775: - Priority: Critical (was: Major) Unneeded lock in ShuffleMapTask.deserializeInfo

[jira] [Updated] (SPARK-1770) repartition and coalesce(shuffle=true) put objects with the same key in the same bucket

2014-05-14 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1770: - Fix Version/s: 1.0.0 repartition and coalesce(shuffle=true) put objects with the same key in

[jira] [Updated] (SPARK-1775) Unneeded lock in ShuffleMapTask.deserializeInfo

2014-05-15 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1775: - Fix Version/s: 0.9.2 Unneeded lock in ShuffleMapTask.deserializeInfo

[jira] [Created] (SPARK-1770) repartition and coalesce(shuffle=true) put objects with the same key in the same bucket

2014-05-15 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-1770: Summary: repartition and coalesce(shuffle=true) put objects with the same key in the same bucket Key: SPARK-1770 URL: https://issues.apache.org/jira/browse/SPARK-1770

[jira] [Created] (SPARK-1858) Update third-party Hadoop distros doc to list more distros

2014-05-16 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-1858: Summary: Update third-party Hadoop distros doc to list more distros Key: SPARK-1858 URL: https://issues.apache.org/jira/browse/SPARK-1858 Project: Spark

[jira] [Updated] (SPARK-1145) Memory mapping with many small blocks can cause JVM allocation failures

2014-05-17 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1145: - Fix Version/s: 0.9.2 Memory mapping with many small blocks can cause JVM allocation failures

[jira] [Created] (SPARK-1874) Clean up MLlib sample data

2014-05-18 Thread Matei Zaharia (JIRA)
Matei Zaharia created SPARK-1874: Summary: Clean up MLlib sample data Key: SPARK-1874 URL: https://issues.apache.org/jira/browse/SPARK-1874 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-1875) NoClassDefFoundError: StringUtils when building against Hadoop 1

2014-05-18 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1875?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1875: - Fix Version/s: 1.0.0 NoClassDefFoundError: StringUtils when building against Hadoop 1

[jira] [Updated] (SPARK-1875) NoClassDefFoundError: StringUtils when building against Hadoop 1

2014-05-18 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1875?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matei Zaharia updated SPARK-1875: - Priority: Blocker (was: Critical) NoClassDefFoundError: StringUtils when building against

[jira] [Commented] (SPARK-1875) NoClassDefFoundError: StringUtils when building against Hadoop 1

2014-05-18 Thread Matei Zaharia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1875?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14001297#comment-14001297 ] Matei Zaharia commented on SPARK-1875: -- This may have been broken by

  1   2   3   4   5   6   >