[jira] [Commented] (SPARK-7106) Support model save/load in Python's FPGrowth

2015-06-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14581732#comment-14581732 ] Joseph K. Bradley commented on SPARK-7106: -- Not yet. You can check this yourself

[jira] [Created] (SPARK-8308) add missing save load for python doc example and tune down MatrixFactorization iterations

2015-06-11 Thread yuhao yang (JIRA)
yuhao yang created SPARK-8308: - Summary: add missing save load for python doc example and tune down MatrixFactorization iterations Key: SPARK-8308 URL: https://issues.apache.org/jira/browse/SPARK-8308

[jira] [Commented] (SPARK-6192) Enhance MLlib's Python API (GSoC 2015)

2015-06-11 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14581736#comment-14581736 ] Manoj Kumar commented on SPARK-6192: [~mengxr] I have linked other ongoing issues as

[jira] [Assigned] (SPARK-8308) add missing save load for python doc example and tune down MatrixFactorization iterations

2015-06-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8308: --- Assignee: Apache Spark add missing save load for python doc example and tune down

[jira] [Assigned] (SPARK-8308) add missing save load for python doc example and tune down MatrixFactorization iterations

2015-06-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8308: --- Assignee: (was: Apache Spark) add missing save load for python doc example and tune

[jira] [Commented] (SPARK-8308) add missing save load for python doc example and tune down MatrixFactorization iterations

2015-06-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14581787#comment-14581787 ] Apache Spark commented on SPARK-8308: - User 'hhbyyh' has created a pull request for

[jira] [Issue Comment Deleted] (SPARK-7106) Support model save/load in Python's FPGrowth

2015-06-11 Thread Hrishikesh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hrishikesh updated SPARK-7106: -- Comment: was deleted (was: Shouldn't save/load method be added in Scala first in order to work on

[jira] [Commented] (SPARK-7106) Support model save/load in Python's FPGrowth

2015-06-11 Thread Hrishikesh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14581755#comment-14581755 ] Hrishikesh commented on SPARK-7106: --- Shouldn't save/load method be added in Scala first

[jira] [Commented] (SPARK-7106) Support model save/load in Python's FPGrowth

2015-06-11 Thread Hrishikesh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14581754#comment-14581754 ] Hrishikesh commented on SPARK-7106: --- Shouldn't save/load method be added in Scala first

[jira] [Comment Edited] (SPARK-7106) Support model save/load in Python's FPGrowth

2015-06-11 Thread Hrishikesh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14574225#comment-14574225 ] Hrishikesh edited comment on SPARK-7106 at 6/11/15 8:56 AM:

[jira] [Commented] (SPARK-8297) Scheduler backend is not notified in case node fails in YARN

2015-06-11 Thread Mridul Muralidharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8297?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14581713#comment-14581713 ] Mridul Muralidharan commented on SPARK-8297: kill -9 is not sufficient -

[jira] [Comment Edited] (SPARK-8309) OpenHashMap doesn't work with more than 12M items

2015-06-11 Thread Vyacheslav Baranov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14582019#comment-14582019 ] Vyacheslav Baranov edited comment on SPARK-8309 at 6/11/15 2:59 PM:

[jira] [Created] (SPARK-8309) OpenHashMap doesn't work with more than 12M items

2015-06-11 Thread Vyacheslav Baranov (JIRA)
Vyacheslav Baranov created SPARK-8309: - Summary: OpenHashMap doesn't work with more than 12M items Key: SPARK-8309 URL: https://issues.apache.org/jira/browse/SPARK-8309 Project: Spark

[jira] [Commented] (SPARK-4362) Make prediction probability available in NaiveBayesModel

2015-06-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14582013#comment-14582013 ] Apache Spark commented on SPARK-4362: - User 'acidghost' has created a pull request for

[jira] [Commented] (SPARK-8309) OpenHashMap doesn't work with more than 12M items

2015-06-11 Thread Vyacheslav Baranov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14582019#comment-14582019 ] Vyacheslav Baranov commented on SPARK-8309: --- The problem occurs because of

[jira] [Comment Edited] (SPARK-8309) OpenHashMap doesn't work with more than 12M items

2015-06-11 Thread Vyacheslav Baranov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14582019#comment-14582019 ] Vyacheslav Baranov edited comment on SPARK-8309 at 6/11/15 2:55 PM:

[jira] [Updated] (SPARK-8309) OpenHashMap doesn't work with more than 12M items

2015-06-11 Thread Vyacheslav Baranov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vyacheslav Baranov updated SPARK-8309: -- Description: The problem might be demonstrated with the following testcase: {code}

[jira] [Commented] (SPARK-4557) Spark Streaming' foreachRDD method should accept a VoidFunction..., not a Function..., Void

2015-06-11 Thread Alexis Seigneurin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14581888#comment-14581888 ] Alexis Seigneurin commented on SPARK-4557: -- I'm using Java 8. Here is what the

[jira] [Commented] (SPARK-1403) Spark on Mesos does not set Thread's context class loader

2015-06-11 Thread John Omernik (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14581933#comment-14581933 ] John Omernik commented on SPARK-1403: - Per Kannan: Seeing this in Spark 1.2.2, 1.3.0,

[jira] [Reopened] (SPARK-1403) Spark on Mesos does not set Thread's context class loader

2015-06-11 Thread Yana Kadiyska (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yana Kadiyska reopened SPARK-1403: -- Multiple users reporting this is occuring again in 1.3 Spark on Mesos does not set Thread's

[jira] [Resolved] (SPARK-3284) saveAsParquetFile not working on windows

2015-06-11 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steve Loughran resolved SPARK-3284. --- Resolution: Duplicate closing as duplicate of SPARK-6961 saveAsParquetFile not working on

[jira] [Commented] (SPARK-5594) SparkException: Failed to get broadcast (TorrentBroadcast)

2015-06-11 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14581912#comment-14581912 ] Shixiong Zhu commented on SPARK-5594: - [~suryasev] could you provide your full codes?

[jira] [Updated] (SPARK-1403) Spark on Mesos does not set Thread's context class loader

2015-06-11 Thread Yana Kadiyska (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yana Kadiyska updated SPARK-1403: - Affects Version/s: 1.3.0 Spark on Mesos does not set Thread's context class loader

[jira] [Commented] (SPARK-6961) Cannot save data to parquet files when executing from Windows from a Maven Project

2015-06-11 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14581886#comment-14581886 ] Steve Loughran commented on SPARK-6961: --- issue here is WINUTILS.EXE isn't on the

[jira] [Resolved] (SPARK-7915) Support specifying the column list for target table in CTAS

2015-06-11 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-7915. - Resolution: Fixed Fix Version/s: 1.5.0 Assignee: Cheng Hao Support

[jira] [Resolved] (SPARK-7444) Eliminate noisy css warn/error logs for UISeleniumSuite

2015-06-11 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-7444. - Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 5983

[jira] [Created] (SPARK-8312) Populate statistics info of hive tables if it's needed to be

2015-06-11 Thread Navis (JIRA)
Navis created SPARK-8312: Summary: Populate statistics info of hive tables if it's needed to be Key: SPARK-8312 URL: https://issues.apache.org/jira/browse/SPARK-8312 Project: Spark Issue Type:

[jira] [Created] (SPARK-8311) saveAsTextFile with Hadoop1 could lead to errors

2015-06-11 Thread Shivaram Venkataraman (JIRA)
Shivaram Venkataraman created SPARK-8311: Summary: saveAsTextFile with Hadoop1 could lead to errors Key: SPARK-8311 URL: https://issues.apache.org/jira/browse/SPARK-8311 Project: Spark

[jira] [Updated] (SPARK-8287) Filters not pushed with substitution through aggregation

2015-06-11 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-8287: Target Version/s: 1.5.0 (was: 1.4.0) Filters not pushed with substitution through

[jira] [Updated] (SPARK-7710) User guide and example code for math/stat functions in DataFrames

2015-06-11 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-7710: Target Version/s: 1.4.1 (was: 1.4.0) User guide and example code for math/stat functions

[jira] [Commented] (SPARK-8128) Dataframe Fails to Recognize Column in Schema

2015-06-11 Thread Brad Willard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14582543#comment-14582543 ] Brad Willard commented on SPARK-8128: - I have more logging from the job before it dies

[jira] [Updated] (SPARK-8128) Schema Merging Broken: Dataframe Fails to Recognize Column in Schema

2015-06-11 Thread Brad Willard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Brad Willard updated SPARK-8128: Description: I'm loading a folder of parquet files with about 600 parquet files and loading it

[jira] [Commented] (SPARK-8287) Filters not pushed with substitution through aggregation

2015-06-11 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14582530#comment-14582530 ] Michael Armbrust commented on SPARK-8287: - The problem is not with the removal of

[jira] [Updated] (SPARK-8287) Filters not pushed with substitution through aggregation

2015-06-11 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-8287: Assignee: Li Sheng Filters not pushed with substitution through aggregation

[jira] [Updated] (SPARK-7444) Eliminate noisy css warn/error logs for UISeleniumSuite

2015-06-11 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-7444: Assignee: Shixiong Zhu Eliminate noisy css warn/error logs for UISeleniumSuite

[jira] [Updated] (SPARK-8128) Schema Merging Broken: Dataframe Fails to Recognize Column in Schema

2015-06-11 Thread Brad Willard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Brad Willard updated SPARK-8128: Summary: Schema Merging Broken: Dataframe Fails to Recognize Column in Schema (was: Dataframe

[jira] [Assigned] (SPARK-7157) Add approximate stratified sampling to DataFrame

2015-06-11 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-7157: Assignee: Xiangrui Meng Add approximate stratified sampling to DataFrame

[jira] [Updated] (SPARK-7821) Hide private SQL JDBC classes from Javadoc

2015-06-11 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-7821: Target Version/s: 1.4.1 (was: 1.4.0) Hide private SQL JDBC classes from Javadoc

[jira] [Updated] (SPARK-8036) Ignores files whose name starts with . while enumerating files in HadoopFsRelation

2015-06-11 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-8036: Target Version/s: 1.4.1, 1.5.0 (was: 1.4.0) Ignores files whose name starts with . while

[jira] [Updated] (SPARK-8312) Populate statistics info of hive tables if it's needed to be

2015-06-11 Thread Navis (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8312?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Navis updated SPARK-8312: - Description: Currently, spark-sql uses stats in metastore for estimating size of hive table, which means analyze

[jira] [Assigned] (SPARK-8312) Populate statistics info of hive tables if it's needed to be

2015-06-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8312?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8312: --- Assignee: (was: Apache Spark) Populate statistics info of hive tables if it's needed to

[jira] [Commented] (SPARK-8312) Populate statistics info of hive tables if it's needed to be

2015-06-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14582551#comment-14582551 ] Apache Spark commented on SPARK-8312: - User 'navis' has created a pull request for

[jira] [Assigned] (SPARK-8312) Populate statistics info of hive tables if it's needed to be

2015-06-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8312?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8312: --- Assignee: Apache Spark Populate statistics info of hive tables if it's needed to be

[jira] [Closed] (SPARK-8296) Not able to load Dataframe using Python throws py4j.protocol.Py4JJavaError

2015-06-11 Thread ABHISHEK CHOUDHARY (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ABHISHEK CHOUDHARY closed SPARK-8296. - Resolution: Done Fix Version/s: 1.3.1 When I debug I found that Spark was

[jira] [Resolved] (SPARK-8310) Spark EC2 branch in 1.4 is wrong

2015-06-11 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman resolved SPARK-8310. -- Resolution: Fixed Fix Version/s: 1.4.1 1.5.0 Issue

[jira] [Updated] (SPARK-8287) Filters not pushed with substitution through aggregation

2015-06-11 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-8287: Summary: Filters not pushed with substitution through aggregation (was: Filter not push

[jira] [Updated] (SPARK-8128) Dataframe Fails to Recognize Column in Schema

2015-06-11 Thread Brad Willard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Brad Willard updated SPARK-8128: Affects Version/s: 1.4.0 1.3.0 Dataframe Fails to Recognize Column in

[jira] [Comment Edited] (SPARK-8128) Schema Merging Broken: Dataframe Fails to Recognize Column in Schema

2015-06-11 Thread Brad Willard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14576027#comment-14576027 ] Brad Willard edited comment on SPARK-8128 at 6/11/15 9:36 PM: --

[jira] [Updated] (SPARK-8128) Schema Merging Broken: Dataframe Fails to Recognize Column in Schema

2015-06-11 Thread Brad Willard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Brad Willard updated SPARK-8128: Description: I'm loading a folder of parquet files with about 600 parquet files and loading it

[jira] [Created] (SPARK-8313) Support Spark Packages containing R code with --packages

2015-06-11 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-8313: -- Summary: Support Spark Packages containing R code with --packages Key: SPARK-8313 URL: https://issues.apache.org/jira/browse/SPARK-8313 Project: Spark Issue

[jira] [Created] (SPARK-8314) improvement in performance of MLUtils.appendBias

2015-06-11 Thread Roger Menezes (JIRA)
Roger Menezes created SPARK-8314: Summary: improvement in performance of MLUtils.appendBias Key: SPARK-8314 URL: https://issues.apache.org/jira/browse/SPARK-8314 Project: Spark Issue Type:

[jira] [Commented] (SPARK-8314) improvement in performance of MLUtils.appendBias

2015-06-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14582629#comment-14582629 ] Apache Spark commented on SPARK-8314: - User 'rogermenezes' has created a pull request

[jira] [Resolved] (SPARK-8286) Rewrite UTF8String in Java and move it into unsafe package.

2015-06-11 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-8286. Resolution: Fixed Fix Version/s: 1.5.0 Rewrite UTF8String in Java and move it into unsafe

[jira] [Created] (SPARK-8315) Better error when saving to parquet with duplicate columns

2015-06-11 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-8315: --- Summary: Better error when saving to parquet with duplicate columns Key: SPARK-8315 URL: https://issues.apache.org/jira/browse/SPARK-8315 Project: Spark

[jira] [Updated] (SPARK-8315) Better error when saving to parquet with duplicate columns

2015-06-11 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8315?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-8315: Description: Parquet allows you to silently write out files with duplicate column names and

[jira] [Updated] (SPARK-8315) Better error when saving to parquet with duplicate columns

2015-06-11 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8315?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-8315: Description: Parquet allows you to silently write out files with duplicate column names and

[jira] [Comment Edited] (SPARK-8301) Improve UTF8String substring/startsWith/endsWith/contains performance

2015-06-11 Thread Tarek Auel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14582691#comment-14582691 ] Tarek Auel edited comment on SPARK-8301 at 6/11/15 11:45 PM: -

[jira] [Created] (SPARK-8316) Upgrade Maven to 3.3.3

2015-06-11 Thread Nicholas Chammas (JIRA)
Nicholas Chammas created SPARK-8316: --- Summary: Upgrade Maven to 3.3.3 Key: SPARK-8316 URL: https://issues.apache.org/jira/browse/SPARK-8316 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-8301) Improve UTF8String substring/startsWith/endsWith/contains performance

2015-06-11 Thread Tarek Auel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14582691#comment-14582691 ] Tarek Auel commented on SPARK-8301: --- Another approach could be: (0 until

[jira] [Created] (SPARK-8317) Do not push sort into shuffle in Exchange operator

2015-06-11 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-8317: - Summary: Do not push sort into shuffle in Exchange operator Key: SPARK-8317 URL: https://issues.apache.org/jira/browse/SPARK-8317 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-8314) improvement in performance of MLUtils.appendBias

2015-06-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8314?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8314: --- Assignee: Apache Spark improvement in performance of MLUtils.appendBias

[jira] [Updated] (SPARK-2808) update kafka to version 0.8.2

2015-06-11 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-2808: - Component/s: Streaming update kafka to version 0.8.2 -

[jira] [Commented] (SPARK-8301) Improve UTF8String substring/startsWith/endsWith/contains performance

2015-06-11 Thread Tarek Auel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14582609#comment-14582609 ] Tarek Auel commented on SPARK-8301: --- Hi, do you have concrete ideas how the

[jira] [Comment Edited] (SPARK-8301) Improve UTF8String substring/startsWith/endsWith/contains performance

2015-06-11 Thread Tarek Auel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14582609#comment-14582609 ] Tarek Auel edited comment on SPARK-8301 at 6/11/15 10:25 PM: -

[jira] [Assigned] (SPARK-7157) Add approximate stratified sampling to DataFrame

2015-06-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7157: --- Assignee: Apache Spark (was: Xiangrui Meng) Add approximate stratified sampling to

[jira] [Assigned] (SPARK-7157) Add approximate stratified sampling to DataFrame

2015-06-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7157: --- Assignee: Xiangrui Meng (was: Apache Spark) Add approximate stratified sampling to

[jira] [Commented] (SPARK-8316) Upgrade Maven to 3.3.3

2015-06-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8316?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14582701#comment-14582701 ] Apache Spark commented on SPARK-8316: - User 'nchammas' has created a pull request for

[jira] [Assigned] (SPARK-8316) Upgrade Maven to 3.3.3

2015-06-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8316: --- Assignee: (was: Apache Spark) Upgrade Maven to 3.3.3 --

[jira] [Issue Comment Deleted] (SPARK-8322) EC2 script not fully updated for 1.4.0 release

2015-06-11 Thread Mark Smith (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Smith updated SPARK-8322: -- Comment: was deleted (was: This should probably also be back-ported from master to the 1.4 branch, but

[jira] [Updated] (SPARK-7862) Query would hang when the using script has error output in SparkSQL

2015-06-11 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-7862: Assignee: zhichao-li Query would hang when the using script has error output in SparkSQL

[jira] [Commented] (SPARK-7442) Spark 1.3.1 / Hadoop 2.6 prebuilt pacakge has broken S3 filesystem access

2015-06-11 Thread Peng Cheng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14582965#comment-14582965 ] Peng Cheng commented on SPARK-7442: --- Still not fixed in 1.4.0 ... reverting to hadoop

[jira] [Resolved] (SPARK-7862) Query would hang when the using script has error output in SparkSQL

2015-06-11 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-7862. - Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 6404

[jira] [Commented] (SPARK-8311) saveAsTextFile with Hadoop1 could lead to errors

2015-06-11 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14582977#comment-14582977 ] Shivaram Venkataraman commented on SPARK-8311: -- Yeah it looks very similar.

[jira] [Commented] (SPARK-8322) EC2 script not fully updated for 1.4.0 release

2015-06-11 Thread Mark Smith (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14582971#comment-14582971 ] Mark Smith commented on SPARK-8322: --- This is the backport to branch-1.4 EC2 script not

[jira] [Commented] (SPARK-8322) EC2 script not fully updated for 1.4.0 release

2015-06-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14582970#comment-14582970 ] Apache Spark commented on SPARK-8322: - User 'markmsmith' has created a pull request

[jira] [Resolved] (SPARK-6566) Update Spark to use the latest version of Parquet libraries

2015-06-11 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-6566. --- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 5889

[jira] [Resolved] (SPARK-8317) Do not push sort into shuffle in Exchange operator

2015-06-11 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-8317. - Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 6772

[jira] [Commented] (SPARK-8318) Spark Streaming Starter JIRAs

2015-06-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14582929#comment-14582929 ] Sean Owen commented on SPARK-8318: -- Minor, but doesn't Component + label = starter

[jira] [Resolved] (SPARK-8311) saveAsTextFile with Hadoop1 could lead to errors

2015-06-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-8311. -- Resolution: Duplicate Yes 95% sure that's a duplicate saveAsTextFile with Hadoop1 could lead to

[jira] [Created] (SPARK-8322) EC2 script not fully updated for 1.4.0 release

2015-06-11 Thread Mark Smith (JIRA)
Mark Smith created SPARK-8322: - Summary: EC2 script not fully updated for 1.4.0 release Key: SPARK-8322 URL: https://issues.apache.org/jira/browse/SPARK-8322 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-8322) EC2 script not fully updated for 1.4.0 release

2015-06-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14582937#comment-14582937 ] Sean Owen commented on SPARK-8322: -- Related to SPARK-8310. You'll probably want a PR for

[jira] [Assigned] (SPARK-8322) EC2 script not fully updated for 1.4.0 release

2015-06-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8322: --- Assignee: (was: Apache Spark) EC2 script not fully updated for 1.4.0 release

[jira] [Commented] (SPARK-8322) EC2 script not fully updated for 1.4.0 release

2015-06-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14582942#comment-14582942 ] Apache Spark commented on SPARK-8322: - User 'markmsmith' has created a pull request

[jira] [Updated] (SPARK-8322) EC2 script not fully updated for 1.4.0 release

2015-06-11 Thread Mark Smith (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Smith updated SPARK-8322: -- Target Version/s: (was: 1.4.0) Fix Version/s: (was: 1.4.0) EC2 script not fully updated

[jira] [Assigned] (SPARK-8322) EC2 script not fully updated for 1.4.0 release

2015-06-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8322: --- Assignee: Apache Spark EC2 script not fully updated for 1.4.0 release

[jira] [Commented] (SPARK-8322) EC2 script not fully updated for 1.4.0 release

2015-06-11 Thread Mark Smith (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14582948#comment-14582948 ] Mark Smith commented on SPARK-8322: --- This should probably also be back-ported from

[jira] [Updated] (SPARK-8307) Improve timestamp from parquet

2015-06-11 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8307?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-8307: -- Summary: Improve timestamp from parquet (was: Improve timestamp from parquet/hive) Improve timestamp

[jira] [Assigned] (SPARK-8307) Improve timestamp from parquet

2015-06-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8307?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8307: --- Assignee: Apache Spark (was: Davies Liu) Improve timestamp from parquet

[jira] [Updated] (SPARK-8289) Provide a specific stack size with all Java implementations to prevent stack overflows with certain tests

2015-06-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8289: - Assignee: Adam Roberts Provide a specific stack size with all Java implementations to prevent stack

[jira] [Resolved] (SPARK-8289) Provide a specific stack size with all Java implementations to prevent stack overflows with certain tests

2015-06-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-8289. -- Resolution: Fixed Fix Version/s: 1.4.1 Issue resolved by pull request 6727

[jira] [Commented] (SPARK-8307) Improve timestamp from parquet

2015-06-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14581594#comment-14581594 ] Apache Spark commented on SPARK-8307: - User 'davies' has created a pull request for

[jira] [Assigned] (SPARK-8307) Improve timestamp from parquet

2015-06-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8307?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8307: --- Assignee: Davies Liu (was: Apache Spark) Improve timestamp from parquet

[jira] [Commented] (SPARK-8190) ExpressionEvalHelper.checkEvaluation should also run the optimizer version

2015-06-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14581602#comment-14581602 ] Apache Spark commented on SPARK-8190: - User 'davies' has created a pull request for

[jira] [Updated] (SPARK-8162) Run spark-shell cause NullPointerException

2015-06-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8162: - Assignee: Andrew Or Run spark-shell cause NullPointerException

[jira] [Updated] (SPARK-8304) Table with a large number of columns

2015-06-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8304: - Component/s: SQL Please review https://cwiki.apache.org/confluence/display/SPARK/Contributing+to+Spark

[jira] [Updated] (SPARK-8296) Not able to load Dataframe using Python throws py4j.protocol.Py4JJavaError

2015-06-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8296: - Component/s: SQL PySpark Not able to load Dataframe using Python throws

[jira] [Updated] (SPARK-8307) Improve timestamp from parquet

2015-06-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8307?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8307: - Component/s: SQL Improve timestamp from parquet -- Key:

[jira] [Updated] (SPARK-8286) Rewrite UTF8String in Java and move it into unsafe package.

2015-06-11 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-8286: --- Component/s: SQL Rewrite UTF8String in Java and move it into unsafe package.

[jira] [Updated] (SPARK-8278) Remove deprecated JsonRDD functionality in Spark SQL

2015-06-11 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-8278: --- Summary: Remove deprecated JsonRDD functionality in Spark SQL (was: Remove deprecated JsonRDD

[jira] [Resolved] (SPARK-8304) Table with a large number of columns

2015-06-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-8304. -- Resolution: Invalid Table with a large number of columns

  1   2   3   >