[jira] [Commented] (SPARK-7751) Add @Since annotation to stable and experimental methods in MLlib

2015-08-26 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14715331#comment-14715331 ] Joseph K. Bradley commented on SPARK-7751: -- I keep a JIRA dashboard which lists

[jira] [Assigned] (SPARK-10300) Use tags to control which tests to run depending on changes being tested

2015-08-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10300: Assignee: Apache Spark Use tags to control which tests to run depending on changes being

[jira] [Commented] (SPARK-2991) RDD transforms for scan and scanLeft

2015-08-26 Thread Erik Erlandson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2991?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14715417#comment-14715417 ] Erik Erlandson commented on SPARK-2991: --- This RFE now has a

[jira] [Resolved] (SPARK-9665) ML 1.5 QA: API: Experimental, DeveloperApi, final, sealed audit

2015-08-26 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-9665. -- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 8452

[jira] [Assigned] (SPARK-10300) Use tags to control which tests to run depending on changes being tested

2015-08-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10300: Assignee: (was: Apache Spark) Use tags to control which tests to run depending on

[jira] [Commented] (SPARK-10300) Use tags to control which tests to run depending on changes being tested

2015-08-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14715356#comment-14715356 ] Apache Spark commented on SPARK-10300: -- User 'vanzin' has created a pull request for

[jira] [Commented] (SPARK-10253) Remove Guava dependencies in MLlib java tests

2015-08-26 Thread Feynman Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14715175#comment-14715175 ] Feynman Liang commented on SPARK-10253: --- I believe only committers can assign

[jira] [Created] (SPARK-10300) Use tags to control which tests to run depending on changes being tested

2015-08-26 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-10300: -- Summary: Use tags to control which tests to run depending on changes being tested Key: SPARK-10300 URL: https://issues.apache.org/jira/browse/SPARK-10300

[jira] [Updated] (SPARK-10295) Dynamic allocation in Mesos does not release when RDDs are cached

2015-08-26 Thread Hans van den Bogert (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10295?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hans van den Bogert updated SPARK-10295: Summary: Dynamic allocation in Mesos does not release when RDDs are cached (was:

[jira] [Created] (SPARK-10298) PySpark can't JSON serialize a DataFrame with DecimalType columns.

2015-08-26 Thread Kevin Cox (JIRA)
Kevin Cox created SPARK-10298: - Summary: PySpark can't JSON serialize a DataFrame with DecimalType columns. Key: SPARK-10298 URL: https://issues.apache.org/jira/browse/SPARK-10298 Project: Spark

[jira] [Created] (SPARK-10299) word2vec should allow users to specify the window size

2015-08-26 Thread holdenk (JIRA)
holdenk created SPARK-10299: --- Summary: word2vec should allow users to specify the window size Key: SPARK-10299 URL: https://issues.apache.org/jira/browse/SPARK-10299 Project: Spark Issue Type:

[jira] [Comment Edited] (SPARK-9807) pyspark.sql.createDataFrame does not infer data type of parsed TSV

2015-08-26 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14712577#comment-14712577 ] Yanbo Liang edited comment on SPARK-9807 at 8/26/15 6:16 AM: -

[jira] [Commented] (SPARK-9807) pyspark.sql.createDataFrame does not infer data type of parsed TSV

2015-08-26 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14712577#comment-14712577 ] Yanbo Liang commented on SPARK-9807: This is not a bug. {map(lambda l:

[jira] [Commented] (SPARK-9807) pyspark.sql.createDataFrame does not infer data type of parsed TSV

2015-08-26 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14712592#comment-14712592 ] Yanbo Liang commented on SPARK-9807: The document is correct. It said the type of each

[jira] [Commented] (SPARK-9807) pyspark.sql.createDataFrame does not infer data type of parsed TSV

2015-08-26 Thread Karen Yin-Yee Ng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14712593#comment-14712593 ] Karen Yin-Yee Ng commented on SPARK-9807: - I have an adhoc piece of python code

[jira] [Commented] (SPARK-3789) [GRAPHX] Python bindings for GraphX

2015-08-26 Thread Olivier Girardot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14712612#comment-14712612 ] Olivier Girardot commented on SPARK-3789: - anyone still working on that right now

[jira] [Commented] (SPARK-9807) pyspark.sql.createDataFrame does not infer data type of parsed TSV

2015-08-26 Thread Karen Yin-Yee Ng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14712597#comment-14712597 ] Karen Yin-Yee Ng commented on SPARK-9807: - It just means that the DataFrame keeps

[jira] [Commented] (SPARK-9807) pyspark.sql.createDataFrame does not infer data type of parsed TSV

2015-08-26 Thread Karen Yin-Yee Ng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14712599#comment-14712599 ] Karen Yin-Yee Ng commented on SPARK-9807: - It just means that the DataFrame keeps

[jira] [Commented] (SPARK-9807) pyspark.sql.createDataFrame does not infer data type of parsed TSV

2015-08-26 Thread Karen Yin-Yee Ng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14712598#comment-14712598 ] Karen Yin-Yee Ng commented on SPARK-9807: - It just means that the DataFrame keeps

[jira] [Commented] (SPARK-9807) pyspark.sql.createDataFrame does not infer data type of parsed TSV

2015-08-26 Thread Karen Yin-Yee Ng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14712596#comment-14712596 ] Karen Yin-Yee Ng commented on SPARK-9807: - It just means that the DataFrame keeps

[jira] [Assigned] (SPARK-10251) Some internal spark classes are not registered with kryo

2015-08-26 Thread Ram Sriharsha (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ram Sriharsha reassigned SPARK-10251: - Assignee: Ram Sriharsha Some internal spark classes are not registered with kryo

[jira] [Commented] (SPARK-9807) pyspark.sql.createDataFrame does not infer data type of parsed TSV

2015-08-26 Thread Karen Yin-Yee Ng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14712595#comment-14712595 ] Karen Yin-Yee Ng commented on SPARK-9807: - It just means that the DataFrame keeps

[jira] [Updated] (SPARK-9316) Add support for filtering using `[` (synonym for filter / select)

2015-08-26 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-9316: - Assignee: Felix Cheung Add support for filtering using `[` (synonym for filter /

[jira] [Commented] (SPARK-9803) Add transform and subset to DataFrame

2015-08-26 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9803?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14712609#comment-14712609 ] Shivaram Venkataraman commented on SPARK-9803: -- [~felixcheung] I think that

[jira] [Issue Comment Deleted] (SPARK-9807) pyspark.sql.createDataFrame does not infer data type of parsed TSV

2015-08-26 Thread Karen Yin-Yee Ng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karen Yin-Yee Ng updated SPARK-9807: Comment: was deleted (was: It just means that the DataFrame keeps the data type from the

[jira] [Issue Comment Deleted] (SPARK-9807) pyspark.sql.createDataFrame does not infer data type of parsed TSV

2015-08-26 Thread Karen Yin-Yee Ng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karen Yin-Yee Ng updated SPARK-9807: Comment: was deleted (was: It just means that the DataFrame keeps the data type from the

[jira] [Issue Comment Deleted] (SPARK-9807) pyspark.sql.createDataFrame does not infer data type of parsed TSV

2015-08-26 Thread Karen Yin-Yee Ng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karen Yin-Yee Ng updated SPARK-9807: Comment: was deleted (was: It just means that the DataFrame keeps the data type from the

[jira] [Issue Comment Deleted] (SPARK-9807) pyspark.sql.createDataFrame does not infer data type of parsed TSV

2015-08-26 Thread Karen Yin-Yee Ng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karen Yin-Yee Ng updated SPARK-9807: Comment: was deleted (was: It just means that the DataFrame keeps the data type from the

[jira] [Commented] (SPARK-10251) Some internal spark classes are not registered with kryo

2015-08-26 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14712586#comment-14712586 ] Reynold Xin commented on SPARK-10251: - Confirming the problem. We can also reproduce

[jira] [Comment Edited] (SPARK-10251) Some internal spark classes are not registered with kryo

2015-08-26 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14712586#comment-14712586 ] Reynold Xin edited comment on SPARK-10251 at 8/26/15 6:29 AM:

[jira] [Commented] (SPARK-9807) pyspark.sql.createDataFrame does not infer data type of parsed TSV

2015-08-26 Thread Karen Yin-Yee Ng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14712583#comment-14712583 ] Karen Yin-Yee Ng commented on SPARK-9807: - According to the documentation at

[jira] [Resolved] (SPARK-10236) Update @Since annotation for mllib.feature

2015-08-26 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10236?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai resolved SPARK-10236. - Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 8449

[jira] [Resolved] (SPARK-9316) Add support for filtering using `[` (synonym for filter / select)

2015-08-26 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman resolved SPARK-9316. -- Resolution: Fixed Fix Version/s: 1.5.1 1.6.0 Issue

[jira] [Commented] (SPARK-7751) Add @Since annotation to stable and experimental methods in MLlib

2015-08-26 Thread Feynman Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14715157#comment-14715157 ] Feynman Liang commented on SPARK-7751: -- If we are worried about differing solutions

[jira] [Updated] (SPARK-10294) NPE when save data to parquet table

2015-08-26 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10294?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-10294: - Priority: Major (was: Critical) NPE when save data to parquet table

[jira] [Commented] (SPARK-9986) Create a simple test framework for local operators

2015-08-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9986?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14714200#comment-14714200 ] Apache Spark commented on SPARK-9986: - User 'zsxwing' has created a pull request for

[jira] [Assigned] (SPARK-9986) Create a simple test framework for local operators

2015-08-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9986: --- Assignee: Shixiong Zhu (was: Apache Spark) Create a simple test framework for local

[jira] [Assigned] (SPARK-9901) User guide for RowMatrix Tall-and-skinny QR

2015-08-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9901: --- Assignee: Apache Spark (was: yuhao yang) User guide for RowMatrix Tall-and-skinny QR

[jira] [Updated] (SPARK-10294) When saving a file larger than S3 size limit to S3, Parquet writer's close method is called twice and then NPE is thrown.

2015-08-26 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10294?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-10294: - Summary: When saving a file larger than S3 size limit to S3, Parquet writer's close method is called

[jira] [Updated] (SPARK-10294) NPE when save data to parquet table

2015-08-26 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10294?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-10294: - Description: When a task saves a large parquet file (larger than the S3 file size limit) to S3, looks

[jira] [Commented] (SPARK-9890) User guide for CountVectorizer

2015-08-26 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14714244#comment-14714244 ] yuhao yang commented on SPARK-9890: --- This takes some time. I've got a draft on this and

[jira] [Assigned] (SPARK-10248) DAGSchedulerSuite should check there were no errors in EventProcessLoop

2015-08-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10248: Assignee: Apache Spark DAGSchedulerSuite should check there were no errors in

[jira] [Commented] (SPARK-10248) DAGSchedulerSuite should check there were no errors in EventProcessLoop

2015-08-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14714502#comment-14714502 ] Apache Spark commented on SPARK-10248: -- User 'squito' has created a pull request for

[jira] [Assigned] (SPARK-10248) DAGSchedulerSuite should check there were no errors in EventProcessLoop

2015-08-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10248: Assignee: (was: Apache Spark) DAGSchedulerSuite should check there were no errors in

[jira] [Updated] (SPARK-10202) Specify schema during KMeansModel.save to avoid reflection

2015-08-26 Thread Feynman Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Feynman Liang updated SPARK-10202: -- Issue Type: Sub-task (was: Improvement) Parent: SPARK-10199 Specify schema during

[jira] [Updated] (SPARK-10203) Specify schema during GLMClassificationModel.save to avoid reflection

2015-08-26 Thread Feynman Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10203?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Feynman Liang updated SPARK-10203: -- Issue Type: Sub-task (was: Improvement) Parent: SPARK-10199 Specify schema during

[jira] [Updated] (SPARK-10201) Specify schema during GaussianMixtureModel.save to avoid reflection

2015-08-26 Thread Feynman Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Feynman Liang updated SPARK-10201: -- Issue Type: Sub-task (was: Improvement) Parent: SPARK-10199 Specify schema during

[jira] [Updated] (SPARK-10204) Specify schema during NaiveBayes.save to avoid reflection

2015-08-26 Thread Feynman Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Feynman Liang updated SPARK-10204: -- Issue Type: Sub-task (was: Improvement) Parent: SPARK-10199 Specify schema during

[jira] [Updated] (SPARK-10200) Specify schema during GLMRegressionModel.save to avoid reflection

2015-08-26 Thread Feynman Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Feynman Liang updated SPARK-10200: -- Issue Type: Sub-task (was: Improvement) Parent: SPARK-10199 Specify schema during

[jira] [Updated] (SPARK-10294) When Parquet writer's close method throws an exception, we will call close again and trigger a NPE

2015-08-26 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10294?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-10294: - Target Version/s: 1.5.0, 1.5.1 (was: 1.5.0) When Parquet writer's close method throws an exception, we

[jira] [Updated] (SPARK-10294) When Parquet writer's close method throws an exception, we will call close again and trigger a NPE

2015-08-26 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10294?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-10294: - Summary: When Parquet writer's close method throws an exception, we will call close again and trigger a

[jira] [Commented] (SPARK-10199) Avoid using reflections for parquet model save

2015-08-26 Thread Feynman Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14715148#comment-14715148 ] Feynman Liang commented on SPARK-10199: --- Awesome, thanks! You can tag that PR with

[jira] [Updated] (SPARK-10146) Have an easy way to set data source reader/writer specific confs

2015-08-26 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-10146: - Issue Type: Sub-task (was: Improvement) Parent: SPARK-9932 Have an easy way to set data source

[jira] [Assigned] (SPARK-10251) Some internal spark classes are not registered with kryo

2015-08-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10251: Assignee: Ram Sriharsha (was: Apache Spark) Some internal spark classes are not

[jira] [Commented] (SPARK-10251) Some internal spark classes are not registered with kryo

2015-08-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14714474#comment-14714474 ] Apache Spark commented on SPARK-10251: -- User 'harsha2010' has created a pull request

[jira] [Assigned] (SPARK-10251) Some internal spark classes are not registered with kryo

2015-08-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10251: Assignee: Apache Spark (was: Ram Sriharsha) Some internal spark classes are not

[jira] [Commented] (SPARK-10207) Specify schema during Word2Vec.save to avoid reflection

2015-08-26 Thread Feynman Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10207?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14715141#comment-14715141 ] Feynman Liang commented on SPARK-10207: --- [~srowen] I linked them using is required

[jira] [Updated] (SPARK-10207) Specify schema during Word2Vec.save to avoid reflection

2015-08-26 Thread Feynman Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10207?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Feynman Liang updated SPARK-10207: -- Issue Type: Sub-task (was: Improvement) Parent: SPARK-10199 Specify schema during

[jira] [Updated] (SPARK-10206) Specify schema during IsotonicRegression.save to avoid reflection

2015-08-26 Thread Feynman Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Feynman Liang updated SPARK-10206: -- Issue Type: Sub-task (was: Improvement) Parent: SPARK-10199 Specify schema during

[jira] [Updated] (SPARK-10213) Specify schema during DecisionTreeModel.save to avoid reflection

2015-08-26 Thread Feynman Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Feynman Liang updated SPARK-10213: -- Issue Type: Sub-task (was: Improvement) Parent: SPARK-10199 Specify schema during

[jira] [Updated] (SPARK-10212) Specify schema during TreeEnsembleModel.save to avoid reflection

2015-08-26 Thread Feynman Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Feynman Liang updated SPARK-10212: -- Issue Type: Sub-task (was: Improvement) Parent: SPARK-10199 Specify schema during

[jira] [Updated] (SPARK-10294) When save data to a data source table, we should bound the size of a saved file

2015-08-26 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10294?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-10294: - Summary: When save data to a data source table, we should bound the size of a saved file (was: When

[jira] [Updated] (SPARK-10211) Specify schema during MatrixFactorizationModel.save to avoid reflection

2015-08-26 Thread Feynman Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Feynman Liang updated SPARK-10211: -- Issue Type: Sub-task (was: Improvement) Parent: SPARK-10199 Specify schema during

[jira] [Updated] (SPARK-10208) Specify schema during LocalLDAModel.save to avoid reflection

2015-08-26 Thread Feynman Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Feynman Liang updated SPARK-10208: -- Issue Type: Sub-task (was: Improvement) Parent: SPARK-10199 Specify schema during

[jira] [Updated] (SPARK-10205) Specify schema during PowerIterationClustering.save to avoid reflection

2015-08-26 Thread Feynman Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Feynman Liang updated SPARK-10205: -- Issue Type: Sub-task (was: Improvement) Parent: SPARK-10199 Specify schema during

[jira] [Updated] (SPARK-10209) Specify schema during DistributedLDAModel.save to avoid reflection

2015-08-26 Thread Feynman Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10209?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Feynman Liang updated SPARK-10209: -- Issue Type: Sub-task (was: Improvement) Parent: SPARK-10199 Specify schema during

[jira] [Updated] (SPARK-10297) When save data to a data source table, we should bound the size of a saved file

2015-08-26 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-10297: - Issue Type: Sub-task (was: Bug) Parent: SPARK-9932 When save data to a data source table, we

[jira] [Updated] (SPARK-10104) Consolidate different forms of table identifiers

2015-08-26 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-10104: Assignee: Wenchen Fan Consolidate different forms of table identifiers

[jira] [Created] (SPARK-10297) When save data to a data source table, we should bound the size of a saved file

2015-08-26 Thread Yin Huai (JIRA)
Yin Huai created SPARK-10297: Summary: When save data to a data source table, we should bound the size of a saved file Key: SPARK-10297 URL: https://issues.apache.org/jira/browse/SPARK-10297 Project:

[jira] [Commented] (SPARK-10219) Error when additional options provided as variable in write.df

2015-08-26 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14715186#comment-14715186 ] Shivaram Venkataraman commented on SPARK-10219: --- Thanks I see the problem

[jira] [Commented] (SPARK-9424) Document recent Parquet changes in Spark 1.5

2015-08-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9424?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14715204#comment-14715204 ] Apache Spark commented on SPARK-9424: - User 'liancheng' has created a pull request for

[jira] [Assigned] (SPARK-9424) Document recent Parquet changes in Spark 1.5

2015-08-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9424: --- Assignee: Cheng Lian (was: Apache Spark) Document recent Parquet changes in Spark 1.5

[jira] [Updated] (SPARK-9424) Document recent Parquet changes in Spark 1.5

2015-08-26 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-9424: -- Description: Specifically, the following changes need to be documented/explained: - Metadata discovery

[jira] [Assigned] (SPARK-9424) Document recent Parquet changes in Spark 1.5

2015-08-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9424: --- Assignee: Apache Spark (was: Cheng Lian) Document recent Parquet changes in Spark 1.5

[jira] [Commented] (SPARK-10219) Error when additional options provided as variable in write.df

2015-08-26 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14715233#comment-14715233 ] Shivaram Venkataraman commented on SPARK-10219: --- I think I tracked this

[jira] [Commented] (SPARK-9901) User guide for RowMatrix Tall-and-skinny QR

2015-08-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9901?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14714174#comment-14714174 ] Apache Spark commented on SPARK-9901: - User 'hhbyyh' has created a pull request for

[jira] [Updated] (SPARK-10294) NPE when save data to parquet table

2015-08-26 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10294?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-10294: - Description: Unlike SPARK-7837, all attempts (even the first attempt) of a task failed with the

[jira] [Assigned] (SPARK-9901) User guide for RowMatrix Tall-and-skinny QR

2015-08-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9901: --- Assignee: yuhao yang (was: Apache Spark) User guide for RowMatrix Tall-and-skinny QR

[jira] [Commented] (SPARK-9991) Create local limit operator

2015-08-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9991?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14714201#comment-14714201 ] Apache Spark commented on SPARK-9991: - User 'zsxwing' has created a pull request for

[jira] [Assigned] (SPARK-9991) Create local limit operator

2015-08-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9991: --- Assignee: Apache Spark Create local limit operator ---

[jira] [Commented] (SPARK-9993) Create local union operator

2015-08-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14714203#comment-14714203 ] Apache Spark commented on SPARK-9993: - User 'zsxwing' has created a pull request for

[jira] [Assigned] (SPARK-9993) Create local union operator

2015-08-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9993: --- Assignee: (was: Apache Spark) Create local union operator ---

[jira] [Assigned] (SPARK-9991) Create local limit operator

2015-08-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9991: --- Assignee: (was: Apache Spark) Create local limit operator ---

[jira] [Assigned] (SPARK-9993) Create local union operator

2015-08-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9993: --- Assignee: Apache Spark Create local union operator ---

[jira] [Assigned] (SPARK-9986) Create a simple test framework for local operators

2015-08-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9986: --- Assignee: Apache Spark (was: Shixiong Zhu) Create a simple test framework for local

[jira] [Created] (SPARK-10296) add preservesParitioning parameter to RDD.map

2015-08-26 Thread Esteban Donato (JIRA)
Esteban Donato created SPARK-10296: -- Summary: add preservesParitioning parameter to RDD.map Key: SPARK-10296 URL: https://issues.apache.org/jira/browse/SPARK-10296 Project: Spark Issue

[jira] [Updated] (SPARK-10295) Dynamic reservation in Mesos does not release when RDDs are cached

2015-08-26 Thread Hans van den Bogert (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10295?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hans van den Bogert updated SPARK-10295: Description: When running spark in coarse grained mode with shuffle service and

[jira] [Comment Edited] (SPARK-9316) Add support for filtering using `[` (synonym for filter / select)

2015-08-26 Thread Deborah Siegel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9316?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14715472#comment-14715472 ] Deborah Siegel edited comment on SPARK-9316 at 8/26/15 9:03 PM:

[jira] [Commented] (SPARK-9228) Combine unsafe and codegen into a single option

2015-08-26 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14715512#comment-14715512 ] Davies Liu commented on SPARK-9228: --- [~jameszhouyi] unsafe.offHeap is another option

[jira] [Closed] (SPARK-10302) NPE while save a DataFrame as ORC

2015-08-26 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu closed SPARK-10302. -- Resolution: Duplicate Fix Version/s: 1.5.0 NPE while save a DataFrame as ORC

[jira] [Created] (SPARK-10305) PySpark createDataFrame on list of LabeledPoints fails (regression)

2015-08-26 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-10305: - Summary: PySpark createDataFrame on list of LabeledPoints fails (regression) Key: SPARK-10305 URL: https://issues.apache.org/jira/browse/SPARK-10305

[jira] [Assigned] (SPARK-10305) PySpark createDataFrame on list of LabeledPoints fails (regression)

2015-08-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10305?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10305: Assignee: (was: Apache Spark) PySpark createDataFrame on list of LabeledPoints fails

[jira] [Assigned] (SPARK-10305) PySpark createDataFrame on list of LabeledPoints fails (regression)

2015-08-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10305?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10305: Assignee: Apache Spark PySpark createDataFrame on list of LabeledPoints fails

[jira] [Commented] (SPARK-10305) PySpark createDataFrame on list of LabeledPoints fails (regression)

2015-08-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14715639#comment-14715639 ] Apache Spark commented on SPARK-10305: -- User 'davies' has created a pull request for

[jira] [Created] (SPARK-10306) sbt hive/update issue

2015-08-26 Thread holdenk (JIRA)
holdenk created SPARK-10306: --- Summary: sbt hive/update issue Key: SPARK-10306 URL: https://issues.apache.org/jira/browse/SPARK-10306 Project: Spark Issue Type: Bug Reporter: holdenk

[jira] [Commented] (SPARK-10306) sbt hive/update issue

2015-08-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14715688#comment-14715688 ] Apache Spark commented on SPARK-10306: -- User 'holdenk' has created a pull request

[jira] [Assigned] (SPARK-10306) sbt hive/update issue

2015-08-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10306: Assignee: Apache Spark sbt hive/update issue -

[jira] [Updated] (SPARK-10303) Spark SQL JSON Reader uses inefficient form of Union operation

2015-08-26 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-10303: --- Attachment: screenshot-1.png Spark SQL JSON Reader uses inefficient form of Union operation

[jira] [Updated] (SPARK-10303) Spark SQL JSON Reader uses inefficient form of Union operation

2015-08-26 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-10303: --- Description: See attached screenshot. For a job that uses SQLContext's JSON reader, the RDD DAG

[jira] [Assigned] (SPARK-10301) For struct type, if parquet's global schema has less fields than a file's schema, data reading will fail

2015-08-26 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai reassigned SPARK-10301: Assignee: Yin Huai For struct type, if parquet's global schema has less fields than a file's

  1   2   >