[jira] [Created] (SPARK-6603) SQLContext.registerFunction - SQLContext.udf.register

2015-03-30 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-6603: -- Summary: SQLContext.registerFunction - SQLContext.udf.register Key: SPARK-6603 URL: https://issues.apache.org/jira/browse/SPARK-6603 Project: Spark Issue Type:

[jira] [Created] (SPARK-6604) Specify ip of python server scoket

2015-03-30 Thread Weizhong (JIRA)
Weizhong created SPARK-6604: --- Summary: Specify ip of python server scoket Key: SPARK-6604 URL: https://issues.apache.org/jira/browse/SPARK-6604 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-6605) Same transformation in DStream leads to different result

2015-03-30 Thread SaintBacchus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386324#comment-14386324 ] SaintBacchus commented on SPARK-6605: - Hi, [~tdas] can you have a look at this

[jira] [Assigned] (SPARK-6604) Specify ip of python server scoket

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6604: --- Assignee: (was: Apache Spark) Specify ip of python server scoket

[jira] [Commented] (SPARK-5895) Add VectorSlicer

2015-03-30 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386296#comment-14386296 ] Xusen Yin commented on SPARK-5895: -- Is it possible to select by type or value? Like in

[jira] [Commented] (SPARK-6594) Spark Streaming can't receive data from kafka

2015-03-30 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386313#comment-14386313 ] Saisai Shao commented on SPARK-6594: Hi [~q79969786], would you please paste more

[jira] [Assigned] (SPARK-6600) Open ports in ec2/spark_ec2.py to allow HDFS NFS gateway

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6600: --- Assignee: (was: Apache Spark) Open ports in ec2/spark_ec2.py to allow HDFS NFS gateway

[jira] [Assigned] (SPARK-6600) Open ports in ec2/spark_ec2.py to allow HDFS NFS gateway

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6600: --- Assignee: Apache Spark Open ports in ec2/spark_ec2.py to allow HDFS NFS gateway

[jira] [Commented] (SPARK-6600) Open ports in ec2/spark_ec2.py to allow HDFS NFS gateway

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386267#comment-14386267 ] Apache Spark commented on SPARK-6600: - User 'florianverhein' has created a pull

[jira] [Commented] (SPARK-6604) Specify ip of python server scoket

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386268#comment-14386268 ] Apache Spark commented on SPARK-6604: - User 'Sephiroth-Lin' has created a pull request

[jira] [Assigned] (SPARK-6604) Specify ip of python server scoket

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6604: --- Assignee: Apache Spark Specify ip of python server scoket

[jira] [Created] (SPARK-6605) Same transformation in DStream leads to different result

2015-03-30 Thread SaintBacchus (JIRA)
SaintBacchus created SPARK-6605: --- Summary: Same transformation in DStream leads to different result Key: SPARK-6605 URL: https://issues.apache.org/jira/browse/SPARK-6605 Project: Spark Issue

[jira] [Comment Edited] (SPARK-5895) Add VectorSlicer

2015-03-30 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386365#comment-14386365 ] Xusen Yin edited comment on SPARK-5895 at 3/30/15 8:14 AM: --- I

[jira] [Commented] (SPARK-5894) Add PolynomialMapper

2015-03-30 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386346#comment-14386346 ] Xusen Yin commented on SPARK-5894: -- [~mengxr] [~josephkb] Do you have time to check it?

[jira] [Commented] (SPARK-6606) Accumulator deserialized twice because the NarrowCoGroupSplitDep contains rdd object.

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386355#comment-14386355 ] Apache Spark commented on SPARK-6606: - User 'suyanNone' has created a pull request for

[jira] [Assigned] (SPARK-6606) Accumulator deserialized twice because the NarrowCoGroupSplitDep contains rdd object.

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6606: --- Assignee: Apache Spark Accumulator deserialized twice because the NarrowCoGroupSplitDep

[jira] [Resolved] (SPARK-6594) Spark Streaming can't receive data from kafka

2015-03-30 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saisai Shao resolved SPARK-6594. Resolution: Not a Problem Spark Streaming can't receive data from kafka

[jira] [Commented] (SPARK-6528) IDF transformer

2015-03-30 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386460#comment-14386460 ] Xusen Yin commented on SPARK-6528: -- [~josephkb] Pls assign it to me. IDF transformer

[jira] [Commented] (SPARK-6401) Unable to load a old API input format in Spark streaming

2015-03-30 Thread Thomas F. (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386491#comment-14386491 ] Thomas F. commented on SPARK-6401: -- How do we proceed for this issue ? As we already

[jira] [Created] (SPARK-6606) Accumulator deserialized twice because the NarrowCoGroupSplitDep contains rdd object.

2015-03-30 Thread SuYan (JIRA)
SuYan created SPARK-6606: Summary: Accumulator deserialized twice because the NarrowCoGroupSplitDep contains rdd object. Key: SPARK-6606 URL: https://issues.apache.org/jira/browse/SPARK-6606 Project: Spark

[jira] [Commented] (SPARK-6258) Python MLlib API missing items: Clustering

2015-03-30 Thread Hrishikesh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386394#comment-14386394 ] Hrishikesh commented on SPARK-6258: --- Hi [~josephkb] I am a newbie to spark and I would

[jira] [Commented] (SPARK-6401) Unable to load a old API input format in Spark streaming

2015-03-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386499#comment-14386499 ] Sean Owen commented on SPARK-6401: -- Since it's technically an API change to streaming I'd

[jira] [Commented] (SPARK-5895) Add VectorSlicer

2015-03-30 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386365#comment-14386365 ] Xusen Yin commented on SPARK-5895: -- I have another concern here. We can not reveal each

[jira] [Comment Edited] (SPARK-6594) Spark Streaming can't receive data from kafka

2015-03-30 Thread q79969786 (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386380#comment-14386380 ] q79969786 edited comment on SPARK-6594 at 3/30/15 8:25 AM: ---

[jira] [Commented] (SPARK-6594) Spark Streaming can't receive data from kafka

2015-03-30 Thread q79969786 (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386380#comment-14386380 ] q79969786 commented on SPARK-6594: -- thanks, it worked after reboot kafka Spark

[jira] [Commented] (SPARK-6605) Same transformation in DStream leads to different result

2015-03-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386510#comment-14386510 ] Sean Owen commented on SPARK-6605: -- What other implementation are you referring to? I

[jira] [Assigned] (SPARK-4226) SparkSQL - Add support for subqueries in predicates

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-4226: --- Assignee: Apache Spark SparkSQL - Add support for subqueries in predicates

[jira] [Assigned] (SPARK-6608) Make DataFrame.rdd a lazy val

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6608: --- Assignee: (was: Apache Spark) Make DataFrame.rdd a lazy val

[jira] [Commented] (SPARK-6608) Make DataFrame.rdd a lazy val

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386635#comment-14386635 ] Apache Spark commented on SPARK-6608: - User 'liancheng' has created a pull request for

[jira] [Assigned] (SPARK-6608) Make DataFrame.rdd a lazy val

2015-03-30 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian reassigned SPARK-6608: - Assignee: Cheng Lian Make DataFrame.rdd a lazy val -

[jira] [Commented] (SPARK-4036) Add Conditional Random Fields (CRF) algorithm to Spark MLlib

2015-03-30 Thread Kai Sasaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4036?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386636#comment-14386636 ] Kai Sasaki commented on SPARK-4036: --- [~mengxr] I write a design doc based on your

[jira] [Assigned] (SPARK-6528) IDF transformer

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6528: --- Assignee: Apache Spark IDF transformer --- Key: SPARK-6528

[jira] [Assigned] (SPARK-6528) IDF transformer

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6528: --- Assignee: (was: Apache Spark) IDF transformer --- Key:

[jira] [Commented] (SPARK-6528) IDF transformer

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386664#comment-14386664 ] Apache Spark commented on SPARK-6528: - User 'yinxusen' has created a pull request for

[jira] [Updated] (SPARK-6607) Aggregation attribute name including special chars '(' and ')' should be replaced before generating Parquet schema

2015-03-30 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-6607: -- Assignee: Liang-Chi Hsieh Aggregation attribute name including special chars '(' and ')' should be

[jira] [Created] (SPARK-6609) explicit checkpoint does not work

2015-03-30 Thread lisendong (JIRA)
lisendong created SPARK-6609: Summary: explicit checkpoint does not work Key: SPARK-6609 URL: https://issues.apache.org/jira/browse/SPARK-6609 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-6610) explicit checkpoint does not work

2015-03-30 Thread lisendong (JIRA)
lisendong created SPARK-6610: Summary: explicit checkpoint does not work Key: SPARK-6610 URL: https://issues.apache.org/jira/browse/SPARK-6610 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-6610) mlllib explicit ALS checkpoint does not work

2015-03-30 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Guoqiang Li updated SPARK-6610: --- Target Version/s: 1.3.1 (was: 1.3.0) Fix Version/s: (was: 1.3.0) mlllib explicit ALS

[jira] [Updated] (SPARK-6607) Aggregation attribute name including special chars '(' and ')' should be replaced before generating Parquet schema

2015-03-30 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-6607: -- Target Version/s: 1.4.0 Affects Version/s: 1.1.1 1.2.1

[jira] [Commented] (SPARK-3530) Pipeline and Parameters

2015-03-30 Thread Jao Rabary (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386747#comment-14386747 ] Jao Rabary commented on SPARK-3530: --- Yes, the scenario is to instantiate a pre-trained

[jira] [Commented] (SPARK-6517) Implement the Algorithm of Hierarchical Clustering

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386707#comment-14386707 ] Apache Spark commented on SPARK-6517: - User 'yu-iskw' has created a pull request for

[jira] [Assigned] (SPARK-6517) Implement the Algorithm of Hierarchical Clustering

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6517: --- Assignee: Apache Spark Implement the Algorithm of Hierarchical Clustering

[jira] [Assigned] (SPARK-6517) Implement the Algorithm of Hierarchical Clustering

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6517: --- Assignee: (was: Apache Spark) Implement the Algorithm of Hierarchical Clustering

[jira] [Updated] (SPARK-6610) mlllib explicit ALS checkpoint does not work

2015-03-30 Thread lisendong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] lisendong updated SPARK-6610: - Summary: mlllib explicit ALS checkpoint does not work (was: explicit checkpoint does not work) mlllib

[jira] [Commented] (SPARK-6517) Implement the Algorithm of Hierarchical Clustering

2015-03-30 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386756#comment-14386756 ] Yu Ishikawa commented on SPARK-6517: Hi [~freeman-lab] and [~rnowling], Would you

[jira] [Resolved] (SPARK-6610) mlllib explicit ALS checkpoint does not work

2015-03-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6610. -- Resolution: Duplicate Target Version/s: (was: 1.3.1) mlllib explicit ALS checkpoint does

[jira] [Commented] (SPARK-3276) Provide a API to specify whether the old files need to be ignored in file input text DStream

2015-03-30 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-3276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386775#comment-14386775 ] Emre Sevinç commented on SPARK-3276: Any plans to make the private val

[jira] [Commented] (SPARK-6061) File source dstream can not include the old file which timestamp is before the system time

2015-03-30 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-6061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386774#comment-14386774 ] Emre Sevinç commented on SPARK-6061: Any plans to make the private val

[jira] [Updated] (SPARK-6609) explicit checkpoint does not work

2015-03-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6609?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6609: - Priority: Minor (was: Critical) Target Version/s: (was: 1.3.0) Fix Version/s:

[jira] [Commented] (SPARK-6609) explicit checkpoint does not work

2015-03-30 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6609?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386781#comment-14386781 ] Guoqiang Li commented on SPARK-6609: We should merge [the PR

[jira] [Commented] (SPARK-6568) spark-shell.cmd --jars option does not accept the jar that has space in its path

2015-03-30 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386552#comment-14386552 ] Steve Loughran commented on SPARK-6568: --- Can you show the full stack trace?

[jira] [Commented] (SPARK-6282) Strange Python import error when using random() in a lambda function

2015-03-30 Thread Dmytro Bielievtsov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386564#comment-14386564 ] Dmytro Bielievtsov commented on SPARK-6282: --- I had the same strange issue when

[jira] [Resolved] (SPARK-2348) In Windows having a enviorinment variable named 'classpath' gives error

2015-03-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-2348. -- Resolution: Not a Problem In Windows having a enviorinment variable named 'classpath' gives error

[jira] [Commented] (SPARK-6605) Same transformation in DStream leads to different result

2015-03-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386568#comment-14386568 ] Sean Owen commented on SPARK-6605: -- Thanks, that's very useful. I think the behavior is

[jira] [Commented] (SPARK-6605) Same transformation in DStream leads to different result

2015-03-30 Thread SaintBacchus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386610#comment-14386610 ] SaintBacchus commented on SPARK-6605: - Yeah, [~srowen] it's not a wrong answer but

[jira] [Updated] (SPARK-6603) SQLContext.registerFunction - SQLContext.udf.register

2015-03-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6603: - Component/s: SQL SQLContext.registerFunction - SQLContext.udf.register

[jira] [Updated] (SPARK-5750) Document that ordering of elements in shuffled partitions is not deterministic across runs

2015-03-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5750?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5750: - Priority: Minor (was: Major) Document that ordering of elements in shuffled partitions is not

[jira] [Resolved] (SPARK-5836) Highlight in Spark documentation that by default Spark does not delete its temporary files

2015-03-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5836?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5836. -- Resolution: Fixed Fix Version/s: 1.4.0 1.3.1 Assignee: Ilya Ganelin

[jira] [Commented] (SPARK-6607) Aggregation attribute name including special chars '(' and ')' should be replaced before generating Parquet schema

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386560#comment-14386560 ] Apache Spark commented on SPARK-6607: - User 'viirya' has created a pull request for

[jira] [Assigned] (SPARK-6607) Aggregation attribute name including special chars '(' and ')' should be replaced before generating Parquet schema

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6607: --- Assignee: (was: Apache Spark) Aggregation attribute name including special chars '('

[jira] [Assigned] (SPARK-6607) Aggregation attribute name including special chars '(' and ')' should be replaced before generating Parquet schema

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6607: --- Assignee: Apache Spark Aggregation attribute name including special chars '(' and ')'

[jira] [Commented] (SPARK-799) Windows versions of the deploy scripts

2015-03-30 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386571#comment-14386571 ] Steve Loughran commented on SPARK-799: -- Proving python versions of the launcher

[jira] [Comment Edited] (SPARK-6282) Strange Python import error when using random() in a lambda function

2015-03-30 Thread Dmytro Bielievtsov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386564#comment-14386564 ] Dmytro Bielievtsov edited comment on SPARK-6282 at 3/30/15 11:18 AM:

[jira] [Commented] (SPARK-2356) Exception: Could not locate executable null\bin\winutils.exe in the Hadoop

2015-03-30 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386585#comment-14386585 ] Steve Loughran commented on SPARK-2356: --- It's coming from {{

[jira] [Commented] (SPARK-6598) Python API for IDFModel

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386619#comment-14386619 ] Apache Spark commented on SPARK-6598: - User 'Lewuathe' has created a pull request for

[jira] [Assigned] (SPARK-6598) Python API for IDFModel

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6598: --- Assignee: (was: Apache Spark) Python API for IDFModel ---

[jira] [Assigned] (SPARK-6598) Python API for IDFModel

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6598: --- Assignee: Apache Spark Python API for IDFModel ---

[jira] [Commented] (SPARK-6189) Pandas to DataFrame conversion should check field names for periods

2015-03-30 Thread Kalle Jepsen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386625#comment-14386625 ] Kalle Jepsen commented on SPARK-6189: - I do not really understand why the column names

[jira] [Commented] (SPARK-6605) Same transformation in DStream leads to different result

2015-03-30 Thread SaintBacchus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386540#comment-14386540 ] SaintBacchus commented on SPARK-6605: - Hi [~srowen], my test code is this :

[jira] [Commented] (SPARK-3441) Explain in docs that repartitionAndSortWithinPartitions enacts Hadoop style shuffle

2015-03-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386541#comment-14386541 ] Sean Owen commented on SPARK-3441: -- This is mentioned in the change for

[jira] [Updated] (SPARK-5836) Highlight in Spark documentation that by default Spark does not delete its temporary files

2015-03-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5836?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5836: - Priority: Minor (was: Major) Highlight in Spark documentation that by default Spark does not delete its

[jira] [Comment Edited] (SPARK-6282) Strange Python import error when using random() in a lambda function

2015-03-30 Thread Dmytro Bielievtsov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386564#comment-14386564 ] Dmytro Bielievtsov edited comment on SPARK-6282 at 3/30/15 11:18 AM:

[jira] [Assigned] (SPARK-4226) SparkSQL - Add support for subqueries in predicates

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-4226: --- Assignee: (was: Apache Spark) SparkSQL - Add support for subqueries in predicates

[jira] [Created] (SPARK-6608) Make DataFrame.rdd a lazy val

2015-03-30 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-6608: - Summary: Make DataFrame.rdd a lazy val Key: SPARK-6608 URL: https://issues.apache.org/jira/browse/SPARK-6608 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-5456) Decimal Type comparison issue

2015-03-30 Thread Karthik Gorthi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386628#comment-14386628 ] Karthik Gorthi commented on SPARK-5456: --- One workaround we followed is to convert

[jira] [Resolved] (SPARK-6596) fix the instruction on building scaladoc

2015-03-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6596?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6596. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5253

[jira] [Updated] (SPARK-6596) fix the instruction on building scaladoc

2015-03-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6596?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6596: - Priority: Trivial (was: Major) Assignee: Nan Zhu fix the instruction on building scaladoc

[jira] [Resolved] (SPARK-6595) DataFrame self joins with MetastoreRelations fail

2015-03-30 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-6595. --- Resolution: Fixed Fixed by https://github.com/apache/spark/pull/5251 DataFrame self joins with

[jira] [Resolved] (SPARK-6609) explicit checkpoint does not work

2015-03-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6609?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6609. -- Resolution: Invalid [~lisendong] Actually you were not even pointing at the 1.3 branch. SPARK-5955 did

[jira] [Assigned] (SPARK-6602) Replace direct use of Akka with Spark RPC interface

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6602: --- Assignee: Shixiong Zhu (was: Apache Spark) Replace direct use of Akka with Spark RPC

[jira] [Commented] (SPARK-6602) Replace direct use of Akka with Spark RPC interface

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386875#comment-14386875 ] Apache Spark commented on SPARK-6602: - User 'zsxwing' has created a pull request for

[jira] [Assigned] (SPARK-6602) Replace direct use of Akka with Spark RPC interface

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6602: --- Assignee: Apache Spark (was: Shixiong Zhu) Replace direct use of Akka with Spark RPC

[jira] [Commented] (SPARK-4123) Show dependency changes in pull requests

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386888#comment-14386888 ] Apache Spark commented on SPARK-4123: - User 'brennonyork' has created a pull request

[jira] [Commented] (SPARK-5990) Model import/export for IsotonicRegression

2015-03-30 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5990?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386936#comment-14386936 ] Yanbo Liang commented on SPARK-5990: [~josephkb] Could you assign this to me? Model

[jira] [Updated] (SPARK-4226) SparkSQL - Add support for subqueries in predicates

2015-03-30 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-4226: -- Description: I have a test table defined in Hive as follows: {code:sql} CREATE TABLE sparkbug ( id

[jira] [Assigned] (SPARK-2883) Spark Support for ORCFile format

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-2883: --- Assignee: (was: Apache Spark) Spark Support for ORCFile format

[jira] [Assigned] (SPARK-2883) Spark Support for ORCFile format

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-2883: --- Assignee: Apache Spark Spark Support for ORCFile format

[jira] [Commented] (SPARK-6258) Python MLlib API missing items: Clustering

2015-03-30 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14387115#comment-14387115 ] Joseph K. Bradley commented on SPARK-6258: -- [~hrishikesh], glad to hear you're

[jira] [Assigned] (SPARK-6403) Launch master as spot instance on EC2

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6403: --- Assignee: (was: Apache Spark) Launch master as spot instance on EC2

[jira] [Assigned] (SPARK-6611) Add support for INTEGER as synonym of INT to DDLParser

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6611: --- Assignee: Apache Spark Add support for INTEGER as synonym of INT to DDLParser

[jira] [Assigned] (SPARK-6611) Add support for INTEGER as synonym of INT to DDLParser

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6611: --- Assignee: (was: Apache Spark) Add support for INTEGER as synonym of INT to DDLParser

[jira] [Commented] (SPARK-6611) Add support for INTEGER as synonym of INT to DDLParser

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386945#comment-14386945 ] Apache Spark commented on SPARK-6611: - User 'smola' has created a pull request for

[jira] [Assigned] (SPARK-6403) Launch master as spot instance on EC2

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6403: --- Assignee: Apache Spark Launch master as spot instance on EC2

[jira] [Commented] (SPARK-6239) Spark MLlib fpm#FPGrowth minSupport should use long instead

2015-03-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386933#comment-14386933 ] Sean Owen commented on SPARK-6239: -- Just the little API overhead for littler gain IMHO.

[jira] [Commented] (SPARK-5564) Support sparse LDA solutions

2015-03-30 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14387085#comment-14387085 ] Joseph K. Bradley commented on SPARK-5564: -- [~debasish83] I mainly used a

[jira] [Created] (SPARK-6611) Add support for INTEGER as synonym of INT to DDLParser

2015-03-30 Thread Santiago M. Mola (JIRA)
Santiago M. Mola created SPARK-6611: --- Summary: Add support for INTEGER as synonym of INT to DDLParser Key: SPARK-6611 URL: https://issues.apache.org/jira/browse/SPARK-6611 Project: Spark

[jira] [Commented] (SPARK-6603) SQLContext.registerFunction - SQLContext.udf.register

2015-03-30 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6603?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14387105#comment-14387105 ] Reynold Xin commented on SPARK-6603: How about not deprecating registerFunction, and

[jira] [Created] (SPARK-6612) Python KMeans parity

2015-03-30 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6612: Summary: Python KMeans parity Key: SPARK-6612 URL: https://issues.apache.org/jira/browse/SPARK-6612 Project: Spark Issue Type: Improvement

[jira] [Assigned] (SPARK-5990) Model import/export for IsotonicRegression

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5990: --- Assignee: Apache Spark Model import/export for IsotonicRegression

[jira] [Assigned] (SPARK-5990) Model import/export for IsotonicRegression

2015-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5990: --- Assignee: (was: Apache Spark) Model import/export for IsotonicRegression

  1   2   3   >