[jira] [Commented] (SPARK-10577) [PySpark, SQL] DataFrame hint for broadcast join

2015-09-14 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-10577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743030#comment-14743030 ] Maciej Bryński commented on SPARK-10577: Unfortunatelly I'm rather poweruser than

[jira] [Updated] (SPARK-10577) [PySpark, SQL] DataFrame hint for broadcast join

2015-09-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-10577: Labels: starter (was: ) > [PySpark, SQL] DataFrame hint for broadcast join > -

[jira] [Commented] (SPARK-10577) [PySpark] DataFrame hint for broadcast join

2015-09-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743033#comment-14743033 ] Reynold Xin commented on SPARK-10577: - We already have the Java API in functions.scal

[jira] [Updated] (SPARK-10577) [PySpark] DataFrame hint for broadcast join

2015-09-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-10577: Summary: [PySpark] DataFrame hint for broadcast join (was: [PySpark, SQL] DataFrame hint for broad

[jira] [Updated] (SPARK-10577) [PySpark] DataFrame hint for broadcast join

2015-09-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-10577: Target Version/s: 1.6.0 > [PySpark] DataFrame hint for broadcast join > ---

[jira] [Updated] (SPARK-10577) [PySpark] DataFrame hint for broadcast join

2015-09-14 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-10577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-10577: --- Description: As in https://issues.apache.org/jira/browse/SPARK-8300 there should by possibili

[jira] [Commented] (SPARK-1103) Garbage collect RDD information inside of Spark

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743060#comment-14743060 ] Apache Spark commented on SPARK-1103: - User 'tdas' has created a pull request for this

[jira] [Created] (SPARK-10586) BlockManager ca't be removed when it is re-registered, then disassociats

2015-09-14 Thread meiyoula (JIRA)
meiyoula created SPARK-10586: Summary: BlockManager ca't be removed when it is re-registered, then disassociats Key: SPARK-10586 URL: https://issues.apache.org/jira/browse/SPARK-10586 Project: Spark

[jira] [Assigned] (SPARK-10586) BlockManager ca't be removed when it is re-registered, then disassociats

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10586: Assignee: (was: Apache Spark) > BlockManager ca't be removed when it is re-registered,

[jira] [Commented] (SPARK-10586) BlockManager ca't be removed when it is re-registered, then disassociats

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10586?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743088#comment-14743088 ] Apache Spark commented on SPARK-10586: -- User 'XuTingjun' has created a pull request

[jira] [Assigned] (SPARK-10586) BlockManager ca't be removed when it is re-registered, then disassociats

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10586: Assignee: Apache Spark > BlockManager ca't be removed when it is re-registered, then disas

[jira] [Resolved] (SPARK-9720) spark.ml Identifiable types should have UID in toString methods

2015-09-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-9720. -- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8062 [https://github.com/ap

[jira] [Resolved] (SPARK-10550) SQLListener error constructing extended SQLContext

2015-09-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-10550. --- Resolution: Not A Problem I'm going to provisionally close this as I don't think access to internals

[jira] [Commented] (SPARK-2960) Spark executables fail to start via symlinks

2015-09-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743197#comment-14743197 ] Sean Owen commented on SPARK-2960: -- As an addendum, I think the title is misleading. It's

[jira] [Commented] (SPARK-10539) Intersection Optimization is Wrong

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10539?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743200#comment-14743200 ] Apache Spark commented on SPARK-10539: -- User 'yjshen' has created a pull request for

[jira] [Assigned] (SPARK-10539) Intersection Optimization is Wrong

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10539: Assignee: Apache Spark > Intersection Optimization is Wrong >

[jira] [Assigned] (SPARK-10539) Intersection Optimization is Wrong

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10539: Assignee: (was: Apache Spark) > Intersection Optimization is Wrong > -

[jira] [Commented] (SPARK-10458) Would like to know if a given Spark Context is stopped or currently stopping

2015-09-14 Thread Madhusudanan Kandasamy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743279#comment-14743279 ] Madhusudanan Kandasamy commented on SPARK-10458: [~srowen] I'll create a

[jira] [Created] (SPARK-10587) In pyspark, toDF() dosen't exsist in RDD object

2015-09-14 Thread SemiCoder (JIRA)
SemiCoder created SPARK-10587: - Summary: In pyspark, toDF() dosen't exsist in RDD object Key: SPARK-10587 URL: https://issues.apache.org/jira/browse/SPARK-10587 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-10587) In pyspark, toDF() dosen't exsist in RDD object

2015-09-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743334#comment-14743334 ] Sean Owen commented on SPARK-10587: --- It's in {{python/pyspark/sql/context.py}}. Are you

[jira] [Commented] (SPARK-10180) JDBCRDD does not process EqualNullSafe filter.

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743339#comment-14743339 ] Apache Spark commented on SPARK-10180: -- User 'HyukjinKwon' has created a pull reques

[jira] [Created] (SPARK-10588) Saving a DataFrame containing only nulls to JSON doesn't work

2015-09-14 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-10588: -- Summary: Saving a DataFrame containing only nulls to JSON doesn't work Key: SPARK-10588 URL: https://issues.apache.org/jira/browse/SPARK-10588 Project: Spark Is

[jira] [Commented] (SPARK-2960) Spark executables fail to start via symlinks

2015-09-14 Thread Danil Mironov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743415#comment-14743415 ] Danil Mironov commented on SPARK-2960: -- The title of the issue is not that misleading

[jira] [Commented] (SPARK-1537) Add integration with Yarn's Application Timeline Server

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743421#comment-14743421 ] Apache Spark commented on SPARK-1537: - User 'steveloughran' has created a pull request

[jira] [Commented] (SPARK-10577) [PySpark] DataFrame hint for broadcast join

2015-09-14 Thread Jian Feng Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743426#comment-14743426 ] Jian Feng Zhang commented on SPARK-10577: - I'd like to take this to create a pull

[jira] [Created] (SPARK-10589) Add defense against external site framing

2015-09-14 Thread Sean Owen (JIRA)
Sean Owen created SPARK-10589: - Summary: Add defense against external site framing Key: SPARK-10589 URL: https://issues.apache.org/jira/browse/SPARK-10589 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-10589) Add defense against external site framing

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10589: Assignee: Apache Spark (was: Sean Owen) > Add defense against external site framing > ---

[jira] [Assigned] (SPARK-10589) Add defense against external site framing

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10589: Assignee: Sean Owen (was: Apache Spark) > Add defense against external site framing > ---

[jira] [Commented] (SPARK-10589) Add defense against external site framing

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743470#comment-14743470 ] Apache Spark commented on SPARK-10589: -- User 'srowen' has created a pull request for

[jira] [Commented] (SPARK-7012) Add support for NOT NULL modifier for column definitions on DDLParser

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743500#comment-14743500 ] Apache Spark commented on SPARK-7012: - User 'sabhyankar' has created a pull request fo

[jira] [Assigned] (SPARK-7012) Add support for NOT NULL modifier for column definitions on DDLParser

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7012: --- Assignee: Apache Spark > Add support for NOT NULL modifier for column definitions on DDLParse

[jira] [Assigned] (SPARK-7012) Add support for NOT NULL modifier for column definitions on DDLParser

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7012: --- Assignee: (was: Apache Spark) > Add support for NOT NULL modifier for column definitions

[jira] [Commented] (SPARK-2356) Exception: Could not locate executable null\bin\winutils.exe in the Hadoop

2015-09-14 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743531#comment-14743531 ] Steve Loughran commented on SPARK-2356: --- The original JIRA here is just that there's

[jira] [Commented] (SPARK-7442) Spark 1.3.1 / Hadoop 2.6 prebuilt pacakge has broken S3 filesystem access

2015-09-14 Thread Rustam Aliyev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743539#comment-14743539 ] Rustam Aliyev commented on SPARK-7442: -- Hit this bug today. It basically makes Spark

[jira] [Commented] (SPARK-6961) Cannot save data to parquet files when executing from Windows from a Maven Project

2015-09-14 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743537#comment-14743537 ] Steve Loughran commented on SPARK-6961: --- Well, its an installation-side issue in tha

[jira] [Issue Comment Deleted] (SPARK-7442) Spark 1.3.1 / Hadoop 2.6 prebuilt pacakge has broken S3 filesystem access

2015-09-14 Thread Rustam Aliyev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rustam Aliyev updated SPARK-7442: - Comment: was deleted (was: Hit this bug today. It basically makes Spark on AWS useless for many s

[jira] [Commented] (SPARK-4815) ThriftServer use only one SessionState to run sql using hive

2015-09-14 Thread Joseph Fourny (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743555#comment-14743555 ] Joseph Fourny commented on SPARK-4815: -- Is this really fixed? I am on Spark 1.5.0 (rc

[jira] [Commented] (SPARK-10550) SQLListener error constructing extended SQLContext

2015-09-14 Thread shao lo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743572#comment-14743572 ] shao lo commented on SPARK-10550: - There are parts that are marked as experimental. This

[jira] [Created] (SPARK-10590) Spark with YARN build is broken

2015-09-14 Thread Kevin Tsai (JIRA)
Kevin Tsai created SPARK-10590: -- Summary: Spark with YARN build is broken Key: SPARK-10590 URL: https://issues.apache.org/jira/browse/SPARK-10590 Project: Spark Issue Type: Bug Affects Versi

[jira] [Updated] (SPARK-10590) Spark with YARN build is broken

2015-09-14 Thread Kevin Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kevin Tsai updated SPARK-10590: --- Environment: CentOS 6.5 Oracle JDK 1.7.0_75 Maven 3.3.3 Hadoop 2.6.0 Spark 1.5.0 was: CentOS 6.5

[jira] [Commented] (SPARK-10550) SQLListener error constructing extended SQLContext

2015-09-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743642#comment-14743642 ] Sean Owen commented on SPARK-10550: --- It's marked {{protected[sql]}} which means it is n

[jira] [Commented] (SPARK-10590) Spark with YARN build is broken

2015-09-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743666#comment-14743666 ] Sean Owen commented on SPARK-10590: --- Did you run the script to set up the build for Sca

[jira] [Assigned] (SPARK-10585) only copy data once when generate unsafe projection

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10585: Assignee: Apache Spark > only copy data once when generate unsafe projection > ---

[jira] [Assigned] (SPARK-10585) only copy data once when generate unsafe projection

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10585: Assignee: (was: Apache Spark) > only copy data once when generate unsafe projection >

[jira] [Commented] (SPARK-10585) only copy data once when generate unsafe projection

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743677#comment-14743677 ] Apache Spark commented on SPARK-10585: -- User 'cloud-fan' has created a pull request

[jira] [Commented] (SPARK-9325) Support `collect` on DataFrame columns

2015-09-14 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743759#comment-14743759 ] Shivaram Venkataraman commented on SPARK-9325: -- Thanks [~felixcheung] for inv

[jira] [Commented] (SPARK-10588) Saving a DataFrame containing only nulls to JSON doesn't work

2015-09-14 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743770#comment-14743770 ] Yin Huai commented on SPARK-10588: -- This is an expected behavior. When we write a row ou

[jira] [Updated] (SPARK-10588) Saving a DataFrame containing only nulls to JSON doesn't work

2015-09-14 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-10588: - Priority: Minor (was: Major) > Saving a DataFrame containing only nulls to JSON doesn't work > -

[jira] [Assigned] (SPARK-10458) Would like to know if a given Spark Context is stopped or currently stopping

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10458: Assignee: (was: Apache Spark) > Would like to know if a given Spark Context is stopped

[jira] [Commented] (SPARK-10458) Would like to know if a given Spark Context is stopped or currently stopping

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743777#comment-14743777 ] Apache Spark commented on SPARK-10458: -- User 'kmadhugit' has created a pull request

[jira] [Assigned] (SPARK-10458) Would like to know if a given Spark Context is stopped or currently stopping

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10458: Assignee: Apache Spark > Would like to know if a given Spark Context is stopped or current

[jira] [Created] (SPARK-10591) False negative in QueryTest.checkAnswer

2015-09-14 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-10591: -- Summary: False negative in QueryTest.checkAnswer Key: SPARK-10591 URL: https://issues.apache.org/jira/browse/SPARK-10591 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-10590) Spark with YARN build is broken

2015-09-14 Thread Kevin Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743813#comment-14743813 ] Kevin Tsai commented on SPARK-10590: Hi Sean, The result is same as previous when I b

[jira] [Closed] (SPARK-10579) Extend statistical functions: Add Cardinality/Quantiles/Quartiles/Median in Statistics , e.g. for columns

2015-09-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley closed SPARK-10579. - Resolution: Won't Fix > Extend statistical functions: Add Cardinality/Quantiles/Quartiles

[jira] [Commented] (SPARK-10579) Extend statistical functions: Add Cardinality/Quantiles/Quartiles/Median in Statistics , e.g. for columns

2015-09-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10579?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743819#comment-14743819 ] Joseph K. Bradley commented on SPARK-10579: --- A lot of this functionality is bei

[jira] [Commented] (SPARK-10573) IndexToString transformSchema adds output field as DoubleType

2015-09-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743825#comment-14743825 ] Joseph K. Bradley commented on SPARK-10573: --- I think your assessment is correct

[jira] [Resolved] (SPARK-10578) pyspark.ml.classification.RandomForestClassifer does not return `rawPrediction` column

2015-09-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-10578. --- Resolution: Fixed Assignee: Joseph K. Bradley Fix Version/s: 1.5.0 [~

[jira] [Commented] (SPARK-10578) pyspark.ml.classification.RandomForestClassifer does not return `rawPrediction` column

2015-09-14 Thread Karen Yin-Yee Ng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743831#comment-14743831 ] Karen Yin-Yee Ng commented on SPARK-10578: -- Thanks [~josephkb] and [~viirya] for

[jira] [Commented] (SPARK-10574) HashingTF should use MurmurHash3

2015-09-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743834#comment-14743834 ] Joseph K. Bradley commented on SPARK-10574: --- I agree that switching to MurmurHa

[jira] [Assigned] (SPARK-10588) Saving a DataFrame containing only nulls to JSON doesn't work

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10588: Assignee: Apache Spark > Saving a DataFrame containing only nulls to JSON doesn't work > -

[jira] [Assigned] (SPARK-10588) Saving a DataFrame containing only nulls to JSON doesn't work

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10588: Assignee: (was: Apache Spark) > Saving a DataFrame containing only nulls to JSON doesn

[jira] [Commented] (SPARK-10588) Saving a DataFrame containing only nulls to JSON doesn't work

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743840#comment-14743840 ] Apache Spark commented on SPARK-10588: -- User 'viirya' has created a pull request for

[jira] [Commented] (SPARK-6417) Add Linear Programming algorithm

2015-09-14 Thread Ehsan Mohyedin Kermani (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6417?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743896#comment-14743896 ] Ehsan Mohyedin Kermani commented on SPARK-6417: --- Thank you Joseph for the ad

[jira] [Commented] (SPARK-10574) HashingTF should use MurmurHash3

2015-09-14 Thread Simeon Simeonov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743922#comment-14743922 ] Simeon Simeonov commented on SPARK-10574: - [~josephkb] this makes sense. There ar

[jira] [Comment Edited] (SPARK-10574) HashingTF should use MurmurHash3

2015-09-14 Thread Simeon Simeonov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743922#comment-14743922 ] Simeon Simeonov edited comment on SPARK-10574 at 9/14/15 5:55 PM: -

[jira] [Commented] (SPARK-6548) stddev_pop and stddev_samp aggregate functions

2015-09-14 Thread Jihong MA (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6548?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743959#comment-14743959 ] Jihong MA commented on SPARK-6548: -- [~davies]please fix the assignee to Jihong, Thanks!

[jira] [Assigned] (SPARK-10573) IndexToString transformSchema adds output field as DoubleType

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10573: Assignee: Apache Spark > IndexToString transformSchema adds output field as DoubleType > -

[jira] [Commented] (SPARK-10573) IndexToString transformSchema adds output field as DoubleType

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743958#comment-14743958 ] Apache Spark commented on SPARK-10573: -- User 'pnpritchard' has created a pull reques

[jira] [Assigned] (SPARK-10573) IndexToString transformSchema adds output field as DoubleType

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10573: Assignee: (was: Apache Spark) > IndexToString transformSchema adds output field as Dou

[jira] [Updated] (SPARK-6548) stddev_pop and stddev_samp aggregate functions

2015-09-14 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-6548: Assignee: Jihong MA > stddev_pop and stddev_samp aggregate functions > -

[jira] [Updated] (SPARK-10573) IndexToString transformSchema adds output field as DoubleType

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10573: -- Target Version/s: 1.6.0, 1.5.1 > IndexToString transformSchema adds output field as DoubleType

[jira] [Updated] (SPARK-10573) IndexToString transformSchema adds output field as DoubleType

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10573: -- Assignee: Nick Pritchard > IndexToString transformSchema adds output field as DoubleType >

[jira] [Updated] (SPARK-10573) IndexToString transformSchema adds output field as DoubleType

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10573: -- Shepherd: Xiangrui Meng > IndexToString transformSchema adds output field as DoubleType > -

[jira] [Updated] (SPARK-10077) Java package doc for spark.ml.feature

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10077: -- Assignee: holdenk > Java package doc for spark.ml.feature > ---

[jira] [Updated] (SPARK-10077) Java package doc for spark.ml.feature

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10077: -- Shepherd: Xiangrui Meng Target Version/s: 1.6.0 > Java package doc for spark.ml.fea

[jira] [Updated] (SPARK-9769) Add Python API for ml.feature.CountVectorizer

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-9769: - Shepherd: Xiangrui Meng > Add Python API for ml.feature.CountVectorizer >

[jira] [Commented] (SPARK-10590) Spark with YARN build is broken

2015-09-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14744000#comment-14744000 ] Sean Owen commented on SPARK-10590: --- No, I'm talking about {{./dev/change-scala-version

[jira] [Created] (SPARK-10592) deprecate weights and use coefficients instead in ML models

2015-09-14 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-10592: - Summary: deprecate weights and use coefficients instead in ML models Key: SPARK-10592 URL: https://issues.apache.org/jira/browse/SPARK-10592 Project: Spark

[jira] [Updated] (SPARK-9793) PySpark DenseVector, SparseVector should override __eq__ and __hash__

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-9793: - Shepherd: Xiangrui Meng Assignee: Yanbo Liang > PySpark DenseVector, SparseVector should overr

[jira] [Updated] (SPARK-9774) Add Python API for ml.regression.IsotonicRegression

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-9774: - Shepherd: Yanbo Liang Target Version/s: 1.6.0 Priority: Major (was: Minor)

[jira] [Updated] (SPARK-10266) Add @Since annotation to ml.tuning

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10266: -- Assignee: Ehsan Mohyedin Kermani > Add @Since annotation to ml.tuning > ---

[jira] [Updated] (SPARK-9774) Add Python API for ml.regression.IsotonicRegression

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-9774: - Assignee: holdenk > Add Python API for ml.regression.IsotonicRegression >

[jira] [Created] (SPARK-10593) sql lateral view same name gives wrong value

2015-09-14 Thread Davies Liu (JIRA)
Davies Liu created SPARK-10593: -- Summary: sql lateral view same name gives wrong value Key: SPARK-10593 URL: https://issues.apache.org/jira/browse/SPARK-10593 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-10266) Add @Since annotation to ml.tuning

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10266: -- Shepherd: Yu Ishikawa Target Version/s: 1.6.0 > Add @Since annotation to ml.tuning

[jira] [Updated] (SPARK-10593) sql lateral view same name gives wrong value

2015-09-14 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-10593: --- Description: This query will return wrong result: {code} select insideLayer1.json as json_insideLaye

[jira] [Updated] (SPARK-10265) Add @Since annotation to ml.regression

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10265: -- Target Version/s: 1.6.0 > Add @Since annotation to ml.regression >

[jira] [Updated] (SPARK-10265) Add @Since annotation to ml.regression

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10265: -- Assignee: Ehsan Mohyedin Kermani > Add @Since annotation to ml.regression > ---

[jira] [Commented] (SPARK-10266) Add @Since annotation to ml.tuning

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14744024#comment-14744024 ] Xiangrui Meng commented on SPARK-10266: --- [~Ehsan Mohyedin Kermani] Next time please

[jira] [Updated] (SPARK-10269) Add @since annotation to pyspark.mllib.classification

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10269: -- Shepherd: Yu Ishikawa > Add @since annotation to pyspark.mllib.classification > ---

[jira] [Updated] (SPARK-10272) Add @since annotation to pyspark.mllib.evaluation

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10272: -- Shepherd: Yu Ishikawa > Add @since annotation to pyspark.mllib.evaluation > ---

[jira] [Updated] (SPARK-10271) Add @since annotation to pyspark.mllib.clustering

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10271?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10271: -- Shepherd: Yu Ishikawa > Add @since annotation to pyspark.mllib.clustering > ---

[jira] [Created] (SPARK-10594) ApplicationMaster "--help" references the removed "--num-executors" option

2015-09-14 Thread Erick Tryzelaar (JIRA)
Erick Tryzelaar created SPARK-10594: --- Summary: ApplicationMaster "--help" references the removed "--num-executors" option Key: SPARK-10594 URL: https://issues.apache.org/jira/browse/SPARK-10594 Proj

[jira] [Updated] (SPARK-10272) Add @since annotation to pyspark.mllib.evaluation

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10272: -- Assignee: Noel Smith > Add @since annotation to pyspark.mllib.evaluation >

[jira] [Updated] (SPARK-10273) Add @since annotation to pyspark.mllib.feature

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10273: -- Assignee: Noel Smith > Add @since annotation to pyspark.mllib.feature > ---

[jira] [Updated] (SPARK-10269) Add @since annotation to pyspark.mllib.classification

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10269: -- Assignee: Noel Smith > Add @since annotation to pyspark.mllib.classification >

[jira] [Updated] (SPARK-10271) Add @since annotation to pyspark.mllib.clustering

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10271?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10271: -- Assignee: Noel Smith > Add @since annotation to pyspark.mllib.clustering >

[jira] [Updated] (SPARK-10275) Add @since annotation to pyspark.mllib.random

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10275: -- Shepherd: Noel Smith > Add @since annotation to pyspark.mllib.random >

[jira] [Updated] (SPARK-10276) Add @since annotation to pyspark.mllib.recommendation

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10276: -- Assignee: Yu Ishikawa > Add @since annotation to pyspark.mllib.recommendation > ---

[jira] [Updated] (SPARK-10274) Add @since annotation to pyspark.mllib.fpm

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10274: -- Shepherd: Noel Smith > Add @since annotation to pyspark.mllib.fpm > ---

[jira] [Updated] (SPARK-10273) Add @since annotation to pyspark.mllib.feature

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10273: -- Shepherd: Yu Ishikawa > Add @since annotation to pyspark.mllib.feature > --

  1   2   3   >