[jira] [Commented] (SPARK-10180) JDBCRDD does not process EqualNullSafe filter.

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743339#comment-14743339 ] Apache Spark commented on SPARK-10180: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Created] (SPARK-10588) Saving a DataFrame containing only nulls to JSON doesn't work

2015-09-14 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-10588: -- Summary: Saving a DataFrame containing only nulls to JSON doesn't work Key: SPARK-10588 URL: https://issues.apache.org/jira/browse/SPARK-10588 Project: Spark

[jira] [Commented] (SPARK-10587) In pyspark, toDF() dosen't exsist in RDD object

2015-09-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743334#comment-14743334 ] Sean Owen commented on SPARK-10587: --- It's in {{python/pyspark/sql/context.py}}. Are you sure your

[jira] [Assigned] (SPARK-10589) Add defense against external site framing

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10589: Assignee: Apache Spark (was: Sean Owen) > Add defense against external site framing >

[jira] [Commented] (SPARK-2960) Spark executables fail to start via symlinks

2015-09-14 Thread Danil Mironov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743415#comment-14743415 ] Danil Mironov commented on SPARK-2960: -- The title of the issue is not that misleading, when one

[jira] [Created] (SPARK-10589) Add defense against external site framing

2015-09-14 Thread Sean Owen (JIRA)
Sean Owen created SPARK-10589: - Summary: Add defense against external site framing Key: SPARK-10589 URL: https://issues.apache.org/jira/browse/SPARK-10589 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-1537) Add integration with Yarn's Application Timeline Server

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743421#comment-14743421 ] Apache Spark commented on SPARK-1537: - User 'steveloughran' has created a pull request for this issue:

[jira] [Commented] (SPARK-10577) [PySpark] DataFrame hint for broadcast join

2015-09-14 Thread Jian Feng Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743426#comment-14743426 ] Jian Feng Zhang commented on SPARK-10577: - I'd like to take this to create a pull request. >

[jira] [Assigned] (SPARK-10589) Add defense against external site framing

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10589: Assignee: Sean Owen (was: Apache Spark) > Add defense against external site framing >

[jira] [Commented] (SPARK-10589) Add defense against external site framing

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743470#comment-14743470 ] Apache Spark commented on SPARK-10589: -- User 'srowen' has created a pull request for this issue:

[jira] [Issue Comment Deleted] (SPARK-7442) Spark 1.3.1 / Hadoop 2.6 prebuilt pacakge has broken S3 filesystem access

2015-09-14 Thread Rustam Aliyev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rustam Aliyev updated SPARK-7442: - Comment: was deleted (was: Hit this bug today. It basically makes Spark on AWS useless for many

[jira] [Commented] (SPARK-4815) ThriftServer use only one SessionState to run sql using hive

2015-09-14 Thread Joseph Fourny (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743555#comment-14743555 ] Joseph Fourny commented on SPARK-4815: -- Is this really fixed? I am on Spark 1.5.0 (rc3) and I see

[jira] [Commented] (SPARK-2356) Exception: Could not locate executable null\bin\winutils.exe in the Hadoop

2015-09-14 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743531#comment-14743531 ] Steve Loughran commented on SPARK-2356: --- The original JIRA here is just that there's an error being

[jira] [Commented] (SPARK-6961) Cannot save data to parquet files when executing from Windows from a Maven Project

2015-09-14 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743537#comment-14743537 ] Steve Loughran commented on SPARK-6961: --- Well, its an installation-side issue in that "if it isn't

[jira] [Commented] (SPARK-10550) SQLListener error constructing extended SQLContext

2015-09-14 Thread shao lo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743572#comment-14743572 ] shao lo commented on SPARK-10550: - There are parts that are marked as experimental. This is not in that

[jira] [Commented] (SPARK-7442) Spark 1.3.1 / Hadoop 2.6 prebuilt pacakge has broken S3 filesystem access

2015-09-14 Thread Rustam Aliyev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743539#comment-14743539 ] Rustam Aliyev commented on SPARK-7442: -- Hit this bug today. It basically makes Spark on AWS useless

[jira] [Created] (SPARK-10590) Spark with YARN build is broken

2015-09-14 Thread Kevin Tsai (JIRA)
Kevin Tsai created SPARK-10590: -- Summary: Spark with YARN build is broken Key: SPARK-10590 URL: https://issues.apache.org/jira/browse/SPARK-10590 Project: Spark Issue Type: Bug Affects

[jira] [Commented] (SPARK-7012) Add support for NOT NULL modifier for column definitions on DDLParser

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743500#comment-14743500 ] Apache Spark commented on SPARK-7012: - User 'sabhyankar' has created a pull request for this issue:

[jira] [Assigned] (SPARK-7012) Add support for NOT NULL modifier for column definitions on DDLParser

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7012: --- Assignee: Apache Spark > Add support for NOT NULL modifier for column definitions on

[jira] [Assigned] (SPARK-7012) Add support for NOT NULL modifier for column definitions on DDLParser

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7012: --- Assignee: (was: Apache Spark) > Add support for NOT NULL modifier for column definitions

[jira] [Updated] (SPARK-10590) Spark with YARN build is broken

2015-09-14 Thread Kevin Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kevin Tsai updated SPARK-10590: --- Environment: CentOS 6.5 Oracle JDK 1.7.0_75 Maven 3.3.3 Hadoop 2.6.0 Spark 1.5.0 was: CentOS 6.5

[jira] [Commented] (SPARK-10458) Would like to know if a given Spark Context is stopped or currently stopping

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743777#comment-14743777 ] Apache Spark commented on SPARK-10458: -- User 'kmadhugit' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10458) Would like to know if a given Spark Context is stopped or currently stopping

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10458: Assignee: Apache Spark > Would like to know if a given Spark Context is stopped or

[jira] [Assigned] (SPARK-10458) Would like to know if a given Spark Context is stopped or currently stopping

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10458: Assignee: (was: Apache Spark) > Would like to know if a given Spark Context is

[jira] [Commented] (SPARK-10550) SQLListener error constructing extended SQLContext

2015-09-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743642#comment-14743642 ] Sean Owen commented on SPARK-10550: --- It's marked {{protected[sql]}} which means it is not accessible

[jira] [Commented] (SPARK-10590) Spark with YARN build is broken

2015-09-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743666#comment-14743666 ] Sean Owen commented on SPARK-10590: --- Did you run the script to set up the build for Scala 2.11 first?

[jira] [Commented] (SPARK-10588) Saving a DataFrame containing only nulls to JSON doesn't work

2015-09-14 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743770#comment-14743770 ] Yin Huai commented on SPARK-10588: -- This is an expected behavior. When we write a row out, we skip those

[jira] [Updated] (SPARK-10588) Saving a DataFrame containing only nulls to JSON doesn't work

2015-09-14 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-10588: - Priority: Minor (was: Major) > Saving a DataFrame containing only nulls to JSON doesn't work >

[jira] [Commented] (SPARK-10585) only copy data once when generate unsafe projection

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743677#comment-14743677 ] Apache Spark commented on SPARK-10585: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10585) only copy data once when generate unsafe projection

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10585: Assignee: Apache Spark > only copy data once when generate unsafe projection >

[jira] [Assigned] (SPARK-10585) only copy data once when generate unsafe projection

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10585: Assignee: (was: Apache Spark) > only copy data once when generate unsafe projection >

[jira] [Commented] (SPARK-9325) Support `collect` on DataFrame columns

2015-09-14 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743759#comment-14743759 ] Shivaram Venkataraman commented on SPARK-9325: -- Thanks [~felixcheung] for investigating into

[jira] [Commented] (SPARK-6417) Add Linear Programming algorithm

2015-09-14 Thread Ehsan Mohyedin Kermani (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6417?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743896#comment-14743896 ] Ehsan Mohyedin Kermani commented on SPARK-6417: --- Thank you Joseph for the advice! I have

[jira] [Commented] (SPARK-10590) Spark with YARN build is broken

2015-09-14 Thread Kevin Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743813#comment-14743813 ] Kevin Tsai commented on SPARK-10590: Hi Sean, The result is same as previous when I build it after

[jira] [Commented] (SPARK-10579) Extend statistical functions: Add Cardinality/Quantiles/Quartiles/Median in Statistics , e.g. for columns

2015-09-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10579?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743819#comment-14743819 ] Joseph K. Bradley commented on SPARK-10579: --- A lot of this functionality is being added to

[jira] [Closed] (SPARK-10579) Extend statistical functions: Add Cardinality/Quantiles/Quartiles/Median in Statistics , e.g. for columns

2015-09-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley closed SPARK-10579. - Resolution: Won't Fix > Extend statistical functions: Add

[jira] [Commented] (SPARK-10588) Saving a DataFrame containing only nulls to JSON doesn't work

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743840#comment-14743840 ] Apache Spark commented on SPARK-10588: -- User 'viirya' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10588) Saving a DataFrame containing only nulls to JSON doesn't work

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10588: Assignee: (was: Apache Spark) > Saving a DataFrame containing only nulls to JSON

[jira] [Assigned] (SPARK-10588) Saving a DataFrame containing only nulls to JSON doesn't work

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10588: Assignee: Apache Spark > Saving a DataFrame containing only nulls to JSON doesn't work >

[jira] [Created] (SPARK-10591) False negative in QueryTest.checkAnswer

2015-09-14 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-10591: -- Summary: False negative in QueryTest.checkAnswer Key: SPARK-10591 URL: https://issues.apache.org/jira/browse/SPARK-10591 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-10573) IndexToString transformSchema adds output field as DoubleType

2015-09-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743825#comment-14743825 ] Joseph K. Bradley commented on SPARK-10573: --- I think your assessment is correct. Would you

[jira] [Resolved] (SPARK-10578) pyspark.ml.classification.RandomForestClassifer does not return `rawPrediction` column

2015-09-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-10578. --- Resolution: Fixed Assignee: Joseph K. Bradley Fix Version/s: 1.5.0

[jira] [Commented] (SPARK-10574) HashingTF should use MurmurHash3

2015-09-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743834#comment-14743834 ] Joseph K. Bradley commented on SPARK-10574: --- I agree that switching to MurmurHash3 is a good

[jira] [Commented] (SPARK-10578) pyspark.ml.classification.RandomForestClassifer does not return `rawPrediction` column

2015-09-14 Thread Karen Yin-Yee Ng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14743831#comment-14743831 ] Karen Yin-Yee Ng commented on SPARK-10578: -- Thanks [~josephkb] and [~viirya] for the quick

[jira] [Updated] (SPARK-10599) Decrease communication in BlockMatrix multiply and increase performance

2015-09-14 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Burak Yavuz updated SPARK-10599: Description: The BlockMatrix multiply sends each block to all the corresponding columns of the

[jira] [Created] (SPARK-10600) SparkSQL - Support for Not Exists in a Correlated Subquery

2015-09-14 Thread Richard Garris (JIRA)
Richard Garris created SPARK-10600: -- Summary: SparkSQL - Support for Not Exists in a Correlated Subquery Key: SPARK-10600 URL: https://issues.apache.org/jira/browse/SPARK-10600 Project: Spark

[jira] [Updated] (SPARK-10597) MultivariateOnlineSummarizer for weighted instances

2015-09-14 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-10597: Description: MultivariateOnlineSummarizer for weighted instances is implemented as private API for

[jira] [Created] (SPARK-10597) MultivariateOnlineSummarizer for weighted instances

2015-09-14 Thread DB Tsai (JIRA)
DB Tsai created SPARK-10597: --- Summary: MultivariateOnlineSummarizer for weighted instances Key: SPARK-10597 URL: https://issues.apache.org/jira/browse/SPARK-10597 Project: Spark Issue Type: New

[jira] [Assigned] (SPARK-10594) ApplicationMaster "--help" references the removed "--num-executors" option

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10594: Assignee: (was: Apache Spark) > ApplicationMaster "--help" references the removed

[jira] [Assigned] (SPARK-10593) sql lateral view same name gives wrong value

2015-09-14 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu reassigned SPARK-10593: -- Assignee: Davies Liu > sql lateral view same name gives wrong value >

[jira] [Updated] (SPARK-10598) RoutingTablePartition toMessage method refers to bytes instead of bits

2015-09-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10598: -- Description: (was: (Have a look at

[jira] [Updated] (SPARK-10598) RoutingTablePartition toMessage method refers to bytes instead of bits

2015-09-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10598: -- Affects Version/s: (was: 1.4.0) Target Version/s: (was: 1.5.0) Priority:

[jira] [Commented] (SPARK-9325) Support `collect` on DataFrame columns

2015-09-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744170#comment-14744170 ] Reynold Xin commented on SPARK-9325: Do you want to support collect(df$Age + 1) ? > Support

[jira] [Updated] (SPARK-10563) SparkContext's local properties should be cloned when inherited

2015-09-14 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-10563: -- Target Version/s: 1.6.0, 1.5.1 (was: 1.6.0) > SparkContext's local properties should be cloned when

[jira] [Created] (SPARK-10599) Decrease communication in BlockMatrix multiply and increase performance

2015-09-14 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-10599: --- Summary: Decrease communication in BlockMatrix multiply and increase performance Key: SPARK-10599 URL: https://issues.apache.org/jira/browse/SPARK-10599 Project: Spark

[jira] [Assigned] (SPARK-10599) Decrease communication in BlockMatrix multiply and increase performance

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10599: Assignee: Apache Spark > Decrease communication in BlockMatrix multiply and increase

[jira] [Updated] (SPARK-10598) RoutingTablePartition toMessage method refers to bytes instead of bits

2015-09-14 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-10598: -- Assignee: Robin East > RoutingTablePartition toMessage method refers to bytes instead of bits >

[jira] [Resolved] (SPARK-6981) [SQL] SparkPlanner and QueryExecution should be factored out from SQLContext

2015-09-14 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6981?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-6981. - Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 6356

[jira] [Commented] (SPARK-7040) Explore receiver-less DStream for Flume

2015-09-14 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7040?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744162#comment-14744162 ] Tathagata Das commented on SPARK-7040: -- I am not sure how Direct API can be built for Flume as Flume

[jira] [Assigned] (SPARK-10593) sql lateral view same name gives wrong value

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10593: Assignee: Apache Spark > sql lateral view same name gives wrong value >

[jira] [Assigned] (SPARK-10593) sql lateral view same name gives wrong value

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10593: Assignee: (was: Apache Spark) > sql lateral view same name gives wrong value >

[jira] [Commented] (SPARK-10593) sql lateral view same name gives wrong value

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744242#comment-14744242 ] Apache Spark commented on SPARK-10593: -- User 'davies' has created a pull request for this issue:

[jira] [Created] (SPARK-10598) RoutingTablePartition toMessage method refers to bytes instead of bits

2015-09-14 Thread Robin East (JIRA)
Robin East created SPARK-10598: -- Summary: RoutingTablePartition toMessage method refers to bytes instead of bits Key: SPARK-10598 URL: https://issues.apache.org/jira/browse/SPARK-10598 Project: Spark

[jira] [Resolved] (SPARK-10522) Nanoseconds part of Timestamp should be positive in parquet

2015-09-14 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-10522. Resolution: Fixed Fix Version/s: 1.5.1 1.6.0 Issue resolved by pull

[jira] [Updated] (SPARK-6981) [SQL] SparkPlanner and QueryExecution should be factored out from SQLContext

2015-09-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6981?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6981: - Assignee: Edoardo Vacchi > [SQL] SparkPlanner and QueryExecution should be factored out from SQLContext >

[jira] [Resolved] (SPARK-10543) Peak Execution Memory Quantile should be Per-task Basis

2015-09-14 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-10543. --- Resolution: Fixed Assignee: Sen Fang Fix Version/s: 1.5.1

[jira] [Commented] (SPARK-10317) start-history-server.sh CLI parsing incompatible with HistoryServer's arg parsing

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744451#comment-14744451 ] Apache Spark commented on SPARK-10317: -- User 'rekhajoshm' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10317) start-history-server.sh CLI parsing incompatible with HistoryServer's arg parsing

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10317: Assignee: (was: Apache Spark) > start-history-server.sh CLI parsing incompatible with

[jira] [Assigned] (SPARK-10317) start-history-server.sh CLI parsing incompatible with HistoryServer's arg parsing

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10317: Assignee: Apache Spark > start-history-server.sh CLI parsing incompatible with

[jira] [Created] (SPARK-10603) Univariate statistics as UDAFs: multi-pass continuous stats

2015-09-14 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-10603: - Summary: Univariate statistics as UDAFs: multi-pass continuous stats Key: SPARK-10603 URL: https://issues.apache.org/jira/browse/SPARK-10603 Project: Spark

[jira] [Resolved] (SPARK-10573) IndexToString transformSchema adds output field as DoubleType

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-10573. --- Resolution: Fixed Fix Version/s: 1.6.0 > IndexToString transformSchema adds output

[jira] [Commented] (SPARK-9325) Support `collect` on DataFrame columns

2015-09-14 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744254#comment-14744254 ] Davies Liu commented on SPARK-9325: --- I would -1 on this. I'm worried that once we have

[jira] [Resolved] (SPARK-10587) In pyspark, toDF() dosen't exsist in RDD object

2015-09-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10587?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-10587. --- Resolution: Not A Problem > In pyspark, toDF() dosen't exsist in RDD object >

[jira] [Created] (SPARK-10602) Univariate statistics as UDAFs: single-pass continuous stats

2015-09-14 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-10602: - Summary: Univariate statistics as UDAFs: single-pass continuous stats Key: SPARK-10602 URL: https://issues.apache.org/jira/browse/SPARK-10602 Project:

[jira] [Updated] (SPARK-10591) False negative in QueryTest.checkAnswer

2015-09-14 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-10591: --- Description: # For double and float, {{NaN == NaN}} is always {{false}} # {{checkAnswer}} doesn't

[jira] [Commented] (SPARK-9325) Support `collect` on DataFrame columns

2015-09-14 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744167#comment-14744167 ] Shivaram Venkataraman commented on SPARK-9325: -- Just `collect` and maybe `head`. This is just

[jira] [Assigned] (SPARK-10594) ApplicationMaster "--help" references the removed "--num-executors" option

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10594: Assignee: Apache Spark > ApplicationMaster "--help" references the removed

[jira] [Commented] (SPARK-10594) ApplicationMaster "--help" references the removed "--num-executors" option

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744209#comment-14744209 ] Apache Spark commented on SPARK-10594: -- User 'erickt' has created a pull request for this issue:

[jira] [Commented] (SPARK-10598) RoutingTablePartition toMessage method refers to bytes instead of bits

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744275#comment-14744275 ] Apache Spark commented on SPARK-10598: -- User 'insidedctm' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10598) RoutingTablePartition toMessage method refers to bytes instead of bits

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10598: Assignee: (was: Apache Spark) > RoutingTablePartition toMessage method refers to

[jira] [Assigned] (SPARK-10598) RoutingTablePartition toMessage method refers to bytes instead of bits

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10598: Assignee: Apache Spark > RoutingTablePartition toMessage method refers to bytes instead

[jira] [Updated] (SPARK-10575) Wrap RDD.takeSample with scope

2015-09-14 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-10575: -- Affects Version/s: 1.4.0 Target Version/s: 1.6.0 > Wrap RDD.takeSample with scope >

[jira] [Commented] (SPARK-10598) RoutingTablePartition toMessage method refers to bytes instead of bits

2015-09-14 Thread Robin East (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744359#comment-14744359 ] Robin East commented on SPARK-10598: Apologies - have checked it out. You're referring to Fix and

[jira] [Updated] (SPARK-10522) Nanoseconds part of Timestamp should be positive in parquet

2015-09-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10522: -- Assignee: Davies Liu > Nanoseconds part of Timestamp should be positive in parquet >

[jira] [Resolved] (SPARK-10549) scala 2.11 spark on yarn with security - Repl doesn't work

2015-09-14 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-10549. --- Resolution: Fixed Fix Version/s: 1.5.1 1.6.0 Target Version/s:

[jira] [Resolved] (SPARK-7040) Explore receiver-less DStream for Flume

2015-09-14 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7040?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-7040. -- Resolution: Invalid > Explore receiver-less DStream for Flume >

[jira] [Updated] (SPARK-10573) IndexToString transformSchema adds output field as DoubleType

2015-09-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10573: -- Fix Version/s: 1.5.1 > IndexToString transformSchema adds output field as DoubleType >

[jira] [Commented] (SPARK-10599) Decrease communication in BlockMatrix multiply and increase performance

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744322#comment-14744322 ] Apache Spark commented on SPARK-10599: -- User 'brkyvz' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10599) Decrease communication in BlockMatrix multiply and increase performance

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10599: Assignee: (was: Apache Spark) > Decrease communication in BlockMatrix multiply and

[jira] [Resolved] (SPARK-10594) ApplicationMaster "--help" references the removed "--num-executors" option

2015-09-14 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-10594. --- Resolution: Fixed Fix Version/s: 1.6.0 Target Version/s: 1.6.0 > ApplicationMaster

[jira] [Updated] (SPARK-10594) ApplicationMaster "--help" references the removed "--num-executors" option

2015-09-14 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-10594: -- Assignee: Erick Tryzelaar > ApplicationMaster "--help" references the removed "--num-executors" option

[jira] [Resolved] (SPARK-9996) Create local nested loop join operator

2015-09-14 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-9996. -- Resolution: Fixed Fix Version/s: 1.6.0 > Create local nested loop join operator >

[jira] [Resolved] (SPARK-9997) Create local Expand operator

2015-09-14 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-9997. -- Resolution: Fixed Fix Version/s: 1.6.0 Target Version/s: 1.6.0 > Create local Expand

[jira] [Commented] (SPARK-8418) Add single- and multi-value support to ML Transformers

2015-09-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8418?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744508#comment-14744508 ] Joseph K. Bradley commented on SPARK-8418: -- Apologies for being AWOL! I'd definitely appreciate

[jira] [Created] (SPARK-10604) Univariate statistics as UDAFs: categorical stats

2015-09-14 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-10604: - Summary: Univariate statistics as UDAFs: categorical stats Key: SPARK-10604 URL: https://issues.apache.org/jira/browse/SPARK-10604 Project: Spark

[jira] [Commented] (SPARK-9325) Support `collect` on DataFrame columns

2015-09-14 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744175#comment-14744175 ] Shivaram Venkataraman commented on SPARK-9325: -- Hmm not necessarily. If `df$newAge <- df$Age

[jira] [Commented] (SPARK-10563) SparkContext's local properties should be cloned when inherited

2015-09-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744291#comment-14744291 ] Apache Spark commented on SPARK-10563: -- User 'andrewor14' has created a pull request for this issue:

[jira] [Updated] (SPARK-10575) Wrap RDD.takeSample with scope

2015-09-14 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-10575: -- Assignee: Vinod KC > Wrap RDD.takeSample with scope > -- > >

[jira] [Created] (SPARK-10601) Spark SQL - Support for MINUS

2015-09-14 Thread Richard Garris (JIRA)
Richard Garris created SPARK-10601: -- Summary: Spark SQL - Support for MINUS Key: SPARK-10601 URL: https://issues.apache.org/jira/browse/SPARK-10601 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-10587) In pyspark, toDF() dosen't exsist in RDD object

2015-09-14 Thread SemiCoder (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14744643#comment-14744643 ] SemiCoder commented on SPARK-10587: --- It's not my code, it's code in latest released version. In fact,

  1   2   3   >