[jira] [Resolved] (SPARK-20020) SparkR should support checkpointing DataFrame

2017-03-19 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-20020. -- Resolution: Fixed Assignee: Felix Cheung Fix Version/s: 2.2.0 Targe

[jira] [Assigned] (SPARK-19994) Wrong outputOrdering for right/full outer smj

2017-03-19 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-19994: --- Assignee: Zhenhua Wang > Wrong outputOrdering for right/full outer smj > ---

[jira] [Resolved] (SPARK-19994) Wrong outputOrdering for right/full outer smj

2017-03-19 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-19994. - Resolution: Fixed Fix Version/s: 2.2.0 2.0.3 2.1.1 I

[jira] [Commented] (SPARK-20027) Compilation fixed in java docs.

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15932197#comment-15932197 ] Apache Spark commented on SPARK-20027: -- User 'ScrapCodes' has created a pull request

[jira] [Assigned] (SPARK-20027) Compilation fixed in java docs.

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20027: Assignee: (was: Apache Spark) > Compilation fixed in java docs. >

[jira] [Assigned] (SPARK-20027) Compilation fixed in java docs.

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20027: Assignee: Apache Spark > Compilation fixed in java docs. > ---

[jira] [Created] (SPARK-20027) Compilation fixed in java docs.

2017-03-19 Thread Prashant Sharma (JIRA)
Prashant Sharma created SPARK-20027: --- Summary: Compilation fixed in java docs. Key: SPARK-20027 URL: https://issues.apache.org/jira/browse/SPARK-20027 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-20015) Document R Structured Streaming (experimental) in R vignettes and R & SS programming guide

2017-03-19 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-20015: - Component/s: Documentation > Document R Structured Streaming (experimental) in R vignettes and R

[jira] [Updated] (SPARK-20020) SparkR should support checkpointing DataFrame

2017-03-19 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-20020: - Component/s: Documentation > SparkR should support checkpointing DataFrame >

[jira] [Updated] (SPARK-20026) Document R GLM Tweedie family support in programming guide and code example

2017-03-19 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-20026: - Component/s: Documentation > Document R GLM Tweedie family support in programming guide and code

[jira] [Assigned] (SPARK-20025) Driver fail over will not work, if SPARK_LOCAL* env is set.

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20025: Assignee: Apache Spark > Driver fail over will not work, if SPARK_LOCAL* env is set. > ---

[jira] [Assigned] (SPARK-20025) Driver fail over will not work, if SPARK_LOCAL* env is set.

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20025: Assignee: (was: Apache Spark) > Driver fail over will not work, if SPARK_LOCAL* env is

[jira] [Commented] (SPARK-20025) Driver fail over will not work, if SPARK_LOCAL* env is set.

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15932192#comment-15932192 ] Apache Spark commented on SPARK-20025: -- User 'ScrapCodes' has created a pull request

[jira] [Created] (SPARK-20026) Document R GLM Tweedie family support in programming guide and code example

2017-03-19 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-20026: Summary: Document R GLM Tweedie family support in programming guide and code example Key: SPARK-20026 URL: https://issues.apache.org/jira/browse/SPARK-20026 Project:

[jira] [Created] (SPARK-20025) Driver fail over will not work, if SPARK_LOCAL* env is set.

2017-03-19 Thread Prashant Sharma (JIRA)
Prashant Sharma created SPARK-20025: --- Summary: Driver fail over will not work, if SPARK_LOCAL* env is set. Key: SPARK-20025 URL: https://issues.apache.org/jira/browse/SPARK-20025 Project: Spark

[jira] [Commented] (SPARK-20020) SparkR should support checkpointing DataFrame

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15932186#comment-15932186 ] Apache Spark commented on SPARK-20020: -- User 'felixcheung' has created a pull reques

[jira] [Resolved] (SPARK-19849) Support ArrayType in to_json function/expression

2017-03-19 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-19849. -- Resolution: Fixed Assignee: Hyukjin Kwon Fix Version/s: 2.2.0 Targe

[jira] [Updated] (SPARK-19968) Use a cached instance of KafkaProducer for writing to kafka via KafkaSink.

2017-03-19 Thread Prashant Sharma (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prashant Sharma updated SPARK-19968: Description: KafkaProducer is thread safe and an instance can be reused for writing every

[jira] [Resolved] (SPARK-12124) Spark Sql MongoDB Cross Join Not Working

2017-03-19 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-12124. -- Resolution: Invalid The error messages originate from {{MongoRecordReader}} which is a thirdpar

[jira] [Assigned] (SPARK-19955) Update run-tests to support conda

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19955?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19955: Assignee: (was: Apache Spark) > Update run-tests to support conda > --

[jira] [Commented] (SPARK-19955) Update run-tests to support conda

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15932160#comment-15932160 ] Apache Spark commented on SPARK-19955: -- User 'holdenk' has created a pull request fo

[jira] [Assigned] (SPARK-19955) Update run-tests to support conda

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19955?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19955: Assignee: Apache Spark > Update run-tests to support conda > -

[jira] [Commented] (SPARK-10109) NPE when saving Parquet To HDFS

2017-03-19 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15932154#comment-15932154 ] Hyukjin Kwon commented on SPARK-10109: -- Then, I think this should be resolvable when

[jira] [Assigned] (SPARK-20024) SessionCatalog API setCurrentDatabase need to set the current database of ExternalCatalog

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20024: Assignee: Xiao Li (was: Apache Spark) > SessionCatalog API setCurrentDatabase need to set

[jira] [Commented] (SPARK-20024) SessionCatalog API setCurrentDatabase need to set the current database of ExternalCatalog

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20024?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15932152#comment-15932152 ] Apache Spark commented on SPARK-20024: -- User 'gatorsmile' has created a pull request

[jira] [Assigned] (SPARK-20024) SessionCatalog API setCurrentDatabase need to set the current database of ExternalCatalog

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20024: Assignee: Apache Spark (was: Xiao Li) > SessionCatalog API setCurrentDatabase need to set

[jira] [Created] (SPARK-20024) SessionCatalog API setCurrentDatabase need to set the current database of ExternalCatalog

2017-03-19 Thread Xiao Li (JIRA)
Xiao Li created SPARK-20024: --- Summary: SessionCatalog API setCurrentDatabase need to set the current database of ExternalCatalog Key: SPARK-20024 URL: https://issues.apache.org/jira/browse/SPARK-20024 Proje

[jira] [Commented] (SPARK-20023) Can not see table comment when describe formatted table

2017-03-19 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20023?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15932135#comment-15932135 ] Xiao Li commented on SPARK-20023: - Sure, will take a look at it soon. > Can not see tab

[jira] [Commented] (SPARK-19988) Flaky Test: OrcSourceSuite SPARK-19459/SPARK-18220: read char/varchar column written by Hive

2017-03-19 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15932133#comment-15932133 ] Xiao Li commented on SPARK-19988: - I might need to reopen it soon. We have a bug in the c

[jira] [Resolved] (SPARK-6072) Enable hash joins for null-safe equality predicates

2017-03-19 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-6072. - Resolution: Duplicate I take this as a soft-yes for resolving this. It seems SPARK-1 fixes the

[jira] [Updated] (SPARK-20023) Can not see table comment when describe formatted table

2017-03-19 Thread Zhenhua Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhenhua Wang updated SPARK-20023: - Description: Spark 2.x implements create table by itself. https://github.com/apache/spark/commit/

[jira] [Commented] (SPARK-17080) join reorder

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17080?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15932117#comment-15932117 ] Apache Spark commented on SPARK-17080: -- User 'wzhfy' has created a pull request for

[jira] [Updated] (SPARK-20022) java.lang.OutOfMemoryError: Unable to acquire 4228 bytes of memory

2017-03-19 Thread Harish (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Harish updated SPARK-20022: --- Description: I am getting below error in 2.0.2. Any help? or work around? WARN TaskSetManager: Lost task 34.

[jira] [Updated] (SPARK-20022) java.lang.OutOfMemoryError: Unable to acquire 4228 bytes of memory

2017-03-19 Thread Harish (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Harish updated SPARK-20022: --- Description: I am getting below error in 2.0.2. Any help? WARN TaskSetManager: Lost task 34.0 in stage 2007.

[jira] [Commented] (SPARK-20023) Can not see table comment when describe formatted table

2017-03-19 Thread Zhenhua Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20023?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15932096#comment-15932096 ] Zhenhua Wang commented on SPARK-20023: -- Can you please take a look at this? [~smileg

[jira] [Commented] (SPARK-14388) Create Table

2017-03-19 Thread chenerlu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15932094#comment-15932094 ] chenerlu commented on SPARK-14388: -- Spark 2.x implements create table by itself. https:/

[jira] [Commented] (SPARK-19941) Spark should not schedule tasks on executors on decommissioning YARN nodes

2017-03-19 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15932088#comment-15932088 ] Saisai Shao commented on SPARK-19941: - I think this scenario is quite similar to cont

[jira] [Updated] (SPARK-20023) Can not see table comment when describe formatted table

2017-03-19 Thread chenerlu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] chenerlu updated SPARK-20023: - Description: Spark 2.x implements create table by itself. https://github.com/apache/spark/commit/7d2ed8cc

[jira] [Created] (SPARK-20023) Can not see table comment when describe formatted table

2017-03-19 Thread chenerlu (JIRA)
chenerlu created SPARK-20023: Summary: Can not see table comment when describe formatted table Key: SPARK-20023 URL: https://issues.apache.org/jira/browse/SPARK-20023 Project: Spark Issue Type: B

[jira] [Created] (SPARK-20022) java.lang.OutOfMemoryError: Unable to acquire 4228 bytes of memory

2017-03-19 Thread Harish (JIRA)
Harish created SPARK-20022: -- Summary: java.lang.OutOfMemoryError: Unable to acquire 4228 bytes of memory Key: SPARK-20022 URL: https://issues.apache.org/jira/browse/SPARK-20022 Project: Spark Issue

[jira] [Commented] (SPARK-20016) SparkLauncher submit job failed after setConf with special charaters under windows

2017-03-19 Thread Vincent Sun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15932070#comment-15932070 ] Vincent Sun commented on SPARK-20016: - Thanks for comments. Have updated more details

[jira] [Updated] (SPARK-20016) SparkLauncher submit job failed when setConf with special charaters under windows

2017-03-19 Thread Vincent Sun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vincent Sun updated SPARK-20016: Description: I am using sparkLauncher JAVA API to submit job to a remote spark cluster master. Co

[jira] [Updated] (SPARK-20016) SparkLauncher submit job failed after setConf with special charaters under windows

2017-03-19 Thread Vincent Sun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vincent Sun updated SPARK-20016: Summary: SparkLauncher submit job failed after setConf with special charaters under windows (was:

[jira] [Assigned] (SPARK-20021) Miss backslash in python code

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20021?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20021: Assignee: Apache Spark > Miss backslash in python code > - > >

[jira] [Assigned] (SPARK-20021) Miss backslash in python code

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20021?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20021: Assignee: (was: Apache Spark) > Miss backslash in python code > --

[jira] [Commented] (SPARK-20021) Miss backslash in python code

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15932066#comment-15932066 ] Apache Spark commented on SPARK-20021: -- User 'uncleGen' has created a pull request f

[jira] [Created] (SPARK-20021) Miss backslash in python code

2017-03-19 Thread Genmao Yu (JIRA)
Genmao Yu created SPARK-20021: - Summary: Miss backslash in python code Key: SPARK-20021 URL: https://issues.apache.org/jira/browse/SPARK-20021 Project: Spark Issue Type: Bug Components:

[jira] [Commented] (SPARK-19988) Flaky Test: OrcSourceSuite SPARK-19459/SPARK-18220: read char/varchar column written by Hive

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15932043#comment-15932043 ] Apache Spark commented on SPARK-19988: -- User 'gatorsmile' has created a pull request

[jira] [Resolved] (SPARK-19988) Flaky Test: OrcSourceSuite SPARK-19459/SPARK-18220: read char/varchar column written by Hive

2017-03-19 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout resolved SPARK-19988. Resolution: Fixed Assignee: Xiao Li Fix Version/s: 2.2.0 Resolved by https:

[jira] [Resolved] (SPARK-19067) mapGroupsWithState - arbitrary stateful operations with Structured Streaming (similar to DStream.mapWithState)

2017-03-19 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-19067. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 17179 [https://g

[jira] [Commented] (SPARK-18886) Delay scheduling should not delay some executors indefinitely if one task is scheduled before delay timeout

2017-03-19 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15931893#comment-15931893 ] Imran Rashid commented on SPARK-18886: -- Thanks Kay for the full description (and fin

[jira] [Commented] (SPARK-20020) SparkR should support checkpointing DataFrame

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15931862#comment-15931862 ] Apache Spark commented on SPARK-20020: -- User 'felixcheung' has created a pull reques

[jira] [Assigned] (SPARK-20020) SparkR should support checkpointing DataFrame

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20020: Assignee: (was: Apache Spark) > SparkR should support checkpointing DataFrame > --

[jira] [Assigned] (SPARK-20020) SparkR should support checkpointing DataFrame

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20020: Assignee: Apache Spark > SparkR should support checkpointing DataFrame > -

[jira] [Created] (SPARK-20020) SparkR should support checkpointing DataFrame

2017-03-19 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-20020: Summary: SparkR should support checkpointing DataFrame Key: SPARK-20020 URL: https://issues.apache.org/jira/browse/SPARK-20020 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-20020) SparkR should support checkpointing DataFrame

2017-03-19 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-20020: - Description: As an user I want to be able to checkpoint DataFrame to run complex queries, iterati

[jira] [Updated] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2017-03-19 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18817: - Affects Version/s: 2.0.2 2.1.0 > Ensure nothing is written outside R's tem

[jira] [Commented] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2017-03-19 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15931848#comment-15931848 ] Felix Cheung commented on SPARK-18817: -- we have more discussions on the PR thread an

[jira] [Resolved] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2017-03-19 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-18817. -- Resolution: Fixed Assignee: Felix Cheung Fix Version/s: 2.2.0

[jira] [Assigned] (SPARK-10764) Add optional caching to Pipelines

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10764: Assignee: (was: Apache Spark) > Add optional caching to Pipelines > --

[jira] [Assigned] (SPARK-19969) Doc and examples for Imputer

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19969: Assignee: (was: Apache Spark) > Doc and examples for Imputer > ---

[jira] [Assigned] (SPARK-19991) FileSegmentManagedBuffer performance improvement.

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19991: Assignee: (was: Apache Spark) > FileSegmentManagedBuffer performance improvement. > --

[jira] [Commented] (SPARK-10764) Add optional caching to Pipelines

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15931815#comment-15931815 ] Apache Spark commented on SPARK-10764: -- User 'sachintyagi22' has created a pull requ

[jira] [Commented] (SPARK-19969) Doc and examples for Imputer

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15931812#comment-15931812 ] Apache Spark commented on SPARK-19969: -- User 'hhbyyh' has created a pull request for

[jira] [Commented] (SPARK-19975) Add map_keys and map_values functions to Python

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19975?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15931813#comment-15931813 ] Apache Spark commented on SPARK-19975: -- User 'yongtang' has created a pull request f

[jira] [Assigned] (SPARK-19995) Using real user to connect HiveMetastore in HiveClientImpl

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19995: Assignee: (was: Apache Spark) > Using real user to connect HiveMetastore in HiveClient

[jira] [Assigned] (SPARK-19803) Flaky BlockManagerProactiveReplicationSuite tests

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19803?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19803: Assignee: Apache Spark (was: Shubham Chopra) > Flaky BlockManagerProactiveReplicationSuit

[jira] [Assigned] (SPARK-19803) Flaky BlockManagerProactiveReplicationSuite tests

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19803?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19803: Assignee: Shubham Chopra (was: Apache Spark) > Flaky BlockManagerProactiveReplicationSuit

[jira] [Commented] (SPARK-19997) proxy-user failed connecting to a kerberos configured metastore

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15931816#comment-15931816 ] Apache Spark commented on SPARK-19997: -- User 'yaooqinn' has created a pull request f

[jira] [Assigned] (SPARK-19979) [MLLIB] Multiple Estimators/Pipelines In CrossValidator

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19979: Assignee: Apache Spark > [MLLIB] Multiple Estimators/Pipelines In CrossValidator > ---

[jira] [Commented] (SPARK-19995) Using real user to connect HiveMetastore in HiveClientImpl

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15931817#comment-15931817 ] Apache Spark commented on SPARK-19995: -- User 'jerryshao' has created a pull request

[jira] [Commented] (SPARK-20010) Sort information is lost after sort merge join

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15931819#comment-15931819 ] Apache Spark commented on SPARK-20010: -- User 'wzhfy' has created a pull request for

[jira] [Commented] (SPARK-19973) StagePage should display the number of executors.

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19973?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15931809#comment-15931809 ] Apache Spark commented on SPARK-19973: -- User 'jinxing64' has created a pull request

[jira] [Assigned] (SPARK-20010) Sort information is lost after sort merge join

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20010: Assignee: (was: Apache Spark) > Sort information is lost after sort merge join > -

[jira] [Commented] (SPARK-20003) FPGrowthModel setMinConfidence should affect rules generation and transform

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20003?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15931818#comment-15931818 ] Apache Spark commented on SPARK-20003: -- User 'hhbyyh' has created a pull request for

[jira] [Assigned] (SPARK-19973) StagePage should display the number of executors.

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19973: Assignee: Apache Spark > StagePage should display the number of executors. > -

[jira] [Commented] (SPARK-19991) FileSegmentManagedBuffer performance improvement.

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19991?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15931814#comment-15931814 ] Apache Spark commented on SPARK-19991: -- User 'witgo' has created a pull request for

[jira] [Assigned] (SPARK-19969) Doc and examples for Imputer

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19969: Assignee: Apache Spark > Doc and examples for Imputer > > >

[jira] [Assigned] (SPARK-20003) FPGrowthModel setMinConfidence should affect rules generation and transform

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20003?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20003: Assignee: (was: Apache Spark) > FPGrowthModel setMinConfidence should affect rules gen

[jira] [Assigned] (SPARK-19985) Some ML Models error when copy or do not set parent

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19985: Assignee: Apache Spark > Some ML Models error when copy or do not set parent > ---

[jira] [Assigned] (SPARK-19995) Using real user to connect HiveMetastore in HiveClientImpl

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19995: Assignee: Apache Spark > Using real user to connect HiveMetastore in HiveClientImpl >

[jira] [Assigned] (SPARK-19973) StagePage should display the number of executors.

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19973: Assignee: (was: Apache Spark) > StagePage should display the number of executors. > --

[jira] [Commented] (SPARK-19968) Use a cached instance of KafkaProducer for writing to kafka via KafkaSink.

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15931808#comment-15931808 ] Apache Spark commented on SPARK-19968: -- User 'ScrapCodes' has created a pull request

[jira] [Assigned] (SPARK-19899) FPGrowth input column naming

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19899: Assignee: Apache Spark > FPGrowth input column naming > > >

[jira] [Commented] (SPARK-15040) PySpark impl for ml.feature.Imputer

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15040?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15931811#comment-15931811 ] Apache Spark commented on SPARK-15040: -- User 'MLnick' has created a pull request for

[jira] [Assigned] (SPARK-20010) Sort information is lost after sort merge join

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20010: Assignee: Apache Spark > Sort information is lost after sort merge join >

[jira] [Assigned] (SPARK-19899) FPGrowth input column naming

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19899: Assignee: (was: Apache Spark) > FPGrowth input column naming > ---

[jira] [Assigned] (SPARK-19991) FileSegmentManagedBuffer performance improvement.

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19991: Assignee: Apache Spark > FileSegmentManagedBuffer performance improvement. > -

[jira] [Assigned] (SPARK-19975) Add map_keys and map_values functions to Python

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19975: Assignee: (was: Apache Spark) > Add map_keys and map_values functions to Python > --

[jira] [Assigned] (SPARK-19985) Some ML Models error when copy or do not set parent

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19985: Assignee: (was: Apache Spark) > Some ML Models error when copy or do not set parent >

[jira] [Assigned] (SPARK-20003) FPGrowthModel setMinConfidence should affect rules generation and transform

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20003?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20003: Assignee: Apache Spark > FPGrowthModel setMinConfidence should affect rules generation and

[jira] [Assigned] (SPARK-10764) Add optional caching to Pipelines

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10764: Assignee: Apache Spark > Add optional caching to Pipelines > -

[jira] [Assigned] (SPARK-19975) Add map_keys and map_values functions to Python

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19975: Assignee: Apache Spark > Add map_keys and map_values functions to Python > -

[jira] [Commented] (SPARK-19979) [MLLIB] Multiple Estimators/Pipelines In CrossValidator

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15931807#comment-15931807 ] Apache Spark commented on SPARK-19979: -- User 'leifker' has created a pull request fo

[jira] [Commented] (SPARK-19974) in-memory LRU for partitions of multiple RDDs testcase did a get that confuse me

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15931810#comment-15931810 ] Apache Spark commented on SPARK-19974: -- User 'jianran' has created a pull request fo

[jira] [Assigned] (SPARK-19979) [MLLIB] Multiple Estimators/Pipelines In CrossValidator

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19979: Assignee: (was: Apache Spark) > [MLLIB] Multiple Estimators/Pipelines In CrossValidato

[jira] [Assigned] (SPARK-19486) Investigate using multiple threads for task serialization

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19486: Assignee: (was: Apache Spark) > Investigate using multiple threads for task serializat

[jira] [Commented] (SPARK-19486) Investigate using multiple threads for task serialization

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15931806#comment-15931806 ] Apache Spark commented on SPARK-19486: -- User 'witgo' has created a pull request for

[jira] [Assigned] (SPARK-19486) Investigate using multiple threads for task serialization

2017-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19486: Assignee: Apache Spark > Investigate using multiple threads for task serialization > -

[jira] [Commented] (SPARK-20004) Spark thrift server ovewrites spark.app.name

2017-03-19 Thread Egor Pahomov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15931800#comment-15931800 ] Egor Pahomov commented on SPARK-20004: -- [~srowen], 1.6 and 2.0 allowed to specify ap

  1   2   >