[jira] [Created] (SPARK-19793) Use clock.getTimeMillis when mark task as finished in TaskSetManager.

2017-03-02 Thread jin xing (JIRA)
jin xing created SPARK-19793: Summary: Use clock.getTimeMillis when mark task as finished in TaskSetManager. Key: SPARK-19793 URL: https://issues.apache.org/jira/browse/SPARK-19793 Project: Spark

[jira] [Created] (SPARK-19794) Release HDFS Client after read/write checkpoint

2017-03-02 Thread darion yaphet (JIRA)
darion yaphet created SPARK-19794: - Summary: Release HDFS Client after read/write checkpoint Key: SPARK-19794 URL: https://issues.apache.org/jira/browse/SPARK-19794 Project: Spark Issue Type:

[jira] [Commented] (SPARK-19793) Use clock.getTimeMillis when mark task as finished in TaskSetManager.

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15891924#comment-15891924 ] Apache Spark commented on SPARK-19793: -- User 'jinxing64' has created a pull request

[jira] [Assigned] (SPARK-19793) Use clock.getTimeMillis when mark task as finished in TaskSetManager.

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19793: Assignee: (was: Apache Spark) > Use clock.getTimeMillis when mark task as finished in

[jira] [Assigned] (SPARK-19793) Use clock.getTimeMillis when mark task as finished in TaskSetManager.

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19793: Assignee: Apache Spark > Use clock.getTimeMillis when mark task as finished in TaskSetMana

[jira] [Created] (SPARK-19795) R should support column functions to_json, from_json

2017-03-02 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-19795: Summary: R should support column functions to_json, from_json Key: SPARK-19795 URL: https://issues.apache.org/jira/browse/SPARK-19795 Project: Spark Issue Ty

[jira] [Assigned] (SPARK-19795) R should support column functions to_json, from_json

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19795?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19795: Assignee: Felix Cheung (was: Apache Spark) > R should support column functions to_json, f

[jira] [Commented] (SPARK-19795) R should support column functions to_json, from_json

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15891956#comment-15891956 ] Apache Spark commented on SPARK-19795: -- User 'felixcheung' has created a pull reques

[jira] [Assigned] (SPARK-19794) Release HDFS Client after read/write checkpoint

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19794?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19794: Assignee: (was: Apache Spark) > Release HDFS Client after read/write checkpoint >

[jira] [Assigned] (SPARK-19795) R should support column functions to_json, from_json

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19795?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19795: Assignee: Apache Spark (was: Felix Cheung) > R should support column functions to_json, f

[jira] [Assigned] (SPARK-19794) Release HDFS Client after read/write checkpoint

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19794?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19794: Assignee: Apache Spark > Release HDFS Client after read/write checkpoint > ---

[jira] [Commented] (SPARK-19794) Release HDFS Client after read/write checkpoint

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19794?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15891957#comment-15891957 ] Apache Spark commented on SPARK-19794: -- User 'darionyaphet' has created a pull reque

[jira] [Commented] (SPARK-19659) Fetch big blocks to disk when shuffle-read

2017-03-02 Thread jin xing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15891999#comment-15891999 ] jin xing commented on SPARK-19659: -- [~rxin] [~davies] [~andrewor14] [~joshrosen] I've u

[jira] [Comment Edited] (SPARK-19659) Fetch big blocks to disk when shuffle-read

2017-03-02 Thread jin xing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15884789#comment-15884789 ] jin xing edited comment on SPARK-19659 at 3/2/17 10:21 AM: --- [~i

[jira] [Created] (SPARK-19796) taskScheduler fails serializing long statements received by thrift server

2017-03-02 Thread Giambattista (JIRA)
Giambattista created SPARK-19796: Summary: taskScheduler fails serializing long statements received by thrift server Key: SPARK-19796 URL: https://issues.apache.org/jira/browse/SPARK-19796 Project: Sp

[jira] [Commented] (SPARK-17931) taskScheduler has some unneeded serialization

2017-03-02 Thread Giambattista (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15892016#comment-15892016 ] Giambattista commented on SPARK-17931: -- Thanks, I just opened SPARK-19796 and added

[jira] [Resolved] (SPARK-19733) ALS performs unnecessary casting on item and user ids

2017-03-02 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath resolved SPARK-19733. Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 17059 [https:/

[jira] [Assigned] (SPARK-19733) ALS performs unnecessary casting on item and user ids

2017-03-02 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath reassigned SPARK-19733: -- Assignee: Vasilis Vryniotis > ALS performs unnecessary casting on item and user ids >

[jira] [Resolved] (SPARK-19704) AFTSurvivalRegression should support numeric censorCol

2017-03-02 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath resolved SPARK-19704. Resolution: Fixed > AFTSurvivalRegression should support numeric censorCol > --

[jira] [Assigned] (SPARK-19704) AFTSurvivalRegression should support numeric censorCol

2017-03-02 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath reassigned SPARK-19704: -- Assignee: zhengruifeng > AFTSurvivalRegression should support numeric censorCol >

[jira] [Updated] (SPARK-19704) AFTSurvivalRegression should support numeric censorCol

2017-03-02 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath updated SPARK-19704: --- Fix Version/s: 2.2.0 > AFTSurvivalRegression should support numeric censorCol > -

[jira] [Created] (SPARK-19797) ML pipelines document error

2017-03-02 Thread Zhe Sun (JIRA)
Zhe Sun created SPARK-19797: --- Summary: ML pipelines document error Key: SPARK-19797 URL: https://issues.apache.org/jira/browse/SPARK-19797 Project: Spark Issue Type: Bug Components: ML

[jira] [Commented] (SPARK-19783) Treat shorter/longer lengths of tokens as malformed records in CSV parser

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15892146#comment-15892146 ] Apache Spark commented on SPARK-19783: -- User 'maropu' has created a pull request for

[jira] [Assigned] (SPARK-19783) Treat shorter/longer lengths of tokens as malformed records in CSV parser

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19783: Assignee: (was: Apache Spark) > Treat shorter/longer lengths of tokens as malformed re

[jira] [Assigned] (SPARK-19783) Treat shorter/longer lengths of tokens as malformed records in CSV parser

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19783: Assignee: Apache Spark > Treat shorter/longer lengths of tokens as malformed records in CS

[jira] [Commented] (SPARK-19797) ML pipelines document error

2017-03-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19797?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15892149#comment-15892149 ] Sean Owen commented on SPARK-19797: --- I don't think that's true. The resulting pipeline

[jira] [Commented] (SPARK-19797) ML pipelines document error

2017-03-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19797?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15892157#comment-15892157 ] Sean Owen commented on SPARK-19797: --- Hm, on second look, the placement of the sentence

[jira] [Resolved] (SPARK-19778) alais cannot use in group by

2017-03-02 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-19778. -- Resolution: Duplicate I am resolving this as a duplicate of SPARK-14471 Please reopen this if

[jira] [Commented] (SPARK-19503) Execution Plan Optimizer: avoid sort or shuffle when it does not change end result such as df.sort(...).count()

2017-03-02 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15892169#comment-15892169 ] Takeshi Yamamuro commented on SPARK-19503: -- I'm not sure this should be fixed th

[jira] [Comment Edited] (SPARK-19503) Execution Plan Optimizer: avoid sort or shuffle when it does not change end result such as df.sort(...).count()

2017-03-02 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15892169#comment-15892169 ] Takeshi Yamamuro edited comment on SPARK-19503 at 3/2/17 12:39 PM:

[jira] [Commented] (SPARK-19797) ML pipelines document error

2017-03-02 Thread Zhe Sun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19797?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15892170#comment-15892170 ] Zhe Sun commented on SPARK-19797: - A pull request was created https://github.com/apache/s

[jira] [Commented] (SPARK-19797) ML pipelines document error

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19797?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15892168#comment-15892168 ] Apache Spark commented on SPARK-19797: -- User 'ymwdalex' has created a pull request f

[jira] [Assigned] (SPARK-19797) ML pipelines document error

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19797: Assignee: Apache Spark > ML pipelines document error > --- > >

[jira] [Assigned] (SPARK-19797) ML pipelines document error

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19797: Assignee: (was: Apache Spark) > ML pipelines document error >

[jira] [Commented] (SPARK-19797) ML pipelines document error

2017-03-02 Thread Zhe Sun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19797?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15892189#comment-15892189 ] Zhe Sun commented on SPARK-19797: - Hi Sean, thanks for your quick reply. bq. If the Pip

[jira] [Comment Edited] (SPARK-19797) ML pipelines document error

2017-03-02 Thread Zhe Sun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19797?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15892189#comment-15892189 ] Zhe Sun edited comment on SPARK-19797 at 3/2/17 12:52 PM: -- Hi Se

[jira] [Commented] (SPARK-19797) ML pipelines document error

2017-03-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19797?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15892216#comment-15892216 ] Sean Owen commented on SPARK-19797: --- Yes, it's not true of scoring though, and the diff

[jira] [Updated] (SPARK-19345) Add doc for "coldStartStrategy" usage in ALS

2017-03-02 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19345?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath updated SPARK-19345: --- Priority: Minor (was: Major) > Add doc for "coldStartStrategy" usage in ALS > --

[jira] [Resolved] (SPARK-19345) Add doc for "coldStartStrategy" usage in ALS

2017-03-02 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19345?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath resolved SPARK-19345. Resolution: Fixed Fix Version/s: 2.2.0 > Add doc for "coldStartStrategy" usage in AL

[jira] [Commented] (SPARK-18769) Spark to be smarter about what the upper bound is and to restrict number of executor when dynamic allocation is enabled

2017-03-02 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15892318#comment-15892318 ] Thomas Graves commented on SPARK-18769: --- I definitely understand there is an actual

[jira] [Created] (SPARK-19798) Query returns stale results when tables are modified on other sessions

2017-03-02 Thread Giambattista (JIRA)
Giambattista created SPARK-19798: Summary: Query returns stale results when tables are modified on other sessions Key: SPARK-19798 URL: https://issues.apache.org/jira/browse/SPARK-19798 Project: Spark

[jira] [Assigned] (SPARK-17080) join reorder

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17080: Assignee: (was: Apache Spark) > join reorder > > > Key: S

[jira] [Assigned] (SPARK-17080) join reorder

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17080: Assignee: Apache Spark > join reorder > > > Key: SPARK-17080

[jira] [Commented] (SPARK-17080) join reorder

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17080?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15892325#comment-15892325 ] Apache Spark commented on SPARK-17080: -- User 'wzhfy' has created a pull request for

[jira] [Commented] (SPARK-18890) Do all task serialization in CoarseGrainedExecutorBackend thread (rather than TaskSchedulerImpl)

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15892363#comment-15892363 ] Apache Spark commented on SPARK-18890: -- User 'witgo' has created a pull request for

[jira] [Updated] (SPARK-19796) taskScheduler fails serializing long statements received by thrift server

2017-03-02 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid updated SPARK-19796: - Priority: Blocker (was: Major) > taskScheduler fails serializing long statements received by thr

[jira] [Commented] (SPARK-19796) taskScheduler fails serializing long statements received by thrift server

2017-03-02 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19796?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15892390#comment-15892390 ] Imran Rashid commented on SPARK-19796: -- Since its a regression, I'm making this a bl

[jira] [Created] (SPARK-19799) Support WITH clause in subqueries

2017-03-02 Thread Giambattista (JIRA)
Giambattista created SPARK-19799: Summary: Support WITH clause in subqueries Key: SPARK-19799 URL: https://issues.apache.org/jira/browse/SPARK-19799 Project: Spark Issue Type: Improvement

[jira] [Assigned] (SPARK-19796) taskScheduler fails serializing long statements received by thrift server

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19796: Assignee: (was: Apache Spark) > taskScheduler fails serializing long statements receiv

[jira] [Commented] (SPARK-19796) taskScheduler fails serializing long statements received by thrift server

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19796?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15892482#comment-15892482 ] Apache Spark commented on SPARK-19796: -- User 'squito' has created a pull request for

[jira] [Assigned] (SPARK-19796) taskScheduler fails serializing long statements received by thrift server

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19796: Assignee: Apache Spark > taskScheduler fails serializing long statements received by thrif

[jira] [Updated] (SPARK-19766) INNER JOIN on constant alias columns return incorrect results

2017-03-02 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-19766: Fix Version/s: 2.0.3 > INNER JOIN on constant alias columns return incorrect results >

[jira] [Commented] (SPARK-19796) taskScheduler fails serializing long statements received by thrift server

2017-03-02 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19796?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15892547#comment-15892547 ] Imran Rashid commented on SPARK-19796: -- [~kayousterhout] [~shivaram] here's another

[jira] [Created] (SPARK-19800) Implement one kind of streaming sampling - reservoir sampling

2017-03-02 Thread Genmao Yu (JIRA)
Genmao Yu created SPARK-19800: - Summary: Implement one kind of streaming sampling - reservoir sampling Key: SPARK-19800 URL: https://issues.apache.org/jira/browse/SPARK-19800 Project: Spark Issu

[jira] [Assigned] (SPARK-19800) Implement one kind of streaming sampling - reservoir sampling

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19800: Assignee: Apache Spark > Implement one kind of streaming sampling - reservoir sampling > -

[jira] [Commented] (SPARK-19800) Implement one kind of streaming sampling - reservoir sampling

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15892571#comment-15892571 ] Apache Spark commented on SPARK-19800: -- User 'uncleGen' has created a pull request f

[jira] [Assigned] (SPARK-19800) Implement one kind of streaming sampling - reservoir sampling

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19800: Assignee: (was: Apache Spark) > Implement one kind of streaming sampling - reservoir s

[jira] [Commented] (SPARK-18699) Spark CSV parsing types other than String throws exception when malformed

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15892679#comment-15892679 ] Apache Spark commented on SPARK-18699: -- User 'HyukjinKwon' has created a pull reques

[jira] [Commented] (SPARK-11197) Run SQL query on files directly without create a table

2017-03-02 Thread Ladislav Jech (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11197?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15892706#comment-15892706 ] Ladislav Jech commented on SPARK-11197: --- Grat stuff! > Run SQL query on files dire

[jira] [Resolved] (SPARK-19720) Redact sensitive information from SparkSubmit console output

2017-03-02 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-19720. Resolution: Fixed Assignee: Mark Grover Fix Version/s: 2.2.0 > Redact sensi

[jira] [Created] (SPARK-19801) Remove JDK7 from Travis CI

2017-03-02 Thread Dongjoon Hyun (JIRA)
Dongjoon Hyun created SPARK-19801: - Summary: Remove JDK7 from Travis CI Key: SPARK-19801 URL: https://issues.apache.org/jira/browse/SPARK-19801 Project: Spark Issue Type: Bug Compon

[jira] [Assigned] (SPARK-19801) Remove JDK7 from Travis CI

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19801: Assignee: Apache Spark > Remove JDK7 from Travis CI > -- > >

[jira] [Assigned] (SPARK-19801) Remove JDK7 from Travis CI

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19801: Assignee: (was: Apache Spark) > Remove JDK7 from Travis CI > -

[jira] [Commented] (SPARK-19801) Remove JDK7 from Travis CI

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15892817#comment-15892817 ] Apache Spark commented on SPARK-19801: -- User 'dongjoon-hyun' has created a pull requ

[jira] [Created] (SPARK-19802) Remote History Server

2017-03-02 Thread Ben Barnard (JIRA)
Ben Barnard created SPARK-19802: --- Summary: Remote History Server Key: SPARK-19802 URL: https://issues.apache.org/jira/browse/SPARK-19802 Project: Spark Issue Type: Improvement Compone

[jira] [Created] (SPARK-19803) Flaky BlockManagerProactiveReplicationSuite tests

2017-03-02 Thread Sital Kedia (JIRA)
Sital Kedia created SPARK-19803: --- Summary: Flaky BlockManagerProactiveReplicationSuite tests Key: SPARK-19803 URL: https://issues.apache.org/jira/browse/SPARK-19803 Project: Spark Issue Type: B

[jira] [Comment Edited] (SPARK-1693) Dependent on multiple versions of servlet-api jars lead to throw an SecurityException when Spark built for hadoop 2.3.0 , 2.4.0

2017-03-02 Thread Andrew Otto (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15893074#comment-15893074 ] Andrew Otto edited comment on SPARK-1693 at 3/2/17 9:42 PM: We

[jira] [Commented] (SPARK-1693) Dependent on multiple versions of servlet-api jars lead to throw an SecurityException when Spark built for hadoop 2.3.0 , 2.4.0

2017-03-02 Thread Andrew Otto (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15893074#comment-15893074 ] Andrew Otto commented on SPARK-1693: We just upgraded to CDH 5.10, which has Spark 1.6

[jira] [Commented] (SPARK-18454) Changes to improve Nearest Neighbor Search for LSH

2017-03-02 Thread Mingjie Tang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18454?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15893087#comment-15893087 ] Mingjie Tang commented on SPARK-18454: -- [~yunn] the current multi-probe NNS can be i

[jira] [Commented] (SPARK-19771) Support OR-AND amplification in Locality Sensitive Hashing (LSH)

2017-03-02 Thread Yun Ni (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19771?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15893097#comment-15893097 ] Yun Ni commented on SPARK-19771: [~merlin] (1) The computation cost is NumHashFunctions

[jira] [Comment Edited] (SPARK-19771) Support OR-AND amplification in Locality Sensitive Hashing (LSH)

2017-03-02 Thread Yun Ni (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19771?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15893097#comment-15893097 ] Yun Ni edited comment on SPARK-19771 at 3/2/17 9:55 PM: [~merlin]

[jira] [Commented] (SPARK-19771) Support OR-AND amplification in Locality Sensitive Hashing (LSH)

2017-03-02 Thread Mingjie Tang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19771?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15893117#comment-15893117 ] Mingjie Tang commented on SPARK-19771: -- (1) because you need to explode each tuple.

[jira] [Commented] (SPARK-18113) Sending AskPermissionToCommitOutput failed, driver enter into task deadloop

2017-03-02 Thread Andrew Ash (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18113?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15893222#comment-15893222 ] Andrew Ash commented on SPARK-18113: We discovered another bug related to committing

[jira] [Resolved] (SPARK-19631) OutputCommitCoordinator should not allow commits for already failed tasks

2017-03-02 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout resolved SPARK-19631. Resolution: Fixed Fix Version/s: 2.2.0 > OutputCommitCoordinator should not allow co

[jira] [Assigned] (SPARK-19631) OutputCommitCoordinator should not allow commits for already failed tasks

2017-03-02 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout reassigned SPARK-19631: -- Assignee: Patrick Woody > OutputCommitCoordinator should not allow commits for already

[jira] [Commented] (SPARK-19796) taskScheduler fails serializing long statements received by thrift server

2017-03-02 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19796?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15893376#comment-15893376 ] Kay Ousterhout commented on SPARK-19796: Do you think we should (separately) fix

[jira] [Created] (SPARK-19804) HiveClientImpl does not work with Hive 2.2.0 metastore

2017-03-02 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-19804: -- Summary: HiveClientImpl does not work with Hive 2.2.0 metastore Key: SPARK-19804 URL: https://issues.apache.org/jira/browse/SPARK-19804 Project: Spark Is

[jira] [Resolved] (SPARK-19276) FetchFailures can be hidden by user (or sql) exception handling

2017-03-02 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout resolved SPARK-19276. Resolution: Fixed Assignee: Imran Rashid Fix Version/s: 2.2.0 > FetchFailur

[jira] [Commented] (SPARK-19771) Support OR-AND amplification in Locality Sensitive Hashing (LSH)

2017-03-02 Thread Yun Ni (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19771?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15893508#comment-15893508 ] Yun Ni commented on SPARK-19771: [~merlin] What you are suggesting is to hash each AND ha

[jira] [Resolved] (SPARK-19750) Spark UI http -> https redirect error

2017-03-02 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19750?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-19750. Resolution: Fixed Assignee: Saisai Shao Fix Version/s: 2.1.1

[jira] [Closed] (SPARK-19349) Check resource ready to avoid multiple receivers to be scheduled on the same node.

2017-03-02 Thread Genmao Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Genmao Yu closed SPARK-19349. - Resolution: Won't Fix > Check resource ready to avoid multiple receivers to be scheduled on the same > n

[jira] [Commented] (SPARK-14698) CREATE FUNCTION cloud not add function to hive metastore

2017-03-02 Thread poseidon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14698?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15893565#comment-15893565 ] poseidon commented on SPARK-14698: -- [~azeroth2b] I think in spark 1.6.1, author do it o

[jira] [Commented] (SPARK-19804) HiveClientImpl does not work with Hive 2.2.0 metastore

2017-03-02 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19804?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15893579#comment-15893579 ] Marcelo Vanzin commented on SPARK-19804: For posterity, the error you get looks l

[jira] [Commented] (SPARK-19802) Remote History Server

2017-03-02 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15893580#comment-15893580 ] Saisai Shao commented on SPARK-19802: - Spark's {{ApplicationHistoryProvider}} is plug

[jira] [Commented] (SPARK-19796) taskScheduler fails serializing long statements received by thrift server

2017-03-02 Thread Mridul Muralidharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19796?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15893584#comment-15893584 ] Mridul Muralidharan commented on SPARK-19796: - I would not prefer (b) - if w

[jira] [Commented] (SPARK-18608) Spark ML algorithms that check RDD cache level for internal caching double-cache data

2017-03-02 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15893596#comment-15893596 ] zhengruifeng commented on SPARK-18608: -- [~mlnick] [~yuhaoyan] [~srowen] I think if w

[jira] [Commented] (SPARK-19803) Flaky BlockManagerProactiveReplicationSuite tests

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19803?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15893640#comment-15893640 ] Apache Spark commented on SPARK-19803: -- User 'uncleGen' has created a pull request f

[jira] [Assigned] (SPARK-19803) Flaky BlockManagerProactiveReplicationSuite tests

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19803?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19803: Assignee: (was: Apache Spark) > Flaky BlockManagerProactiveReplicationSuite tests > --

[jira] [Assigned] (SPARK-19803) Flaky BlockManagerProactiveReplicationSuite tests

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19803?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19803: Assignee: Apache Spark > Flaky BlockManagerProactiveReplicationSuite tests > -

[jira] [Resolved] (SPARK-19745) SVCAggregator serializes coefficients

2017-03-02 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-19745. - Resolution: Fixed Fix Version/s: 2.2.0 > SVCAggregator serializes coefficients > -

[jira] [Commented] (SPARK-15474) ORC data source fails to write and read back empty dataframe

2017-03-02 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15893691#comment-15893691 ] Hyukjin Kwon commented on SPARK-15474: -- It seems an issue related with Hive's {{OrcO

[jira] [Created] (SPARK-19805) Log the row type when query result dose match

2017-03-02 Thread Genmao Yu (JIRA)
Genmao Yu created SPARK-19805: - Summary: Log the row type when query result dose match Key: SPARK-19805 URL: https://issues.apache.org/jira/browse/SPARK-19805 Project: Spark Issue Type: Improveme

[jira] [Assigned] (SPARK-19805) Log the row type when query result dose match

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19805?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19805: Assignee: Apache Spark > Log the row type when query result dose match > -

[jira] [Commented] (SPARK-19805) Log the row type when query result dose match

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19805?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15893693#comment-15893693 ] Apache Spark commented on SPARK-19805: -- User 'uncleGen' has created a pull request f

[jira] [Assigned] (SPARK-19805) Log the row type when query result dose match

2017-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19805?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19805: Assignee: (was: Apache Spark) > Log the row type when query result dose match > --

[jira] [Commented] (SPARK-15474) ORC data source fails to write and read back empty dataframe

2017-03-02 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15893703#comment-15893703 ] Nicholas Chammas commented on SPARK-15474: -- cc [~owen.omalley] > ORC data sour

[jira] [Commented] (SPARK-10294) When Parquet writer's close method throws an exception, we will call close again and trigger a NPE

2017-03-02 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10294?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15893706#comment-15893706 ] Hyukjin Kwon commented on SPARK-10294: -- Hi [~yhuai], it seems this issue refers PARQ

[jira] [Commented] (SPARK-10294) When Parquet writer's close method throws an exception, we will call close again and trigger a NPE

2017-03-02 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10294?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15893709#comment-15893709 ] Hyukjin Kwon commented on SPARK-10294: -- Maybe, we could resolve this as a duplicate

[jira] [Commented] (SPARK-15474) ORC data source fails to write and read back empty dataframe

2017-03-02 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15893713#comment-15893713 ] Hyukjin Kwon commented on SPARK-15474: -- Let me leave some pointer - https://github.

[jira] [Created] (SPARK-19806) PySpark GLR supports tweedie distribution

2017-03-02 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-19806: --- Summary: PySpark GLR supports tweedie distribution Key: SPARK-19806 URL: https://issues.apache.org/jira/browse/SPARK-19806 Project: Spark Issue Type: Improveme

  1   2   >