[jira] [Commented] (SPARK-9499) Possible file handle leak in spilling/sort code

2015-08-03 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9499?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14651932#comment-14651932 ] Herman van Hovell commented on SPARK-9499: -- I have also tried

[jira] [Comment Edited] (SPARK-9499) Possible file handle leak in spilling/sort code

2015-08-03 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9499?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14651932#comment-14651932 ] Herman van Hovell edited comment on SPARK-9499 at 8/3/15 2:46 PM:

[jira] [Updated] (SPARK-9499) Possible file handle leak in spilling/sort code

2015-08-03 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-9499: - Attachment: open.files.II.txt {{lsof}} with {{spark.shuffle.sort.bypassMergeThreshold=0}}

[jira] [Commented] (SPARK-9499) Possible file handle leak in spilling/sort code

2015-08-13 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9499?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14695148#comment-14695148 ] Herman van Hovell commented on SPARK-9499: -- This has been fixed by the PR for

[jira] [Commented] (SPARK-9594) Failed to get broadcast_33_piece0 while using Accumulators in UDF

2015-08-13 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14695202#comment-14695202 ] Herman van Hovell commented on SPARK-9594: -- This is more of a question for the

[jira] [Resolved] (SPARK-9594) Failed to get broadcast_33_piece0 while using Accumulators in UDF

2015-08-17 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell resolved SPARK-9594. -- Resolution: Not A Problem Failed to get broadcast_33_piece0 while using Accumulators

[jira] [Commented] (SPARK-10043) Add window functions into SparkR

2015-08-17 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14699555#comment-14699555 ] Herman van Hovell commented on SPARK-10043: --- Could you provide an example in

[jira] [Commented] (SPARK-9999) RDD-like API on top of Catalyst/DataFrame

2015-08-17 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14699512#comment-14699512 ] Herman van Hovell commented on SPARK-: -- This sounds interesting. In order to

[jira] [Commented] (SPARK-10043) Add window functions into SparkR

2015-08-17 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14699552#comment-14699552 ] Herman van Hovell commented on SPARK-10043: --- Which window functions aren't

[jira] [Commented] (SPARK-9357) Remove JoinedRow

2015-08-19 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14703267#comment-14703267 ] Herman van Hovell commented on SPARK-9357: -- {{JoinedRow}} adds branches and a

[jira] [Updated] (SPARK-10100) AggregateFunction2's Max is slower than AggregateExpression1's MaxFunction

2015-08-19 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-10100: -- Attachment: SPARK-10100.perf.test.scala [~yhuai] I did some benchmarking today

[jira] [Commented] (SPARK-10100) AggregateFunction2's Max is slower than AggregateExpression1's MaxFunction

2015-08-18 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14702344#comment-14702344 ] Herman van Hovell commented on SPARK-10100: --- PR is in. AggregateFunction2's

[jira] [Commented] (SPARK-10100) AggregateFunction2's Max is slower than AggregateExpression1's MaxFunction

2015-08-20 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14704742#comment-14704742 ] Herman van Hovell commented on SPARK-10100: --- Lets leave it for 1.6.

[jira] [Commented] (SPARK-9357) Remove JoinedRow

2015-08-20 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14704815#comment-14704815 ] Herman van Hovell commented on SPARK-9357: -- [~chenghao] This summarizes the pros

[jira] [Commented] (SPARK-10100) AggregateFunction2's Max is slower than AggregateExpression1's MaxFunction

2015-08-18 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14702252#comment-14702252 ] Herman van Hovell commented on SPARK-10100: --- Any idea why? JoinedRow?

[jira] [Commented] (SPARK-10100) Eliminate hash table lookup if there is no grouping key in aggregation.

2015-08-20 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14705161#comment-14705161 ] Herman van Hovell commented on SPARK-10100: --- You are using a grouping key in

[jira] [Commented] (SPARK-8850) Turn unsafe mode on by default

2015-07-30 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14648714#comment-14648714 ] Herman van Hovell commented on SPARK-8850: -- Hi, I am getting a Too many open

[jira] [Updated] (SPARK-8850) Turn unsafe mode on by default

2015-07-30 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-8850: - Attachment: open Dump of all open files after a {{Too Many Files Open}} error. The

[jira] [Updated] (SPARK-8641) Native Spark Window Functions

2015-08-03 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-8641: - Target Version/s: 1.6.0 (was: 1.5.0) Native Spark Window Functions

[jira] [Updated] (SPARK-7712) Window Function Improvements

2015-08-03 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-7712: - Target Version/s: 1.5.0, 1.6.0 (was: 1.5.0) Window Function Improvements

[jira] [Updated] (SPARK-9499) Possible file handle leak in spilling/sort code

2015-08-03 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-9499: - Attachment: perf_test4.scala [~joshrosen], I have checked out the new master, and it

[jira] [Commented] (SPARK-9357) Remove JoinedRow

2015-08-03 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14652390#comment-14652390 ] Herman van Hovell commented on SPARK-9357: -- +1 for removing this. The

[jira] [Created] (SPARK-9740) first/last aggregate NULL behavior

2015-08-07 Thread Herman van Hovell (JIRA)
Herman van Hovell created SPARK-9740: Summary: first/last aggregate NULL behavior Key: SPARK-9740 URL: https://issues.apache.org/jira/browse/SPARK-9740 Project: Spark Issue Type:

[jira] [Commented] (SPARK-9740) first/last aggregate NULL behavior

2015-08-07 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14661859#comment-14661859 ] Herman van Hovell commented on SPARK-9740: -- BTW: I encountered this while doing

[jira] [Commented] (SPARK-9740) first/last aggregate NULL behavior

2015-08-11 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14682405#comment-14682405 ] Herman van Hovell commented on SPARK-9740: -- I think Hive uses a Flag,

[jira] [Commented] (SPARK-9813) Incorrect UNION ALL behavior

2015-08-10 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14681246#comment-14681246 ] Herman van Hovell commented on SPARK-9813: -- So I am not to sure if we want to

[jira] [Updated] (SPARK-9740) first/last aggregate NULL behavior

2015-08-07 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-9740: - Target Version/s: 1.6.0 first/last aggregate NULL behavior

[jira] [Created] (SPARK-9741) approx count distinct function

2015-08-07 Thread Herman van Hovell (JIRA)
Herman van Hovell created SPARK-9741: Summary: approx count distinct function Key: SPARK-9741 URL: https://issues.apache.org/jira/browse/SPARK-9741 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-9740) first/last aggregate NULL behavior

2015-08-07 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-9740: - Affects Version/s: (was: 1.6.0) first/last aggregate NULL behavior

[jira] [Created] (SPARK-9742) NullPointerException when using --packages

2015-08-07 Thread Herman van Hovell (JIRA)
Herman van Hovell created SPARK-9742: Summary: NullPointerException when using --packages Key: SPARK-9742 URL: https://issues.apache.org/jira/browse/SPARK-9742 Project: Spark Issue Type:

[jira] [Commented] (SPARK-9978) Window functions require partitionBy to work as expected

2015-08-14 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9978?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14697642#comment-14697642 ] Herman van Hovell commented on SPARK-9978: -- There is a small bug in the Python

[jira] [Updated] (SPARK-9980) SBT publishLocal error due invalid characters in doc

2015-08-14 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-9980: - Summary: SBT publishLocal error due invalid characters in doc (was: SBT Unsafe

[jira] [Created] (SPARK-9980) SBT Unsafe publishLocal error due to '' in scaladoc

2015-08-14 Thread Herman van Hovell (JIRA)
Herman van Hovell created SPARK-9980: Summary: SBT Unsafe publishLocal error due to '' in scaladoc Key: SPARK-9980 URL: https://issues.apache.org/jira/browse/SPARK-9980 Project: Spark

[jira] [Commented] (SPARK-9980) SBT publishLocal error due invalid characters in doc

2015-08-14 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14697429#comment-14697429 ] Herman van Hovell commented on SPARK-9980: -- Also found an issue in the launcher.

[jira] [Updated] (SPARK-9980) SBT publishLocal error due to invalid characters in doc

2015-08-14 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-9980: - Summary: SBT publishLocal error due to invalid characters in doc (was: SBT publishLocal

[jira] [Commented] (SPARK-9760) SparkSubmit doesn't work with --packages when --repositories is not specified

2015-08-15 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14698402#comment-14698402 ] Herman van Hovell commented on SPARK-9760: -- This has been fixed in SPARK-9760.

[jira] [Commented] (SPARK-9874) UnionAll operation on DataFrame doesn't check for column names

2015-08-12 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14693373#comment-14693373 ] Herman van Hovell commented on SPARK-9874: -- This is a duplicate of SPARK-9813.

[jira] [Commented] (SPARK-9813) Incorrect UNION ALL behavior

2015-08-12 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14693468#comment-14693468 ] Herman van Hovell commented on SPARK-9813: -- It turns out that the columns for the

[jira] [Comment Edited] (SPARK-9813) Incorrect UNION ALL behavior

2015-08-12 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14693468#comment-14693468 ] Herman van Hovell edited comment on SPARK-9813 at 8/12/15 2:13 PM:

[jira] [Commented] (SPARK-9813) Incorrect UNION ALL behavior

2015-08-12 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14693548#comment-14693548 ] Herman van Hovell commented on SPARK-9813: -- +1 on {{UNION ALL}} working well. A

[jira] [Commented] (SPARK-9813) Incorrect UNION ALL behavior

2015-08-12 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14693501#comment-14693501 ] Herman van Hovell commented on SPARK-9813: -- To get back on your reply: * The

[jira] [Commented] (SPARK-9742) NullPointerException when using --packages

2015-08-09 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9742?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14679495#comment-14679495 ] Herman van Hovell commented on SPARK-9742: -- Seems SPARK-9760 and this ticket are

[jira] [Commented] (SPARK-4366) Aggregation Improvement

2015-07-23 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14639400#comment-14639400 ] Herman van Hovell commented on SPARK-4366: -- What is going to happen to the old

[jira] [Commented] (SPARK-8641) Native Spark Window Functions

2015-07-21 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14635205#comment-14635205 ] Herman van Hovell commented on SPARK-8641: -- We need to wait for the new UDAF

[jira] [Created] (SPARK-9221) Support IntervalType in Range Frame

2015-07-21 Thread Herman van Hovell (JIRA)
Herman van Hovell created SPARK-9221: Summary: Support IntervalType in Range Frame Key: SPARK-9221 URL: https://issues.apache.org/jira/browse/SPARK-9221 Project: Spark Issue Type:

[jira] [Updated] (SPARK-8641) Native Spark Window Functions

2015-07-21 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-8641: - Description: *Rationale* The window operator currently uses Hive UDAFs for all

[jira] [Comment Edited] (SPARK-8682) Range Join for Spark SQL

2015-07-16 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8682?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14630449#comment-14630449 ] Herman van Hovell edited comment on SPARK-8682 at 7/16/15 10:31 PM:

[jira] [Updated] (SPARK-8638) Window Function Performance Improvements

2015-07-16 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-8638: - Attachment: perf_test2.scala Additional Performance Test. In this case we test the

[jira] [Updated] (SPARK-8682) Range Join for Spark SQL

2015-07-16 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-8682: - Attachment: perf_testing.scala Some Performance Testing code. Range Join for Spark SQL

[jira] [Commented] (SPARK-8638) Window Function Performance Improvements

2015-07-17 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14631479#comment-14631479 ] Herman van Hovell commented on SPARK-8638: -- Yes, that is correct. So I thought

[jira] [Updated] (SPARK-8638) Window Function Performance Improvements

2015-07-18 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-8638: - Attachment: perf_test3.scala Another round of Benchmarking. This test benchmarks the

[jira] [Updated] (SPARK-8640) Window Function Multiple Frame Processing in Single Processing Step

2015-07-19 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-8640: - Attachment: perf_test_window_collapse.scala Some benchmarking result for this ticket. The

[jira] [Updated] (SPARK-7712) Window Function Improvements

2015-07-21 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-7712: - External issue URL: (was: https://github.com/hvanhovell/spark-window) Window Function

[jira] [Updated] (SPARK-7712) Window Function Improvements

2015-07-21 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-7712: - Summary: Window Function Improvements (was: Native Spark Window Functions Performance

[jira] [Updated] (SPARK-7712) Window Function Improvements

2015-07-21 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-7712: - Description: This is an umbrella ticket for Window Function Improvements targetted at

[jira] [Updated] (SPARK-8641) Native Spark Window Functions

2015-07-21 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-8641: - Description: The current Window implementation uses Hive UDAFs for all aggregation

[jira] [Comment Edited] (SPARK-10226) Error occured in SparkSQL when using !=

2015-08-25 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14711410#comment-14711410 ] Herman van Hovell edited comment on SPARK-10226 at 8/25/15 3:11 PM:

[jira] [Comment Edited] (SPARK-10226) Error occured in SparkSQL when using !=

2015-08-25 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14711400#comment-14711400 ] Herman van Hovell edited comment on SPARK-10226 at 8/25/15 3:05 PM:

[jira] [Commented] (SPARK-10226) Error occured in SparkSQL when using !=

2015-08-25 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14711410#comment-14711410 ] Herman van Hovell commented on SPARK-10226: --- Apparently most databases support

[jira] [Commented] (SPARK-10226) Error occured in SparkSQL when using !=

2015-08-25 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14711400#comment-14711400 ] Herman van Hovell commented on SPARK-10226: --- In what SQL dialect is {{!=}} a

[jira] [Commented] (SPARK-11388) Build breaks due to the use of tags in javadoc.

2015-10-28 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14979215#comment-14979215 ] Herman van Hovell commented on SPARK-11388: --- Fix is done. Problem now is that

[jira] [Created] (SPARK-11388) Build breaks due to the use of tags in javadoc.

2015-10-28 Thread Herman van Hovell (JIRA)
Herman van Hovell created SPARK-11388: - Summary: Build breaks due to the use of tags in javadoc. Key: SPARK-11388 URL: https://issues.apache.org/jira/browse/SPARK-11388 Project: Spark

[jira] [Comment Edited] (SPARK-11388) Build breaks due to the use of tags in javadoc.

2015-10-28 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14979215#comment-14979215 ] Herman van Hovell edited comment on SPARK-11388 at 10/28/15 8:57 PM: -

[jira] [Resolved] (SPARK-11594) Cannot create UDAF in REPL

2015-11-11 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell resolved SPARK-11594. --- Resolution: Not A Problem > Cannot create UDAF in REPL > --

[jira] [Commented] (SPARK-11594) Cannot create UDAF in REPL

2015-11-11 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15000151#comment-15000151 ] Herman van Hovell commented on SPARK-11594: --- Move to scala 2.10.5 fixed this. > Cannot create

[jira] [Commented] (SPARK-11725) Let UDF to handle null value

2015-11-14 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15005433#comment-15005433 ] Herman van Hovell commented on SPARK-11725: --- I can reproduce the {{-1}} default values on

[jira] [Commented] (SPARK-11725) Let UDF to handle null value

2015-11-14 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15005391#comment-15005391 ] Herman van Hovell commented on SPARK-11725: --- I'd rather add a warning than prevent this from

[jira] [Created] (SPARK-11594) Cannot create UDAF in REPL

2015-11-09 Thread Herman van Hovell (JIRA)
Herman van Hovell created SPARK-11594: - Summary: Cannot create UDAF in REPL Key: SPARK-11594 URL: https://issues.apache.org/jira/browse/SPARK-11594 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-10962) DataFrame "except" method...

2015-10-30 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14982160#comment-14982160 ] Herman van Hovell commented on SPARK-10962: --- Do you want to know which row had a duplicate? If

[jira] [Updated] (SPARK-11449) PortableDataStream should be a factory

2015-11-02 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-11449: -- Summary: PortableDataStream should be a factory (was: Improve documentation on close

[jira] [Commented] (SPARK-9357) Remove JoinedRow

2015-11-02 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14985264#comment-14985264 ] Herman van Hovell commented on SPARK-9357: -- Is this ticket still relevant? > Remove JoinedRow >

[jira] [Commented] (SPARK-11405) ROW_NUMBER function does not adhere to window ORDER BY, when joining

2015-10-30 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14982272#comment-14982272 ] Herman van Hovell commented on SPARK-11405: --- Hi I have tried to reproduce this on local mode.

[jira] [Commented] (SPARK-11275) [SQL] Regression in rollup/cube

2015-11-02 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14984863#comment-14984863 ] Herman van Hovell commented on SPARK-11275: --- This is caused by the fact that the logical Expand

[jira] [Created] (SPARK-11449) Improve documentation on close behavior of PortableDataStream

2015-11-02 Thread Herman van Hovell (JIRA)
Herman van Hovell created SPARK-11449: - Summary: Improve documentation on close behavior of PortableDataStream Key: SPARK-11449 URL: https://issues.apache.org/jira/browse/SPARK-11449 Project:

[jira] [Created] (SPARK-11450) Add support for UnsafeRow to Expand

2015-11-02 Thread Herman van Hovell (JIRA)
Herman van Hovell created SPARK-11450: - Summary: Add support for UnsafeRow to Expand Key: SPARK-11450 URL: https://issues.apache.org/jira/browse/SPARK-11450 Project: Spark Issue Type:

[jira] [Created] (SPARK-11451) Support single distinct count on multiple columns

2015-11-02 Thread Herman van Hovell (JIRA)
Herman van Hovell created SPARK-11451: - Summary: Support single distinct count on multiple columns Key: SPARK-11451 URL: https://issues.apache.org/jira/browse/SPARK-11451 Project: Spark

[jira] [Commented] (SPARK-9241) Supporting multiple DISTINCT columns

2015-10-15 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14958758#comment-14958758 ] Herman van Hovell commented on SPARK-9241: -- We could implement this using GROUPING SETS. That is

[jira] [Commented] (SPARK-10893) Lag Analytic function broken

2015-10-15 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10893?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14958520#comment-14958520 ] Herman van Hovell commented on SPARK-10893: --- A bug was found in the Window implementation. It

[jira] [Commented] (SPARK-9241) Supporting multiple DISTINCT columns

2015-10-15 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14959604#comment-14959604 ] Herman van Hovell commented on SPARK-9241: -- It should grow linear (or am I missing something).

[jira] [Commented] (SPARK-11008) Spark window function returns inconsistent/wrong results

2015-10-08 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14949300#comment-14949300 ] Herman van Hovell commented on SPARK-11008: --- Hi, I have tried your code in local mode on the

[jira] [Comment Edited] (SPARK-11008) Spark window function returns inconsistent/wrong results

2015-10-08 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14949300#comment-14949300 ] Herman van Hovell edited comment on SPARK-11008 at 10/8/15 8:09 PM:

[jira] [Commented] (SPARK-11008) Spark window function returns inconsistent/wrong results

2015-10-13 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14955593#comment-14955593 ] Herman van Hovell commented on SPARK-11008: --- A fix for a bug concerning the use of Window

[jira] [Commented] (SPARK-11725) Let UDF to handle null value

2015-11-13 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15003960#comment-15003960 ] Herman van Hovell commented on SPARK-11725: --- -1 is the default value for an Int in the code

[jira] [Commented] (SPARK-11725) Let UDF to handle null value

2015-11-13 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15003943#comment-15003943 ] Herman van Hovell commented on SPARK-11725: --- {{Int}} is a primitive. So there is no way to

[jira] [Comment Edited] (SPARK-11725) Let UDF to handle null value

2015-11-13 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15003943#comment-15003943 ] Herman van Hovell edited comment on SPARK-11725 at 11/13/15 12:54 PM:

[jira] [Created] (SPARK-11850) Spark StdDev/Variance defaults are incompatible with Hive

2015-11-19 Thread Herman van Hovell (JIRA)
Herman van Hovell created SPARK-11850: - Summary: Spark StdDev/Variance defaults are incompatible with Hive Key: SPARK-11850 URL: https://issues.apache.org/jira/browse/SPARK-11850 Project: Spark

[jira] [Commented] (SPARK-10893) Lag Analytic function broken

2015-10-01 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10893?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14939710#comment-14939710 ] Herman van Hovell commented on SPARK-10893: --- I just tried the following scala code (port from

[jira] [Commented] (SPARK-12491) UDAF result differs in SQL if alias is used

2015-12-25 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12491?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15071454#comment-15071454 ] Herman van Hovell commented on SPARK-12491: --- I cannot reproduce this issue on 1.5.2 and the

[jira] [Commented] (SPARK-12521) DataFrame Partitions in java does not work

2015-12-25 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15071432#comment-15071432 ] Herman van Hovell commented on SPARK-12521: --- The {{lowerBound}} and {{upperBound}} parameters

[jira] [Commented] (SPARK-12524) Group by key in a pairrdd without any shuffle

2015-12-25 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12524?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15071473#comment-15071473 ] Herman van Hovell commented on SPARK-12524: --- I think you are looking for something similar to

[jira] [Commented] (SPARK-12491) UDAF result differs in SQL if alias is used

2015-12-28 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12491?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15073146#comment-15073146 ] Herman van Hovell commented on SPARK-12491: --- I just tried the latest 1.5 branch on a spark

[jira] [Commented] (SPARK-12535) Generating scaladoc using sbt fails for network-common and catalyst modules

2015-12-27 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15072525#comment-15072525 ] Herman van Hovell commented on SPARK-12535: --- This is caused by the same problem as SPARK-12530.

[jira] [Updated] (SPARK-12491) UDAF result differs in SQL if alias is used

2015-12-28 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12491?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-12491: -- Attachment: UDAF_GM.zip Code for the attached .jar. > UDAF result differs in SQL if

[jira] [Commented] (SPARK-12491) UDAF result differs in SQL if alias is used

2015-12-28 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12491?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15072999#comment-15072999 ] Herman van Hovell commented on SPARK-12491: --- The logical plans look fine. I have defined the

[jira] [Commented] (SPARK-12491) UDAF result differs in SQL if alias is used

2015-12-28 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12491?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15073092#comment-15073092 ] Herman van Hovell commented on SPARK-12491: --- I did some testing on a spark cluster, and it

[jira] [Commented] (SPARK-12544) Support window functions in SQLContext

2015-12-29 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15073649#comment-15073649 ] Herman van Hovell commented on SPARK-12544: --- What do we need to support exactly? > Support

[jira] [Created] (SPARK-12024) Improved multi-column counting

2015-11-27 Thread Herman van Hovell (JIRA)
Herman van Hovell created SPARK-12024: - Summary: Improved multi-column counting Key: SPARK-12024 URL: https://issues.apache.org/jira/browse/SPARK-12024 Project: Spark Issue Type:

[jira] [Commented] (SPARK-11850) Spark StdDev/Variance defaults are incompatible with Hive

2015-11-19 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15014197#comment-15014197 ] Herman van Hovell commented on SPARK-11850: --- No problem. I'll disable those tests. This perhaps

[jira] [Commented] (SPARK-11850) Spark StdDev/Variance defaults are incompatible with Hive

2015-11-19 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15013285#comment-15013285 ] Herman van Hovell commented on SPARK-11850: --- [~rxin]/[~yhuai] any thoughts? > Spark

[jira] [Commented] (SPARK-3947) Support Scala/Java UDAF

2015-11-19 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3947?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15013707#comment-15013707 ] Herman van Hovell commented on SPARK-3947: -- The problem with 1.5.2 you are having, is probably

  1   2   3   4   5   6   7   8   9   10   >