[
https://issues.apache.org/jira/browse/SPARK-11215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16591667#comment-16591667
]
Barry Becker commented on SPARK-11215:
--
Is the main motivation for this feature per
[
https://issues.apache.org/jira/browse/SPARK-9610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16581253#comment-16581253
]
Barry Becker commented on SPARK-9610:
-
All ML models should support having and option
[
https://issues.apache.org/jira/browse/SPARK-21986?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16568718#comment-16568718
]
Barry Becker commented on SPARK-21986:
--
Here are a couple more test cases that show
Barry Becker created SPARK-24394:
Summary: Nodes in decision tree sometimes have negative impurity
values
Key: SPARK-24394
URL: https://issues.apache.org/jira/browse/SPARK-24394
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-24019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16444202#comment-16444202
]
Barry Becker commented on SPARK-24019:
--
Lowering to minor because I found a way to s
[
https://issues.apache.org/jira/browse/SPARK-24019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16444202#comment-16444202
]
Barry Becker edited comment on SPARK-24019 at 4/19/18 3:07 PM:
[
https://issues.apache.org/jira/browse/SPARK-24019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Barry Becker updated SPARK-24019:
-
Priority: Minor (was: Major)
> AnalysisException for Window function expression to compute deriv
Barry Becker created SPARK-24019:
Summary: AnalysisException for Window function expression to
compute derivative
Key: SPARK-24019
URL: https://issues.apache.org/jira/browse/SPARK-24019
Project: Spark
Barry Becker created SPARK-23824:
Summary: Make inpurityStats publicly accessible in ml.tree.Node
Key: SPARK-23824
URL: https://issues.apache.org/jira/browse/SPARK-23824
Project: Spark
Issue
[
https://issues.apache.org/jira/browse/SPARK-6162?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16415600#comment-16415600
]
Barry Becker commented on SPARK-6162:
-
If we all agree that is is something that would
[
https://issues.apache.org/jira/browse/SPARK-8529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16369194#comment-16369194
]
Barry Becker commented on SPARK-8529:
-
Complementing the output metadata in what way?
[
https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16064972#comment-16064972
]
Barry Becker edited comment on SPARK-20226 at 11/7/17 6:09 PM:
[
https://issues.apache.org/jira/browse/SPARK-9610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16219005#comment-16219005
]
Barry Becker commented on SPARK-9610:
-
Frequent item sets (associations) could use it
[
https://issues.apache.org/jira/browse/SPARK-7276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16167814#comment-16167814
]
Barry Becker commented on SPARK-7276:
-
Isn't there still a problem with withColumn per
[
https://issues.apache.org/jira/browse/SPARK-21986?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16163768#comment-16163768
]
Barry Becker commented on SPARK-21986:
--
But wait, the dataset I discovered the probl
Barry Becker created SPARK-21986:
Summary: QuantileDiscretizer picks wrong split point for data with
lots of 0's
Key: SPARK-21986
URL: https://issues.apache.org/jira/browse/SPARK-21986
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-14155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16155790#comment-16155790
]
Barry Becker commented on SPARK-14155:
--
Does it work with datasets now in 2.1?
> Hi
[
https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16064972#comment-16064972
]
Barry Becker commented on SPARK-20226:
--
Calling cache() on the dataframe on the afte
[
https://issues.apache.org/jira/browse/SPARK-16845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16018840#comment-16018840
]
Barry Becker commented on SPARK-16845:
--
I checked out the the v2.1.1 tag of spark fr
[
https://issues.apache.org/jira/browse/SPARK-20542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16004688#comment-16004688
]
Barry Becker commented on SPARK-20542:
--
@viirya, your implementation of MultipleBuck
[
https://issues.apache.org/jira/browse/SPARK-20542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16002609#comment-16002609
]
Barry Becker commented on SPARK-20542:
--
This is a great improvement, @viirya! Accord
[
https://issues.apache.org/jira/browse/SPARK-19581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16002582#comment-16002582
]
Barry Becker commented on SPARK-19581:
--
I think its just a matter of sending a featu
[
https://issues.apache.org/jira/browse/SPARK-13747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16001388#comment-16001388
]
Barry Becker commented on SPARK-13747:
--
Good to hear that your workaround was succes
[
https://issues.apache.org/jira/browse/SPARK-13747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16001303#comment-16001303
]
Barry Becker commented on SPARK-13747:
--
@saif1988, just to clarify, did you add the
[
https://issues.apache.org/jira/browse/SPARK-13747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16001186#comment-16001186
]
Barry Becker commented on SPARK-13747:
--
I also tried the "thread-pool-executor" work
[
https://issues.apache.org/jira/browse/SPARK-13747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16000830#comment-16000830
]
Barry Becker commented on SPARK-13747:
--
There seems to be some related discussion he
[
https://issues.apache.org/jira/browse/SPARK-20392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15987706#comment-15987706
]
Barry Becker commented on SPARK-20392:
--
Thanks for working on a fix. Do you have any
[
https://issues.apache.org/jira/browse/SPARK-20392?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Barry Becker updated SPARK-20392:
-
Attachment: model_9756.zip
blockbuster_fewCols.csv
attaching blockbuster_fewCols.
[
https://issues.apache.org/jira/browse/SPARK-20392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15981386#comment-15981386
]
Barry Becker commented on SPARK-20392:
--
[~viirya] that is correct. If I reduce the d
[
https://issues.apache.org/jira/browse/SPARK-20392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15979049#comment-15979049
]
Barry Becker edited comment on SPARK-20392 at 4/21/17 4:49 PM:
[
https://issues.apache.org/jira/browse/SPARK-20392?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Barry Becker updated SPARK-20392:
-
Attachment: model_9754.zip
Attaching the parquet pipeline (as zip).
> Slow performance when call
[
https://issues.apache.org/jira/browse/SPARK-20392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15979049#comment-15979049
]
Barry Becker edited comment on SPARK-20392 at 4/21/17 4:46 PM:
[
https://issues.apache.org/jira/browse/SPARK-20392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15979049#comment-15979049
]
Barry Becker commented on SPARK-20392:
--
Yes [~kiszk], I was able to create a simple
[
https://issues.apache.org/jira/browse/SPARK-20392?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Barry Becker updated SPARK-20392:
-
Attachment: giant_query_plan_for_fitting_pipeline.txt
Giant nested query plan using when calling
[
https://issues.apache.org/jira/browse/SPARK-20392?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Barry Becker updated SPARK-20392:
-
Attachment: blockbuster.csv
Attaching blockbuster.csv data file with many columns, but few rows.
Barry Becker created SPARK-20392:
Summary: Slow performance when calling fit on ML pipeline for
dataset with many columns but few rows
Key: SPARK-20392
URL: https://issues.apache.org/jira/browse/SPARK-20392
[
https://issues.apache.org/jira/browse/SPARK-6509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15972744#comment-15972744
]
Barry Becker commented on SPARK-6509:
-
As further proof of relevance, I will be giving
[
https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15960868#comment-15960868
]
Barry Becker commented on SPARK-20226:
--
Only 11 columns. I did not want to wait for
[
https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15960806#comment-15960806
]
Barry Becker commented on SPARK-20226:
--
OK, I set the flag using
sqlContext.setConf
[
https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15959134#comment-15959134
]
Barry Becker commented on SPARK-20226:
--
Yes. We are running through spark job-server
[
https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15959024#comment-15959024
]
Barry Becker edited comment on SPARK-20226 at 4/6/17 2:45 PM:
-
[
https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15959024#comment-15959024
]
Barry Becker commented on SPARK-20226:
--
I set spark.sql.constraintPropagation.enable
[
https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Barry Becker updated SPARK-20226:
-
Attachment: profile_indexer2.PNG
A snapshot of the hotspot sampler from JVisualVM while cacheTabl
[
https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15957732#comment-15957732
]
Barry Becker commented on SPARK-20226:
--
I did some profiling using the sampler in JV
[
https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15957489#comment-15957489
]
Barry Becker commented on SPARK-20226:
--
I thought the problem was in the cacheTable
[
https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15957457#comment-15957457
]
Barry Becker commented on SPARK-20226:
--
It seems like it has to do with the interact
[
https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Barry Becker updated SPARK-20226:
-
Labels: cache (was: )
> Call to sqlContext.cacheTable takes an incredibly long time in some case
[
https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15957296#comment-15957296
]
Barry Becker edited comment on SPARK-20226 at 4/5/17 5:36 PM:
-
[
https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15957296#comment-15957296
]
Barry Becker commented on SPARK-20226:
--
We noticed that this is reproducible just by
[
https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Barry Becker updated SPARK-20226:
-
Attachment: xyzzy.csv
Attaching the datafile, but I don't think it is significant. This problem c
[
https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Barry Becker updated SPARK-20226:
-
Description:
I have a case where the call to sqlContext.cacheTable can take an arbitrarily
long
Barry Becker created SPARK-20226:
Summary: Call to sqlContext.cacheTable takes an incredibly long
time in some cases
Key: SPARK-20226
URL: https://issues.apache.org/jira/browse/SPARK-20226
Project: Sp
[
https://issues.apache.org/jira/browse/SPARK-20071?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15938475#comment-15938475
]
Barry Becker commented on SPARK-20071:
--
Yes. I agree. I wanted to report the issue,
Barry Becker created SPARK-20071:
Summary: StringIndexer overflows Kryo serialization buffer when
run on column with many long distinct values
Key: SPARK-20071
URL: https://issues.apache.org/jira/browse/SPARK-2007
[
https://issues.apache.org/jira/browse/SPARK-13747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15936997#comment-15936997
]
Barry Becker commented on SPARK-13747:
--
We have hit this on rare instances in our pr
Barry Becker created SPARK-19699:
Summary: createOrReplaceTable does not always replace an existing
table of the same name
Key: SPARK-19699
URL: https://issues.apache.org/jira/browse/SPARK-19699
Proje
[
https://issues.apache.org/jira/browse/SPARK-19581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15863819#comment-15863819
]
Barry Becker commented on SPARK-19581:
--
I agree with minor prioritization, since the
Barry Becker created SPARK-19581:
Summary: running NaiveBayes model with 0 features can crash the
executor with D rorreGEMV
Key: SPARK-19581
URL: https://issues.apache.org/jira/browse/SPARK-19581
Proj
[
https://issues.apache.org/jira/browse/SPARK-4049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15838617#comment-15838617
]
Barry Becker commented on SPARK-4049:
-
I read the comments, but I'm still not really s
[
https://issues.apache.org/jira/browse/SPARK-19317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15834790#comment-15834790
]
Barry Becker commented on SPARK-19317:
--
I figured out a workaround for this problem.
[
https://issues.apache.org/jira/browse/SPARK-19317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Barry Becker updated SPARK-19317:
-
Priority: Minor (was: Major)
> UnsupportedOperationException: empty.reduceLeft in LinearSeqOptim
[
https://issues.apache.org/jira/browse/SPARK-19317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Barry Becker updated SPARK-19317:
-
Description:
I wish I had more of a simple reproducible case to give, but I got the below
except
[
https://issues.apache.org/jira/browse/SPARK-19317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15834700#comment-15834700
]
Barry Becker commented on SPARK-19317:
--
As far as I can tell, this only occurs when
[
https://issues.apache.org/jira/browse/SPARK-19317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Barry Becker updated SPARK-19317:
-
Description:
I wish I had more of a simple reproducible case to give, but I got the below
except
Barry Becker created SPARK-19317:
Summary: UnsupportedOperationException: empty.reduceLeft in
LinearSeqOptimized
Key: SPARK-19317
URL: https://issues.apache.org/jira/browse/SPARK-19317
Project: Spark
Barry Becker created SPARK-19245:
Summary: Cannot build spark-assembly jar
Key: SPARK-19245
URL: https://issues.apache.org/jira/browse/SPARK-19245
Project: Spark
Issue Type: Documentation
[
https://issues.apache.org/jira/browse/SPARK-16845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15762805#comment-15762805
]
Barry Becker edited comment on SPARK-16845 at 12/20/16 9:24 PM:
---
[
https://issues.apache.org/jira/browse/SPARK-11293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15765096#comment-15765096
]
Barry Becker commented on SPARK-11293:
--
Not sure if this is related, but I am runnin
[
https://issues.apache.org/jira/browse/SPARK-16845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15762805#comment-15762805
]
Barry Becker commented on SPARK-16845:
--
I found a workaround that allows me to avoid
[
https://issues.apache.org/jira/browse/SPARK-11215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15722696#comment-15722696
]
Barry Becker commented on SPARK-11215:
--
This would be a good feature. It might be ni
[
https://issues.apache.org/jira/browse/SPARK-18502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15706389#comment-15706389
]
Barry Becker commented on SPARK-18502:
--
Is there a way to escape the backtick when i
[
https://issues.apache.org/jira/browse/SPARK-13913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15677144#comment-15677144
]
Barry Becker edited comment on SPARK-13913 at 11/18/16 5:02 PM:
---
[
https://issues.apache.org/jira/browse/SPARK-13913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15677144#comment-15677144
]
Barry Becker commented on SPARK-13913:
--
I can still reproduce this using spark 1.6.3
Barry Becker created SPARK-18502:
Summary: Spark does not handle columns that contain backquote (`)
Key: SPARK-18502
URL: https://issues.apache.org/jira/browse/SPARK-18502
Project: Spark
Issu
[
https://issues.apache.org/jira/browse/SPARK-11977?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15676856#comment-15676856
]
Barry Becker commented on SPARK-11977:
--
I would also like to know how to handle colu
[
https://issues.apache.org/jira/browse/SPARK-12965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15674129#comment-15674129
]
Barry Becker commented on SPARK-12965:
--
This is a big issue for us because we don't
[
https://issues.apache.org/jira/browse/SPARK-16845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15664866#comment-15664866
]
Barry Becker commented on SPARK-16845:
--
I am encountering a similar exception in spa
[
https://issues.apache.org/jira/browse/SPARK-14138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15664842#comment-15664842
]
Barry Becker commented on SPARK-14138:
--
I am using spark 1.6.3 on a DataFrame with 2
[
https://issues.apache.org/jira/browse/SPARK-8443?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15664779#comment-15664779
]
Barry Becker commented on SPARK-8443:
-
I see the same error in spark 1.6.3. Is there a
[
https://issues.apache.org/jira/browse/SPARK-18181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15623018#comment-15623018
]
Barry Becker commented on SPARK-18181:
--
For this case to leak a lot of memory, I bin
Barry Becker created SPARK-18181:
Summary: Huge managed memory leak (2.7G) when running reduceByKey
Key: SPARK-18181
URL: https://issues.apache.org/jira/browse/SPARK-18181
Project: Spark
Issu
[
https://issues.apache.org/jira/browse/SPARK-14363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15622975#comment-15622975
]
Barry Becker commented on SPARK-14363:
--
I am hitting this issue in 1.6.2.
In fact, I
[
https://issues.apache.org/jira/browse/SPARK-18054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15597816#comment-15597816
]
Barry Becker commented on SPARK-18054:
--
Ah. That is quite likely the problem. I will
[
https://issues.apache.org/jira/browse/SPARK-16216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15596382#comment-15596382
]
Barry Becker commented on SPARK-16216:
--
Yes, That worked. Thanks for the workaround!
Barry Becker created SPARK-18054:
Summary: Unexpected error from UDF that gets an element of a
vector: argument 1 requires vector type, however, '`_column_`' is of vector type
Key: SPARK-18054
URL: https://issues.
[
https://issues.apache.org/jira/browse/SPARK-16216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15595619#comment-15595619
]
Barry Becker commented on SPARK-16216:
--
If timezone is not specified, the date shoul
[
https://issues.apache.org/jira/browse/SPARK-16216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15595619#comment-15595619
]
Barry Becker edited comment on SPARK-16216 at 10/21/16 4:41 PM:
---
[
https://issues.apache.org/jira/browse/SPARK-17219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1968#comment-1968
]
Barry Becker commented on SPARK-17219:
--
I'll make another attempt to clarify my use
[
https://issues.apache.org/jira/browse/SPARK-14234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15453414#comment-15453414
]
Barry Becker commented on SPARK-14234:
--
Is it a lot of work to backport this fix 1.6
[
https://issues.apache.org/jira/browse/SPARK-17219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15436819#comment-15436819
]
Barry Becker commented on SPARK-17219:
--
In my opinion, yes. It is something that app
[
https://issues.apache.org/jira/browse/SPARK-17219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15436767#comment-15436767
]
Barry Becker commented on SPARK-17219:
--
If you support the different strategies as R
[
https://issues.apache.org/jira/browse/SPARK-17219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15435651#comment-15435651
]
Barry Becker commented on SPARK-17219:
--
If the decision is to have an additional nul
[
https://issues.apache.org/jira/browse/SPARK-17219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15435484#comment-15435484
]
Barry Becker commented on SPARK-17219:
--
Nulls were not accepted in the column. I had
[
https://issues.apache.org/jira/browse/SPARK-17219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15435403#comment-15435403
]
Barry Becker commented on SPARK-17219:
--
There needs to be some way to handle null va
Barry Becker created SPARK-17219:
Summary: QuantileDiscretizer does strange things with NaN values
Key: SPARK-17219
URL: https://issues.apache.org/jira/browse/SPARK-17219
Project: Spark
Issue
[
https://issues.apache.org/jira/browse/SPARK-17086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Barry Becker updated SPARK-17086:
-
Attachment: titanic.csv
> QuantileDiscretizer throws InvalidArgumentException (parameter splits g
[
https://issues.apache.org/jira/browse/SPARK-17086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15435236#comment-15435236
]
Barry Becker commented on SPARK-17086:
--
Thanks.
BTW, I hope there are some test cas
[
https://issues.apache.org/jira/browse/SPARK-17086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15434805#comment-15434805
]
Barry Becker edited comment on SPARK-17086 at 8/24/16 12:18 PM:
---
[
https://issues.apache.org/jira/browse/SPARK-17086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15434805#comment-15434805
]
Barry Becker commented on SPARK-17086:
--
Is it possible to get this fix into 2.0.1? M
[
https://issues.apache.org/jira/browse/SPARK-6509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15431341#comment-15431341
]
Barry Becker commented on SPARK-6509:
-
I may have missed the reasoning somewhere, but
1 - 100 of 128 matches
Mail list logo