[
https://issues.apache.org/jira/browse/SPARK-2620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-2620:
--
Affects Version/s: 2.1.0
> case class cannot be used as key for reduce
>
[
https://issues.apache.org/jira/browse/SPARK-2620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-2620:
--
Affects Version/s: 1.6.0
2.0.0
> case class cannot be used as
Maciej Szymkiewicz created SPARK-17587:
--
Summary: SparseVector __getitem__ should follow __getitem__
contract
Key: SPARK-17587
URL: https://issues.apache.org/jira/browse/SPARK-17587
Project:
[
https://issues.apache.org/jira/browse/SPARK-17027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15418071#comment-15418071
]
Maciej Szymkiewicz commented on SPARK-17027:
Yes, this exactly the problem.
{code}
[
https://issues.apache.org/jira/browse/SPARK-17027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15418071#comment-15418071
]
Maciej Szymkiewicz edited comment on SPARK-17027 at 8/11/16 10:38 PM:
Maciej Szymkiewicz created SPARK-17027:
--
Summary: PolynomialExpansion.choose is prone to integer overflow
Key: SPARK-17027
URL: https://issues.apache.org/jira/browse/SPARK-17027
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-12157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15401190#comment-15401190
]
Maciej Szymkiewicz commented on SPARK-12157:
Well, it is alpha component (see Scala API docs
[
https://issues.apache.org/jira/browse/SPARK-14155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15401156#comment-15401156
]
Maciej Szymkiewicz commented on SPARK-14155:
[~rxin] Is there any progress on that or some
[
https://issues.apache.org/jira/browse/SPARK-12157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15401153#comment-15401153
]
Maciej Szymkiewicz commented on SPARK-12157:
[~nchammas]You're using incorrect schema.
[
https://issues.apache.org/jira/browse/SPARK-16589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15391108#comment-15391108
]
Maciej Szymkiewicz commented on SPARK-16589:
[~holdenk] Makes sense. I was thinking more
[
https://issues.apache.org/jira/browse/SPARK-16589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-16589:
---
Description:
Chaining cartesian calls in PySpark results in the number of records
[
https://issues.apache.org/jira/browse/SPARK-16589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-16589:
---
Affects Version/s: 1.4.0
1.5.0
> Chained cartesian produces
[
https://issues.apache.org/jira/browse/SPARK-16589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15384235#comment-15384235
]
Maciej Szymkiewicz commented on SPARK-16589:
Thanks [~dongjoon].
[~joshrosen] Could you
Maciej Szymkiewicz created SPARK-16626:
--
Summary: Code duplication after SPARK-14906
Key: SPARK-16626
URL: https://issues.apache.org/jira/browse/SPARK-16626
Project: Spark
Issue Type:
[
https://issues.apache.org/jira/browse/SPARK-16589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15382614#comment-15382614
]
Maciej Szymkiewicz commented on SPARK-16589:
[~dongjoon] I'll work on that but I am not
Maciej Szymkiewicz created SPARK-16589:
--
Summary: Chained cartesian produces incorrect number of records
Key: SPARK-16589
URL: https://issues.apache.org/jira/browse/SPARK-16589
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-15559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-15559:
---
Description:
In Python 3.x any object that provides eq method requires hash method
[
https://issues.apache.org/jira/browse/SPARK-15559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-15559:
---
Description:
In Python 3.x any object that provides {{__eq__}} method requires
[
https://issues.apache.org/jira/browse/SPARK-15559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-15559:
---
Description:
In Python 3.x any object that provides `__eq__` method requires
Maciej Szymkiewicz created SPARK-15559:
--
Summary: TopicAndPartition should provide __hash__ method
Key: SPARK-15559
URL: https://issues.apache.org/jira/browse/SPARK-15559
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-14739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15249061#comment-15249061
]
Maciej Szymkiewicz commented on SPARK-14739:
I extracted relevant test fixes and made PR
[
https://issues.apache.org/jira/browse/SPARK-14739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15249053#comment-15249053
]
Maciej Szymkiewicz edited comment on SPARK-14739 at 4/20/16 12:47 AM:
[
https://issues.apache.org/jira/browse/SPARK-14739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15249053#comment-15249053
]
Maciej Szymkiewicz commented on SPARK-14739:
Sure, but your latest PR still doesn't resolve
[
https://issues.apache.org/jira/browse/SPARK-14739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15248994#comment-15248994
]
Maciej Szymkiewicz commented on SPARK-14739:
This solves only small part of the problem.
Maciej Szymkiewicz created SPARK-14739:
--
Summary: Vectors.parse doesn't handle dense vectors of size 0 and
sparse vectros with no indices
Key: SPARK-14739
URL:
[
https://issues.apache.org/jira/browse/SPARK-14202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-14202:
---
Affects Version/s: (was: 1.3.0)
> python_full_outer_join should use generator
Maciej Szymkiewicz created SPARK-14202:
--
Summary: python_full_outer_join should use generator expression
instead of list comp
Key: SPARK-14202
URL: https://issues.apache.org/jira/browse/SPARK-14202
[
https://issues.apache.org/jira/browse/SPARK-12916?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15206030#comment-15206030
]
Maciej Szymkiewicz commented on SPARK-12916:
Since PySpark `Row` is just a subclass of
Maciej Szymkiewicz created SPARK-14058:
--
Summary: Incorrect docstring in Window.orderBy
Key: SPARK-14058
URL: https://issues.apache.org/jira/browse/SPARK-14058
Project: Spark
Issue
[
https://issues.apache.org/jira/browse/SPARK-12824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15098738#comment-15098738
]
Maciej Szymkiewicz commented on SPARK-12824:
??It seems that all the keys in the dictionary
[
https://issues.apache.org/jira/browse/SPARK-12824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15098738#comment-15098738
]
Maciej Szymkiewicz edited comment on SPARK-12824 at 1/14/16 7:51 PM:
-
[
https://issues.apache.org/jira/browse/SPARK-12824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15098738#comment-15098738
]
Maciej Szymkiewicz edited comment on SPARK-12824 at 1/14/16 7:55 PM:
-
[
https://issues.apache.org/jira/browse/SPARK-12824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15098738#comment-15098738
]
Maciej Szymkiewicz edited comment on SPARK-12824 at 1/14/16 7:56 PM:
-
[
https://issues.apache.org/jira/browse/SPARK-7683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-7683:
--
Comment: was deleted
(was: [~srowen] Do you have any example how it could break
[
https://issues.apache.org/jira/browse/SPARK-7683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15076650#comment-15076650
]
Maciej Szymkiewicz commented on SPARK-7683:
---
[~srowen] Do you have any example how it could
Maciej Szymkiewicz created SPARK-12595:
--
Summary: fold should pass arguments to op in the correct order
Key: SPARK-12595
URL: https://issues.apache.org/jira/browse/SPARK-12595
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-6459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15074268#comment-15074268
]
Maciej Szymkiewicz commented on SPARK-6459:
---
[~marmbrus] Isn't this warning obsolete in 1.5+?
>
[
https://issues.apache.org/jira/browse/SPARK-6459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15074322#comment-15074322
]
Maciej Szymkiewicz commented on SPARK-6459:
---
I've been trying to reproduce the problem on 1.5.2
[
https://issues.apache.org/jira/browse/SPARK-6459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15074342#comment-15074342
]
Maciej Szymkiewicz commented on SPARK-6459:
---
Thanks for clarification.
> Warn when Column API
[
https://issues.apache.org/jira/browse/SPARK-9137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15029170#comment-15029170
]
Maciej Szymkiewicz commented on SPARK-9137:
---
[~josephkb] Could you take a look at [this question
[
https://issues.apache.org/jira/browse/SPARK-12006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-12006:
---
Description:
Steps to reproduce :
{code}
from pyspark.mllib.clustering import
Maciej Szymkiewicz created SPARK-12006:
--
Summary: GaussianMixture.train crashes if an itnital model is not
None
Key: SPARK-12006
URL: https://issues.apache.org/jira/browse/SPARK-12006
Project:
[
https://issues.apache.org/jira/browse/SPARK-11281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15007038#comment-15007038
]
Maciej Szymkiewicz commented on SPARK-11281:
[~shivaram] I've tested both current master and
[
https://issues.apache.org/jira/browse/SPARK-11281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15006960#comment-15006960
]
Maciej Szymkiewicz commented on SPARK-11281:
[~shivaram] No, there isn't. I removed this one
[
https://issues.apache.org/jira/browse/SPARK-11281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-11281:
---
Comment: was deleted
(was: [~sunrui], [~shivaram] I don't think it is resolved by
[
https://issues.apache.org/jira/browse/SPARK-11086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15006292#comment-15006292
]
Maciej Szymkiewicz commented on SPARK-11086:
[~shivaram] Does it resolve [SPARK-8277] as
[
https://issues.apache.org/jira/browse/SPARK-11569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14995580#comment-14995580
]
Maciej Szymkiewicz commented on SPARK-11569:
It looks this problem affects Scala after all:
[
https://issues.apache.org/jira/browse/SPARK-11530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14995874#comment-14995874
]
Maciej Szymkiewicz commented on SPARK-11530:
It should actually target MLlib, shouldn't it?
Maciej Szymkiewicz created SPARK-11569:
--
Summary: StringIndexer transform fails when column contains nulls
Key: SPARK-11569
URL: https://issues.apache.org/jira/browse/SPARK-11569
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-11167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14970839#comment-14970839
]
Maciej Szymkiewicz commented on SPARK-11167:
spark-csv has a much simpler job to do and
Maciej Szymkiewicz created SPARK-11283:
--
Summary: List column gets additional level of nesting when
converted to Spark DataFrame
Key: SPARK-11283
URL: https://issues.apache.org/jira/browse/SPARK-11283
[
https://issues.apache.org/jira/browse/SPARK-11167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14970922#comment-14970922
]
Maciej Szymkiewicz commented on SPARK-11167:
Related problem:
Maciej Szymkiewicz created SPARK-11281:
--
Summary: Issue with creating and collecting DataFrame using
environments
Key: SPARK-11281
URL: https://issues.apache.org/jira/browse/SPARK-11281
[
https://issues.apache.org/jira/browse/SPARK-11167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-11167:
---
Comment: was deleted
(was: Related problem:
Maciej Szymkiewicz created SPARK-11167:
--
Summary: Incorrect type resolution on heterogeneous data structures
Key: SPARK-11167
URL: https://issues.apache.org/jira/browse/SPARK-11167
Project: Spark
Maciej Szymkiewicz created SPARK-11084:
--
Summary: SparseVector.__getitem__ should check if value can be
non-zero before executing searchsorted
Key: SPARK-11084
URL:
Maciej Szymkiewicz created SPARK-11086:
--
Summary: createDataFrame should dropFactor column-wise not
cell-wise
Key: SPARK-11086
URL: https://issues.apache.org/jira/browse/SPARK-11086
Project:
[
https://issues.apache.org/jira/browse/SPARK-10973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-10973:
---
External issue URL: https://github.com/apache/spark/pull/9009
> __gettitem__ method
Maciej Szymkiewicz created SPARK-10973:
--
Summary: __gettitem__ method throws IndexError exception when we
try to access index after the last non-zero entry.
Key: SPARK-10973
URL:
[
https://issues.apache.org/jira/browse/SPARK-10973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-10973:
---
External issue URL: (was: https://github.com/apache/spark/pull/9009)
>
[
https://issues.apache.org/jira/browse/SPARK-10467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-10467:
---
Description:
If we take a row from a data frame and try to extract vector element by
[
https://issues.apache.org/jira/browse/SPARK-10467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-10467:
---
Description:
If we take a row from a data frame and try to extract vector element by
Maciej Szymkiewicz created SPARK-10467:
--
Summary: Vector is converted to tuple when extracted from Row
using __getitem__
Key: SPARK-10467
URL: https://issues.apache.org/jira/browse/SPARK-10467
[
https://issues.apache.org/jira/browse/SPARK-10467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-10467:
---
Description:
If we take a row from a data frame and try to extract vector element by
[
https://issues.apache.org/jira/browse/SPARK-10467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-10467:
---
Description:
{code}
from pyspark.ml.feature import HashingTF
df =
Maciej Szymkiewicz created SPARK-9978:
-
Summary: Window functions require partitionBy to work as expected
Key: SPARK-9978
URL: https://issues.apache.org/jira/browse/SPARK-9978
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-9978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-9978:
--
Description:
I am trying to reproduce following SQL query:
{code}
Maciej Szymkiewicz created SPARK-9098:
-
Summary: Inconsistent Dense Vectors hashing between PySpark and
Scala
Key: SPARK-9098
URL: https://issues.apache.org/jira/browse/SPARK-9098
Project: Spark
601 - 668 of 668 matches
Mail list logo