[jira] [Commented] (SPARK-20208) Document R fpGrowth support in vignettes, programming guide and code example

2017-04-19 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15974374#comment-15974374 ] Maciej Szymkiewicz commented on SPARK-20208: [~felixcheung] There is active PR

[jira] [Created] (SPARK-20375) R wrappers for array and map

2017-04-18 Thread Maciej Szymkiewicz (JIRA)
Maciej Szymkiewicz created SPARK-20375: -- Summary: R wrappers for array and map Key: SPARK-20375 URL: https://issues.apache.org/jira/browse/SPARK-20375 Project: Spark Issue Type:

[jira] [Updated] (SPARK-20371) R wrappers for collect_list and collect_set

2017-04-18 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz updated SPARK-20371: --- Affects Version/s: 2.0.0 > R wrappers for collect_list and collect_set >

[jira] [Created] (SPARK-20371) R wrappers for collect_list and collect_set

2017-04-18 Thread Maciej Szymkiewicz (JIRA)
Maciej Szymkiewicz created SPARK-20371: -- Summary: R wrappers for collect_list and collect_set Key: SPARK-20371 URL: https://issues.apache.org/jira/browse/SPARK-20371 Project: Spark

[jira] [Commented] (SPARK-20361) JVM locale affects SQL type names

2017-04-17 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15971734#comment-15971734 ] Maciej Szymkiewicz commented on SPARK-20361: Indeed. > JVM locale affects SQL type names >

[jira] [Closed] (SPARK-20361) JVM locale affects SQL type names

2017-04-17 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz closed SPARK-20361. -- Resolution: Fixed > JVM locale affects SQL type names >

[jira] [Updated] (SPARK-20361) JVM locale affects SQL type names

2017-04-17 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz updated SPARK-20361: --- Description: Steps to reproduce: {code} from pyspark.sql.types import IntegerType

[jira] [Created] (SPARK-20361) JVM locale affects SQL type names

2017-04-17 Thread Maciej Szymkiewicz (JIRA)
Maciej Szymkiewicz created SPARK-20361: -- Summary: JVM locale affects SQL type names Key: SPARK-20361 URL: https://issues.apache.org/jira/browse/SPARK-20361 Project: Spark Issue Type:

[jira] [Comment Edited] (SPARK-20347) Provide AsyncRDDActions in Python

2017-04-17 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15970933#comment-15970933 ] Maciej Szymkiewicz edited comment on SPARK-20347 at 4/17/17 10:22 AM:

[jira] [Comment Edited] (SPARK-20347) Provide AsyncRDDActions in Python

2017-04-17 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15970933#comment-15970933 ] Maciej Szymkiewicz edited comment on SPARK-20347 at 4/17/17 10:17 AM:

[jira] [Comment Edited] (SPARK-20347) Provide AsyncRDDActions in Python

2017-04-17 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15970933#comment-15970933 ] Maciej Szymkiewicz edited comment on SPARK-20347 at 4/17/17 10:16 AM:

[jira] [Commented] (SPARK-20347) Provide AsyncRDDActions in Python

2017-04-17 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15970933#comment-15970933 ] Maciej Szymkiewicz commented on SPARK-20347: This is a nice idea but I wonder what would be

[jira] [Created] (SPARK-20290) PySpark Column should provide eqNullSafe

2017-04-11 Thread Maciej Szymkiewicz (JIRA)
Maciej Szymkiewicz created SPARK-20290: -- Summary: PySpark Column should provide eqNullSafe Key: SPARK-20290 URL: https://issues.apache.org/jira/browse/SPARK-20290 Project: Spark Issue

[jira] [Commented] (SPARK-10931) PySpark ML Models should contain Param values

2017-04-09 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15962141#comment-15962141 ] Maciej Szymkiewicz commented on SPARK-10931: [~vlad.feinberg] It is worth noting that without

[jira] [Commented] (SPARK-20208) Document R fpGrowth support in vignettes, programming guide and code example

2017-04-06 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15960277#comment-15960277 ] Maciej Szymkiewicz commented on SPARK-20208: [~felixcheung] I am working on this but it is

[jira] [Comment Edited] (SPARK-20208) Document R fpGrowth support in vignettes, programming guide and code example

2017-04-06 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15960277#comment-15960277 ] Maciej Szymkiewicz edited comment on SPARK-20208 at 4/7/17 4:52 AM:

[jira] [Closed] (SPARK-19728) PythonUDF with multiple parents shouldn't be pushed down when used as a predicate

2017-03-23 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz closed SPARK-19728. -- Resolution: Fixed Fix Version/s: 2.2.0 > PythonUDF with multiple parents

[jira] [Updated] (SPARK-19728) PythonUDF with multiple parents shouldn't be pushed down when used as a predicate

2017-03-23 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz updated SPARK-19728: --- Affects Version/s: (was: 2.2.0) > PythonUDF with multiple parents shouldn't be

[jira] [Commented] (SPARK-19475) (ML|MLlib).linalg.DenseVector method delegation fails for __neg__

2017-03-20 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19475?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15933981#comment-15933981 ] Maciej Szymkiewicz commented on SPARK-19475: Sounds good [~holdenk], though I wonder if we

[jira] [Commented] (SPARK-19019) PySpark does not work with Python 3.6.0

2017-03-18 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15931288#comment-15931288 ] Maciej Szymkiewicz commented on SPARK-19019: [~davies] Could it be backported to 1.6 and 2.0?

[jira] [Commented] (SPARK-19899) FPGrowth input column naming

2017-03-16 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19899?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15928827#comment-15928827 ] Maciej Szymkiewicz commented on SPARK-19899: [~mlnick] For some reason SparkQA recognized the

[jira] [Commented] (SPARK-19899) FPGrowth input column naming

2017-03-14 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19899?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15924435#comment-15924435 ] Maciej Szymkiewicz commented on SPARK-19899: "itemsCol" sounds good. What should we use as a

[jira] [Commented] (SPARK-19940) FPGrowthModel.transform should skip duplicated items

2017-03-13 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15923149#comment-15923149 ] Maciej Szymkiewicz commented on SPARK-19940: cc [~yuhaoyan] > FPGrowthModel.transform

[jira] [Issue Comment Deleted] (SPARK-14503) spark.ml Scala API for FPGrowth

2017-03-13 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14503?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz updated SPARK-14503: --- Comment: was deleted (was: [~yuhaoyan] Sure thing. I'll try to do it later today.)

[jira] [Created] (SPARK-19940) FPGrowthModel.transform should skip duplicated items

2017-03-13 Thread Maciej Szymkiewicz (JIRA)
Maciej Szymkiewicz created SPARK-19940: -- Summary: FPGrowthModel.transform should skip duplicated items Key: SPARK-19940 URL: https://issues.apache.org/jira/browse/SPARK-19940 Project: Spark

[jira] [Commented] (SPARK-14503) spark.ml Scala API for FPGrowth

2017-03-13 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15907589#comment-15907589 ] Maciej Szymkiewicz commented on SPARK-14503: [~yuhaoyan] Sure thing. I'll try to do it later

[jira] [Commented] (SPARK-14503) spark.ml Scala API for FPGrowth

2017-03-10 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15905840#comment-15905840 ] Maciej Szymkiewicz commented on SPARK-14503: I think we should keep only unique predictions

[jira] [Commented] (SPARK-19899) FPGrowth input column naming

2017-03-10 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19899?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15905834#comment-15905834 ] Maciej Szymkiewicz commented on SPARK-19899: Thanks [~yuhaoyan]. > FPGrowth input column

[jira] [Commented] (SPARK-19899) FPGrowth input column naming

2017-03-10 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19899?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15905480#comment-15905480 ] Maciej Szymkiewicz commented on SPARK-19899: This is just an idea, but I would start with: -

[jira] [Commented] (SPARK-19899) FPGrowth input column naming

2017-03-10 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19899?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15905405#comment-15905405 ] Maciej Szymkiewicz commented on SPARK-19899: In my opinion a trait for each input category

[jira] [Updated] (SPARK-19899) FPGrowth input column naming

2017-03-10 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz updated SPARK-19899: --- Description: Current implementation extends {{HasFeaturesCol}}. Personally I find it

[jira] [Commented] (SPARK-19899) FPGrowth input column naming

2017-03-10 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19899?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15904999#comment-15904999 ] Maciej Szymkiewicz commented on SPARK-19899: CC [~felixcheung], [~josephkb], [~mlnick],

[jira] [Created] (SPARK-19899) FPGrowth input column naming

2017-03-10 Thread Maciej Szymkiewicz (JIRA)
Maciej Szymkiewicz created SPARK-19899: -- Summary: FPGrowth input column naming Key: SPARK-19899 URL: https://issues.apache.org/jira/browse/SPARK-19899 Project: Spark Issue Type:

[jira] [Commented] (SPARK-13802) Fields order in Row(**kwargs) is not consistent with Schema.toInternal method

2017-02-26 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15884939#comment-15884939 ] Maciej Szymkiewicz commented on SPARK-13802: [~szymonm] Do you have anything particular in

[jira] [Updated] (SPARK-19728) PythonUDF with multiple parents shouldn't be pushed down when used as a predicate

2017-02-24 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz updated SPARK-19728: --- Summary: PythonUDF with multiple parents shouldn't be pushed down when used as a

[jira] [Created] (SPARK-19728) PythonUDF with multiple parents shouldn't be pushed down when used as a predicat

2017-02-24 Thread Maciej Szymkiewicz (JIRA)
Maciej Szymkiewicz created SPARK-19728: -- Summary: PythonUDF with multiple parents shouldn't be pushed down when used as a predicat Key: SPARK-19728 URL: https://issues.apache.org/jira/browse/SPARK-19728

[jira] [Comment Edited] (SPARK-16931) PySpark access to data-frame bucketing api

2017-02-19 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873710#comment-15873710 ] Maciej Szymkiewicz edited comment on SPARK-16931 at 2/19/17 2:09 PM: -

[jira] [Reopened] (SPARK-16931) PySpark access to data-frame bucketing api

2017-02-19 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16931?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz reopened SPARK-16931: Should be implemented to achieve feature parity. > PySpark access to data-frame

[jira] [Commented] (SPARK-16931) PySpark access to data-frame bucketing api

2017-02-19 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873709#comment-15873709 ] Maciej Szymkiewicz commented on SPARK-16931: Thanks [~sowen]. I'll reopen this and if there

[jira] [Commented] (SPARK-16931) PySpark access to data-frame bucketing api

2017-02-18 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15873329#comment-15873329 ] Maciej Szymkiewicz commented on SPARK-16931: [~sowen] Is there any particular reason for

[jira] [Resolved] (SPARK-19163) Lazy creation of the _judf

2017-02-14 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz resolved SPARK-19163. Resolution: Fixed Fix Version/s: 2.2.0 > Lazy creation of the _judf >

[jira] [Commented] (SPARK-19163) Lazy creation of the _judf

2017-02-13 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15864110#comment-15864110 ] Maciej Szymkiewicz commented on SPARK-19163: [~holdenk] I see you've sorted out Jira

[jira] [Updated] (SPARK-19161) Improving UDF Docstrings

2017-02-08 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz updated SPARK-19161: --- Priority: Minor (was: Major) > Improving UDF Docstrings >

[jira] [Updated] (SPARK-19162) UserDefinedFunction constructor should verify that func is callable

2017-02-08 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz updated SPARK-19162: --- Priority: Minor (was: Major) > UserDefinedFunction constructor should verify that

[jira] [Updated] (SPARK-19165) UserDefinedFunction should verify call arguments and provide readable exception in case of mismatch

2017-02-08 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz updated SPARK-19165: --- Priority: Minor (was: Major) > UserDefinedFunction should verify call arguments and

[jira] [Updated] (SPARK-19161) Improving UDF Docstrings

2017-02-08 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz updated SPARK-19161: --- Priority: Major (was: Minor) > Improving UDF Docstrings >

[jira] [Created] (SPARK-19506) Missing warnings import in pyspark.ml.util

2017-02-07 Thread Maciej Szymkiewicz (JIRA)
Maciej Szymkiewicz created SPARK-19506: -- Summary: Missing warnings import in pyspark.ml.util Key: SPARK-19506 URL: https://issues.apache.org/jira/browse/SPARK-19506 Project: Spark Issue

[jira] [Created] (SPARK-19475) (ML|MLlib).linalg.DenseVector method delegation fails for __neg__

2017-02-06 Thread Maciej Szymkiewicz (JIRA)
Maciej Szymkiewicz created SPARK-19475: -- Summary: (ML|MLlib).linalg.DenseVector method delegation fails for __neg__ Key: SPARK-19475 URL: https://issues.apache.org/jira/browse/SPARK-19475

[jira] [Created] (SPARK-19467) PySpark ML shouldn't use circular imports

2017-02-05 Thread Maciej Szymkiewicz (JIRA)
Maciej Szymkiewicz created SPARK-19467: -- Summary: PySpark ML shouldn't use circular imports Key: SPARK-19467 URL: https://issues.apache.org/jira/browse/SPARK-19467 Project: Spark Issue

[jira] [Comment Edited] (SPARK-13802) Fields order in Row(**kwargs) is not consistent with Schema.toInternal method

2017-02-04 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15853008#comment-15853008 ] Maciej Szymkiewicz edited comment on SPARK-13802 at 2/5/17 12:45 AM: -

[jira] [Commented] (SPARK-13802) Fields order in Row(**kwargs) is not consistent with Schema.toInternal method

2017-02-04 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15853008#comment-15853008 ] Maciej Szymkiewicz commented on SPARK-13802: [~szymonm] Realistically it is rather unlikely

[jira] [Created] (SPARK-19454) Improve DataFrame.replace API

2017-02-03 Thread Maciej Szymkiewicz (JIRA)
Maciej Szymkiewicz created SPARK-19454: -- Summary: Improve DataFrame.replace API Key: SPARK-19454 URL: https://issues.apache.org/jira/browse/SPARK-19454 Project: Spark Issue Type:

[jira] [Updated] (SPARK-19453) Correct DataFrame.replace docs

2017-02-03 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19453?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz updated SPARK-19453: --- Summary: Correct DataFrame.replace docs (was: Correct Column.replace docs) >

[jira] [Updated] (SPARK-19453) Correct Column.replace docs

2017-02-03 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19453?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz updated SPARK-19453: --- Summary: Correct Column.replace docs (was: Correct ) > Correct Column.replace docs

[jira] [Created] (SPARK-19453) Correct

2017-02-03 Thread Maciej Szymkiewicz (JIRA)
Maciej Szymkiewicz created SPARK-19453: -- Summary: Correct Key: SPARK-19453 URL: https://issues.apache.org/jira/browse/SPARK-19453 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-19429) Column.__getitem__ should support slice arguments

2017-02-01 Thread Maciej Szymkiewicz (JIRA)
Maciej Szymkiewicz created SPARK-19429: -- Summary: Column.__getitem__ should support slice arguments Key: SPARK-19429 URL: https://issues.apache.org/jira/browse/SPARK-19429 Project: Spark

[jira] [Created] (SPARK-19427) UserDefinedFunction should support data types strings

2017-02-01 Thread Maciej Szymkiewicz (JIRA)
Maciej Szymkiewicz created SPARK-19427: -- Summary: UserDefinedFunction should support data types strings Key: SPARK-19427 URL: https://issues.apache.org/jira/browse/SPARK-19427 Project: Spark

[jira] (SPARK-13802) Fields order in Row(**kwargs) is not consistent with Schema.toInternal method

2017-01-30 Thread Maciej Szymkiewicz (JIRA)
Title: Message Title Maciej Szymkiewicz commented on SPARK-13802

[jira] (SPARK-19403) pyspark.sql.column exports non-existent names

2017-01-30 Thread Maciej Szymkiewicz (JIRA)
Title: Message Title Maciej Szymkiewicz created an issue

[jira] (SPARK-15559) TopicAndPartition should provide __hash__ method

2017-01-30 Thread Maciej Szymkiewicz (JIRA)
Title: Message Title Maciej Szymkiewicz commented on SPARK-15559

[jira] (SPARK-15559) TopicAndPartition should provide __hash__ method

2017-01-30 Thread Maciej Szymkiewicz (JIRA)
Title: Message Title Maciej Szymkiewicz closed an issue as Duplicate

[jira] [Commented] (SPARK-12157) Support numpy types as return values of Python UDFs

2017-01-21 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15832907#comment-15832907 ] Maciej Szymkiewicz commented on SPARK-12157: I've been looking at this in context of

[jira] [Updated] (SPARK-19224) [PYSPARK] Python tests organization

2017-01-16 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz updated SPARK-19224: --- Target Version/s: (was: 2.2.0) Fix Version/s: (was: 2.2.0) > [PYSPARK]

[jira] [Commented] (SPARK-19164) Review of UserDefinedFunction._broadcast

2017-01-12 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15820529#comment-15820529 ] Maciej Szymkiewicz commented on SPARK-19164: [~rxin] I am particularly interested in

[jira] [Comment Edited] (SPARK-19161) Improving UDF Docstrings

2017-01-11 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15817902#comment-15817902 ] Maciej Szymkiewicz edited comment on SPARK-19161 at 1/11/17 10:20 AM:

[jira] [Commented] (SPARK-19161) Improving UDF Docstrings

2017-01-11 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15817902#comment-15817902 ] Maciej Szymkiewicz commented on SPARK-19161: If we decide to use function and not subclass

[jira] [Comment Edited] (SPARK-19159) PySpark UDF API improvements

2017-01-10 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15816557#comment-15816557 ] Maciej Szymkiewicz edited comment on SPARK-19159 at 1/10/17 11:45 PM:

[jira] [Commented] (SPARK-19159) PySpark UDF API improvements

2017-01-10 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15816557#comment-15816557 ] Maciej Szymkiewicz commented on SPARK-19159: [~rdblue] There are all independent. You can

[jira] [Commented] (SPARK-19164) Review of UserDefinedFunction._broadcast

2017-01-10 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15816361#comment-15816361 ] Maciej Szymkiewicz commented on SPARK-19164: [~rxin] Could you review this? > Review of

[jira] [Updated] (SPARK-19164) Review of UserDefinedFunction._broadcast

2017-01-10 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz updated SPARK-19164: --- Description: It doesn't look like {{UserDefinedFunction._broadcast}} is used at all.

[jira] [Created] (SPARK-19165) UserDefinedFunction should verify call arguments and provide readable exception in case of mismatch

2017-01-10 Thread Maciej Szymkiewicz (JIRA)
Maciej Szymkiewicz created SPARK-19165: -- Summary: UserDefinedFunction should verify call arguments and provide readable exception in case of mismatch Key: SPARK-19165 URL:

[jira] [Updated] (SPARK-19162) UserDefinedFunction constructor should verify that func is callable

2017-01-10 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz updated SPARK-19162: --- Description: Current state Right now `UserDefinedFunctions` don't perform any input

[jira] [Issue Comment Deleted] (SPARK-19162) UserDefinedFunction constructor should verify that func is callable

2017-01-10 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz updated SPARK-19162: --- Comment: was deleted (was: This could be further extend to basic validation of the

[jira] [Updated] (SPARK-19162) UserDefinedFunction constructor should verify that func is callable

2017-01-10 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz updated SPARK-19162: --- Description: Current state Right now `UserDefinedFunctions` don't perform any input

[jira] [Updated] (SPARK-19162) UserDefinedFunction constructor should verify that func is callable

2017-01-10 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz updated SPARK-19162: --- Summary: UserDefinedFunction constructor should verify that func is callable (was:

[jira] [Updated] (SPARK-19162) UDF input types validation

2017-01-10 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz updated SPARK-19162: --- Summary: UDF input types validation (was: Input types validation) > UDF input types

[jira] [Commented] (SPARK-19162) Input types validation

2017-01-10 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19162?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15816232#comment-15816232 ] Maciej Szymkiewicz commented on SPARK-19162: This could be further extend to basic validation

[jira] [Updated] (SPARK-19161) Improving UDF Docstrings

2017-01-10 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz updated SPARK-19161: --- Affects Version/s: (was: 1.5.0) > Improving UDF Docstrings >

[jira] [Updated] (SPARK-19163) Lazy creation of the _judf

2017-01-10 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz updated SPARK-19163: --- Affects Version/s: (was: 1.5.0) > Lazy creation of the _judf >

[jira] [Updated] (SPARK-19162) Input types validation

2017-01-10 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz updated SPARK-19162: --- Affects Version/s: (was: 1.5.0) > Input types validation >

[jira] [Updated] (SPARK-19160) Decorator for UDF creation.

2017-01-10 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz updated SPARK-19160: --- Affects Version/s: (was: 1.5.0) > Decorator for UDF creation. >

[jira] [Updated] (SPARK-19159) PySpark UDF API improvements

2017-01-10 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz updated SPARK-19159: --- Affects Version/s: (was: 1.5.0) > PySpark UDF API improvements >

[jira] [Created] (SPARK-19164) Review of UserDefinedFunction._broadcast

2017-01-10 Thread Maciej Szymkiewicz (JIRA)
Maciej Szymkiewicz created SPARK-19164: -- Summary: Review of UserDefinedFunction._broadcast Key: SPARK-19164 URL: https://issues.apache.org/jira/browse/SPARK-19164 Project: Spark Issue

[jira] [Updated] (SPARK-19160) Decorator for UDF creation.

2017-01-10 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz updated SPARK-19160: --- Affects Version/s: 2.2.0 > Decorator for UDF creation. > ---

[jira] [Updated] (SPARK-19159) PySpark UDF API improvements

2017-01-10 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz updated SPARK-19159: --- Affects Version/s: 2.2.0 > PySpark UDF API improvements >

[jira] [Created] (SPARK-19163) Lazy creation of the _judf

2017-01-10 Thread Maciej Szymkiewicz (JIRA)
Maciej Szymkiewicz created SPARK-19163: -- Summary: Lazy creation of the _judf Key: SPARK-19163 URL: https://issues.apache.org/jira/browse/SPARK-19163 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-19162) Input types validation

2017-01-10 Thread Maciej Szymkiewicz (JIRA)
Maciej Szymkiewicz created SPARK-19162: -- Summary: Input types validation Key: SPARK-19162 URL: https://issues.apache.org/jira/browse/SPARK-19162 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-19160) Decorator for UDF creation.

2017-01-10 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz updated SPARK-19160: --- Summary: Decorator for UDF creation. (was: UDF creation) > Decorator for UDF

[jira] [Created] (SPARK-19161) Improving UDF Docstrings

2017-01-10 Thread Maciej Szymkiewicz (JIRA)
Maciej Szymkiewicz created SPARK-19161: -- Summary: Improving UDF Docstrings Key: SPARK-19161 URL: https://issues.apache.org/jira/browse/SPARK-19161 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-19160) UDF creation

2017-01-10 Thread Maciej Szymkiewicz (JIRA)
Maciej Szymkiewicz created SPARK-19160: -- Summary: UDF creation Key: SPARK-19160 URL: https://issues.apache.org/jira/browse/SPARK-19160 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-19159) PySpark UDF API improvements

2017-01-10 Thread Maciej Szymkiewicz (JIRA)
Maciej Szymkiewicz created SPARK-19159: -- Summary: PySpark UDF API improvements Key: SPARK-19159 URL: https://issues.apache.org/jira/browse/SPARK-19159 Project: Spark Issue Type:

[jira] [Created] (SPARK-18690) Backward compatibility of unbounded frames

2016-12-02 Thread Maciej Szymkiewicz (JIRA)
Maciej Szymkiewicz created SPARK-18690: -- Summary: Backward compatibility of unbounded frames Key: SPARK-18690 URL: https://issues.apache.org/jira/browse/SPARK-18690 Project: Spark Issue

[jira] [Commented] (SPARK-16626) Code duplication after SPARK-14906

2016-10-07 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16626?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15554978#comment-15554978 ] Maciej Szymkiewicz commented on SPARK-16626: Oh well. Let's mark it a won't fix. No reason to

[jira] [Commented] (SPARK-17587) SparseVector __getitem__ should follow __getitem__ contract

2016-10-04 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15545262#comment-15545262 ] Maciej Szymkiewicz commented on SPARK-17587: I would probably go with 2.1.0 alone and

[jira] [Commented] (SPARK-16589) Chained cartesian produces incorrect number of records

2016-10-04 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15545239#comment-15545239 ] Maciej Szymkiewicz commented on SPARK-16589: Not actively so if you want to give it a shot go

[jira] [Updated] (SPARK-17756) java.lang.ClassCastException when using cartesian with DStream.transform

2016-10-01 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz updated SPARK-17756: --- External issue URL: http://stackoverflow.com/q/39804337/1560062 >

[jira] [Created] (SPARK-17756) java.lang.ClassCastException when using cartesian with DStream.transform

2016-10-01 Thread Maciej Szymkiewicz (JIRA)
Maciej Szymkiewicz created SPARK-17756: -- Summary: java.lang.ClassCastException when using cartesian with DStream.transform Key: SPARK-17756 URL: https://issues.apache.org/jira/browse/SPARK-17756

[jira] [Issue Comment Deleted] (SPARK-2620) case class cannot be used as key for reduce

2016-10-01 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz updated SPARK-2620: -- Comment: was deleted (was: With Scala 2.11 we get around by creating a package: {code}

[jira] [Commented] (SPARK-2620) case class cannot be used as key for reduce

2016-10-01 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15538581#comment-15538581 ] Maciej Szymkiewicz commented on SPARK-2620: --- With Scala 2.11 we get around by creating a

[jira] [Updated] (SPARK-2620) case class cannot be used as key for reduce

2016-10-01 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz updated SPARK-2620: -- Affects Version/s: 1.3.0 1.4.0 1.5.0 >

<    1   2   3   4   5   6   7   >