yu peng created SPARK-34825:
---
Summary: pyspark.sql.function.lit is treating '1' the same as 1
Key: SPARK-34825
URL: https://issues.apache.org/jira/browse/SPARK-34825
Project: Spark
Issue Type: Bug
[
https://issues.apache.org/jira/browse/SPARK-22237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yu Peng updated SPARK-22237:
Description:
SPARK-10643 is added to allow spark-submit script to download jars/files from
remote hadoop f
Yu Peng created SPARK-22237:
---
Summary: Spark submit script should use downloaded files in
standalone/local client mode
Key: SPARK-22237
URL: https://issues.apache.org/jira/browse/SPARK-22237
Project: Spark
yu peng created SPARK-21540:
---
Summary: add spark.sql.functions.map_keys and
spark.sql.functions.map_values
Key: SPARK-21540
URL: https://issues.apache.org/jira/browse/SPARK-21540
Project: Spark
Is
Yu Peng created SPARK-20860:
---
Summary: Make spark-submit download remote files to local in
client mode
Key: SPARK-20860
URL: https://issues.apache.org/jira/browse/SPARK-20860
Project: Spark
Issue
[
https://issues.apache.org/jira/browse/SPARK-1359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15981794#comment-15981794
]
yu peng commented on SPARK-1359:
i think by randomly shuffle partitions and do gradient De
yu peng created SPARK-20418:
---
Summary: multi-label classification support
Key: SPARK-20418
URL: https://issues.apache.org/jira/browse/SPARK-20418
Project: Spark
Issue Type: New Feature
Co
[
https://issues.apache.org/jira/browse/SPARK-19962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
yu peng updated SPARK-19962:
Issue Type: New Feature (was: Wish)
> add DictVectorizor for DataFrame
>
[
https://issues.apache.org/jira/browse/SPARK-19962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15928123#comment-15928123
]
yu peng commented on SPARK-19962:
-
yeah, exactly.. i would love to use FeatureHasher when
[
https://issues.apache.org/jira/browse/SPARK-19962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15926723#comment-15926723
]
yu peng edited comment on SPARK-19962 at 3/15/17 6:42 PM:
--
yeah,
[
https://issues.apache.org/jira/browse/SPARK-19962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15926723#comment-15926723
]
yu peng commented on SPARK-19962:
-
yeah, with one indexer that performing like mutually e
[
https://issues.apache.org/jira/browse/SPARK-19962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15926559#comment-15926559
]
yu peng commented on SPARK-19962:
-
>Well, it's maintained by the StringIndexerModel for y
[
https://issues.apache.org/jira/browse/SPARK-19962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
yu peng updated SPARK-19962:
Description:
it's really useful to have something like
sklearn.feature_extraction.DictVectorizor
Since ou
[
https://issues.apache.org/jira/browse/SPARK-19962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15926537#comment-15926537
]
yu peng commented on SPARK-19962:
-
yeah.. StringIndexer does some job on single column an
[
https://issues.apache.org/jira/browse/SPARK-19962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
yu peng updated SPARK-19962:
Description:
it's really useful to have something like
sklearn.feature_extraction.DictVectorizor
Since ou
[
https://issues.apache.org/jira/browse/SPARK-19962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15926458#comment-15926458
]
yu peng commented on SPARK-19962:
-
^ i have updated the description
> add DictVectorizor
[
https://issues.apache.org/jira/browse/SPARK-19962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
yu peng updated SPARK-19962:
Description:
it's really useful to have something like
sklearn.feature_extraction.DictVectorizor
Since ou
yu peng created SPARK-19962:
---
Summary: add DictVectorizor for DataFrame
Key: SPARK-19962
URL: https://issues.apache.org/jira/browse/SPARK-19962
Project: Spark
Issue Type: Wish
Components:
Yu Peng created SPARK-17711:
---
Summary: Compress rolled executor logs
Key: SPARK-17711
URL: https://issues.apache.org/jira/browse/SPARK-17711
Project: Spark
Issue Type: New Feature
Repor
yu peng created SPARK-15175:
---
Summary: using csv.DictReader and existing json dataframe reader
as work around to support csv reader
Key: SPARK-15175
URL: https://issues.apache.org/jira/browse/SPARK-15175
Pr
20 matches
Mail list logo