[jira] [Created] (SPARK-34825) pyspark.sql.function.lit is treating '1' the same as 1

2021-03-22 Thread yu peng (Jira)
yu peng created SPARK-34825: --- Summary: pyspark.sql.function.lit is treating '1' the same as 1 Key: SPARK-34825 URL: https://issues.apache.org/jira/browse/SPARK-34825 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-22237) Spark submit script should use downloaded files in standalone/local client mode

2017-10-10 Thread Yu Peng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yu Peng updated SPARK-22237: Description: SPARK-10643 is added to allow spark-submit script to download jars/files from remote hadoop f

[jira] [Created] (SPARK-22237) Spark submit script should use downloaded files in standalone/local client mode

2017-10-10 Thread Yu Peng (JIRA)
Yu Peng created SPARK-22237: --- Summary: Spark submit script should use downloaded files in standalone/local client mode Key: SPARK-22237 URL: https://issues.apache.org/jira/browse/SPARK-22237 Project: Spark

[jira] [Created] (SPARK-21540) add spark.sql.functions.map_keys and spark.sql.functions.map_values

2017-07-26 Thread yu peng (JIRA)
yu peng created SPARK-21540: --- Summary: add spark.sql.functions.map_keys and spark.sql.functions.map_values Key: SPARK-21540 URL: https://issues.apache.org/jira/browse/SPARK-21540 Project: Spark Is

[jira] [Created] (SPARK-20860) Make spark-submit download remote files to local in client mode

2017-05-23 Thread Yu Peng (JIRA)
Yu Peng created SPARK-20860: --- Summary: Make spark-submit download remote files to local in client mode Key: SPARK-20860 URL: https://issues.apache.org/jira/browse/SPARK-20860 Project: Spark Issue

[jira] [Commented] (SPARK-1359) SGD implementation is not efficient

2017-04-24 Thread yu peng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15981794#comment-15981794 ] yu peng commented on SPARK-1359: i think by randomly shuffle partitions and do gradient De

[jira] [Created] (SPARK-20418) multi-label classification support

2017-04-20 Thread yu peng (JIRA)
yu peng created SPARK-20418: --- Summary: multi-label classification support Key: SPARK-20418 URL: https://issues.apache.org/jira/browse/SPARK-20418 Project: Spark Issue Type: New Feature Co

[jira] [Updated] (SPARK-19962) add DictVectorizor for DataFrame

2017-03-20 Thread yu peng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yu peng updated SPARK-19962: Issue Type: New Feature (was: Wish) > add DictVectorizor for DataFrame >

[jira] [Commented] (SPARK-19962) add DictVectorizor for DataFrame

2017-03-16 Thread yu peng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15928123#comment-15928123 ] yu peng commented on SPARK-19962: - yeah, exactly.. i would love to use FeatureHasher when

[jira] [Comment Edited] (SPARK-19962) add DictVectorizor for DataFrame

2017-03-15 Thread yu peng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15926723#comment-15926723 ] yu peng edited comment on SPARK-19962 at 3/15/17 6:42 PM: -- yeah,

[jira] [Commented] (SPARK-19962) add DictVectorizor for DataFrame

2017-03-15 Thread yu peng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15926723#comment-15926723 ] yu peng commented on SPARK-19962: - yeah, with one indexer that performing like mutually e

[jira] [Commented] (SPARK-19962) add DictVectorizor for DataFrame

2017-03-15 Thread yu peng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15926559#comment-15926559 ] yu peng commented on SPARK-19962: - >Well, it's maintained by the StringIndexerModel for y

[jira] [Updated] (SPARK-19962) add DictVectorizor for DataFrame

2017-03-15 Thread yu peng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yu peng updated SPARK-19962: Description: it's really useful to have something like sklearn.feature_extraction.DictVectorizor Since ou

[jira] [Commented] (SPARK-19962) add DictVectorizor for DataFrame

2017-03-15 Thread yu peng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15926537#comment-15926537 ] yu peng commented on SPARK-19962: - yeah.. StringIndexer does some job on single column an

[jira] [Updated] (SPARK-19962) add DictVectorizor for DataFrame

2017-03-15 Thread yu peng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yu peng updated SPARK-19962: Description: it's really useful to have something like sklearn.feature_extraction.DictVectorizor Since ou

[jira] [Commented] (SPARK-19962) add DictVectorizor for DataFrame

2017-03-15 Thread yu peng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15926458#comment-15926458 ] yu peng commented on SPARK-19962: - ^ i have updated the description > add DictVectorizor

[jira] [Updated] (SPARK-19962) add DictVectorizor for DataFrame

2017-03-15 Thread yu peng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yu peng updated SPARK-19962: Description: it's really useful to have something like sklearn.feature_extraction.DictVectorizor Since ou

[jira] [Created] (SPARK-19962) add DictVectorizor for DataFrame

2017-03-15 Thread yu peng (JIRA)
yu peng created SPARK-19962: --- Summary: add DictVectorizor for DataFrame Key: SPARK-19962 URL: https://issues.apache.org/jira/browse/SPARK-19962 Project: Spark Issue Type: Wish Components:

[jira] [Created] (SPARK-17711) Compress rolled executor logs

2016-09-28 Thread Yu Peng (JIRA)
Yu Peng created SPARK-17711: --- Summary: Compress rolled executor logs Key: SPARK-17711 URL: https://issues.apache.org/jira/browse/SPARK-17711 Project: Spark Issue Type: New Feature Repor

[jira] [Created] (SPARK-15175) using csv.DictReader and existing json dataframe reader as work around to support csv reader

2016-05-06 Thread yu peng (JIRA)
yu peng created SPARK-15175: --- Summary: using csv.DictReader and existing json dataframe reader as work around to support csv reader Key: SPARK-15175 URL: https://issues.apache.org/jira/browse/SPARK-15175 Pr