[
https://issues.apache.org/jira/browse/SPARK-17175?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Qian Huang updated SPARK-17175:
---
Summary: Add a expert formula to aggregationDepth of SharedParam (was: Add
a expert formula as
Qian Huang created SPARK-17175:
--
Summary: Add a expert formula as default value to aggregationDepth
of SharedParam
Key: SPARK-17175
URL: https://issues.apache.org/jira/browse/SPARK-17175
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-17174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Amit Baghel updated SPARK-17174:
Description:
add_months function currently supports Date types. If Column is Timestamp type
then
[
https://issues.apache.org/jira/browse/SPARK-17174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Amit Baghel updated SPARK-17174:
Description:
add_months function currently supports Date types. If Column is Timestamp type
then
[
https://issues.apache.org/jira/browse/SPARK-17171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-17171:
Assignee: Apache Spark
> DAG will list all partitions in the graph
>
[
https://issues.apache.org/jira/browse/SPARK-17171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-17171:
Assignee: (was: Apache Spark)
> DAG will list all partitions in the graph
>
[
https://issues.apache.org/jira/browse/SPARK-17171?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429577#comment-15429577
]
Apache Spark commented on SPARK-17171:
--
User 'cenyuhai' has created a pull request for this issue:
Amit Baghel created SPARK-17174:
---
Summary: Provide support for Timestamp type Column in add_months
function to return HH:mm:ss
Key: SPARK-17174
URL: https://issues.apache.org/jira/browse/SPARK-17174
[
https://issues.apache.org/jira/browse/SPARK-17090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
DB Tsai resolved SPARK-17090.
-
Resolution: Fixed
Fix Version/s: 2.1.0
Issue resolved by pull request 14717
[
https://issues.apache.org/jira/browse/SPARK-17173?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-17173:
Assignee: Apache Spark
> Refactor R mllib for easier ml implementations
>
[
https://issues.apache.org/jira/browse/SPARK-17173?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-17173:
Assignee: (was: Apache Spark)
> Refactor R mllib for easier ml implementations
>
[
https://issues.apache.org/jira/browse/SPARK-17173?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429515#comment-15429515
]
Apache Spark commented on SPARK-17173:
--
User 'felixcheung' has created a pull request for this
Felix Cheung created SPARK-17173:
Summary: Refactor R mllib for easier ml implementations
Key: SPARK-17173
URL: https://issues.apache.org/jira/browse/SPARK-17173
Project: Spark
Issue Type:
[
https://issues.apache.org/jira/browse/SPARK-12666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Josh Rosen resolved SPARK-12666.
Resolution: Fixed
Fix Version/s: 2.1.0
2.0.1
Fixed for 2.0.1 and 2.1.0
[
https://issues.apache.org/jira/browse/SPARK-12666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Josh Rosen updated SPARK-12666:
---
Assignee: Bryan Cutler
> spark-shell --packages cannot load artifacts which are publishLocal'd by
[
https://issues.apache.org/jira/browse/SPARK-17024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-17024:
Assignee: (was: Apache Spark)
> Weird behaviour of the DataFrame when a column name
[
https://issues.apache.org/jira/browse/SPARK-17024?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429500#comment-15429500
]
Apache Spark commented on SPARK-17024:
--
User 'izeigerman' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-17024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-17024:
Assignee: Apache Spark
> Weird behaviour of the DataFrame when a column name contains
[
https://issues.apache.org/jira/browse/SPARK-17163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429489#comment-15429489
]
Seth Hendrickson commented on SPARK-17163:
--
Just to sum up some key points:
1.
[
https://issues.apache.org/jira/browse/SPARK-17172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andrew Davidson updated SPARK-17172:
Attachment: hiveUDFBug.ipynb
hiveUDFBug.html
> pyspak hiveContext can not
[
https://issues.apache.org/jira/browse/SPARK-17172?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429465#comment-15429465
]
Andrew Davidson commented on SPARK-17172:
-
attached a notebook that demonstrates the bug. Also
[
https://issues.apache.org/jira/browse/SPARK-17172?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429463#comment-15429463
]
Andrew Davidson commented on SPARK-17172:
-
related bug report :
Andrew Davidson created SPARK-17172:
---
Summary: pyspak hiveContext can not create UDF: Py4JJavaError: An
error occurred while calling None.org.apache.spark.sql.hive.HiveContext.
Key: SPARK-17172
URL:
[
https://issues.apache.org/jira/browse/SPARK-17168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429443#comment-15429443
]
Mathieu D commented on SPARK-17168:
---
This is error-prone, because the scenario I show will drop rows
[
https://issues.apache.org/jira/browse/SPARK-17171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean Owen updated SPARK-17171:
--
Priority: Minor (was: Major)
Issue Type: Improvement (was: Bug)
> DAG will list all partitions
[
https://issues.apache.org/jira/browse/SPARK-17124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wenchen Fan resolved SPARK-17124.
-
Resolution: Fixed
Fix Version/s: 2.1.0
2.0.1
Issue resolved by pull
[
https://issues.apache.org/jira/browse/SPARK-17124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wenchen Fan updated SPARK-17124:
Assignee: Peter Lee
> RelationalGroupedDataset.agg should be order preserving and allow duplicate
[
https://issues.apache.org/jira/browse/SPARK-17168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429435#comment-15429435
]
Takeshi Yamamuro commented on SPARK-17168:
--
Why is having a header in each partition
[
https://issues.apache.org/jira/browse/SPARK-17104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wenchen Fan resolved SPARK-17104.
-
Resolution: Fixed
Fix Version/s: 2.1.0
2.0.1
Issue resolved by pull
[
https://issues.apache.org/jira/browse/SPARK-17104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wenchen Fan updated SPARK-17104:
Assignee: Liang-Chi Hsieh
> LogicalRelation.newInstance should follow the semantics of
>
[
https://issues.apache.org/jira/browse/SPARK-17171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
cen yuhai updated SPARK-17171:
--
Description:
When querying data from a partitioned table, DAG will list all partitions in
the
[
https://issues.apache.org/jira/browse/SPARK-17171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
cen yuhai updated SPARK-17171:
--
Description:
When querying data from a partitioned table, DAG will list all partitions in
the
[
https://issues.apache.org/jira/browse/SPARK-17171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
cen yuhai updated SPARK-17171:
--
Attachment: dag2.png
dag1.png
> DAG will list all partitions in the graph
>
cen yuhai created SPARK-17171:
-
Summary: DAG will list all partitions in the graph
Key: SPARK-17171
URL: https://issues.apache.org/jira/browse/SPARK-17171
Project: Spark
Issue Type: Bug
[
https://issues.apache.org/jira/browse/SPARK-16508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429401#comment-15429401
]
Apache Spark commented on SPARK-16508:
--
User 'felixcheung' has created a pull request for this
[
https://issues.apache.org/jira/browse/SPARK-17170?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-17170:
Assignee: Apache Spark
> Enable whole partition pruning for InMemoryTableScanExec
>
[
https://issues.apache.org/jira/browse/SPARK-17170?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429390#comment-15429390
]
Apache Spark commented on SPARK-17170:
--
User 'pwoody' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-17170?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-17170:
Assignee: (was: Apache Spark)
> Enable whole partition pruning for
Patrick Woody created SPARK-17170:
-
Summary: Enable whole partition pruning for InMemoryTableScanExec
Key: SPARK-17170
URL: https://issues.apache.org/jira/browse/SPARK-17170
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-17046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean Owen resolved SPARK-17046.
---
Resolution: Won't Fix
> prevent user using dataframe.select with empty param list
>
[
https://issues.apache.org/jira/browse/SPARK-17046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean Owen updated SPARK-17046:
--
Affects Version/s: (was: 2.1.0)
Priority: Minor (was: Major)
> prevent user using
[
https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429357#comment-15429357
]
Apache Spark commented on SPARK-16320:
--
User 'srowen' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean Owen updated SPARK-16320:
--
Assignee: Sean Owen
Priority: Minor (was: Critical)
Component/s: Documentation
Qian Huang created SPARK-17169:
--
Summary: To use scala macros to update code when
SharedParamsCodeGen.scala changed
Key: SPARK-17169
URL: https://issues.apache.org/jira/browse/SPARK-17169
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-17086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429348#comment-15429348
]
Sean Owen commented on SPARK-17086:
---
Yeah sounds good -- feel free to make a PR.
> QuantileDiscretizer
Mathieu D created SPARK-17168:
-
Summary: CSV with header is incorrectly read if file is partitioned
Key: SPARK-17168
URL: https://issues.apache.org/jira/browse/SPARK-17168
Project: Spark
Issue
[
https://issues.apache.org/jira/browse/SPARK-17159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-17159:
Assignee: (was: Apache Spark)
> Improve FileInputDStream.findNewFiles list
[
https://issues.apache.org/jira/browse/SPARK-17159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-17159:
Assignee: Apache Spark
> Improve FileInputDStream.findNewFiles list performance
>
[
https://issues.apache.org/jira/browse/SPARK-17159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429329#comment-15429329
]
Apache Spark commented on SPARK-17159:
--
User 'steveloughran' has created a pull request for this
[
https://issues.apache.org/jira/browse/SPARK-17164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429326#comment-15429326
]
Herman van Hovell edited comment on SPARK-17164 at 8/20/16 11:13 AM:
-
[
https://issues.apache.org/jira/browse/SPARK-17164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429326#comment-15429326
]
Herman van Hovell commented on SPARK-17164:
---
I tried this in Hive enabled Spark 1.6:
{noformat}
[
https://issues.apache.org/jira/browse/SPARK-17148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429282#comment-15429282
]
cen yuhai commented on SPARK-17148:
---
I don't know the root cause right now, I can't understand
[
https://issues.apache.org/jira/browse/SPARK-16961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429280#comment-15429280
]
Apache Spark commented on SPARK-16961:
--
User 'yanboliang' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-17167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-17167:
Assignee: Apache Spark
> Issue Exceptions when Analyze Table on In-Memory Cataloged
[
https://issues.apache.org/jira/browse/SPARK-17167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-17167:
Assignee: (was: Apache Spark)
> Issue Exceptions when Analyze Table on In-Memory
[
https://issues.apache.org/jira/browse/SPARK-17167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429266#comment-15429266
]
Apache Spark commented on SPARK-17167:
--
User 'gatorsmile' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-15698?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Reynold Xin updated SPARK-15698:
Target Version/s: 2.0.1, 2.1.0
> Ability to remove old metadata for structure streaming
Xiao Li created SPARK-17167:
---
Summary: Issue Exceptions when Analyze Table on In-Memory
Cataloged Tables
Key: SPARK-17167
URL: https://issues.apache.org/jira/browse/SPARK-17167
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-17167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiao Li updated SPARK-17167:
Description: Currently, `Analyze Table` is only for Hive-serde tables. We
should issue exceptions in all
[
https://issues.apache.org/jira/browse/SPARK-15018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yanbo Liang resolved SPARK-15018.
-
Resolution: Fixed
Fix Version/s: 2.1.0
> PySpark ML Pipeline raises unclear error when no
[
https://issues.apache.org/jira/browse/SPARK-17165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429261#comment-15429261
]
Apache Spark commented on SPARK-17165:
--
User 'petermaxlee' has created a pull request for this
[
https://issues.apache.org/jira/browse/SPARK-17165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-17165:
Assignee: (was: Apache Spark)
> FileStreamSource should not track the list of seen
[
https://issues.apache.org/jira/browse/SPARK-17165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-17165:
Assignee: Apache Spark
> FileStreamSource should not track the list of seen files
[
https://issues.apache.org/jira/browse/SPARK-17138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429258#comment-15429258
]
Yanbo Liang commented on SPARK-17138:
-
[~WeichenXu123] Please hold on this task, since SPARK-17163
[
https://issues.apache.org/jira/browse/SPARK-17137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429255#comment-15429255
]
Yanbo Liang commented on SPARK-17137:
-
Yes, I will do some performance test to weigh the trade-off.
[
https://issues.apache.org/jira/browse/SPARK-17136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429253#comment-15429253
]
Yanbo Liang commented on SPARK-17136:
-
Yes, only first order optimizer can scale well in number of
66 matches
Mail list logo