Github user ericl commented on the issue:
https://github.com/apache/spark/pull/16090
Filed
https://issues.apache.org/jira/browse/SPARK-18725
https://issues.apache.org/jira/browse/SPARK-18726
---
If your project is set up for it, you can reply to this email and have your
reply
Github user cloud-fan commented on the issue:
https://github.com/apache/spark/pull/16090
LGTM, merging to master/2.1!
@ericl please create tickets for the other 2 issues
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as
Github user ericl commented on the issue:
https://github.com/apache/spark/pull/16090
Yeah I was wondering if we should also try to fix that. It seems maybe not
as bad since unpartitioned tables usually aren't that big.
We can create separate tickets for investigating that,
Github user cloud-fan commented on the issue:
https://github.com/apache/spark/pull/16090
After looking more at the code, now I agree with your approach. One
question, seems we still scan the files when creating a unpartitioned external
data source table?
---
If your project is set
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/16090
Merged build finished. Test PASSed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/16090
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/69630/
Test PASSed.
---
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/16090
**[Test build #69630 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69630/consoleFull)**
for PR 16090 at commit
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/16090
**[Test build #69630 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69630/consoleFull)**
for PR 16090 at commit
Github user ericl commented on the issue:
https://github.com/apache/spark/pull/16090
Not sure I follow - could you explain more on why that would resolve the
issue?
Btw, I reverted this pr to b405635, which passes all tests.
---
If your project is set up for it, you can
Github user cloud-fan commented on the issue:
https://github.com/apache/spark/pull/16090
If we are going to hack it, how about this?
```
val dataSource = DataSource(...)
if (classOf[FileFormat].isAssignableFrom(dataSource.providingClass)) {
Github user ericl commented on the issue:
https://github.com/apache/spark/pull/16090
Seems like the caching broke a bunch of tests. I'll take a look at this
again tomorrow.
On Fri, Dec 2, 2016, 7:49 PM UCB AMPLab wrote:
> Test FAILed.
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/16090
Merged build finished. Test FAILed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/16090
Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/69600/
Test FAILed.
---
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/16090
**[Test build #69600 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69600/consoleFull)**
for PR 16090 at commit
Github user ericl commented on the issue:
https://github.com/apache/spark/pull/16090
cc @rxin please merge unless wenchen gets to it first
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/16090
**[Test build #69600 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69600/consoleFull)**
for PR 16090 at commit
Github user ericl commented on the issue:
https://github.com/apache/spark/pull/16090
Fixed by adding a private cache to `Datasource`, which is used to avoid the
duplicate file reads with InMemoryIndex.
---
If your project is set up for it, you can reply to this email and have your
Github user ericl commented on the issue:
https://github.com/apache/spark/pull/16090
Seems like we also create InMemoryFileIndex twice for non-catalog tables.
Let me try to fix that too.
---
If your project is set up for it, you can reply to this email and have your
reply appear on
Github user ericl commented on the issue:
https://github.com/apache/spark/pull/16090
I looked at avoiding the creation of a CatalogFileIndex, but the way table
resolution works right now, the only way is to create some sort of dummy file
index class that does not support scans. It's
Github user cloud-fan commented on the issue:
https://github.com/apache/spark/pull/16090
My main concern is that, in `CreateDataSourceTableCommand`, we call
`DataSource.resolveRelation` to infer the schema and partition columns. At that
time, the table is not created yet, so
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/16090
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/69536/
Test PASSed.
---
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/16090
Merged build finished. Test PASSed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/16090
**[Test build #69536 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69536/consoleFull)**
for PR 16090 at commit
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/16090
Merged build finished. Test FAILed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/16090
Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/69534/
Test FAILed.
---
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/16090
**[Test build #69534 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69534/consoleFull)**
for PR 16090 at commit
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/16090
**[Test build #69536 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69536/consoleFull)**
for PR 16090 at commit
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/16090
**[Test build #69534 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69534/consoleFull)**
for PR 16090 at commit
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/16090
Merged build finished. Test FAILed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/16090
Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/69514/
Test FAILed.
---
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/16090
**[Test build #69514 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69514/consoleFull)**
for PR 16090 at commit
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/16090
**[Test build #69514 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69514/consoleFull)**
for PR 16090 at commit
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/16090
Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/69437/
Test FAILed.
---
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/16090
Merged build finished. Test FAILed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/16090
**[Test build #69437 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69437/consoleFull)**
for PR 16090 at commit
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/16090
**[Test build #69437 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69437/consoleFull)**
for PR 16090 at commit
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/16090
Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/69435/
Test FAILed.
---
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/16090
**[Test build #69435 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69435/consoleFull)**
for PR 16090 at commit
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/16090
Merged build finished. Test FAILed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/16090
**[Test build #69435 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69435/consoleFull)**
for PR 16090 at commit
Github user ericl commented on the issue:
https://github.com/apache/spark/pull/16090
@rxin
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the
41 matches
Mail list logo