Github user asfgit closed the pull request at:
https://github.com/apache/spark/pull/10805
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is ena
Github user rxin commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-173085873
I've merged this in master. Thanks.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does n
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-173080338
Merged build finished. Test PASSed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your projec
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-173080339
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-173080189
**[Test build #49750 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/49750/consoleFull)**
for PR 10805 at commit
[`cd9f742`](https://g
Github user rxin commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-173068930
LGTM
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-173065443
**[Test build #49750 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/49750/consoleFull)**
for PR 10805 at commit
[`cd9f742`](https://gi
Github user aarondav commented on a diff in the pull request:
https://github.com/apache/spark/pull/10805#discussion_r50206066
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/csv/CSVParameters.scala
---
@@ -107,3 +114,28 @@ private[csv] object ParseMode
Github user aarondav commented on a diff in the pull request:
https://github.com/apache/spark/pull/10805#discussion_r50206079
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/csv/CSVParameters.scala
---
@@ -107,3 +114,28 @@ private[csv] object ParseMode
Github user HyukjinKwon commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-173022112
I see. I will anyway try to figure this out though. I somehow this might be
a bit too much as almost all files would have proper extensions and I think the
(almost)
Github user rxin commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-173017720
Yea I'm thinking we should also support specifying options, and it is
"auto" by default which decides based on extensions.
---
If your project is set up for it, you can
Github user HyukjinKwon commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-173016289
Oh yes it does. Actually I am reading compressed files in the test I added
[here](https://github.com/HyukjinKwon/spark/blob/SPARK-12420/sql/core/src/test/scala/org/
Github user rxin commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-172944028
Oh one thing: this doesn't support reading with compression yet, does it?
---
If your project is set up for it, you can reply to this email and have your
reply appear on
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-172816804
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-172816803
Merged build finished. Test PASSed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your projec
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-172816508
**[Test build #49679 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/49679/consoleFull)**
for PR 10805 at commit
[`0245eea`](https://g
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-172813600
Merged build finished. Test PASSed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your projec
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-172813603
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-172813154
**[Test build #49676 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/49676/consoleFull)**
for PR 10805 at commit
[`6400b76`](https://g
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-172790523
Merged build finished. Test PASSed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your projec
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-172790525
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-172789851
**[Test build #49671 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/49671/consoleFull)**
for PR 10805 at commit
[`adb9eb2`](https://g
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-172787690
Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-172787688
Merged build finished. Test FAILed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your projec
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-172785578
**[Test build #49679 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/49679/consoleFull)**
for PR 10805 at commit
[`0245eea`](https://gi
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-172780590
**[Test build #49676 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/49676/consoleFull)**
for PR 10805 at commit
[`6400b76`](https://gi
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-172780089
Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-172780088
Merged build finished. Test FAILed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your projec
Github user rxin commented on a diff in the pull request:
https://github.com/apache/spark/pull/10805#discussion_r50085687
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/csv/CSVRelation.scala
---
@@ -99,6 +100,15 @@ private[csv] class CSVRelation(
Github user rxin commented on a diff in the pull request:
https://github.com/apache/spark/pull/10805#discussion_r50085643
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/csv/CSVParameters.scala
---
@@ -73,6 +76,14 @@ private[sql] case class CSVParamete
Github user rxin commented on a diff in the pull request:
https://github.com/apache/spark/pull/10805#discussion_r50085474
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/csv/CSVParameters.scala
---
@@ -107,3 +117,28 @@ private[csv] object ParseModes {
Github user rxin commented on a diff in the pull request:
https://github.com/apache/spark/pull/10805#discussion_r50085424
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/csv/CSVParameters.scala
---
@@ -107,3 +117,28 @@ private[csv] object ParseModes {
Github user HyukjinKwon commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-17234
Although `CSVCompressionCodecs` might be shared with JSON datasource, I
will make that share this at the separate PR for JSON.
---
If your project is set up for it
Github user rxin commented on a diff in the pull request:
https://github.com/apache/spark/pull/10805#discussion_r50081941
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/csv/CSVParameters.scala
---
@@ -73,6 +82,12 @@ private[sql] case class CSVParamete
Github user rxin commented on a diff in the pull request:
https://github.com/apache/spark/pull/10805#discussion_r50081902
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/csv/CSVParameters.scala
---
@@ -73,6 +82,12 @@ private[sql] case class CSVParamete
Github user rxin commented on a diff in the pull request:
https://github.com/apache/spark/pull/10805#discussion_r50081670
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/csv/CSVParameters.scala
---
@@ -44,6 +46,13 @@ private[sql] case class CSVParamete
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-172768286
**[Test build #49671 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/49671/consoleFull)**
for PR 10805 at commit
[`adb9eb2`](https://gi
Github user HyukjinKwon commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-172766510
Supported shorten names for compression codecs are below (case insensitive):
`bzip2` -> `org.apache.hadoop.io.compress.BZip2Codec`
`gzip` -> `org.apache.
Github user HyukjinKwon commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-172747397
I will resolve conflicts and update this soon.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If yo
Github user rxin commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-172745372
Yup we are dropping Hadoop 1.x support, so it is OK to have it only for
Hadoop 2.x.
---
If your project is set up for it, you can reply to this email and have your
reply
Github user rxin commented on a diff in the pull request:
https://github.com/apache/spark/pull/10805#discussion_r50074461
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/csv/CSVParameters.scala
---
@@ -71,6 +71,8 @@ private[sql] case class CSVParameter
Github user rxin commented on a diff in the pull request:
https://github.com/apache/spark/pull/10805#discussion_r50074385
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/csv/CSVParameters.scala
---
@@ -71,6 +71,8 @@ private[sql] case class CSVParameter
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-172709720
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-172709719
Merged build finished. Test PASSed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your projec
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-172709347
**[Test build #49643 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/49643/consoleFull)**
for PR 10805 at commit
[`e7ebddd`](https://g
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-172691164
**[Test build #49643 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/49643/consoleFull)**
for PR 10805 at commit
[`e7ebddd`](https://gi
Github user HyukjinKwon commented on a diff in the pull request:
https://github.com/apache/spark/pull/10805#discussion_r50053254
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/csv/CSVParameters.scala
---
@@ -71,6 +71,8 @@ private[sql] case class CSVPa
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-172517812
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-172517811
Merged build finished. Test PASSed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your projec
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-172517601
**[Test build #49592 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/49592/consoleFull)**
for PR 10805 at commit
[`5b57fc2`](https://g
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/10805#issuecomment-172496706
**[Test build #49592 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/49592/consoleFull)**
for PR 10805 at commit
[`5b57fc2`](https://gi
GitHub user HyukjinKwon opened a pull request:
https://github.com/apache/spark/pull/10805
[SPARK-12871][SQL] Support to specify the option for compression codec.
https://issues.apache.org/jira/browse/SPARK-12871
This PR added an option to support to specify compression codec.
52 matches
Mail list logo