[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2016-01-28 Thread cowtowncoder
Github user cowtowncoder commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-176310354
  
@srowen Ah. Yes, just wanted to mention it; I don't have enough context so 
there may be specific reasons for picking up particular versions. Glad to see 
version upgrades coming along. My interests are selfish, being both the main 
author of Jackson, and a (relative new) Spark user at work. :)

With that, if there are problems from Jackson side I could help, do not 
hesitate to cc me and/or file issues at 
https://github.com/FasterXML/jackson-databind (or other specific components). I 
know version compatibility is an annoying issue, and would like to help if 
possible.



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2016-01-28 Thread srowen
Github user srowen commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-176305142
  
I think that's OK. I was intending to propose a series of dependency 
updates for 2.x like this sort of thing. 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2016-01-28 Thread cowtowncoder
Github user cowtowncoder commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-176300804
  
I know this issue is closed, but one suggestion I have would be to specify 
the latest Jackson 2.5 version, 2.5.5 instead of 2.5.3. While I understand 
caution in general, Jackson patch versions try to limit inclusion of fixes to 
safe ones and specifically there are no API changes. This also means that while 
it is good to keep even patch versions in sync across components there is very 
low risk of incompatibilities even if patch versions do differ.

In this specific case, `jackson-databind` 2.5.4 has 10 bug fixes -- see 
https://github.com/FasterXML/jackson/wiki/Jackson-Release-2.5.4 -- this is the 
main reason I would recommend use of latest version.



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2016-01-11 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/spark/pull/10256


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2016-01-11 Thread srowen
Github user srowen commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-170484935
  
Merged to master


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2016-01-10 Thread srowen
Github user srowen commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-170346366
  
I'm going to go for this one since I think we may want to, in general, 
consider updating Spark dependencies in 2.x anyway. 2.5 is even old for 
Jackson, so good to bump up here as a side effect I think.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2016-01-10 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-170341303
  
**[Test build #2358 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/2358/consoleFull)**
 for PR 10256 at commit 
[`a1f75c1`](https://github.com/apache/spark/commit/a1f75c177d0ee95ab247ac1d053e5b4516958cd0).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2016-01-10 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-170333530
  
**[Test build #2358 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/2358/consoleFull)**
 for PR 10256 at commit 
[`a1f75c1`](https://github.com/apache/spark/commit/a1f75c177d0ee95ab247ac1d053e5b4516958cd0).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2016-01-10 Thread srowen
Github user srowen commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-170333472
  
@BrianLondon it should be running `dev/run-tests`, so that's something you 
can inspect or run locally if you need to. The doc there describes how to run 
one test. 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2016-01-09 Thread BrianLondon
Github user BrianLondon commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-170270526
  
Manifest updated.  Is there documented somewhere what all the tests Travis 
runs are?  It seems to differ from what's described at 
http://spark.apache.org/docs/latest/building-spark.html


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2016-01-08 Thread srowen
Github user srowen commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-170116355
  
Ah right, you'll need to acknowledge the dep changes in the build file:

```
To update the manifest file, run './dev/test-dependencies.sh 
--replace-manifest'.
diff --git a/dev/deps/spark-deps-hadoop-2.2 
b/dev/pr-deps/spark-deps-hadoop-2.2
index e4373f7..cd3ff29 100644
--- a/dev/deps/spark-deps-hadoop-2.2
+++ b/dev/pr-deps/spark-deps-hadoop-2.2
@@ -84,13 +84,13 @@ hadoop-yarn-server-web-proxy-2.2.0.jar
 httpclient-4.3.2.jar
 httpcore-4.3.2.jar
 ivy-2.4.0.jar
-jackson-annotations-2.4.4.jar
-jackson-core-2.4.4.jar
+jackson-annotations-2.5.3.jar
+jackson-core-2.5.3.jar
 jackson-core-asl-1.9.13.jar
-jackson-databind-2.4.4.jar
+jackson-databind-2.5.3.jar
 jackson-jaxrs-1.9.13.jar
 jackson-mapper-asl-1.9.13.jar
-jackson-module-scala_2.10-2.4.4.jar
+jackson-module-scala_2.10-2.5.3.jar
 jackson-xc-1.9.13.jar
 janino-2.7.8.jar
 jansi-1.4.jar
```

This LGTM but I do think we'll have to keep it for 2.x only.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2016-01-08 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-170114543
  
**[Test build #2354 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/2354/consoleFull)**
 for PR 10256 at commit 
[`6f3bbfd`](https://github.com/apache/spark/commit/6f3bbfd32d54777f5f479ef0229817f3521723d6).
 * This patch **fails build dependency tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2016-01-08 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-170114213
  
**[Test build #2354 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/2354/consoleFull)**
 for PR 10256 at commit 
[`6f3bbfd`](https://github.com/apache/spark/commit/6f3bbfd32d54777f5f479ef0229817f3521723d6).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2016-01-08 Thread BrianLondon
Github user BrianLondon commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-170111349
  
Rebased against current master, ran tests (except for docker) locally, and 
tested kinesis example against aws hosted stream.  All worked.

My preference would be to get the version bump in as soon as practicable 
for two reasons.  First, and mostly selfishly, the inability to link any 
library that requires Jackson 2.5.x features is causing us problems in our 
production environment.  Second, we currently can just bump the version number 
because the Spark codebase appears compatible with both Jackson versions at 
present, but the longer we wait the more likely someone introduces an 
incompatible change.

That said, I fully understand the reluctance to replace a heavily used 
library mid release cycle.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2015-12-30 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-168074687
  
**[Test build #2271 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/2271/consoleFull)**
 for PR 10256 at commit 
[`64858df`](https://github.com/apache/spark/commit/64858df11a590a872914312f55b181a493bfda1f).
 * This patch **fails Scala style tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2015-12-30 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-168073788
  
**[Test build #2271 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/2271/consoleFull)**
 for PR 10256 at commit 
[`64858df`](https://github.com/apache/spark/commit/64858df11a590a872914312f55b181a493bfda1f).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2015-12-21 Thread srowen
Github user srowen commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-166281133
  
Can this wait until 2.0? I think it's going to anyway now. Then the 
dependency issue is fine.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2015-12-18 Thread marmbrus
Github user marmbrus commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-165907516
  
If the SQL/Hive tests are passing I'm good.  @srowen usually has good 
insight on dependency changes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2015-12-18 Thread MLnick
Github user MLnick commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-165865257
  
I've also run the Kinesis tests locally (against a live AWS account) 
against this PR and all tests pass. I'm +1 on this subject to agreement for the 
jackson dependency bump, for which I defer to SQL folks e.g. @marmbrus 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2015-12-15 Thread BrianLondon
Github user BrianLondon commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-164975298
  
Yeah, that looks like the same underlying issue of using AWS Java SDK 
1.10.0 or later with Spark.  I believe this version bump will fix it.  You can 
sometimes get around it by adding the following to your sbt build:

```scala
dependencyOverrides ++= Set(
  "com.fasterxml.jackson.core" % "jackson-core" % "2.4.4",
  "com.fasterxml.jackson.core" % "jackson-databind" % "2.4.4",
  "com.fasterxml.jackson.core" % "jackson-annotations" % "2.4.4"
)
```

That doesn't work however if the calls to the AWS library use any Jackson 
2.5.x functionality and some of them do.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2015-12-15 Thread JoshRosen
Github user JoshRosen commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-164932897
  
FYI, a bunch of `spark-redshift` users have run into the Jackson dependency 
conflict issue that's discussed here; see 
https://github.com/databricks/spark-redshift/issues/133 and 
https://github.com/databricks/spark-redshift/issues/121. Not commenting either 
way w.r.t. the dep. concerns themselves, but just wanted to point out that the 
pain of the Jackson conflict has impacts for a bunch of other applications too.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2015-12-15 Thread tdas
Github user tdas commented on a diff in the pull request:

https://github.com/apache/spark/pull/10256#discussion_r47709922
  
--- Diff: pom.xml ---
@@ -169,7 +169,7 @@
 ${scala.version}
 org.scala-lang
 1.9.13
-2.4.4
+2.5.3
--- End diff --

Those tests are i think enabled only in the default jenkins builder.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2015-12-15 Thread 3ourroom
Github user 3ourroom commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-164837398
  

NAVER - http://www.naver.com/


3ourr...@naver.com 님께 보내신 메일  이 
다음과 같은 이유로 전송 실패했습니다.



받는 사람이 회원님의 메일을 수신차단 하였습니다. 






---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2015-12-15 Thread BrianLondon
Github user BrianLondon commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-164834977
  
I removed the explicit dependence on the AWS Java SDK.  There's a newline 
that was added to `KinesisReceiver.scala` since the last test run.  I think if 
tests are re-run the kinesis part should be included, although someone more 
knowledgable than me about Travis can weigh in.

Tests and the kinesis example worked locally for me for what it's worth.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2015-12-14 Thread holdenk
Github user holdenk commented on a diff in the pull request:

https://github.com/apache/spark/pull/10256#discussion_r47591545
  
--- Diff: pom.xml ---
@@ -169,7 +169,7 @@
 ${scala.version}
 org.scala-lang
 1.9.13
-2.4.4
+2.5.3
--- End diff --

:(


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2015-12-14 Thread BrianLondon
Github user BrianLondon commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-164553929
  
It appears kcl pulls in 1.10.20.  I'm all for dropping the explicit 
dependency.  That probably wouldn't have prevented the regression in Spark 
1.5.2 though. At any rate, the jackson bump is necessary if someone wants to 
use the latest aws sdk (e.g. for Firehose).  


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2015-12-14 Thread holdenk
Github user holdenk commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-164536120
  
I'm not really sure why - skimming the git logs it seems the aws-sdk-java 
sdk dependency was added at the same time as the kcl dependency.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2015-12-14 Thread brkyvz
Github user brkyvz commented on a diff in the pull request:

https://github.com/apache/spark/pull/10256#discussion_r47545169
  
--- Diff: pom.xml ---
@@ -169,7 +169,7 @@
 ${scala.version}
 org.scala-lang
 1.9.13
-2.4.4
+2.5.3
--- End diff --

@holdenk Unfortunately the tests that actually mattered didn't run, they 
were ignored :(
```
[info] WithAggregationKinesisStreamSuite:
[info] - KinesisUtils API (202 milliseconds)
[info] - RDD generation (24 milliseconds)
[info] - basic operation [enable by setting env var ENABLE_KINESIS_TESTS=1] 
!!! IGNORED !!!
[info] - custom message handling [enable by setting env var 
ENABLE_KINESIS_TESTS=1] !!! IGNORED !!!
[info] - failure recovery [enable by setting env var 
ENABLE_KINESIS_TESTS=1] !!! IGNORED !!!
[info] - Prepare KinesisTestUtils [enable by setting env var 
ENABLE_KINESIS_TESTS=1] !!! IGNORED !!!
[info] WithoutAggregationKinesisStreamSuite:
[info] - KinesisUtils API (5 milliseconds)
[info] - RDD generation (6 milliseconds)
[info] - basic operation [enable by setting env var ENABLE_KINESIS_TESTS=1] 
!!! IGNORED !!!
[info] - custom message handling [enable by setting env var 
ENABLE_KINESIS_TESTS=1] !!! IGNORED !!!
[info] - failure recovery [enable by setting env var 
ENABLE_KINESIS_TESTS=1] !!! IGNORED !!!
[info] - Prepare KinesisTestUtils [enable by setting env var 
ENABLE_KINESIS_TESTS=1] !!! IGNORED !!!
[info] WithAggregationKinesisBackedBlockRDDSuite:
[info] - Basic reading from Kinesis [enable by setting env var 
ENABLE_KINESIS_TESTS=1] !!! IGNORED !!!
[info] - Read data available in both block manager and Kinesis [enable by 
setting env var ENABLE_KINESIS_TESTS=1] !!! IGNORED !!!
[info] - Read data available only in block manager, not in Kinesis [enable 
by setting env var ENABLE_KINESIS_TESTS=1] !!! IGNORED !!!
[info] - Read data available only in Kinesis, not in block manager [enable 
by setting env var ENABLE_KINESIS_TESTS=1] !!! IGNORED !!!
[info] - Read data available partially in block manager, rest in Kinesis 
[enable by setting env var ENABLE_KINESIS_TESTS=1] !!! IGNORED !!!
[info] - Test isBlockValid skips block fetching from block manager [enable 
by setting env var ENABLE_KINESIS_TESTS=1] !!! IGNORED !!!
[info] - Test whether RDD is valid after removing blocks from block anager 
[enable by setting env var ENABLE_KINESIS_TESTS=1] !!! IGNORED !!!
[info] WithoutAggregationKinesisBackedBlockRDDSuite:
[info] - Basic reading from Kinesis [enable by setting env var 
ENABLE_KINESIS_TESTS=1] !!! IGNORED !!!
[info] - Read data available in both block manager and Kinesis [enable by 
setting env var ENABLE_KINESIS_TESTS=1] !!! IGNORED !!!
[info] - Read data available only in block manager, not in Kinesis [enable 
by setting env var ENABLE_KINESIS_TESTS=1] !!! IGNORED !!!
[info] - Read data available only in Kinesis, not in block manager [enable 
by setting env var ENABLE_KINESIS_TESTS=1] !!! IGNORED !!!
[info] - Read data available partially in block manager, rest in Kinesis 
[enable by setting env var ENABLE_KINESIS_TESTS=1] !!! IGNORED !!!
[info] - Test isBlockValid skips block fetching from block manager [enable 
by setting env var ENABLE_KINESIS_TESTS=1] !!! IGNORED !!!
[info] - Test whether RDD is valid after removing blocks from block anager 
[enable by setting env var ENABLE_KINESIS_TESTS=1] !!! IGNORED !!!
```


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2015-12-14 Thread holdenk
Github user holdenk commented on a diff in the pull request:

https://github.com/apache/spark/pull/10256#discussion_r47543005
  
--- Diff: pom.xml ---
@@ -169,7 +169,7 @@
 ${scala.version}
 org.scala-lang
 1.9.13
-2.4.4
+2.5.3
--- End diff --

@MLnick the tests seem to have run OK in 
https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/2203/consoleFull
 (although jenkins seems to be having some issues)


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2015-12-14 Thread holdenk
Github user holdenk commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-164527869
  
I think the next step is getting one of the jenkins admins (maybe @srowen 
or @brkyvz ) since they are already on this PR to tell jenkins ok to test and 
maybe see if @marmbrus has any concerns about the Jackson upgrade.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2015-12-14 Thread MLnick
Github user MLnick commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-164528959
  
@brkyvz @holdenk @tdas any particular reason we depend on `aws-sdk-java` 
explicitly? The KCL brings in those dependencies (in fact from `1.5.0` KCL only 
depends on the necessary AWS SDK components, i.e. `core`, `kinesis`, `dynamodb` 
and `cloudwatch`). Depending only on KCL would make things cleaner and avoid 
potential issues such as not bumping versions at the same time (e.g. #8957 and 
[related mailing list 
discussion](https://mail-archives.apache.org/mod_mbox/spark-user/201512.mbox/%3CCALD%2B6GMXKOR5%3DbGYQhezXbV_qaJsoi2Y2cSbaDETeefWbh84Qg%40mail.gmail.com%3E))


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2015-12-14 Thread MLnick
Github user MLnick commented on a diff in the pull request:

https://github.com/apache/spark/pull/10256#discussion_r47540953
  
--- Diff: pom.xml ---
@@ -169,7 +169,7 @@
 ${scala.version}
 org.scala-lang
 1.9.13
-2.4.4
+2.5.3
--- End diff --

@srowen did you manage to run the tests? We'd need to be sure nothing 
breaks with this version bump!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2015-12-14 Thread BrianLondon
Github user BrianLondon commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-164465363
  
@holdenk bump


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2015-12-11 Thread BrianLondon
Github user BrianLondon commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-163985429
  
@brkyvz Pushed up the KCL version and added a newline to KinesisReciever.  
Tests passed for me locally on a hadoop 2.6 build and the kinesis demos ran 
against a live AWS stream.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2015-12-11 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-163945864
  
**[Test build #2203 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/2203/consoleFull)**
 for PR 10256 at commit 
[`47d0a59`](https://github.com/apache/spark/commit/47d0a59d059e37555f9558246cf2a615a4fddb08).
 * This patch passes all tests.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2015-12-11 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-163927639
  
**[Test build #2203 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/2203/consoleFull)**
 for PR 10256 at commit 
[`47d0a59`](https://github.com/apache/spark/commit/47d0a59d059e37555f9558246cf2a615a4fddb08).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2015-12-11 Thread srowen
Github user srowen commented on a diff in the pull request:

https://github.com/apache/spark/pull/10256#discussion_r47350412
  
--- Diff: pom.xml ---
@@ -169,7 +169,7 @@
 ${scala.version}
 org.scala-lang
 1.9.13
-2.4.4
+2.5.3
--- End diff --

Yeah, this makes me nervous. I'll run the tests to see what happens.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2015-12-10 Thread brkyvz
Github user brkyvz commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-163802104
  
@BrianLondon What happens if you change the version of KCL to 1.6.1? Could 
you also please add a new line to anywhere in extras/kinesis-asl? Otherwise 
Kinesis tests don't run and we have no idea whether this is a safe change.

Thanks!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2015-12-10 Thread zsxwing
Github user zsxwing commented on a diff in the pull request:

https://github.com/apache/spark/pull/10256#discussion_r47303246
  
--- Diff: pom.xml ---
@@ -169,7 +169,7 @@
 ${scala.version}
 org.scala-lang
 1.9.13
-2.4.4
+2.5.3
--- End diff --

Is it dangerous to upgrade `jackson`? It's also used by SQL.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2015-12-10 Thread JoshRosen
Github user JoshRosen commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-163786007
  
/cc @brkyvz RE: the KCL library.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2015-12-10 Thread holdenk
Github user holdenk commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-163782601
  
Cool - if we don't need to upgrade the KCL library then thats probably 
good. Maybe good to see if @tdas or someone similar can take a look?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2015-12-10 Thread BrianLondon
Github user BrianLondon commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-163781633
  
I ran the demo with the following two commands against a stream 
(test-stream1) on my AWS account.  It performed as expected.

bin/run-example streaming.KinesisWordProducerASL test-stream1 
https://kinesis.us-east-1.amazonaws.com 100 10
bin/run-example streaming.KinesisWordCountASL testapp test-stream1 
https://kinesis.us-east-1.amazonaws.com

I happy to bump the KCL library as well.  With the bumped 
com.fasterxml.jackson.core version, the project build and passed all tests 
except docker-integration, which I can't run.  AWS Java SDK 1.10.40 calls a 
function that isn't provided by Jackson 2.4.x.  If raising that to 2.5.x isn't 
an option, then we can find whatever the last version of AWS Java SDK to not 
use it is. I know it works with 1.10.24, so somewhere between 1.10.24 and 
1.10.40 could be a temporary solution.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2015-12-10 Thread holdenk
Github user holdenk commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-163778571
  
Note we also depend depend on com.fasterxml.jackson.core 2.4.4 in Spark


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2015-12-10 Thread holdenk
Github user holdenk commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-163778072
  
Have you tested the current kinesis and works with the new API? Looking 
online it seems like we might also need to bump our kinesis client library if 
we upgrade the general aws sdk version.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2015-12-10 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/10256#issuecomment-163748299
  
Can one of the admins verify this patch?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-12269][Streaming][Kinesis] Update aws-j...

2015-12-10 Thread BrianLondon
GitHub user BrianLondon opened a pull request:

https://github.com/apache/spark/pull/10256

[SPARK-12269][Streaming][Kinesis] Update aws-java-sdk version

The current Spark Streaming kinesis connector references a quite old 
version 1.9.40 of the AWS Java SDK (1.10.40 is current). Numerous AWS features 
including Kinesis Firehose are unavailable in 1.9. Those two versions of the 
AWS SDK in turn require conflicting versions of Jackson (2.4.4 and 2.5.3 
respectively) such that one cannot include the current AWS SDK in a project 
that also uses the Spark Streaming Kinesis ASL.

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/BrianLondon/spark master

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/10256.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #10256


commit 47d0a59d059e37555f9558246cf2a615a4fddb08
Author: BrianLondon 
Date:   2015-12-10T20:49:07Z

Updating aws sdk and jackson versions




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org