Re: [VOTE] Release Apache Hadoop 2.4.0
Hi, Updated a test result log based on the result of 2.4.0-rc0: https://gist.github.com/oza/9965197 IMO, there are some blockers to be fixed: * MAPREDUCE-5815(TestMRAppMaster failure) * YARN-1872(TestDistributedShell failure) * HDFS: TestSymlinkLocalFSFileSystem failure on Linux (I cannot find JIRA about this failure) Now I'm checking the problem reported by Azuryy. Thanks, - Tsuyoshi On Fri, Apr 4, 2014 at 8:55 AM, Tsuyoshi OZAWA ozawa.tsuyo...@gmail.com wrote: Hi, Ran tests and confirmed that some tests(TestSymlinkLocalFSFileSystem) fail. The log of the test failure is as follows: https://gist.github.com/oza/9965197 Should we fix or disable the feature? Thanks, - Tsuyoshi On Mon, Mar 31, 2014 at 6:22 PM, Arun C Murthy a...@hortonworks.com wrote: Folks, I've created a release candidate (rc0) for hadoop-2.4.0 that I would like to get released. The RC is available at: http://people.apache.org/~acmurthy/hadoop-2.4.0-rc0 The RC tag in svn is here: https://svn.apache.org/repos/asf/hadoop/common/tags/release-2.4.0-rc0 The maven artifacts are available via repository.apache.org. Please try the release and vote; the vote will run for the usual 7 days. thanks, Arun -- Arun C. Murthy Hortonworks Inc. http://hortonworks.com/ -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You. -- - Tsuyoshi -- - Tsuyoshi
Build failed in Jenkins: Hadoop-Common-trunk #1089
See https://builds.apache.org/job/Hadoop-Common-trunk/1089/changes Changes: [kihwal] Fixing an error in CHANGES.txt [jeagles] HADOOP-10454. Provide FileContext version of har file system. (Kihwal Lee via jeagles) [zjshen] MAPREDUCE-5818. Added hsadmin command into mapred.cmd. Contributed by Jian He. [wheat9] HDFS-6190. Minor textual fixes in DFSClient. Contributed by Charles Lamb. -- [...truncated 62545 lines...] Adding reference: maven.local.repository [DEBUG] Initialize Maven Ant Tasks parsing buildfile jar:file:/home/jenkins/.m2/repository/org/apache/maven/plugins/maven-antrun-plugin/1.7/maven-antrun-plugin-1.7.jar!/org/apache/maven/ant/tasks/antlib.xml with URI = jar:file:/home/jenkins/.m2/repository/org/apache/maven/plugins/maven-antrun-plugin/1.7/maven-antrun-plugin-1.7.jar!/org/apache/maven/ant/tasks/antlib.xml from a zip file parsing buildfile jar:file:/home/jenkins/.m2/repository/org/apache/ant/ant/1.8.2/ant-1.8.2.jar!/org/apache/tools/ant/antlib.xml with URI = jar:file:/home/jenkins/.m2/repository/org/apache/ant/ant/1.8.2/ant-1.8.2.jar!/org/apache/tools/ant/antlib.xml from a zip file Class org.apache.maven.ant.tasks.AttachArtifactTask loaded from parent loader (parentFirst) +Datatype attachartifact org.apache.maven.ant.tasks.AttachArtifactTask Class org.apache.maven.ant.tasks.DependencyFilesetsTask loaded from parent loader (parentFirst) +Datatype dependencyfilesets org.apache.maven.ant.tasks.DependencyFilesetsTask Setting project property: test.build.dir - https://builds.apache.org/job/Hadoop-Common-trunk/ws/trunk/hadoop-common-project/target/test-dir Setting project property: test.exclude.pattern - _ Setting project property: hadoop.assemblies.version - 3.0.0-SNAPSHOT Setting project property: test.exclude - _ Setting project property: distMgmtSnapshotsId - apache.snapshots.https Setting project property: project.build.sourceEncoding - UTF-8 Setting project property: java.security.egd - file:///dev/urandom Setting project property: distMgmtSnapshotsUrl - https://repository.apache.org/content/repositories/snapshots Setting project property: distMgmtStagingUrl - https://repository.apache.org/service/local/staging/deploy/maven2 Setting project property: avro.version - 1.7.4 Setting project property: test.build.data - https://builds.apache.org/job/Hadoop-Common-trunk/ws/trunk/hadoop-common-project/target/test-dir Setting project property: commons-daemon.version - 1.0.13 Setting project property: hadoop.common.build.dir - https://builds.apache.org/job/Hadoop-Common-trunk/ws/trunk/hadoop-common-project/../../hadoop-common-project/hadoop-common/target Setting project property: testsThreadCount - 4 Setting project property: maven.test.redirectTestOutputToFile - true Setting project property: jdiff.version - 1.0.9 Setting project property: build.platform - Linux-i386-32 Setting project property: project.reporting.outputEncoding - UTF-8 Setting project property: distMgmtStagingName - Apache Release Distribution Repository Setting project property: protobuf.version - 2.5.0 Setting project property: failIfNoTests - false Setting project property: protoc.path - ${env.HADOOP_PROTOC_PATH} Setting project property: jersey.version - 1.9 Setting project property: distMgmtStagingId - apache.staging.https Setting project property: distMgmtSnapshotsName - Apache Development Snapshot Repository Setting project property: ant.file - https://builds.apache.org/job/Hadoop-Common-trunk/ws/trunk/hadoop-common-project/pom.xml [DEBUG] Setting properties with prefix: Setting project property: project.groupId - org.apache.hadoop Setting project property: project.artifactId - hadoop-common-project Setting project property: project.name - Apache Hadoop Common Project Setting project property: project.description - Apache Hadoop Common Project Setting project property: project.version - 3.0.0-SNAPSHOT Setting project property: project.packaging - pom Setting project property: project.build.directory - https://builds.apache.org/job/Hadoop-Common-trunk/ws/trunk/hadoop-common-project/target Setting project property: project.build.outputDirectory - https://builds.apache.org/job/Hadoop-Common-trunk/ws/trunk/hadoop-common-project/target/classes Setting project property: project.build.testOutputDirectory - https://builds.apache.org/job/Hadoop-Common-trunk/ws/trunk/hadoop-common-project/target/test-classes Setting project property: project.build.sourceDirectory - https://builds.apache.org/job/Hadoop-Common-trunk/ws/trunk/hadoop-common-project/src/main/java Setting project property: project.build.testSourceDirectory - https://builds.apache.org/job/Hadoop-Common-trunk/ws/trunk/hadoop-common-project/src/test/java Setting project property: localRepository -id: local url: file:///home/jenkins/.m2/repository/ layout: none Setting project property: settings.localRepository - /home/jenkins/.m2/repository Setting project property:
[jira] [Created] (HADOOP-10463) Bring RawLocalFileSystem test coverage to 100%
jay vyas created HADOOP-10463: - Summary: Bring RawLocalFileSystem test coverage to 100% Key: HADOOP-10463 URL: https://issues.apache.org/jira/browse/HADOOP-10463 Project: Hadoop Common Issue Type: Test Components: fs Reporter: jay vyas RawLocalFileSystem coverage is at about 80% (measured with cobertura) at the moment. A few notable untested code paths are: * primitiveMkdir * markSupported * int read() Lets get it as close as possible to 100% coverage. In the process of analyzing existing abstract tests which exersize RawLocalFileSystem, we will also pave the way for HADOOP-10461. -- This message was sent by Atlassian JIRA (v6.2#6252)
Re: [VOTE] Release Apache Hadoop 2.4.0
Thanks for helping Tsuyoshi. Pls mark them as Blockers and set the fix-version to 2.4.1. Thanks again. Arun On Apr 3, 2014, at 11:38 PM, Tsuyoshi OZAWA ozawa.tsuyo...@gmail.com wrote: Hi, Updated a test result log based on the result of 2.4.0-rc0: https://gist.github.com/oza/9965197 IMO, there are some blockers to be fixed: * MAPREDUCE-5815(TestMRAppMaster failure) * YARN-1872(TestDistributedShell failure) * HDFS: TestSymlinkLocalFSFileSystem failure on Linux (I cannot find JIRA about this failure) Now I'm checking the problem reported by Azuryy. Thanks, - Tsuyoshi On Fri, Apr 4, 2014 at 8:55 AM, Tsuyoshi OZAWA ozawa.tsuyo...@gmail.com wrote: Hi, Ran tests and confirmed that some tests(TestSymlinkLocalFSFileSystem) fail. The log of the test failure is as follows: https://gist.github.com/oza/9965197 Should we fix or disable the feature? Thanks, - Tsuyoshi On Mon, Mar 31, 2014 at 6:22 PM, Arun C Murthy a...@hortonworks.com wrote: Folks, I've created a release candidate (rc0) for hadoop-2.4.0 that I would like to get released. The RC is available at: http://people.apache.org/~acmurthy/hadoop-2.4.0-rc0 The RC tag in svn is here: https://svn.apache.org/repos/asf/hadoop/common/tags/release-2.4.0-rc0 The maven artifacts are available via repository.apache.org. Please try the release and vote; the vote will run for the usual 7 days. thanks, Arun -- Arun C. Murthy Hortonworks Inc. http://hortonworks.com/ -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You. -- - Tsuyoshi -- - Tsuyoshi -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
Plans of moving towards JDK7 in trunk
Hi, There have been multiple discussions on deprecating supports of JDK6 and moving towards JDK7. It looks to me that the consensus is that now hadoop is ready to drop the support of JDK6 and to move towards JDK7. Based on the consensus, I wonder whether it is a good time to start the migration. Here are my understandings of the current status: 1. There is no more public updates of JDK6 since Feb 2013. Users no longer get fixes of security vulnerabilities through official public updates. 2. Hadoop core is stuck with out-of-date dependency unless moving towards JDK7. (see http://hadoop.6.n7.nabble.com/very-old-dependencies-td71486.html) The implementation can also benefit from it thanks to the new functionalities in JDK7. 3. The code is ready for JDK7. Cloudera and Hortonworks have successful stories of supporting Hadoop on JDK7. It seems that the real work of moving to JDK7 is minimal. We only need to (1) make sure the jenkins are running on top of JDK7, and (2) to update the minimum required Java version from 6 to 7. Therefore I propose that let's move towards JDK7 in trunk in the short term. Your feedbacks are appreciated. Regards, Haohui -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
Re: Plans of moving towards JDK7 in trunk
Haohui, Is the idea to compile/test with JDK7 and recommend it for runtime and stop there? Or to start using JDK7 API stuff as well? If the later is the case, then backporting stuff to branch-2 may break and patches may have to be refactored for JDK6. Given that branch-2 got GA status not so long ago, I assume it will be active for a while. What are your thoughts on this regard? Thanks On Fri, Apr 4, 2014 at 2:29 PM, Haohui Mai h...@hortonworks.com wrote: Hi, There have been multiple discussions on deprecating supports of JDK6 and moving towards JDK7. It looks to me that the consensus is that now hadoop is ready to drop the support of JDK6 and to move towards JDK7. Based on the consensus, I wonder whether it is a good time to start the migration. Here are my understandings of the current status: 1. There is no more public updates of JDK6 since Feb 2013. Users no longer get fixes of security vulnerabilities through official public updates. 2. Hadoop core is stuck with out-of-date dependency unless moving towards JDK7. (see http://hadoop.6.n7.nabble.com/very-old-dependencies-td71486.html) The implementation can also benefit from it thanks to the new functionalities in JDK7. 3. The code is ready for JDK7. Cloudera and Hortonworks have successful stories of supporting Hadoop on JDK7. It seems that the real work of moving to JDK7 is minimal. We only need to (1) make sure the jenkins are running on top of JDK7, and (2) to update the minimum required Java version from 6 to 7. Therefore I propose that let's move towards JDK7 in trunk in the short term. Your feedbacks are appreciated. Regards, Haohui -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You. -- Alejandro
Re: Plans of moving towards JDK7 in trunk
I'm referring to the later case. Indeed migrating JDK7 for branch-2 is more difficult. I think one reasonable approach is to put the hdfs / yarn clients into separate jars. The client-side jars can only use JDK6 APIs, so that downstream projects running on top of JDK6 continue to work. The HDFS/YARN/MR servers need to be run on top of JDK7, and we're free to use JDK7 APIs inside them. Given the fact that there're way more code in the server-side compared to the client-side, having the ability to use JDK7 in the server-side only might still be a win. The downside I can think of is that it might complicate the effort of publishing maven jars, but this should be an one-time issue. ~Haohui On Fri, Apr 4, 2014 at 2:37 PM, Alejandro Abdelnur t...@cloudera.comwrote: Haohui, Is the idea to compile/test with JDK7 and recommend it for runtime and stop there? Or to start using JDK7 API stuff as well? If the later is the case, then backporting stuff to branch-2 may break and patches may have to be refactored for JDK6. Given that branch-2 got GA status not so long ago, I assume it will be active for a while. What are your thoughts on this regard? Thanks On Fri, Apr 4, 2014 at 2:29 PM, Haohui Mai h...@hortonworks.com wrote: Hi, There have been multiple discussions on deprecating supports of JDK6 and moving towards JDK7. It looks to me that the consensus is that now hadoop is ready to drop the support of JDK6 and to move towards JDK7. Based on the consensus, I wonder whether it is a good time to start the migration. Here are my understandings of the current status: 1. There is no more public updates of JDK6 since Feb 2013. Users no longer get fixes of security vulnerabilities through official public updates. 2. Hadoop core is stuck with out-of-date dependency unless moving towards JDK7. (see http://hadoop.6.n7.nabble.com/very-old-dependencies-td71486.html) The implementation can also benefit from it thanks to the new functionalities in JDK7. 3. The code is ready for JDK7. Cloudera and Hortonworks have successful stories of supporting Hadoop on JDK7. It seems that the real work of moving to JDK7 is minimal. We only need to (1) make sure the jenkins are running on top of JDK7, and (2) to update the minimum required Java version from 6 to 7. Therefore I propose that let's move towards JDK7 in trunk in the short term. Your feedbacks are appreciated. Regards, Haohui -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You. -- Alejandro -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
Re: Plans of moving towards JDK7 in trunk
Please don't forget the mac os build on JDK 7. :) On Fri, Apr 4, 2014 at 3:15 PM, Haohui Mai h...@hortonworks.com wrote: I'm referring to the later case. Indeed migrating JDK7 for branch-2 is more difficult. I think one reasonable approach is to put the hdfs / yarn clients into separate jars. The client-side jars can only use JDK6 APIs, so that downstream projects running on top of JDK6 continue to work. It might not be as clear cut. For clients to run clean on JDK 6, not only the client projects/artifacts but also any of their dependencies must be free of JDK 7 code. And this obviously includes things like hadoop-common (or any downstream dependencies for that matter). The HDFS/YARN/MR servers need to be run on top of JDK7, and we're free to use JDK7 APIs inside them. Given the fact that there're way more code in the server-side compared to the client-side, having the ability to use JDK7 in the server-side only might still be a win. The downside I can think of is that it might complicate the effort of publishing maven jars, but this should be an one-time issue. Could you elaborate on why it would complicate maven jar publication? Perhaps I'm over-simplifying things, but I would have thought it could be easily achieved by marking certain project poms with source/target 1.6 in their maven compiler plugin configuration while upgrading the default setting to 1.7. Do you anticipate more issues? ~Haohui On Fri, Apr 4, 2014 at 2:37 PM, Alejandro Abdelnur t...@cloudera.com wrote: Haohui, Is the idea to compile/test with JDK7 and recommend it for runtime and stop there? Or to start using JDK7 API stuff as well? If the later is the case, then backporting stuff to branch-2 may break and patches may have to be refactored for JDK6. Given that branch-2 got GA status not so long ago, I assume it will be active for a while. What are your thoughts on this regard? Thanks On Fri, Apr 4, 2014 at 2:29 PM, Haohui Mai h...@hortonworks.com wrote: Hi, There have been multiple discussions on deprecating supports of JDK6 and moving towards JDK7. It looks to me that the consensus is that now hadoop is ready to drop the support of JDK6 and to move towards JDK7. Based on the consensus, I wonder whether it is a good time to start the migration. Here are my understandings of the current status: 1. There is no more public updates of JDK6 since Feb 2013. Users no longer get fixes of security vulnerabilities through official public updates. 2. Hadoop core is stuck with out-of-date dependency unless moving towards JDK7. (see http://hadoop.6.n7.nabble.com/very-old-dependencies-td71486.html) The implementation can also benefit from it thanks to the new functionalities in JDK7. 3. The code is ready for JDK7. Cloudera and Hortonworks have successful stories of supporting Hadoop on JDK7. It seems that the real work of moving to JDK7 is minimal. We only need to (1) make sure the jenkins are running on top of JDK7, and (2) to update the minimum required Java version from 6 to 7. Therefore I propose that let's move towards JDK7 in trunk in the short term. Your feedbacks are appreciated. Regards, Haohui -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You. -- Alejandro -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
Re: Plans of moving towards JDK7 in trunk
bq. It might not be as clear cut... Totally agree. I think the key is that we can do the work in an incremental way. We can only introduce JDK7 dependency on the server side. In order to do this we need to separate the client-side code to separate jars. I've already proposed to create a hdfs-client jar in the hdfs-dev mailing list. bq. I would have thought it could be easily achieved by marking certain project poms with source/target 1.6 in their maven compiler plugin configuration while upgrading the default setting to 1.7. Do you anticipate more issues? Correct me if I'm wrong, but I think that's enough. The work should be minimal. ~Haohui On Fri, Apr 4, 2014 at 3:43 PM, Sangjin Lee sj...@apache.org wrote: Please don't forget the mac os build on JDK 7. :) On Fri, Apr 4, 2014 at 3:15 PM, Haohui Mai h...@hortonworks.com wrote: I'm referring to the later case. Indeed migrating JDK7 for branch-2 is more difficult. I think one reasonable approach is to put the hdfs / yarn clients into separate jars. The client-side jars can only use JDK6 APIs, so that downstream projects running on top of JDK6 continue to work. It might not be as clear cut. For clients to run clean on JDK 6, not only the client projects/artifacts but also any of their dependencies must be free of JDK 7 code. And this obviously includes things like hadoop-common (or any downstream dependencies for that matter). The HDFS/YARN/MR servers need to be run on top of JDK7, and we're free to use JDK7 APIs inside them. Given the fact that there're way more code in the server-side compared to the client-side, having the ability to use JDK7 in the server-side only might still be a win. The downside I can think of is that it might complicate the effort of publishing maven jars, but this should be an one-time issue. Could you elaborate on why it would complicate maven jar publication? Perhaps I'm over-simplifying things, but I would have thought it could be easily achieved by marking certain project poms with source/target 1.6 in their maven compiler plugin configuration while upgrading the default setting to 1.7. Do you anticipate more issues? ~Haohui On Fri, Apr 4, 2014 at 2:37 PM, Alejandro Abdelnur t...@cloudera.com wrote: Haohui, Is the idea to compile/test with JDK7 and recommend it for runtime and stop there? Or to start using JDK7 API stuff as well? If the later is the case, then backporting stuff to branch-2 may break and patches may have to be refactored for JDK6. Given that branch-2 got GA status not so long ago, I assume it will be active for a while. What are your thoughts on this regard? Thanks On Fri, Apr 4, 2014 at 2:29 PM, Haohui Mai h...@hortonworks.com wrote: Hi, There have been multiple discussions on deprecating supports of JDK6 and moving towards JDK7. It looks to me that the consensus is that now hadoop is ready to drop the support of JDK6 and to move towards JDK7. Based on the consensus, I wonder whether it is a good time to start the migration. Here are my understandings of the current status: 1. There is no more public updates of JDK6 since Feb 2013. Users no longer get fixes of security vulnerabilities through official public updates. 2. Hadoop core is stuck with out-of-date dependency unless moving towards JDK7. (see http://hadoop.6.n7.nabble.com/very-old-dependencies-td71486.html) The implementation can also benefit from it thanks to the new functionalities in JDK7. 3. The code is ready for JDK7. Cloudera and Hortonworks have successful stories of supporting Hadoop on JDK7. It seems that the real work of moving to JDK7 is minimal. We only need to (1) make sure the jenkins are running on top of JDK7, and (2) to update the minimum required Java version from 6 to 7. Therefore I propose that let's move towards JDK7 in trunk in the short term. Your feedbacks are appreciated. Regards, Haohui -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You. -- Alejandro -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure
Re: Plans of moving towards JDK7 in trunk
So, you want to compile hdfs/yarn/mapred clients (and hadoop-common and hadoop-auth) with JDK6 and the rest with JDK7? On Fri, Apr 4, 2014 at 3:15 PM, Haohui Mai h...@hortonworks.com wrote: I'm referring to the later case. Indeed migrating JDK7 for branch-2 is more difficult. I think one reasonable approach is to put the hdfs / yarn clients into separate jars. The client-side jars can only use JDK6 APIs, so that downstream projects running on top of JDK6 continue to work. The HDFS/YARN/MR servers need to be run on top of JDK7, and we're free to use JDK7 APIs inside them. Given the fact that there're way more code in the server-side compared to the client-side, having the ability to use JDK7 in the server-side only might still be a win. The downside I can think of is that it might complicate the effort of publishing maven jars, but this should be an one-time issue. ~Haohui On Fri, Apr 4, 2014 at 2:37 PM, Alejandro Abdelnur t...@cloudera.com wrote: Haohui, Is the idea to compile/test with JDK7 and recommend it for runtime and stop there? Or to start using JDK7 API stuff as well? If the later is the case, then backporting stuff to branch-2 may break and patches may have to be refactored for JDK6. Given that branch-2 got GA status not so long ago, I assume it will be active for a while. What are your thoughts on this regard? Thanks On Fri, Apr 4, 2014 at 2:29 PM, Haohui Mai h...@hortonworks.com wrote: Hi, There have been multiple discussions on deprecating supports of JDK6 and moving towards JDK7. It looks to me that the consensus is that now hadoop is ready to drop the support of JDK6 and to move towards JDK7. Based on the consensus, I wonder whether it is a good time to start the migration. Here are my understandings of the current status: 1. There is no more public updates of JDK6 since Feb 2013. Users no longer get fixes of security vulnerabilities through official public updates. 2. Hadoop core is stuck with out-of-date dependency unless moving towards JDK7. (see http://hadoop.6.n7.nabble.com/very-old-dependencies-td71486.html) The implementation can also benefit from it thanks to the new functionalities in JDK7. 3. The code is ready for JDK7. Cloudera and Hortonworks have successful stories of supporting Hadoop on JDK7. It seems that the real work of moving to JDK7 is minimal. We only need to (1) make sure the jenkins are running on top of JDK7, and (2) to update the minimum required Java version from 6 to 7. Therefore I propose that let's move towards JDK7 in trunk in the short term. Your feedbacks are appreciated. Regards, Haohui -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You. -- Alejandro -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You. -- Alejandro
Re: [VOTE] Release Apache Hadoop 2.4.0
I built the release from the rc tag, enabled timeline history service and ran a sleep job on a pseudo-distributed cluster. I encourage another rc, for 2.4.0 (non-binding) 1) Despite the discussion on YARN-1701, timeline AHS still sets yarn.timeline-service.generic-application-history.fs-history-store.uri to a location under ${hadoop.log.dir} that is meant for local file system, but uses it on HDFS by default. 2) Critical patch for WebHdfs/Hftp to fix the filesystem contract HDFS-6143 is not included 3) Several patches that already proved themselves useful for diagnostics in production and have been available for some months are still not included. MAPREDUCE-5044/YARN-1515 is the most obvious example. Our users need to see where the task container JVM got stuck when it was timed out by AM. Thanks, Gera On Fri, Apr 4, 2014 at 3:51 PM, Azuryy azury...@gmail.com wrote: Arun, Do you mean you will cut another RC for 2.4? Sent from my iPhone5s On 2014年4月5日, at 3:50, Arun C. Murthy a...@hortonworks.com wrote: Thanks for helping Tsuyoshi. Pls mark them as Blockers and set the fix-version to 2.4.1. Thanks again. Arun On Apr 3, 2014, at 11:38 PM, Tsuyoshi OZAWA ozawa.tsuyo...@gmail.com wrote: Hi, Updated a test result log based on the result of 2.4.0-rc0: https://gist.github.com/oza/9965197 IMO, there are some blockers to be fixed: * MAPREDUCE-5815(TestMRAppMaster failure) * YARN-1872(TestDistributedShell failure) * HDFS: TestSymlinkLocalFSFileSystem failure on Linux (I cannot find JIRA about this failure) Now I'm checking the problem reported by Azuryy. Thanks, - Tsuyoshi On Fri, Apr 4, 2014 at 8:55 AM, Tsuyoshi OZAWA ozawa.tsuyo...@gmail.com wrote: Hi, Ran tests and confirmed that some tests(TestSymlinkLocalFSFileSystem) fail. The log of the test failure is as follows: https://gist.github.com/oza/9965197 Should we fix or disable the feature? Thanks, - Tsuyoshi On Mon, Mar 31, 2014 at 6:22 PM, Arun C Murthy a...@hortonworks.com wrote: Folks, I've created a release candidate (rc0) for hadoop-2.4.0 that I would like to get released. The RC is available at: http://people.apache.org/~acmurthy/hadoop-2.4.0-rc0 The RC tag in svn is here: https://svn.apache.org/repos/asf/hadoop/common/tags/release-2.4.0-rc0 The maven artifacts are available via repository.apache.org. Please try the release and vote; the vote will run for the usual 7 days. thanks, Arun -- Arun C. Murthy Hortonworks Inc. http://hortonworks.com/ -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You. -- - Tsuyoshi -- - Tsuyoshi -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
Re: [VOTE] Release Apache Hadoop 2.4.0
+1 for another rc. There have been quite a few issues found (handful marked blocker) and this is only the first release candidate, seems like the point of having multiple release candidates is to iterate with another one that addresses the major issues found with the previous one. On Fri, Apr 4, 2014 at 5:06 PM, Gera Shegalov g...@shegalov.com wrote: I built the release from the rc tag, enabled timeline history service and ran a sleep job on a pseudo-distributed cluster. I encourage another rc, for 2.4.0 (non-binding) 1) Despite the discussion on YARN-1701, timeline AHS still sets yarn.timeline-service.generic-application-history.fs-history-store.uri to a location under ${hadoop.log.dir} that is meant for local file system, but uses it on HDFS by default. 2) Critical patch for WebHdfs/Hftp to fix the filesystem contract HDFS-6143 is not included 3) Several patches that already proved themselves useful for diagnostics in production and have been available for some months are still not included. MAPREDUCE-5044/YARN-1515 is the most obvious example. Our users need to see where the task container JVM got stuck when it was timed out by AM. Thanks, Gera On Fri, Apr 4, 2014 at 3:51 PM, Azuryy azury...@gmail.com wrote: Arun, Do you mean you will cut another RC for 2.4? Sent from my iPhone5s On 2014年4月5日, at 3:50, Arun C. Murthy a...@hortonworks.com wrote: Thanks for helping Tsuyoshi. Pls mark them as Blockers and set the fix-version to 2.4.1. Thanks again. Arun On Apr 3, 2014, at 11:38 PM, Tsuyoshi OZAWA ozawa.tsuyo...@gmail.com wrote: Hi, Updated a test result log based on the result of 2.4.0-rc0: https://gist.github.com/oza/9965197 IMO, there are some blockers to be fixed: * MAPREDUCE-5815(TestMRAppMaster failure) * YARN-1872(TestDistributedShell failure) * HDFS: TestSymlinkLocalFSFileSystem failure on Linux (I cannot find JIRA about this failure) Now I'm checking the problem reported by Azuryy. Thanks, - Tsuyoshi On Fri, Apr 4, 2014 at 8:55 AM, Tsuyoshi OZAWA ozawa.tsuyo...@gmail.com wrote: Hi, Ran tests and confirmed that some tests(TestSymlinkLocalFSFileSystem) fail. The log of the test failure is as follows: https://gist.github.com/oza/9965197 Should we fix or disable the feature? Thanks, - Tsuyoshi On Mon, Mar 31, 2014 at 6:22 PM, Arun C Murthy a...@hortonworks.com wrote: Folks, I've created a release candidate (rc0) for hadoop-2.4.0 that I would like to get released. The RC is available at: http://people.apache.org/~acmurthy/hadoop-2.4.0-rc0 The RC tag in svn is here: https://svn.apache.org/repos/asf/hadoop/common/tags/release-2.4.0-rc0 The maven artifacts are available via repository.apache.org. Please try the release and vote; the vote will run for the usual 7 days. thanks, Arun -- Arun C. Murthy Hortonworks Inc. http://hortonworks.com/ -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You. -- - Tsuyoshi -- - Tsuyoshi -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.