[jira] [Commented] (YARN-2701) Potential race condition in startLocalizer when using LinuxContainerExecutor

2014-10-22 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-2701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14180955#comment-14180955
 ] 

Binglin Chang commented on YARN-2701:
-

Hi Xuan Gong, thank for the patch. Path lgtm, just some indent issue in 
check_dir definition.

 Potential race condition in startLocalizer when using LinuxContainerExecutor  
 --

 Key: YARN-2701
 URL: https://issues.apache.org/jira/browse/YARN-2701
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Xuan Gong
Assignee: Xuan Gong
Priority: Blocker
 Fix For: 2.6.0

 Attachments: YARN-2701.1.patch, YARN-2701.2.patch, YARN-2701.3.patch, 
 YARN-2701.4.patch, YARN-2701.5.patch, YARN-2701.6.patch, 
 YARN-2701.addendum.1.patch, YARN-2701.addendum.2.patch


 When using LinuxContainerExecutor do startLocalizer, we are using native code 
 container-executor.c. 
 {code}
  if (stat(npath, sb) != 0) {
if (mkdir(npath, perm) != 0) {
 {code}
 We are using check and create method to create the appDir under /usercache. 
 But if there are two containers trying to do this at the same time, race 
 condition may happen.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (YARN-2161) Fix build on macosx: YARN parts

2014-10-20 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-2161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14177969#comment-14177969
 ] 

Binglin Chang commented on YARN-2161:
-

Hi [~xgong], sorry for break the code. I see in YARN-2701 you already have fix 
code, but decide to revert the code in the end to be more safe, but this breaks 
the mac build, how  about use  #ifdef to use old code when compiling on glibc  
2.10(http://linux.die.net/man/2/openat) and use your fixing code otherwise?

 Fix build on macosx: YARN parts
 ---

 Key: YARN-2161
 URL: https://issues.apache.org/jira/browse/YARN-2161
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Binglin Chang
Assignee: Binglin Chang
 Fix For: 2.6.0

 Attachments: YARN-2161.v1.patch, YARN-2161.v2.patch


 When compiling on macosx with -Pnative, there are several warning and errors, 
 fix this would help hadoop developers with macosx env. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (YARN-2161) Fix build on macosx: YARN parts

2014-09-24 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-2161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang updated YARN-2161:

Attachment: YARN-2161.v2.patch

Thanks for the review Allen. Attach new version of the patch, changes:
1. use cmake CHECK_FUNCTION_EXISTS to check fcloseall exists
2. change user bin to user daemon (which both linux and macosx have)
3. add some fix in YARN-1327 (setpgid and libgen.h)


 Fix build on macosx: YARN parts
 ---

 Key: YARN-2161
 URL: https://issues.apache.org/jira/browse/YARN-2161
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Binglin Chang
Assignee: Binglin Chang
 Attachments: YARN-2161.v1.patch, YARN-2161.v2.patch


 When compiling on macosx with -Pnative, there are several warning and errors, 
 fix this would help hadoop developers with macosx env. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (YARN-2051) Fix code bug and add more unit tests for PBImpls

2014-07-14 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-2051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang updated YARN-2051:


Attachment: YARN-2051.v1.patch

This path using reflection to test all PBImpl records in hadoop-yarn-common. 
Basically for each PBImpl record, it tests get/set methods and 
serialization/deserialization. 
95 new test case are added, when running those tests, some classes are found 
with bug and fixed.


 Fix code bug and add more unit tests for PBImpls
 

 Key: YARN-2051
 URL: https://issues.apache.org/jira/browse/YARN-2051
 Project: Hadoop YARN
  Issue Type: Test
Reporter: Junping Du
Assignee: Binglin Chang
Priority: Critical
 Attachments: YARN-2051.v1.patch


 From YARN-2016, we can see some bug could exist in PB implementation of 
 protocol. The bad news is most of these PBImpl don't have any unit test to 
 verify the info is not lost or changed after serialization/deserialization. 
 We should add more tests for it.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-2051) Fix code bug and add more unit tests for PBImpls

2014-07-14 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-2051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14061629#comment-14061629
 ] 

Binglin Chang commented on YARN-2051:
-

The failed test is not related, see YARN-2270

 Fix code bug and add more unit tests for PBImpls
 

 Key: YARN-2051
 URL: https://issues.apache.org/jira/browse/YARN-2051
 Project: Hadoop YARN
  Issue Type: Test
Reporter: Junping Du
Assignee: Binglin Chang
Priority: Critical
 Attachments: YARN-2051.v1.patch


 From YARN-2016, we can see some bug could exist in PB implementation of 
 protocol. The bad news is most of these PBImpl don't have any unit test to 
 verify the info is not lost or changed after serialization/deserialization. 
 We should add more tests for it.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-2088) Fix code bug in GetApplicationsRequestPBImpl#mergeLocalToBuilder

2014-07-10 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-2088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14057302#comment-14057302
 ] 

Binglin Chang commented on YARN-2088:
-

Hi [~jianhe] or [~djp], looks like there are no more comments? Would you help 
get this committed? Thanks.

 Fix code bug in GetApplicationsRequestPBImpl#mergeLocalToBuilder
 

 Key: YARN-2088
 URL: https://issues.apache.org/jira/browse/YARN-2088
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Binglin Chang
Assignee: Binglin Chang
 Attachments: YARN-2088.v1.patch


 Some fields(set,list) are added to proto builders many times, we need to 
 clear those fields before add, otherwise the result proto contains more 
 contents.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-2088) Fix code bug in GetApplicationsRequestPBImpl#mergeLocalToBuilder

2014-06-18 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-2088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14036859#comment-14036859
 ] 

Binglin Chang commented on YARN-2088:
-

bq. The new unit is failing without this change is because the equals() method 
also calls getProto() which adds the fields into proto again, right ?
Yes

 Fix code bug in GetApplicationsRequestPBImpl#mergeLocalToBuilder
 

 Key: YARN-2088
 URL: https://issues.apache.org/jira/browse/YARN-2088
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Binglin Chang
Assignee: Binglin Chang
 Attachments: YARN-2088.v1.patch


 Some fields(set,list) are added to proto builders many times, we need to 
 clear those fields before add, otherwise the result proto contains more 
 contents.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (YARN-2161) Fix build on macosx: YARN parts

2014-06-16 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-2161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang updated YARN-2161:


Attachment: YARN-2161.v1.patch

 Fix build on macosx: YARN parts
 ---

 Key: YARN-2161
 URL: https://issues.apache.org/jira/browse/YARN-2161
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Binglin Chang
Assignee: Binglin Chang
 Attachments: YARN-2161.v1.patch


 When compiling on macosx with -Pnative, there are several warning and errors, 
 fix this would help hadoop developers with macosx env. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-2161) Fix build on macosx: YARN parts

2014-06-16 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-2161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14032164#comment-14032164
 ] 

Binglin Chang commented on YARN-2161:
-

Changes:

container-executor.c:
1. make mkdirs more compatible, remove usage of mkdirat/openat
2. use sysconf() to get LOGIN_NAME_MAX
3. macosx doesn't have fcloseall, so close all opened fds on macosx
4. disable cgroup on macosx

test-container-executor.c:
1. macosx do not have user bin, skip a check
2. change /etc/passwd(not exists on mac) to /bin/ls


 Fix build on macosx: YARN parts
 ---

 Key: YARN-2161
 URL: https://issues.apache.org/jira/browse/YARN-2161
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Binglin Chang
Assignee: Binglin Chang
 Attachments: YARN-2161.v1.patch


 When compiling on macosx with -Pnative, there are several warning and errors, 
 fix this would help hadoop developers with macosx env. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Moved] (YARN-2160) Fix build native library on mac osx

2014-06-14 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-2160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang moved HADOOP-9648 to YARN-2160:
-

Affects Version/s: (was: 2.0.5-alpha)
   (was: 1.1.2)
   (was: 1.0.4)
   (was: 1.2.0)
  Key: YARN-2160  (was: HADOOP-9648)
  Project: Hadoop YARN  (was: Hadoop Common)

 Fix build native library on mac osx
 ---

 Key: YARN-2160
 URL: https://issues.apache.org/jira/browse/YARN-2160
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Kirill A. Korinskiy
Assignee: Binglin Chang
 Attachments: HADOOP-9648-native-osx.1.0.4.patch, 
 HADOOP-9648-native-osx.1.1.2.patch, HADOOP-9648-native-osx.1.2.0.patch, 
 HADOOP-9648-native-osx.2.0.5-alpha-rc1.patch, HADOOP-9648.v2.patch


 Some patches for fixing build a hadoop native library on os x 10.7/10.8.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Created] (YARN-2161) Fix build on macosx: YARN parts

2014-06-14 Thread Binglin Chang (JIRA)
Binglin Chang created YARN-2161:
---

 Summary: Fix build on macosx: YARN parts
 Key: YARN-2161
 URL: https://issues.apache.org/jira/browse/YARN-2161
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Binglin Chang
Assignee: Binglin Chang


When compiling on macosx with -Pnative, there are several warning and errors, 
fix this would help hadoop developers with macosx env. 




--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (YARN-2030) Use StateMachine to simplify handleStoreEvent() in RMStateStore

2014-06-08 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-2030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang updated YARN-2030:


Attachment: YARN-2030.v5.patch

fix compile error

 Use StateMachine to simplify handleStoreEvent() in RMStateStore
 ---

 Key: YARN-2030
 URL: https://issues.apache.org/jira/browse/YARN-2030
 Project: Hadoop YARN
  Issue Type: Improvement
Reporter: Junping Du
Assignee: Binglin Chang
 Attachments: YARN-2030.v1.patch, YARN-2030.v2.patch, 
 YARN-2030.v3.patch, YARN-2030.v4.patch, YARN-2030.v5.patch


 Now the logic to handle different store events in handleStoreEvent() is as 
 following:
 {code}
 if (event.getType().equals(RMStateStoreEventType.STORE_APP)
 || event.getType().equals(RMStateStoreEventType.UPDATE_APP)) {
   ...
   if (event.getType().equals(RMStateStoreEventType.STORE_APP)) {
 ...
   } else {
 ...
   }
   ...
   try {
 if (event.getType().equals(RMStateStoreEventType.STORE_APP)) {
   ...
 } else {
   ...
 }
   } 
   ...
 } else if (event.getType().equals(RMStateStoreEventType.STORE_APP_ATTEMPT)
 || event.getType().equals(RMStateStoreEventType.UPDATE_APP_ATTEMPT)) {
   ...
   if (event.getType().equals(RMStateStoreEventType.STORE_APP_ATTEMPT)) {
 ...
   } else {
 ...
   }
 ...
 if (event.getType().equals(RMStateStoreEventType.STORE_APP_ATTEMPT)) {
   ...
 } else {
   ...
 }
   }
   ...
 } else if (event.getType().equals(RMStateStoreEventType.REMOVE_APP)) {
 ...
 } else {
   ...
 }
 }
 {code}
 This is not only confuse people but also led to mistake easily. We may 
 leverage state machine to simply this even no state transitions.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (YARN-2030) Use StateMachine to simplify handleStoreEvent() in RMStateStore

2014-06-07 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-2030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang updated YARN-2030:


Attachment: YARN-2030.v4.patch

Thanks for the comments, [~jianhe]. Update the patch to make getProto abstract 
method in record class.

 Use StateMachine to simplify handleStoreEvent() in RMStateStore
 ---

 Key: YARN-2030
 URL: https://issues.apache.org/jira/browse/YARN-2030
 Project: Hadoop YARN
  Issue Type: Improvement
Reporter: Junping Du
Assignee: Binglin Chang
 Attachments: YARN-2030.v1.patch, YARN-2030.v2.patch, 
 YARN-2030.v3.patch, YARN-2030.v4.patch


 Now the logic to handle different store events in handleStoreEvent() is as 
 following:
 {code}
 if (event.getType().equals(RMStateStoreEventType.STORE_APP)
 || event.getType().equals(RMStateStoreEventType.UPDATE_APP)) {
   ...
   if (event.getType().equals(RMStateStoreEventType.STORE_APP)) {
 ...
   } else {
 ...
   }
   ...
   try {
 if (event.getType().equals(RMStateStoreEventType.STORE_APP)) {
   ...
 } else {
   ...
 }
   } 
   ...
 } else if (event.getType().equals(RMStateStoreEventType.STORE_APP_ATTEMPT)
 || event.getType().equals(RMStateStoreEventType.UPDATE_APP_ATTEMPT)) {
   ...
   if (event.getType().equals(RMStateStoreEventType.STORE_APP_ATTEMPT)) {
 ...
   } else {
 ...
   }
 ...
 if (event.getType().equals(RMStateStoreEventType.STORE_APP_ATTEMPT)) {
   ...
 } else {
   ...
 }
   }
   ...
 } else if (event.getType().equals(RMStateStoreEventType.REMOVE_APP)) {
 ...
 } else {
   ...
 }
 }
 {code}
 This is not only confuse people but also led to mistake easily. We may 
 leverage state machine to simply this even no state transitions.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (YARN-2030) Use StateMachine to simplify handleStoreEvent() in RMStateStore

2014-06-05 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-2030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang updated YARN-2030:


Attachment: YARN-2030.v3.patch

Thanks for the comments [~djp] and [~jianhe]. I update the patch to make 
ApplicationAttempStateData and ApplicationStateData abstract classes. 
bq. Accordingly storeApplicationStateInternal can take in ApplicationStateData 
instead of ApplicationStateDataPBImpl as the argument to avoid the type cast.
I try to change updateApplicationAttemptStateInternal paramter type from PBImpl 
to abstract records, but looks like some RMStateStore(FileSystemRMStateStore 
and ZKRMStateStore) require the parameter to be PBImpl(so they can use toProto 
to serialize)

 Use StateMachine to simplify handleStoreEvent() in RMStateStore
 ---

 Key: YARN-2030
 URL: https://issues.apache.org/jira/browse/YARN-2030
 Project: Hadoop YARN
  Issue Type: Improvement
Reporter: Junping Du
Assignee: Binglin Chang
 Attachments: YARN-2030.v1.patch, YARN-2030.v2.patch, 
 YARN-2030.v3.patch


 Now the logic to handle different store events in handleStoreEvent() is as 
 following:
 {code}
 if (event.getType().equals(RMStateStoreEventType.STORE_APP)
 || event.getType().equals(RMStateStoreEventType.UPDATE_APP)) {
   ...
   if (event.getType().equals(RMStateStoreEventType.STORE_APP)) {
 ...
   } else {
 ...
   }
   ...
   try {
 if (event.getType().equals(RMStateStoreEventType.STORE_APP)) {
   ...
 } else {
   ...
 }
   } 
   ...
 } else if (event.getType().equals(RMStateStoreEventType.STORE_APP_ATTEMPT)
 || event.getType().equals(RMStateStoreEventType.UPDATE_APP_ATTEMPT)) {
   ...
   if (event.getType().equals(RMStateStoreEventType.STORE_APP_ATTEMPT)) {
 ...
   } else {
 ...
   }
 ...
 if (event.getType().equals(RMStateStoreEventType.STORE_APP_ATTEMPT)) {
   ...
 } else {
   ...
 }
   }
   ...
 } else if (event.getType().equals(RMStateStoreEventType.REMOVE_APP)) {
 ...
 } else {
   ...
 }
 }
 {code}
 This is not only confuse people but also led to mistake easily. We may 
 leverage state machine to simply this even no state transitions.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-2030) Use StateMachine to simplify handleStoreEvent() in RMStateStore

2014-06-03 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-2030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14016325#comment-14016325
 ] 

Binglin Chang commented on YARN-2030:
-

Hi, [~djp].  I think the patch is ready for review, could you help review the 
patch? 

 Use StateMachine to simplify handleStoreEvent() in RMStateStore
 ---

 Key: YARN-2030
 URL: https://issues.apache.org/jira/browse/YARN-2030
 Project: Hadoop YARN
  Issue Type: Improvement
Reporter: Junping Du
Assignee: Binglin Chang
 Attachments: YARN-2030.v1.patch, YARN-2030.v2.patch


 Now the logic to handle different store events in handleStoreEvent() is as 
 following:
 {code}
 if (event.getType().equals(RMStateStoreEventType.STORE_APP)
 || event.getType().equals(RMStateStoreEventType.UPDATE_APP)) {
   ...
   if (event.getType().equals(RMStateStoreEventType.STORE_APP)) {
 ...
   } else {
 ...
   }
   ...
   try {
 if (event.getType().equals(RMStateStoreEventType.STORE_APP)) {
   ...
 } else {
   ...
 }
   } 
   ...
 } else if (event.getType().equals(RMStateStoreEventType.STORE_APP_ATTEMPT)
 || event.getType().equals(RMStateStoreEventType.UPDATE_APP_ATTEMPT)) {
   ...
   if (event.getType().equals(RMStateStoreEventType.STORE_APP_ATTEMPT)) {
 ...
   } else {
 ...
   }
 ...
 if (event.getType().equals(RMStateStoreEventType.STORE_APP_ATTEMPT)) {
   ...
 } else {
   ...
 }
   }
   ...
 } else if (event.getType().equals(RMStateStoreEventType.REMOVE_APP)) {
 ...
 } else {
   ...
 }
 }
 {code}
 This is not only confuse people but also led to mistake easily. We may 
 leverage state machine to simply this even no state transitions.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (YARN-2103) Inconsistency between viaProto flag and initial value of SerializedExceptionProto.Builder

2014-06-01 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-2103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang updated YARN-2103:


Attachment: YARN-2103.v3.patch

 Inconsistency between viaProto flag and initial value of 
 SerializedExceptionProto.Builder
 -

 Key: YARN-2103
 URL: https://issues.apache.org/jira/browse/YARN-2103
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Binglin Chang
Assignee: Binglin Chang
 Attachments: YARN-2103.v1.patch, YARN-2103.v2.patch, 
 YARN-2103.v3.patch


 Bug 1:
 {code}
   SerializedExceptionProto proto = SerializedExceptionProto
   .getDefaultInstance();
   SerializedExceptionProto.Builder builder = null;
   boolean viaProto = false;
 {code}
 Since viaProto is false, we should initiate build rather than proto
 Bug 2:
 the class does not provide hashcode() and equals() like other PBImpl records, 
 this class is used in other records, it may affect other records' behavior. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-2103) Inconsistency between viaProto flag and initial value of SerializedExceptionProto.Builder

2014-06-01 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-2103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14014924#comment-14014924
 ] 

Binglin Chang commented on YARN-2103:
-

Thanks for the view and extra tests, [~ozawa]. I update the patch.

 Inconsistency between viaProto flag and initial value of 
 SerializedExceptionProto.Builder
 -

 Key: YARN-2103
 URL: https://issues.apache.org/jira/browse/YARN-2103
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Binglin Chang
Assignee: Binglin Chang
 Attachments: YARN-2103.v1.patch, YARN-2103.v2.patch, 
 YARN-2103.v3.patch


 Bug 1:
 {code}
   SerializedExceptionProto proto = SerializedExceptionProto
   .getDefaultInstance();
   SerializedExceptionProto.Builder builder = null;
   boolean viaProto = false;
 {code}
 Since viaProto is false, we should initiate build rather than proto
 Bug 2:
 the class does not provide hashcode() and equals() like other PBImpl records, 
 this class is used in other records, it may affect other records' behavior. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (YARN-2103) Inconsistency between viaProto flag and initial value of SerializedExceptionProto.Builder

2014-05-29 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-2103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang updated YARN-2103:


Attachment: YARN-2103.v1.patch

Thanks for the review [~djp], update patch addressing your comments.

 Inconsistency between viaProto flag and initial value of 
 SerializedExceptionProto.Builder
 -

 Key: YARN-2103
 URL: https://issues.apache.org/jira/browse/YARN-2103
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Binglin Chang
Assignee: Binglin Chang
 Attachments: YARN-2103.v1.patch, YARN-2103.v2.patch


 Bug 1:
 {code}
   SerializedExceptionProto proto = SerializedExceptionProto
   .getDefaultInstance();
   SerializedExceptionProto.Builder builder = null;
   boolean viaProto = false;
 {code}
 Since viaProto is false, we should initiate build rather than proto
 Bug 2:
 the class does not provide hashcode() and equals() like other PBImpl records, 
 this class is used in other records, it may affect other records' behavior. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (YARN-2103) Inconsistency between viaProto flag and initial value of SerializedExceptionProto.Builder

2014-05-29 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-2103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang updated YARN-2103:


Attachment: (was: YARN-2103.v1.patch)

 Inconsistency between viaProto flag and initial value of 
 SerializedExceptionProto.Builder
 -

 Key: YARN-2103
 URL: https://issues.apache.org/jira/browse/YARN-2103
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Binglin Chang
Assignee: Binglin Chang
 Attachments: YARN-2103.v1.patch, YARN-2103.v2.patch


 Bug 1:
 {code}
   SerializedExceptionProto proto = SerializedExceptionProto
   .getDefaultInstance();
   SerializedExceptionProto.Builder builder = null;
   boolean viaProto = false;
 {code}
 Since viaProto is false, we should initiate build rather than proto
 Bug 2:
 the class does not provide hashcode() and equals() like other PBImpl records, 
 this class is used in other records, it may affect other records' behavior. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (YARN-2103) Inconsistency between viaProto flag and initial value of SerializedExceptionProto.Builder

2014-05-29 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-2103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang updated YARN-2103:


Attachment: YARN-2103.v2.patch

updated wrong patch... resubmit

 Inconsistency between viaProto flag and initial value of 
 SerializedExceptionProto.Builder
 -

 Key: YARN-2103
 URL: https://issues.apache.org/jira/browse/YARN-2103
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Binglin Chang
Assignee: Binglin Chang
 Attachments: YARN-2103.v1.patch, YARN-2103.v2.patch


 Bug 1:
 {code}
   SerializedExceptionProto proto = SerializedExceptionProto
   .getDefaultInstance();
   SerializedExceptionProto.Builder builder = null;
   boolean viaProto = false;
 {code}
 Since viaProto is false, we should initiate build rather than proto
 Bug 2:
 the class does not provide hashcode() and equals() like other PBImpl records, 
 this class is used in other records, it may affect other records' behavior. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-2103) Fix code bug in SerializedExceptionPBImpl

2014-05-27 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-2103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14009347#comment-14009347
 ] 

Binglin Chang commented on YARN-2103:
-

I plan to add generic test to test all PBImpls in YARN-2051, so separated tests 
are not needed.

 Fix code bug in SerializedExceptionPBImpl
 -

 Key: YARN-2103
 URL: https://issues.apache.org/jira/browse/YARN-2103
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Binglin Chang
Assignee: Binglin Chang
 Attachments: YARN-2103.v1.patch


 {code}
   SerializedExceptionProto proto = SerializedExceptionProto
   .getDefaultInstance();
   SerializedExceptionProto.Builder builder = null;
   boolean viaProto = false;
 {code}
 Since viaProto is false, we should initiate build rather than proto



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-2103) Inconsistency between viaProto flag and initial value of SerializedExceptionProto.Builder

2014-05-27 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-2103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14010700#comment-14010700
 ] 

Binglin Chang commented on YARN-2103:
-

Hi [~ozawa], thanks for reviewing the patch and the comments. I use the 
original title because the bug isn't just about inconsistent viaProto, but also 
lack of equals and hashcode method(which will affect other records who uses 
SerializedException), I guess I should point out all bugs in the jira. 

about code format, most PBImpl classes use those common code:
{code}
  private void maybeInitBuilder() {
if (viaProto || builder == null) {
  builder = GetApplicationsRequestProto.newBuilder(proto);
}
viaProto = false;
  }

  @Override
  public int hashCode() {
return getProto().hashCode();
  }

  @Override
  public boolean equals(Object other) {
if (other == null)
  return false;
if (other.getClass().isAssignableFrom(this.getClass())) {
  return this.getProto().equals(this.getClass().cast(other).getProto());
}
return false;
  }

{code}

you can see GetApplicationsRequestPBImpl/GetApplicationsResponsePBImpl,  I just 
follow those patterns, maybe we can change them all in another JIRA, changing 
them may not fit into in this JIRA. 

bq.  How about adding concrete tests as a first step of generic tests on 
YARN-2051. 
After generic test are added, those old tests are probably redundant and can be 
removed. Guess we can discuss this in the future. I can provide a separate test 
currently.



 Inconsistency between viaProto flag and initial value of 
 SerializedExceptionProto.Builder
 -

 Key: YARN-2103
 URL: https://issues.apache.org/jira/browse/YARN-2103
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Binglin Chang
Assignee: Binglin Chang
 Attachments: YARN-2103.v1.patch


 {code}
   SerializedExceptionProto proto = SerializedExceptionProto
   .getDefaultInstance();
   SerializedExceptionProto.Builder builder = null;
   boolean viaProto = false;
 {code}
 Since viaProto is false, we should initiate build rather than proto



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (YARN-2103) Inconsistency between viaProto flag and initial value of SerializedExceptionProto.Builder

2014-05-27 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-2103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang updated YARN-2103:


Description: 
Bug 1:
{code}
  SerializedExceptionProto proto = SerializedExceptionProto
  .getDefaultInstance();
  SerializedExceptionProto.Builder builder = null;
  boolean viaProto = false;
{code}

Since viaProto is false, we should initiate build rather than proto

Bug 2:
the class does not provide hashcode() and equals() like other PBImpl records, 
this class is used in other records, it may affect other records' behavior. 



  was:
{code}
  SerializedExceptionProto proto = SerializedExceptionProto
  .getDefaultInstance();
  SerializedExceptionProto.Builder builder = null;
  boolean viaProto = false;
{code}

Since viaProto is false, we should initiate build rather than proto



 Inconsistency between viaProto flag and initial value of 
 SerializedExceptionProto.Builder
 -

 Key: YARN-2103
 URL: https://issues.apache.org/jira/browse/YARN-2103
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Binglin Chang
Assignee: Binglin Chang
 Attachments: YARN-2103.v1.patch


 Bug 1:
 {code}
   SerializedExceptionProto proto = SerializedExceptionProto
   .getDefaultInstance();
   SerializedExceptionProto.Builder builder = null;
   boolean viaProto = false;
 {code}
 Since viaProto is false, we should initiate build rather than proto
 Bug 2:
 the class does not provide hashcode() and equals() like other PBImpl records, 
 this class is used in other records, it may affect other records' behavior. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-1911) NullPointerException in ResourceOption.getOverCommitTimeout()

2014-05-26 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14008608#comment-14008608
 ] 

Binglin Chang commented on YARN-1911:
-

+1, I also got this bug when doing YARN-2051, hope it will get committed soon.


 NullPointerException in ResourceOption.getOverCommitTimeout()
 -

 Key: YARN-1911
 URL: https://issues.apache.org/jira/browse/YARN-1911
 Project: Hadoop YARN
  Issue Type: Bug
  Components: api
Affects Versions: 2.3.0
Reporter: Kenji Kikushima
Assignee: Kenji Kikushima
 Attachments: YARN-1911.patch


 I'm trying to implement REST API for Dynamic configuration. But 
 NullPointerException occurred when ResourceOption.getOverCommitTimeout() 
 called. Here is a sample code with error message.
 {code}
 @Test
 public void testGetOverCommitTimeout() {
   Resource capability = Resource.newInstance(4096, 4);
   ResourceOption resourceOption = ResourceOption.newInstance(capability,
 RMNode.OVER_COMMIT_TIMEOUT_MILLIS_DEFAULT);
   Assert.assertEquals(RMNode.OVER_COMMIT_TIMEOUT_MILLIS_DEFAULT,
 resourceOption.getOverCommitTimeout());
 }
 {code}
 {code}
 java.lang.NullPointerException
 at com.google.common.base.Preconditions.checkNotNull(Preconditions.java:187)
 at 
 org.apache.hadoop.yarn.api.records.impl.pb.ResourceOptionPBImpl.getOverCommitTimeout(ResourceOptionPBImpl.java:63)
 at 
 org.apache.hadoop.yarn.server.resourcemanager.TestResourceOption.testGetOverCommitTimeout(TestResourceOption.java:33)
 at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
 at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
 at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
 at java.lang.reflect.Method.invoke(Method.java:597)
 at 
 org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:45)
 at 
 org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:15)
 at 
 org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:42)
 at 
 org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:20)
 at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:263)
 at 
 org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:68)
 at 
 org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:47)
 at org.junit.runners.ParentRunner$3.run(ParentRunner.java:231)
 at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:60)
 at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:229)
 at org.junit.runners.ParentRunner.access$000(ParentRunner.java:50)
 at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:222)
 at org.junit.runners.ParentRunner.run(ParentRunner.java:300)
 at 
 org.eclipse.jdt.internal.junit4.runner.JUnit4TestReference.run(JUnit4TestReference.java:50)
 at 
 org.eclipse.jdt.internal.junit.runner.TestExecution.run(TestExecution.java:38)
 at 
 org.eclipse.jdt.internal.junit.runner.RemoteTestRunner.runTests(RemoteTestRunner.java:467)
 at 
 org.eclipse.jdt.internal.junit.runner.RemoteTestRunner.runTests(RemoteTestRunner.java:683)
 at 
 org.eclipse.jdt.internal.junit.runner.RemoteTestRunner.run(RemoteTestRunner.java:390)
 at 
 org.eclipse.jdt.internal.junit.runner.RemoteTestRunner.main(RemoteTestRunner.java:197)
 {code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Created] (YARN-2103) Fix code bug in SerializedExceptionPBImpl

2014-05-26 Thread Binglin Chang (JIRA)
Binglin Chang created YARN-2103:
---

 Summary: Fix code bug in SerializedExceptionPBImpl
 Key: YARN-2103
 URL: https://issues.apache.org/jira/browse/YARN-2103
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Binglin Chang
Assignee: Binglin Chang


{code}
  SerializedExceptionProto proto = SerializedExceptionProto
  .getDefaultInstance();
  SerializedExceptionProto.Builder builder = null;
  boolean viaProto = false;
{code}

Since viaProto is false, we should initiate build rather than proto




--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (YARN-2051) Fix code bug and add more unit tests for PBImpls

2014-05-26 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-2051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang updated YARN-2051:


Summary: Fix code bug and add more unit tests for PBImpls  (was: Add more 
unit tests for PBImpl that didn't get covered)

 Fix code bug and add more unit tests for PBImpls
 

 Key: YARN-2051
 URL: https://issues.apache.org/jira/browse/YARN-2051
 Project: Hadoop YARN
  Issue Type: Test
Reporter: Junping Du
Assignee: Binglin Chang
Priority: Critical

 From YARN-2016, we can see some bug could exist in PB implementation of 
 protocol. The bad news is most of these PBImpl don't have any unit test to 
 verify the info is not lost or changed after serialization/deserialization. 
 We should add more tests for it.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-1871) We should eliminate writing *PBImpl code in YARN

2014-05-26 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1871?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14008650#comment-14008650
 ] 

Binglin Chang commented on YARN-1871:
-

Good idea on eliminating PBImpl code, some comments:
bq. Make record class become a non-abstract class, add simple getters/setters 
implementation.
I like we use simple getter/setter implementation, rather than current complex 
builder/proto/field mixed cache
It's better to remain the old API unchanged, and hide implementation. if we can 
generate PBImpl toProto/toRecord, why can't we generate simple getters/setters?

bq. serialization a record to Proto type using reflection
We'd better to generate code, reflection can be used on test code, but I'm 
afraid using reflection to ser/de-ser in rpc code is not acceptable.

bq. There are only some limited patterns in current *PBImpl
There are some complex situations: read only type/property, generic types, 
recursive types and name mismatch in record/proto. When doing YARN-2051, I 
found some situations hard to automate and need special treatment. 




 We should eliminate writing *PBImpl code in YARN
 

 Key: YARN-1871
 URL: https://issues.apache.org/jira/browse/YARN-1871
 Project: Hadoop YARN
  Issue Type: Improvement
  Components: api
Affects Versions: 2.4.0
Reporter: Wangda Tan
Assignee: Wangda Tan
 Attachments: YARN-1871.demo.patch


 Currently, We need write PBImpl classes one by one. After running find . 
 -name *PBImpl*.java | xargs wc -l under hadoop source code directory, we 
 can see, there're more than 25,000 LOC. I think we should improve this, which 
 will be very helpful for YARN developers to make changes for YARN protocols.
 There're only some limited patterns in current *PBImpl,
 * Simple types, like string, int32, float.
 * List? types
 * Map? types
 * Enum types
 Code generation should be enough to generate such PBImpl classes.
 Some other requirements are,
 * Leave other related code alone, like service implemention (e.g. 
 ContainerManagerImpl).
 * (If possible) Forward compatibility, developpers can write their own PBImpl 
 or genereate them.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-2051) Fix code bug and add more unit tests for PBImpls

2014-05-26 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-2051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14008654#comment-14008654
 ] 

Binglin Chang commented on YARN-2051:
-

Hi [~wangda], thanks for the notice.

 Fix code bug and add more unit tests for PBImpls
 

 Key: YARN-2051
 URL: https://issues.apache.org/jira/browse/YARN-2051
 Project: Hadoop YARN
  Issue Type: Test
Reporter: Junping Du
Assignee: Binglin Chang
Priority: Critical

 From YARN-2016, we can see some bug could exist in PB implementation of 
 protocol. The bad news is most of these PBImpl don't have any unit test to 
 verify the info is not lost or changed after serialization/deserialization. 
 We should add more tests for it.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (YARN-2103) Fix code bug in SerializedExceptionPBImpl

2014-05-26 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-2103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang updated YARN-2103:


Attachment: YARN-2103.v1.patch

 Fix code bug in SerializedExceptionPBImpl
 -

 Key: YARN-2103
 URL: https://issues.apache.org/jira/browse/YARN-2103
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Binglin Chang
Assignee: Binglin Chang
 Attachments: YARN-2103.v1.patch


 {code}
   SerializedExceptionProto proto = SerializedExceptionProto
   .getDefaultInstance();
   SerializedExceptionProto.Builder builder = null;
   boolean viaProto = false;
 {code}
 Since viaProto is false, we should initiate build rather than proto



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-2088) Fix code bug in GetApplicationsRequestPBImpl#mergeLocalToBuilder

2014-05-22 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-2088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14005722#comment-14005722
 ] 

Binglin Chang commented on YARN-2088:
-

Hi Zhiguo, Thanks for the comments, nice catch.
Those two lines are used in every record class... so delete them in a single 
place actually break code conversion, and it's not related to this jira.
We may discuss whether to delete them all in other jira.


 Fix code bug in GetApplicationsRequestPBImpl#mergeLocalToBuilder
 

 Key: YARN-2088
 URL: https://issues.apache.org/jira/browse/YARN-2088
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Binglin Chang
Assignee: Binglin Chang
 Attachments: YARN-2088.v1.patch


 Some fields(set,list) are added to proto builders many times, we need to 
 clear those fields before add, otherwise the result proto contains more 
 contents.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-2030) Use StateMachine to simplify handleStoreEvent() in RMStateStore

2014-05-22 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-2030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14005753#comment-14005753
 ] 

Binglin Chang commented on YARN-2030:
-

Hi Jian He,
Thanks for the comments, looks like PBImpl already has ProtoBase as super 
class, so we can't change interface to abstract class

{code}
public class ApplicationAttemptStateDataPBImpl
extends ProtoBaseApplicationAttemptStateDataProto 
implements ApplicationAttemptStateData {
{code}


 Use StateMachine to simplify handleStoreEvent() in RMStateStore
 ---

 Key: YARN-2030
 URL: https://issues.apache.org/jira/browse/YARN-2030
 Project: Hadoop YARN
  Issue Type: Improvement
Reporter: Junping Du
Assignee: Binglin Chang
 Attachments: YARN-2030.v1.patch, YARN-2030.v2.patch


 Now the logic to handle different store events in handleStoreEvent() is as 
 following:
 {code}
 if (event.getType().equals(RMStateStoreEventType.STORE_APP)
 || event.getType().equals(RMStateStoreEventType.UPDATE_APP)) {
   ...
   if (event.getType().equals(RMStateStoreEventType.STORE_APP)) {
 ...
   } else {
 ...
   }
   ...
   try {
 if (event.getType().equals(RMStateStoreEventType.STORE_APP)) {
   ...
 } else {
   ...
 }
   } 
   ...
 } else if (event.getType().equals(RMStateStoreEventType.STORE_APP_ATTEMPT)
 || event.getType().equals(RMStateStoreEventType.UPDATE_APP_ATTEMPT)) {
   ...
   if (event.getType().equals(RMStateStoreEventType.STORE_APP_ATTEMPT)) {
 ...
   } else {
 ...
   }
 ...
 if (event.getType().equals(RMStateStoreEventType.STORE_APP_ATTEMPT)) {
   ...
 } else {
   ...
 }
   }
   ...
 } else if (event.getType().equals(RMStateStoreEventType.REMOVE_APP)) {
 ...
 } else {
   ...
 }
 }
 {code}
 This is not only confuse people but also led to mistake easily. We may 
 leverage state machine to simply this even no state transitions.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (YARN-2088) Fix code bug in GetApplicationsRequestPBImpl#mergeLocalToBuilder

2014-05-21 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-2088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang updated YARN-2088:


Attachment: YARN-2088.v1.patch

Bug and fixes:
1. clear builder before adding Set/Lists
2. remove unnecessary maybeInitBuilder in mergeLocalToBuilder
3. we don't need to construct Iterable manually, just use guava library
4. the property limit is not set properly in mergeLocalToBuilder, this may 
cause the limit property be reset to Long.MAX...
5. add a test assertion in TestGetApplicationsRequest to verify the bug

Run the test on my local laptop, the test failed before the patch, and success 
after the patch. 


 Fix code bug in GetApplicationsRequestPBImpl#mergeLocalToBuilder
 

 Key: YARN-2088
 URL: https://issues.apache.org/jira/browse/YARN-2088
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Binglin Chang
Assignee: Binglin Chang
 Attachments: YARN-2088.v1.patch


 Some fields(set,list) are added to proto builders many times, we need to 
 clear those fields before add, otherwise the result proto contains more 
 contents.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-2088) Fix code bug in GetApplicationsRequestPBImpl#mergeLocalToBuilder

2014-05-21 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-2088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14004803#comment-14004803
 ] 

Binglin Chang commented on YARN-2088:
-

Based on recent bugs related to api records/PBImpl, I have some doubts about 
the general patterns used in PBImpls(java fields mixed with proto objects, 
cached states), which causes lots of redundant code and confusion, changes to 
those code is a mental challenge and can easily generate new bugs...


 Fix code bug in GetApplicationsRequestPBImpl#mergeLocalToBuilder
 

 Key: YARN-2088
 URL: https://issues.apache.org/jira/browse/YARN-2088
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Binglin Chang
Assignee: Binglin Chang
 Attachments: YARN-2088.v1.patch


 Some fields(set,list) are added to proto builders many times, we need to 
 clear those fields before add, otherwise the result proto contains more 
 contents.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (YARN-2030) Use StateMachine to simplify handleStoreEvent() in RMStateStore

2014-05-20 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-2030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang updated YARN-2030:


Attachment: YARN-2030.v1.patch

Attach patch. 
This is a code refactor,  TestFSRMStateStore and TestZKRMStateStore already 
cover the code here, so no additional test is added.



 Use StateMachine to simplify handleStoreEvent() in RMStateStore
 ---

 Key: YARN-2030
 URL: https://issues.apache.org/jira/browse/YARN-2030
 Project: Hadoop YARN
  Issue Type: Improvement
Reporter: Junping Du
Assignee: Binglin Chang
 Attachments: YARN-2030.v1.patch


 Now the logic to handle different store events in handleStoreEvent() is as 
 following:
 {code}
 if (event.getType().equals(RMStateStoreEventType.STORE_APP)
 || event.getType().equals(RMStateStoreEventType.UPDATE_APP)) {
   ...
   if (event.getType().equals(RMStateStoreEventType.STORE_APP)) {
 ...
   } else {
 ...
   }
   ...
   try {
 if (event.getType().equals(RMStateStoreEventType.STORE_APP)) {
   ...
 } else {
   ...
 }
   } 
   ...
 } else if (event.getType().equals(RMStateStoreEventType.STORE_APP_ATTEMPT)
 || event.getType().equals(RMStateStoreEventType.UPDATE_APP_ATTEMPT)) {
   ...
   if (event.getType().equals(RMStateStoreEventType.STORE_APP_ATTEMPT)) {
 ...
   } else {
 ...
   }
 ...
 if (event.getType().equals(RMStateStoreEventType.STORE_APP_ATTEMPT)) {
   ...
 } else {
   ...
 }
   }
   ...
 } else if (event.getType().equals(RMStateStoreEventType.REMOVE_APP)) {
 ...
 } else {
   ...
 }
 }
 {code}
 This is not only confuse people but also led to mistake easily. We may 
 leverage state machine to simply this even no state transitions.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Assigned] (YARN-2051) Add more unit tests for PBImpl that didn't get covered

2014-05-20 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-2051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang reassigned YARN-2051:
---

Assignee: Binglin Chang

 Add more unit tests for PBImpl that didn't get covered
 --

 Key: YARN-2051
 URL: https://issues.apache.org/jira/browse/YARN-2051
 Project: Hadoop YARN
  Issue Type: Test
Reporter: Junping Du
Assignee: Binglin Chang
Priority: Critical

 From YARN-2016, we can see some bug could exist in PB implementation of 
 protocol. The bad news is most of these PBImpl don't have any unit test to 
 verify the info is not lost or changed after serialization/deserialization. 
 We should add more tests for it.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (YARN-2030) Use StateMachine to simplify handleStoreEvent() in RMStateStore

2014-05-20 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-2030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang updated YARN-2030:


Attachment: YARN-2030.v2.patch

attach v2 patch to fix findbug warnings

 Use StateMachine to simplify handleStoreEvent() in RMStateStore
 ---

 Key: YARN-2030
 URL: https://issues.apache.org/jira/browse/YARN-2030
 Project: Hadoop YARN
  Issue Type: Improvement
Reporter: Junping Du
Assignee: Binglin Chang
 Attachments: YARN-2030.v1.patch, YARN-2030.v2.patch


 Now the logic to handle different store events in handleStoreEvent() is as 
 following:
 {code}
 if (event.getType().equals(RMStateStoreEventType.STORE_APP)
 || event.getType().equals(RMStateStoreEventType.UPDATE_APP)) {
   ...
   if (event.getType().equals(RMStateStoreEventType.STORE_APP)) {
 ...
   } else {
 ...
   }
   ...
   try {
 if (event.getType().equals(RMStateStoreEventType.STORE_APP)) {
   ...
 } else {
   ...
 }
   } 
   ...
 } else if (event.getType().equals(RMStateStoreEventType.STORE_APP_ATTEMPT)
 || event.getType().equals(RMStateStoreEventType.UPDATE_APP_ATTEMPT)) {
   ...
   if (event.getType().equals(RMStateStoreEventType.STORE_APP_ATTEMPT)) {
 ...
   } else {
 ...
   }
 ...
 if (event.getType().equals(RMStateStoreEventType.STORE_APP_ATTEMPT)) {
   ...
 } else {
   ...
 }
   }
   ...
 } else if (event.getType().equals(RMStateStoreEventType.REMOVE_APP)) {
 ...
 } else {
   ...
 }
 }
 {code}
 This is not only confuse people but also led to mistake easily. We may 
 leverage state machine to simply this even no state transitions.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-2051) Add more unit tests for PBImpl that didn't get covered

2014-05-20 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-2051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14004371#comment-14004371
 ] 

Binglin Chang commented on YARN-2051:
-

I thought about this, most of the pb serde validation involves the following 
procedure:
1. set property to record using value(v0)
2. get proto obj from record
3. create new record from proto obj
4. get property from new record value(v1), validate v0 == v1
This can be automated for all set/get pairs, we just need to use reflection to 
find all get/set pairs of the record class, and test each pair. By doing this, 
we save lots of testing code. In the future when we add new properties to a 
record, no need to add/change the testing code:) 

Note: those record looks like java beans but many of those does not follow 
strict java bean laws, I try to leverage commons-beanutil but it seems it is 
not flexible enough, we make a patch soon.



 Add more unit tests for PBImpl that didn't get covered
 --

 Key: YARN-2051
 URL: https://issues.apache.org/jira/browse/YARN-2051
 Project: Hadoop YARN
  Issue Type: Test
Reporter: Junping Du
Assignee: Binglin Chang
Priority: Critical

 From YARN-2016, we can see some bug could exist in PB implementation of 
 protocol. The bad news is most of these PBImpl don't have any unit test to 
 verify the info is not lost or changed after serialization/deserialization. 
 We should add more tests for it.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Assigned] (YARN-2030) Use StateMachine to simplify handleStoreEvent() in RMStateStore

2014-05-19 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-2030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang reassigned YARN-2030:
---

Assignee: Binglin Chang

 Use StateMachine to simplify handleStoreEvent() in RMStateStore
 ---

 Key: YARN-2030
 URL: https://issues.apache.org/jira/browse/YARN-2030
 Project: Hadoop YARN
  Issue Type: Improvement
Reporter: Junping Du
Assignee: Binglin Chang

 Now the logic to handle different store events in handleStoreEvent() is as 
 following:
 {code}
 if (event.getType().equals(RMStateStoreEventType.STORE_APP)
 || event.getType().equals(RMStateStoreEventType.UPDATE_APP)) {
   ...
   if (event.getType().equals(RMStateStoreEventType.STORE_APP)) {
 ...
   } else {
 ...
   }
   ...
   try {
 if (event.getType().equals(RMStateStoreEventType.STORE_APP)) {
   ...
 } else {
   ...
 }
   } 
   ...
 } else if (event.getType().equals(RMStateStoreEventType.STORE_APP_ATTEMPT)
 || event.getType().equals(RMStateStoreEventType.UPDATE_APP_ATTEMPT)) {
   ...
   if (event.getType().equals(RMStateStoreEventType.STORE_APP_ATTEMPT)) {
 ...
   } else {
 ...
   }
 ...
 if (event.getType().equals(RMStateStoreEventType.STORE_APP_ATTEMPT)) {
   ...
 } else {
   ...
 }
   }
   ...
 } else if (event.getType().equals(RMStateStoreEventType.REMOVE_APP)) {
 ...
 } else {
   ...
 }
 }
 {code}
 This is not only confuse people but also led to mistake easily. We may 
 leverage state machine to simply this even no state transitions.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-1872) TestDistributedShell occasionally fails in trunk

2014-05-15 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1872?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13998493#comment-13998493
 ] 

Binglin Chang commented on YARN-1872:
-

Hi, testDSShell fails with asser failed, don't know whether it is relevant:

https://builds.apache.org/job/Hadoop-Yarn-trunk/561/consoleText

testDSShell(org.apache.hadoop.yarn.applications.distributedshell.TestDistributedShell)
  Time elapsed: 27.557 sec   FAILURE!
java.lang.AssertionError: expected:1 but was:0
at org.junit.Assert.fail(Assert.java:88)
at org.junit.Assert.failNotEquals(Assert.java:743)
at org.junit.Assert.assertEquals(Assert.java:118)
at org.junit.Assert.assertEquals(Assert.java:555)
at org.junit.Assert.assertEquals(Assert.java:542)
at 
org.apache.hadoop.yarn.applications.distributedshell.TestDistributedShell.testDSShell(TestDistributedShell.java:198)


Results :

Failed tests: 
  TestDistributedShell.testDSShell:198 expected:1 but was:0

Tests run: 8, Failures: 1, Errors: 0, Skipped: 0
 

 TestDistributedShell occasionally fails in trunk
 

 Key: YARN-1872
 URL: https://issues.apache.org/jira/browse/YARN-1872
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Ted Yu
Assignee: Hong Zhiguo
 Attachments: TestDistributedShell.out, YARN-1872.patch


 From https://builds.apache.org/job/Hadoop-Yarn-trunk/520/console :
 TestDistributedShell#testDSShellWithCustomLogPropertyFile failed and 
 TestDistributedShell#testDSShell timed out.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-1463) TestContainerManagerSecurity#testContainerManager fails

2013-12-16 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13849169#comment-13849169
 ] 

Binglin Chang commented on YARN-1463:
-

Thanks for the input Haohui and Vinod. I will update the patch soon. 

 TestContainerManagerSecurity#testContainerManager fails
 ---

 Key: YARN-1463
 URL: https://issues.apache.org/jira/browse/YARN-1463
 Project: Hadoop YARN
  Issue Type: Test
Reporter: Ted Yu
Assignee: Binglin Chang
 Attachments: YARN-1463.v1.patch, YARN-1463.v2.patch


 Here is stack trace:
 {code}
 testContainerManager[1](org.apache.hadoop.yarn.server.TestContainerManagerSecurity)
   Time elapsed: 1.756 sec   ERROR!
 org.apache.hadoop.yarn.exceptions.YarnRuntimeException: java.io.IOException: 
 ResourceManager failed to start. Final state is STOPPED
   at 
 org.apache.hadoop.yarn.server.MiniYARNCluster$ResourceManagerWrapper.serviceStart(MiniYARNCluster.java:253)
   at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
   at 
 org.apache.hadoop.service.CompositeService.serviceStart(CompositeService.java:121)
   at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
   at 
 org.apache.hadoop.yarn.server.TestContainerManagerSecurity.testContainerManager(TestContainerManagerSecurity.java:110)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)


[jira] [Updated] (YARN-1463) TestContainerManagerSecurity#testContainerManager fails

2013-12-05 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-1463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang updated YARN-1463:


Attachment: YARN-1463.v2.patch

Attach new patch adding comments about the affected tests

 TestContainerManagerSecurity#testContainerManager fails
 ---

 Key: YARN-1463
 URL: https://issues.apache.org/jira/browse/YARN-1463
 Project: Hadoop YARN
  Issue Type: Test
Reporter: Ted Yu
Assignee: Binglin Chang
 Attachments: YARN-1463.v1.patch, YARN-1463.v2.patch


 Here is stack trace:
 {code}
 testContainerManager[1](org.apache.hadoop.yarn.server.TestContainerManagerSecurity)
   Time elapsed: 1.756 sec   ERROR!
 org.apache.hadoop.yarn.exceptions.YarnRuntimeException: java.io.IOException: 
 ResourceManager failed to start. Final state is STOPPED
   at 
 org.apache.hadoop.yarn.server.MiniYARNCluster$ResourceManagerWrapper.serviceStart(MiniYARNCluster.java:253)
   at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
   at 
 org.apache.hadoop.service.CompositeService.serviceStart(CompositeService.java:121)
   at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
   at 
 org.apache.hadoop.yarn.server.TestContainerManagerSecurity.testContainerManager(TestContainerManagerSecurity.java:110)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (YARN-1450) TestUnmanagedAMLauncher#testDSShell fails on trunk

2013-12-05 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-1450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang updated YARN-1450:


Attachment: YARN-1450.v5.patch

Thanks for the explanation and review Vinod, attach new patch addressing your 
comments.

 TestUnmanagedAMLauncher#testDSShell fails on trunk
 --

 Key: YARN-1450
 URL: https://issues.apache.org/jira/browse/YARN-1450
 Project: Hadoop YARN
  Issue Type: Bug
  Components: applications/distributed-shell
Reporter: Akira AJISAKA
Assignee: Binglin Chang
 Attachments: YARN-1450.v1.patch, YARN-1450.v2.patch, 
 YARN-1450.v3.patch, YARN-1450.v4.patch, YARN-1450.v5.patch, 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher-output.txt,
  
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher.txt


 TestUnmanagedAMLauncher fails on trunk. The console output is
 {code}
 Running 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher
 Tests run: 2, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 35.937 sec 
  FAILURE! - in 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher
 testDSShell(org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher)
   Time elapsed: 14.558 sec   ERROR!
 java.lang.RuntimeException: Failed to receive final expected state in 
 ApplicationReport, CurrentState=ACCEPTED, 
 ExpectedStates=FINISHED,FAILED,KILLED
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.UnmanagedAMLauncher.monitorApplication(UnmanagedAMLauncher.java:447)
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.UnmanagedAMLauncher.run(UnmanagedAMLauncher.java:352)
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher.testDSShell(TestUnmanagedAMLauncher.java:145)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (YARN-1463) TestContainerManagerSecurity#testContainerManager fails

2013-12-04 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13839022#comment-13839022
 ] 

Binglin Chang commented on YARN-1463:
-

There are already 3 tests affected by this, maybe it is better to add some 
comment in WebApps rather than each individual tests?

 TestContainerManagerSecurity#testContainerManager fails
 ---

 Key: YARN-1463
 URL: https://issues.apache.org/jira/browse/YARN-1463
 Project: Hadoop YARN
  Issue Type: Test
Reporter: Ted Yu
Assignee: Binglin Chang
 Attachments: YARN-1463.000.patch, YARN-1463.v1.patch


 Here is stack trace:
 {code}
 testContainerManager[1](org.apache.hadoop.yarn.server.TestContainerManagerSecurity)
   Time elapsed: 1.756 sec   ERROR!
 org.apache.hadoop.yarn.exceptions.YarnRuntimeException: java.io.IOException: 
 ResourceManager failed to start. Final state is STOPPED
   at 
 org.apache.hadoop.yarn.server.MiniYARNCluster$ResourceManagerWrapper.serviceStart(MiniYARNCluster.java:253)
   at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
   at 
 org.apache.hadoop.service.CompositeService.serviceStart(CompositeService.java:121)
   at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
   at 
 org.apache.hadoop.yarn.server.TestContainerManagerSecurity.testContainerManager(TestContainerManagerSecurity.java:110)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (YARN-1450) TestUnmanagedAMLauncher#testDSShell fails on trunk

2013-12-04 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-1450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang updated YARN-1450:


Attachment: YARN-1450.v4.patch

Attach new patch, changes:
Write a standalone simple AM to be used in test.

 TestUnmanagedAMLauncher#testDSShell fails on trunk
 --

 Key: YARN-1450
 URL: https://issues.apache.org/jira/browse/YARN-1450
 Project: Hadoop YARN
  Issue Type: Bug
  Components: applications/distributed-shell
Reporter: Akira AJISAKA
Assignee: Binglin Chang
 Attachments: YARN-1450.v1.patch, YARN-1450.v2.patch, 
 YARN-1450.v3.patch, YARN-1450.v4.patch, 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher-output.txt,
  
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher.txt


 TestUnmanagedAMLauncher fails on trunk. The console output is
 {code}
 Running 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher
 Tests run: 2, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 35.937 sec 
  FAILURE! - in 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher
 testDSShell(org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher)
   Time elapsed: 14.558 sec   ERROR!
 java.lang.RuntimeException: Failed to receive final expected state in 
 ApplicationReport, CurrentState=ACCEPTED, 
 ExpectedStates=FINISHED,FAILED,KILLED
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.UnmanagedAMLauncher.monitorApplication(UnmanagedAMLauncher.java:447)
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.UnmanagedAMLauncher.run(UnmanagedAMLauncher.java:352)
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher.testDSShell(TestUnmanagedAMLauncher.java:145)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Assigned] (YARN-1463) TestContainerManagerSecurity#testContainerManager fails

2013-12-02 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-1463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang reassigned YARN-1463:
---

Assignee: Binglin Chang

 TestContainerManagerSecurity#testContainerManager fails
 ---

 Key: YARN-1463
 URL: https://issues.apache.org/jira/browse/YARN-1463
 Project: Hadoop YARN
  Issue Type: Test
Reporter: Ted Yu
Assignee: Binglin Chang

 Here is stack trace:
 {code}
 testContainerManager[1](org.apache.hadoop.yarn.server.TestContainerManagerSecurity)
   Time elapsed: 1.756 sec   ERROR!
 org.apache.hadoop.yarn.exceptions.YarnRuntimeException: java.io.IOException: 
 ResourceManager failed to start. Final state is STOPPED
   at 
 org.apache.hadoop.yarn.server.MiniYARNCluster$ResourceManagerWrapper.serviceStart(MiniYARNCluster.java:253)
   at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
   at 
 org.apache.hadoop.service.CompositeService.serviceStart(CompositeService.java:121)
   at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
   at 
 org.apache.hadoop.yarn.server.TestContainerManagerSecurity.testContainerManager(TestContainerManagerSecurity.java:110)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (YARN-1463) TestContainerManagerSecurity#testContainerManager fails

2013-12-02 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13836713#comment-13836713
 ] 

Binglin Chang commented on YARN-1463:
-

HDFS-5545 introduced this bug: when decide whether to init spnego, original 
code logic is broken

 TestContainerManagerSecurity#testContainerManager fails
 ---

 Key: YARN-1463
 URL: https://issues.apache.org/jira/browse/YARN-1463
 Project: Hadoop YARN
  Issue Type: Test
Reporter: Ted Yu
Assignee: Binglin Chang

 Here is stack trace:
 {code}
 testContainerManager[1](org.apache.hadoop.yarn.server.TestContainerManagerSecurity)
   Time elapsed: 1.756 sec   ERROR!
 org.apache.hadoop.yarn.exceptions.YarnRuntimeException: java.io.IOException: 
 ResourceManager failed to start. Final state is STOPPED
   at 
 org.apache.hadoop.yarn.server.MiniYARNCluster$ResourceManagerWrapper.serviceStart(MiniYARNCluster.java:253)
   at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
   at 
 org.apache.hadoop.service.CompositeService.serviceStart(CompositeService.java:121)
   at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
   at 
 org.apache.hadoop.yarn.server.TestContainerManagerSecurity.testContainerManager(TestContainerManagerSecurity.java:110)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (YARN-1463) TestContainerManagerSecurity#testContainerManager fails

2013-12-02 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-1463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang updated YARN-1463:


Attachment: YARN-1463.v1.patch

Attach patch with simple fix, the test can succeed now.

 TestContainerManagerSecurity#testContainerManager fails
 ---

 Key: YARN-1463
 URL: https://issues.apache.org/jira/browse/YARN-1463
 Project: Hadoop YARN
  Issue Type: Test
Reporter: Ted Yu
Assignee: Binglin Chang
 Attachments: YARN-1463.v1.patch


 Here is stack trace:
 {code}
 testContainerManager[1](org.apache.hadoop.yarn.server.TestContainerManagerSecurity)
   Time elapsed: 1.756 sec   ERROR!
 org.apache.hadoop.yarn.exceptions.YarnRuntimeException: java.io.IOException: 
 ResourceManager failed to start. Final state is STOPPED
   at 
 org.apache.hadoop.yarn.server.MiniYARNCluster$ResourceManagerWrapper.serviceStart(MiniYARNCluster.java:253)
   at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
   at 
 org.apache.hadoop.service.CompositeService.serviceStart(CompositeService.java:121)
   at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
   at 
 org.apache.hadoop.yarn.server.TestContainerManagerSecurity.testContainerManager(TestContainerManagerSecurity.java:110)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (YARN-1463) TestContainerManagerSecurity#testContainerManager fails

2013-12-02 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837251#comment-13837251
 ] 

Binglin Chang commented on YARN-1463:
-

Hi Haohui, 
I originally did the same as your patch did, but it still failed with other 
errors on my Macbook pro.
So I add more checks, just as the original code did, and it now passed.
{code}
Running org.apache.hadoop.yarn.server.TestContainerManagerSecurity
Tests run: 2, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 26.663 sec  
FAILURE! - in org.apache.hadoop.yarn.server.TestContainerManagerSecurity
testContainerManager[1](org.apache.hadoop.yarn.server.TestContainerManagerSecurity)
  Time elapsed: 1.735 sec   ERROR!
org.apache.hadoop.yarn.exceptions.YarnRuntimeException: java.io.IOException: 
ResourceManager failed to start. Final state is STOPPED
at 
org.apache.hadoop.yarn.server.MiniYARNCluster$ResourceManagerWrapper.serviceStart(MiniYARNCluster.java:253)
at 
org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
at 
org.apache.hadoop.service.CompositeService.serviceStart(CompositeService.java:121)
at 
org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
at 
org.apache.hadoop.yarn.server.TestContainerManagerSecurity.testContainerManager(TestContainerManagerSecurity.java:110)
{code}

 TestContainerManagerSecurity#testContainerManager fails
 ---

 Key: YARN-1463
 URL: https://issues.apache.org/jira/browse/YARN-1463
 Project: Hadoop YARN
  Issue Type: Test
Reporter: Ted Yu
Assignee: Binglin Chang
 Attachments: YARN-1463.000.patch, YARN-1463.v1.patch


 Here is stack trace:
 {code}
 testContainerManager[1](org.apache.hadoop.yarn.server.TestContainerManagerSecurity)
   Time elapsed: 1.756 sec   ERROR!
 org.apache.hadoop.yarn.exceptions.YarnRuntimeException: java.io.IOException: 
 ResourceManager failed to start. Final state is STOPPED
   at 
 org.apache.hadoop.yarn.server.MiniYARNCluster$ResourceManagerWrapper.serviceStart(MiniYARNCluster.java:253)
   at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
   at 
 org.apache.hadoop.service.CompositeService.serviceStart(CompositeService.java:121)
   at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
   at 
 org.apache.hadoop.yarn.server.TestContainerManagerSecurity.testContainerManager(TestContainerManagerSecurity.java:110)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (YARN-1463) TestContainerManagerSecurity#testContainerManager fails

2013-12-02 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837257#comment-13837257
 ] 

Binglin Chang commented on YARN-1463:
-

Detail log:

2013-12-03 10:30:44,577 WARN  [Thread-321] mortbay.log (Slf4jLog.java:warn(89)) 
- Failed startup of context 
org.mortbay.jetty.webapp.WebAppContext@9ba0281{/,file:/Users/decster/projects/hadoop-trunk/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-common/target/classes/webapps/cluster}
javax.servlet.ServletException: javax.servlet.ServletException: Principal not 
defined in configuration
at 
org.apache.hadoop.security.authentication.server.KerberosAuthenticationHandler.init(KerberosAuthenticationHandler.java:203)
at 
org.apache.hadoop.security.authentication.server.AuthenticationFilter.init(AuthenticationFilter.java:146)
at org.mortbay.jetty.servlet.FilterHolder.doStart(FilterHolder.java:97)
at 
org.mortbay.component.AbstractLifeCycle.start(AbstractLifeCycle.java:50)
at 
org.mortbay.jetty.servlet.ServletHandler.initialize(ServletHandler.java:713)
at org.mortbay.jetty.servlet.Context.startContext(Context.java:140)
at 
org.mortbay.jetty.webapp.WebAppContext.startContext(WebAppContext.java:1282)
at 
org.mortbay.jetty.handler.ContextHandler.doStart(ContextHandler.java:518)
at 
org.mortbay.jetty.webapp.WebAppContext.doStart(WebAppContext.java:499)
at 
org.mortbay.component.AbstractLifeCycle.start(AbstractLifeCycle.java:50)
at 
org.mortbay.jetty.handler.HandlerCollection.doStart(HandlerCollection.java:152)
at 
org.mortbay.jetty.handler.ContextHandlerCollection.doStart(ContextHandlerCollection.java:156)
at 
org.mortbay.component.AbstractLifeCycle.start(AbstractLifeCycle.java:50)
at 
org.mortbay.jetty.handler.HandlerWrapper.doStart(HandlerWrapper.java:130)
at org.mortbay.jetty.Server.doStart(Server.java:224)
at 
org.mortbay.component.AbstractLifeCycle.start(AbstractLifeCycle.java:50)
at org.apache.hadoop.http.HttpServer.start(HttpServer.java:914)
at org.apache.hadoop.yarn.webapp.WebApps$Builder.start(WebApps.java:245)
at 
org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.startWepApp(ResourceManager.java:820)
at 
org.apache.hadoop.yarn.server.resourcemanager.ResourceManager$RMActiveServices.serviceStart(ResourceManager.java:471)
at 
org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
at 
org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.startActiveServices(ResourceManager.java:844)
at 
org.apache.hadoop.yarn.server.resourcemanager.RMHAProtocolService.transitionToActive(RMHAProtocolService.java:187)
at 
org.apache.hadoop.yarn.server.resourcemanager.RMHAProtocolService.serviceStart(RMHAProtocolService.java:101)
at 
org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
at 
org.apache.hadoop.service.CompositeService.serviceStart(CompositeService.java:121)
at 
org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.serviceStart(ResourceManager.java:871)
at 
org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
at 
org.apache.hadoop.yarn.server.MiniYARNCluster$ResourceManagerWrapper$3.run(MiniYARNCluster.java:242)

 TestContainerManagerSecurity#testContainerManager fails
 ---

 Key: YARN-1463
 URL: https://issues.apache.org/jira/browse/YARN-1463
 Project: Hadoop YARN
  Issue Type: Test
Reporter: Ted Yu
Assignee: Binglin Chang
 Attachments: YARN-1463.000.patch, YARN-1463.v1.patch


 Here is stack trace:
 {code}
 testContainerManager[1](org.apache.hadoop.yarn.server.TestContainerManagerSecurity)
   Time elapsed: 1.756 sec   ERROR!
 org.apache.hadoop.yarn.exceptions.YarnRuntimeException: java.io.IOException: 
 ResourceManager failed to start. Final state is STOPPED
   at 
 org.apache.hadoop.yarn.server.MiniYARNCluster$ResourceManagerWrapper.serviceStart(MiniYARNCluster.java:253)
   at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
   at 
 org.apache.hadoop.service.CompositeService.serviceStart(CompositeService.java:121)
   at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
   at 
 org.apache.hadoop.yarn.server.TestContainerManagerSecurity.testContainerManager(TestContainerManagerSecurity.java:110)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (YARN-1463) TestContainerManagerSecurity#testContainerManager fails

2013-12-02 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837258#comment-13837258
 ] 

Binglin Chang commented on YARN-1463:
-

We can see from the code, HttpServer does not cover null check for keys

{code}
  private void initSpnego(Configuration conf, String hostName,
  String usernameConfKey, String keytabConfKey) throws IOException {
MapString, String params = new HashMapString, String();
String principalInConf = conf.get(usernameConfKey);
if (principalInConf != null  !principalInConf.isEmpty()) {
  params.put(kerberos.principal, SecurityUtil.getServerPrincipal(
  principalInConf, hostName));
}
String httpKeytab = conf.get(keytabConfKey);
if (httpKeytab != null  !httpKeytab.isEmpty()) {
  params.put(kerberos.keytab, httpKeytab);
}
params.put(AuthenticationFilter.AUTH_TYPE, kerberos);
  
defineFilter(webAppContext, SPNEGO_FILTER,
 AuthenticationFilter.class.getName(), params, null);
  }
{code}

 TestContainerManagerSecurity#testContainerManager fails
 ---

 Key: YARN-1463
 URL: https://issues.apache.org/jira/browse/YARN-1463
 Project: Hadoop YARN
  Issue Type: Test
Reporter: Ted Yu
Assignee: Binglin Chang
 Attachments: YARN-1463.000.patch, YARN-1463.v1.patch


 Here is stack trace:
 {code}
 testContainerManager[1](org.apache.hadoop.yarn.server.TestContainerManagerSecurity)
   Time elapsed: 1.756 sec   ERROR!
 org.apache.hadoop.yarn.exceptions.YarnRuntimeException: java.io.IOException: 
 ResourceManager failed to start. Final state is STOPPED
   at 
 org.apache.hadoop.yarn.server.MiniYARNCluster$ResourceManagerWrapper.serviceStart(MiniYARNCluster.java:253)
   at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
   at 
 org.apache.hadoop.service.CompositeService.serviceStart(CompositeService.java:121)
   at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
   at 
 org.apache.hadoop.yarn.server.TestContainerManagerSecurity.testContainerManager(TestContainerManagerSecurity.java:110)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (YARN-1463) TestContainerManagerSecurity#testContainerManager fails

2013-12-02 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837278#comment-13837278
 ] 

Binglin Chang commented on YARN-1463:
-

bq. Webapps enables spnego authentication when security is enabled but no 
principals / keytabs are passed in. This configuration is wrong and it should 
fail.
I thought the same, but when I looked at the original code:
{code}
  if (spnegoPrincipalKey == null
  || conf.get(spnegoPrincipalKey, ).isEmpty()) {
LOG.warn(Principal for spnego filter is not set);
initSpnego = false;
  }
  if (spnegoKeytabKey == null
  || conf.get(spnegoKeytabKey, ).isEmpty()) {
LOG.warn(Keytab for spnego filter is not set);
initSpnego = false;
  }
{code}

The code make a WARN log instead of ERROR, it looks like a intentional 
behavior, so I keep the original behavior just for safe, thoughts?

 TestContainerManagerSecurity#testContainerManager fails
 ---

 Key: YARN-1463
 URL: https://issues.apache.org/jira/browse/YARN-1463
 Project: Hadoop YARN
  Issue Type: Test
Reporter: Ted Yu
Assignee: Binglin Chang
 Attachments: YARN-1463.000.patch, YARN-1463.v1.patch


 Here is stack trace:
 {code}
 testContainerManager[1](org.apache.hadoop.yarn.server.TestContainerManagerSecurity)
   Time elapsed: 1.756 sec   ERROR!
 org.apache.hadoop.yarn.exceptions.YarnRuntimeException: java.io.IOException: 
 ResourceManager failed to start. Final state is STOPPED
   at 
 org.apache.hadoop.yarn.server.MiniYARNCluster$ResourceManagerWrapper.serviceStart(MiniYARNCluster.java:253)
   at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
   at 
 org.apache.hadoop.service.CompositeService.serviceStart(CompositeService.java:121)
   at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
   at 
 org.apache.hadoop.yarn.server.TestContainerManagerSecurity.testContainerManager(TestContainerManagerSecurity.java:110)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (YARN-1463) TestContainerManagerSecurity#testContainerManager fails

2013-12-02 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837310#comment-13837310
 ] 

Binglin Chang commented on YARN-1463:
-

bq. but the test is broken then
I am sorry. What do you mean? Which test? With my original patch, I didn't see 
any test fail? 

 TestContainerManagerSecurity#testContainerManager fails
 ---

 Key: YARN-1463
 URL: https://issues.apache.org/jira/browse/YARN-1463
 Project: Hadoop YARN
  Issue Type: Test
Reporter: Ted Yu
Assignee: Binglin Chang
 Attachments: YARN-1463.000.patch, YARN-1463.v1.patch


 Here is stack trace:
 {code}
 testContainerManager[1](org.apache.hadoop.yarn.server.TestContainerManagerSecurity)
   Time elapsed: 1.756 sec   ERROR!
 org.apache.hadoop.yarn.exceptions.YarnRuntimeException: java.io.IOException: 
 ResourceManager failed to start. Final state is STOPPED
   at 
 org.apache.hadoop.yarn.server.MiniYARNCluster$ResourceManagerWrapper.serviceStart(MiniYARNCluster.java:253)
   at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
   at 
 org.apache.hadoop.service.CompositeService.serviceStart(CompositeService.java:121)
   at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
   at 
 org.apache.hadoop.yarn.server.TestContainerManagerSecurity.testContainerManager(TestContainerManagerSecurity.java:110)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (YARN-1450) TestUnmanagedAMLauncher#testDSShell fails on trunk

2013-12-02 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837400#comment-13837400
 ] 

Binglin Chang commented on YARN-1450:
-

Hi Vinod, 
Thanks for the explanation, I think I am confused about the propose of 
UnmanagedAM, the document of UnmanagedAM says:
bq. The UnmanagedLauncher is a simple client that launches and unmanaged AM. An 
unmanagedAM is an AM that is not launched and managed by the RM.
I thought it is used as a tool, new apps can use this to launch AM locally 
without create/start YarnClient. If old apps have a workable standalone AM, it 
can be launched by this tool too. I did not consider it as a tool to launch old 
client based apps, of course it will fail if it is used to launch arbitrarily 
old apps.  Which leads to the question:
1. What is the usage of UnmanagedAM? Support new special apps which need a 
local AM or support all apps? If the later it is far from complete, so either 
disable this test or writeuse a simple AM in test.
2. Is distributedshell.ApplicationMaster not supposed to be launched standalone 
and must only be called by distributedshell.Client?


 TestUnmanagedAMLauncher#testDSShell fails on trunk
 --

 Key: YARN-1450
 URL: https://issues.apache.org/jira/browse/YARN-1450
 Project: Hadoop YARN
  Issue Type: Bug
  Components: applications/distributed-shell
Reporter: Akira AJISAKA
Assignee: Binglin Chang
 Attachments: YARN-1450.v1.patch, YARN-1450.v2.patch, 
 YARN-1450.v3.patch, 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher-output.txt,
  
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher.txt


 TestUnmanagedAMLauncher fails on trunk. The console output is
 {code}
 Running 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher
 Tests run: 2, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 35.937 sec 
  FAILURE! - in 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher
 testDSShell(org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher)
   Time elapsed: 14.558 sec   ERROR!
 java.lang.RuntimeException: Failed to receive final expected state in 
 ApplicationReport, CurrentState=ACCEPTED, 
 ExpectedStates=FINISHED,FAILED,KILLED
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.UnmanagedAMLauncher.monitorApplication(UnmanagedAMLauncher.java:447)
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.UnmanagedAMLauncher.run(UnmanagedAMLauncher.java:352)
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher.testDSShell(TestUnmanagedAMLauncher.java:145)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (YARN-1464) TestRMNMSecretKeys#testNMUpdation fails with NullPointerException

2013-12-02 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837416#comment-13837416
 ] 

Binglin Chang commented on YARN-1464:
-

Looks like the exact reason in YARN-1463 caused this. Mark as duplicated

 TestRMNMSecretKeys#testNMUpdation fails with NullPointerException
 -

 Key: YARN-1464
 URL: https://issues.apache.org/jira/browse/YARN-1464
 Project: Hadoop YARN
  Issue Type: Test
Reporter: Ted Yu

 Here is the stack trace:
 {code}
 testNMUpdation(org.apache.hadoop.yarn.server.TestRMNMSecretKeys)  Time 
 elapsed: 2.704 sec   ERROR!
 java.lang.NullPointerException: null
   at java.util.Hashtable.get(Hashtable.java:334)
   at java.util.Properties.getProperty(Properties.java:932)
   at org.apache.hadoop.conf.Configuration.get(Configuration.java:874)
   at org.apache.hadoop.http.HttpServer.initSpnego(HttpServer.java:892)
   at org.apache.hadoop.http.HttpServer.access$100(HttpServer.java:101)
   at org.apache.hadoop.http.HttpServer$Builder.build(HttpServer.java:323)
   at org.apache.hadoop.yarn.webapp.WebApps$Builder.start(WebApps.java:232)
   at 
 org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.startWepApp(ResourceManager.java:820)
   at 
 org.apache.hadoop.yarn.server.resourcemanager.ResourceManager$RMActiveServices.serviceStart(ResourceManager.java:471)
   at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
   at 
 org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.startActiveServices(ResourceManager.java:844)
   at 
 org.apache.hadoop.yarn.server.resourcemanager.RMHAProtocolService.transitionToActive(RMHAProtocolService.java:187)
   at 
 org.apache.hadoop.yarn.server.resourcemanager.RMHAProtocolService.serviceStart(RMHAProtocolService.java:101)
   at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
   at 
 org.apache.hadoop.service.CompositeService.serviceStart(CompositeService.java:121)
   at 
 org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.serviceStart(ResourceManager.java:871)
   at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
   at 
 org.apache.hadoop.yarn.server.TestRMNMSecretKeys.validateRMNMKeyExchange(TestRMNMSecretKeys.java:69)
   at 
 org.apache.hadoop.yarn.server.TestRMNMSecretKeys.testNMUpdation(TestRMNMSecretKeys.java:49)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Resolved] (YARN-1464) TestRMNMSecretKeys#testNMUpdation fails with NullPointerException

2013-12-02 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang resolved YARN-1464.
-

Resolution: Duplicate

 TestRMNMSecretKeys#testNMUpdation fails with NullPointerException
 -

 Key: YARN-1464
 URL: https://issues.apache.org/jira/browse/YARN-1464
 Project: Hadoop YARN
  Issue Type: Test
Reporter: Ted Yu

 Here is the stack trace:
 {code}
 testNMUpdation(org.apache.hadoop.yarn.server.TestRMNMSecretKeys)  Time 
 elapsed: 2.704 sec   ERROR!
 java.lang.NullPointerException: null
   at java.util.Hashtable.get(Hashtable.java:334)
   at java.util.Properties.getProperty(Properties.java:932)
   at org.apache.hadoop.conf.Configuration.get(Configuration.java:874)
   at org.apache.hadoop.http.HttpServer.initSpnego(HttpServer.java:892)
   at org.apache.hadoop.http.HttpServer.access$100(HttpServer.java:101)
   at org.apache.hadoop.http.HttpServer$Builder.build(HttpServer.java:323)
   at org.apache.hadoop.yarn.webapp.WebApps$Builder.start(WebApps.java:232)
   at 
 org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.startWepApp(ResourceManager.java:820)
   at 
 org.apache.hadoop.yarn.server.resourcemanager.ResourceManager$RMActiveServices.serviceStart(ResourceManager.java:471)
   at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
   at 
 org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.startActiveServices(ResourceManager.java:844)
   at 
 org.apache.hadoop.yarn.server.resourcemanager.RMHAProtocolService.transitionToActive(RMHAProtocolService.java:187)
   at 
 org.apache.hadoop.yarn.server.resourcemanager.RMHAProtocolService.serviceStart(RMHAProtocolService.java:101)
   at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
   at 
 org.apache.hadoop.service.CompositeService.serviceStart(CompositeService.java:121)
   at 
 org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.serviceStart(ResourceManager.java:871)
   at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
   at 
 org.apache.hadoop.yarn.server.TestRMNMSecretKeys.validateRMNMKeyExchange(TestRMNMSecretKeys.java:69)
   at 
 org.apache.hadoop.yarn.server.TestRMNMSecretKeys.testNMUpdation(TestRMNMSecretKeys.java:49)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (YARN-1463) TestContainerManagerSecurity#testContainerManager fails

2013-12-02 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837420#comment-13837420
 ] 

Binglin Chang commented on YARN-1463:
-

Looks like another test is affected by this: TestRMNMSecretKeys#testNMUpdation.
Those tests all expect RM can start, even if security is enabled but keys are 
empty, I don't know whether this is intended and legal, better keep the old 
behavior or someone can give more input?


 TestContainerManagerSecurity#testContainerManager fails
 ---

 Key: YARN-1463
 URL: https://issues.apache.org/jira/browse/YARN-1463
 Project: Hadoop YARN
  Issue Type: Test
Reporter: Ted Yu
Assignee: Binglin Chang
 Attachments: YARN-1463.000.patch, YARN-1463.v1.patch


 Here is stack trace:
 {code}
 testContainerManager[1](org.apache.hadoop.yarn.server.TestContainerManagerSecurity)
   Time elapsed: 1.756 sec   ERROR!
 org.apache.hadoop.yarn.exceptions.YarnRuntimeException: java.io.IOException: 
 ResourceManager failed to start. Final state is STOPPED
   at 
 org.apache.hadoop.yarn.server.MiniYARNCluster$ResourceManagerWrapper.serviceStart(MiniYARNCluster.java:253)
   at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
   at 
 org.apache.hadoop.service.CompositeService.serviceStart(CompositeService.java:121)
   at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
   at 
 org.apache.hadoop.yarn.server.TestContainerManagerSecurity.testContainerManager(TestContainerManagerSecurity.java:110)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Assigned] (YARN-1450) TestUnmanagedAMLauncher#testDSShell fails on trunk

2013-11-27 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-1450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang reassigned YARN-1450:
---

Assignee: Binglin Chang

 TestUnmanagedAMLauncher#testDSShell fails on trunk
 --

 Key: YARN-1450
 URL: https://issues.apache.org/jira/browse/YARN-1450
 Project: Hadoop YARN
  Issue Type: Bug
  Components: applications
Reporter: Akira AJISAKA
Assignee: Binglin Chang
 Attachments: 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher-output.txt,
  
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher.txt


 TestUnmanagedAMLauncher fails on trunk. The console output is
 {code}
 Running 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher
 Tests run: 2, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 35.937 sec 
  FAILURE! - in 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher
 testDSShell(org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher)
   Time elapsed: 14.558 sec   ERROR!
 java.lang.RuntimeException: Failed to receive final expected state in 
 ApplicationReport, CurrentState=ACCEPTED, 
 ExpectedStates=FINISHED,FAILED,KILLED
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.UnmanagedAMLauncher.monitorApplication(UnmanagedAMLauncher.java:447)
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.UnmanagedAMLauncher.run(UnmanagedAMLauncher.java:352)
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher.testDSShell(TestUnmanagedAMLauncher.java:145)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (YARN-1450) TestUnmanagedAMLauncher#testDSShell fails on trunk

2013-11-27 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-1450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang updated YARN-1450:


Attachment: YARN-1450.v1.patch

Bug cause:
YARN-1303 removed parameter --shell_command in DistShell.AppMaster, cause 
DistShell.client have parameter --shell_command, unfortunately 
TestUnmanagedAMLauncher launch DistShell.AppMaster directly, causing this bug.
Patch changes:
Add parameter --shell_command back to DistShell.AppMaster, this parameter takes 
effect only if shell command file does not exists.


 TestUnmanagedAMLauncher#testDSShell fails on trunk
 --

 Key: YARN-1450
 URL: https://issues.apache.org/jira/browse/YARN-1450
 Project: Hadoop YARN
  Issue Type: Bug
  Components: applications
Reporter: Akira AJISAKA
Assignee: Binglin Chang
 Attachments: YARN-1450.v1.patch, 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher-output.txt,
  
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher.txt


 TestUnmanagedAMLauncher fails on trunk. The console output is
 {code}
 Running 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher
 Tests run: 2, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 35.937 sec 
  FAILURE! - in 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher
 testDSShell(org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher)
   Time elapsed: 14.558 sec   ERROR!
 java.lang.RuntimeException: Failed to receive final expected state in 
 ApplicationReport, CurrentState=ACCEPTED, 
 ExpectedStates=FINISHED,FAILED,KILLED
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.UnmanagedAMLauncher.monitorApplication(UnmanagedAMLauncher.java:447)
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.UnmanagedAMLauncher.run(UnmanagedAMLauncher.java:352)
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher.testDSShell(TestUnmanagedAMLauncher.java:145)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (YARN-1450) TestUnmanagedAMLauncher#testDSShell fails on trunk

2013-11-27 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-1450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang updated YARN-1450:


Component/s: (was: applications)
 applications/distributed-shell

 TestUnmanagedAMLauncher#testDSShell fails on trunk
 --

 Key: YARN-1450
 URL: https://issues.apache.org/jira/browse/YARN-1450
 Project: Hadoop YARN
  Issue Type: Bug
  Components: applications/distributed-shell
Reporter: Akira AJISAKA
Assignee: Binglin Chang
 Attachments: YARN-1450.v1.patch, 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher-output.txt,
  
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher.txt


 TestUnmanagedAMLauncher fails on trunk. The console output is
 {code}
 Running 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher
 Tests run: 2, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 35.937 sec 
  FAILURE! - in 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher
 testDSShell(org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher)
   Time elapsed: 14.558 sec   ERROR!
 java.lang.RuntimeException: Failed to receive final expected state in 
 ApplicationReport, CurrentState=ACCEPTED, 
 ExpectedStates=FINISHED,FAILED,KILLED
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.UnmanagedAMLauncher.monitorApplication(UnmanagedAMLauncher.java:447)
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.UnmanagedAMLauncher.run(UnmanagedAMLauncher.java:352)
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher.testDSShell(TestUnmanagedAMLauncher.java:145)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (YARN-1450) TestUnmanagedAMLauncher#testDSShell fails on trunk

2013-11-27 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13834382#comment-13834382
 ] 

Binglin Chang commented on YARN-1450:
-

bq. So, if we launch the AM with --shell_command, we will meet the same issue 
as before (can not allow multiple commands separating with ; or |). Right ?
Actually there is no issue, the AM is started standalone(no client - RM - NM 
- launch AM), we don't need to care about escaping/unescaping issues, allow 
multiple commands separating with ;, ;, | or env variable should be fine. 
There is nothing wrong about --shell_command, anything you can write to a file 
you can pass with cmdline, the issue in YARN-1303 is cuased by client try to 
construct a whole cmd without proper escaping support. 

 TestUnmanagedAMLauncher#testDSShell fails on trunk
 --

 Key: YARN-1450
 URL: https://issues.apache.org/jira/browse/YARN-1450
 Project: Hadoop YARN
  Issue Type: Bug
  Components: applications/distributed-shell
Reporter: Akira AJISAKA
Assignee: Binglin Chang
 Attachments: YARN-1450.v1.patch, 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher-output.txt,
  
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher.txt


 TestUnmanagedAMLauncher fails on trunk. The console output is
 {code}
 Running 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher
 Tests run: 2, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 35.937 sec 
  FAILURE! - in 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher
 testDSShell(org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher)
   Time elapsed: 14.558 sec   ERROR!
 java.lang.RuntimeException: Failed to receive final expected state in 
 ApplicationReport, CurrentState=ACCEPTED, 
 ExpectedStates=FINISHED,FAILED,KILLED
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.UnmanagedAMLauncher.monitorApplication(UnmanagedAMLauncher.java:447)
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.UnmanagedAMLauncher.run(UnmanagedAMLauncher.java:352)
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher.testDSShell(TestUnmanagedAMLauncher.java:145)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (YARN-1450) TestUnmanagedAMLauncher#testDSShell fails on trunk

2013-11-27 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-1450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang updated YARN-1450:


Attachment: (was: HADOOP-10130.v2.patch)

 TestUnmanagedAMLauncher#testDSShell fails on trunk
 --

 Key: YARN-1450
 URL: https://issues.apache.org/jira/browse/YARN-1450
 Project: Hadoop YARN
  Issue Type: Bug
  Components: applications/distributed-shell
Reporter: Akira AJISAKA
Assignee: Binglin Chang
 Attachments: YARN-1450.v1.patch, YARN-1450.v2.patch, 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher-output.txt,
  
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher.txt


 TestUnmanagedAMLauncher fails on trunk. The console output is
 {code}
 Running 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher
 Tests run: 2, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 35.937 sec 
  FAILURE! - in 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher
 testDSShell(org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher)
   Time elapsed: 14.558 sec   ERROR!
 java.lang.RuntimeException: Failed to receive final expected state in 
 ApplicationReport, CurrentState=ACCEPTED, 
 ExpectedStates=FINISHED,FAILED,KILLED
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.UnmanagedAMLauncher.monitorApplication(UnmanagedAMLauncher.java:447)
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.UnmanagedAMLauncher.run(UnmanagedAMLauncher.java:352)
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher.testDSShell(TestUnmanagedAMLauncher.java:145)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (YARN-1450) TestUnmanagedAMLauncher#testDSShell fails on trunk

2013-11-27 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-1450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang updated YARN-1450:


Attachment: HADOOP-10130.v2.patch

Attach patch generated with --no-prefix

 TestUnmanagedAMLauncher#testDSShell fails on trunk
 --

 Key: YARN-1450
 URL: https://issues.apache.org/jira/browse/YARN-1450
 Project: Hadoop YARN
  Issue Type: Bug
  Components: applications/distributed-shell
Reporter: Akira AJISAKA
Assignee: Binglin Chang
 Attachments: YARN-1450.v1.patch, YARN-1450.v2.patch, 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher-output.txt,
  
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher.txt


 TestUnmanagedAMLauncher fails on trunk. The console output is
 {code}
 Running 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher
 Tests run: 2, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 35.937 sec 
  FAILURE! - in 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher
 testDSShell(org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher)
   Time elapsed: 14.558 sec   ERROR!
 java.lang.RuntimeException: Failed to receive final expected state in 
 ApplicationReport, CurrentState=ACCEPTED, 
 ExpectedStates=FINISHED,FAILED,KILLED
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.UnmanagedAMLauncher.monitorApplication(UnmanagedAMLauncher.java:447)
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.UnmanagedAMLauncher.run(UnmanagedAMLauncher.java:352)
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher.testDSShell(TestUnmanagedAMLauncher.java:145)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (YARN-1450) TestUnmanagedAMLauncher#testDSShell fails on trunk

2013-11-27 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-1450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang updated YARN-1450:


Attachment: YARN-1450.v2.patch

 TestUnmanagedAMLauncher#testDSShell fails on trunk
 --

 Key: YARN-1450
 URL: https://issues.apache.org/jira/browse/YARN-1450
 Project: Hadoop YARN
  Issue Type: Bug
  Components: applications/distributed-shell
Reporter: Akira AJISAKA
Assignee: Binglin Chang
 Attachments: YARN-1450.v1.patch, YARN-1450.v2.patch, 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher-output.txt,
  
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher.txt


 TestUnmanagedAMLauncher fails on trunk. The console output is
 {code}
 Running 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher
 Tests run: 2, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 35.937 sec 
  FAILURE! - in 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher
 testDSShell(org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher)
   Time elapsed: 14.558 sec   ERROR!
 java.lang.RuntimeException: Failed to receive final expected state in 
 ApplicationReport, CurrentState=ACCEPTED, 
 ExpectedStates=FINISHED,FAILED,KILLED
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.UnmanagedAMLauncher.monitorApplication(UnmanagedAMLauncher.java:447)
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.UnmanagedAMLauncher.run(UnmanagedAMLauncher.java:352)
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher.testDSShell(TestUnmanagedAMLauncher.java:145)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (YARN-1450) TestUnmanagedAMLauncher#testDSShell fails on trunk

2013-11-27 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13834410#comment-13834410
 ] 

Binglin Chang commented on YARN-1450:
-

bq. JAVA_HOME /bin/java -Xmx512m 
org.apache.hadoop.yarn.applications.distributedshell.ApplicationMaster 
--shell_command pwd;ls --num_containers 5

if you pass this to current distshell client, it will fail too, like your final 
example in 
YARN-1303[https://issues.apache.org/jira/browse/YARN-1303?focusedCommentId=13828462page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13828462],
 you should call:

JAVA_HOME /bin/java -Xmx512m 
org.apache.hadoop.yarn.applications.distributedshell.ApplicationMaster 
--shell_command pwd;ls --num_containers 5

 TestUnmanagedAMLauncher#testDSShell fails on trunk
 --

 Key: YARN-1450
 URL: https://issues.apache.org/jira/browse/YARN-1450
 Project: Hadoop YARN
  Issue Type: Bug
  Components: applications/distributed-shell
Reporter: Akira AJISAKA
Assignee: Binglin Chang
 Attachments: YARN-1450.v1.patch, YARN-1450.v2.patch, 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher-output.txt,
  
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher.txt


 TestUnmanagedAMLauncher fails on trunk. The console output is
 {code}
 Running 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher
 Tests run: 2, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 35.937 sec 
  FAILURE! - in 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher
 testDSShell(org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher)
   Time elapsed: 14.558 sec   ERROR!
 java.lang.RuntimeException: Failed to receive final expected state in 
 ApplicationReport, CurrentState=ACCEPTED, 
 ExpectedStates=FINISHED,FAILED,KILLED
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.UnmanagedAMLauncher.monitorApplication(UnmanagedAMLauncher.java:447)
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.UnmanagedAMLauncher.run(UnmanagedAMLauncher.java:352)
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher.testDSShell(TestUnmanagedAMLauncher.java:145)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (YARN-1450) TestUnmanagedAMLauncher#testDSShell fails on trunk

2013-11-27 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13834414#comment-13834414
 ] 

Binglin Chang commented on YARN-1450:
-

bq. Seems like the patch is undoing what YARN-1303 did - shouldn't be doing 
that.
This is not the undoing of YARN-1303, I just add another option so some 
application need to call distshell.AppMaster directly can use. it is fine that 
Distshell.Client can not benefit from it.

 TestUnmanagedAMLauncher#testDSShell fails on trunk
 --

 Key: YARN-1450
 URL: https://issues.apache.org/jira/browse/YARN-1450
 Project: Hadoop YARN
  Issue Type: Bug
  Components: applications/distributed-shell
Reporter: Akira AJISAKA
Assignee: Binglin Chang
 Attachments: YARN-1450.v1.patch, YARN-1450.v2.patch, 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher-output.txt,
  
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher.txt


 TestUnmanagedAMLauncher fails on trunk. The console output is
 {code}
 Running 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher
 Tests run: 2, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 35.937 sec 
  FAILURE! - in 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher
 testDSShell(org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher)
   Time elapsed: 14.558 sec   ERROR!
 java.lang.RuntimeException: Failed to receive final expected state in 
 ApplicationReport, CurrentState=ACCEPTED, 
 ExpectedStates=FINISHED,FAILED,KILLED
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.UnmanagedAMLauncher.monitorApplication(UnmanagedAMLauncher.java:447)
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.UnmanagedAMLauncher.run(UnmanagedAMLauncher.java:352)
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher.testDSShell(TestUnmanagedAMLauncher.java:145)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (YARN-1450) TestUnmanagedAMLauncher#testDSShell fails on trunk

2013-11-27 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13834424#comment-13834424
 ] 

Binglin Chang commented on YARN-1450:
-

Let me state the problem more clearly:
Story in YARN-1303: 
1. Client accept  --shell_command pwd;ls
2. Client can not construct a whole cmdline with proper escape support if 
shell_command contains special chars like ;, so it write the pwd;ls to file 
instead
3. AM started, read shell_command from file, get pwd;ls
Story in this jira YARN-1450:
1. some app directly launch AM without client, AM directly accept 
--shell_command pwd;ls


 TestUnmanagedAMLauncher#testDSShell fails on trunk
 --

 Key: YARN-1450
 URL: https://issues.apache.org/jira/browse/YARN-1450
 Project: Hadoop YARN
  Issue Type: Bug
  Components: applications/distributed-shell
Reporter: Akira AJISAKA
Assignee: Binglin Chang
 Attachments: YARN-1450.v1.patch, YARN-1450.v2.patch, 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher-output.txt,
  
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher.txt


 TestUnmanagedAMLauncher fails on trunk. The console output is
 {code}
 Running 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher
 Tests run: 2, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 35.937 sec 
  FAILURE! - in 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher
 testDSShell(org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher)
   Time elapsed: 14.558 sec   ERROR!
 java.lang.RuntimeException: Failed to receive final expected state in 
 ApplicationReport, CurrentState=ACCEPTED, 
 ExpectedStates=FINISHED,FAILED,KILLED
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.UnmanagedAMLauncher.monitorApplication(UnmanagedAMLauncher.java:447)
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.UnmanagedAMLauncher.run(UnmanagedAMLauncher.java:352)
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher.testDSShell(TestUnmanagedAMLauncher.java:145)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (YARN-1450) TestUnmanagedAMLauncher#testDSShell fails on trunk

2013-11-27 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-1450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang updated YARN-1450:


Attachment: YARN-1450.v3.patch

The patch didn't apply because my code base was old, update new patch

 TestUnmanagedAMLauncher#testDSShell fails on trunk
 --

 Key: YARN-1450
 URL: https://issues.apache.org/jira/browse/YARN-1450
 Project: Hadoop YARN
  Issue Type: Bug
  Components: applications/distributed-shell
Reporter: Akira AJISAKA
Assignee: Binglin Chang
 Attachments: YARN-1450.v1.patch, YARN-1450.v2.patch, 
 YARN-1450.v3.patch, 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher-output.txt,
  
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher.txt


 TestUnmanagedAMLauncher fails on trunk. The console output is
 {code}
 Running 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher
 Tests run: 2, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 35.937 sec 
  FAILURE! - in 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher
 testDSShell(org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher)
   Time elapsed: 14.558 sec   ERROR!
 java.lang.RuntimeException: Failed to receive final expected state in 
 ApplicationReport, CurrentState=ACCEPTED, 
 ExpectedStates=FINISHED,FAILED,KILLED
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.UnmanagedAMLauncher.monitorApplication(UnmanagedAMLauncher.java:447)
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.UnmanagedAMLauncher.run(UnmanagedAMLauncher.java:352)
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher.testDSShell(TestUnmanagedAMLauncher.java:145)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (YARN-1450) TestUnmanagedAMLauncher#testDSShell fails on trunk

2013-11-27 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13834498#comment-13834498
 ] 

Binglin Chang commented on YARN-1450:
-

The failed test TestUnmanagedAMLauncher#testDSShell can cover this code, so I 
think new test case is not needed.

 TestUnmanagedAMLauncher#testDSShell fails on trunk
 --

 Key: YARN-1450
 URL: https://issues.apache.org/jira/browse/YARN-1450
 Project: Hadoop YARN
  Issue Type: Bug
  Components: applications/distributed-shell
Reporter: Akira AJISAKA
Assignee: Binglin Chang
 Attachments: YARN-1450.v1.patch, YARN-1450.v2.patch, 
 YARN-1450.v3.patch, 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher-output.txt,
  
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher.txt


 TestUnmanagedAMLauncher fails on trunk. The console output is
 {code}
 Running 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher
 Tests run: 2, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 35.937 sec 
  FAILURE! - in 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher
 testDSShell(org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher)
   Time elapsed: 14.558 sec   ERROR!
 java.lang.RuntimeException: Failed to receive final expected state in 
 ApplicationReport, CurrentState=ACCEPTED, 
 ExpectedStates=FINISHED,FAILED,KILLED
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.UnmanagedAMLauncher.monitorApplication(UnmanagedAMLauncher.java:447)
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.UnmanagedAMLauncher.run(UnmanagedAMLauncher.java:352)
   at 
 org.apache.hadoop.yarn.applications.unmanagedamlauncher.TestUnmanagedAMLauncher.testDSShell(TestUnmanagedAMLauncher.java:145)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (YARN-1170) yarn proto definitions should specify package as 'hadoop.yarn'

2013-09-11 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1170?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13764159#comment-13764159
 ] 

Binglin Chang commented on YARN-1170:
-

This patch only contains proto files change, which does not affect generated 
java code. I checked manually by generating cpp output from those proto files, 
and see that the generated cpp files has correct namespaces.

 yarn proto definitions should specify package as 'hadoop.yarn'
 --

 Key: YARN-1170
 URL: https://issues.apache.org/jira/browse/YARN-1170
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Arun C Murthy
Priority: Blocker
 Attachments: YARN-1170.v1.patch


 yarn proto definitions should specify package as 'hadoop.yarn' similar to 
 protos with 'hadoop.common'  'hadoop.hdfs' in Common  HDFS respectively.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (YARN-1170) yarn proto definitions should specify package as 'hadoop.yarn'

2013-09-08 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-1170?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang updated YARN-1170:


Attachment: YARN-1170.v1.patch

Add namespace to yarn protos also cause some compile error in mapreduce, by the 
way this patch add hadoop.mapreduce namespace to mapreduce protos.

 yarn proto definitions should specify package as 'hadoop.yarn'
 --

 Key: YARN-1170
 URL: https://issues.apache.org/jira/browse/YARN-1170
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Arun C Murthy
Priority: Blocker
 Attachments: YARN-1170.v1.patch


 yarn proto definitions should specify package as 'hadoop.yarn' similar to 
 protos with 'hadoop.common'  'hadoop.hdfs' in Common  HDFS respectively.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (YARN-898) Snapshot support for distcp

2013-07-03 Thread Binglin Chang (JIRA)
Binglin Chang created YARN-898:
--

 Summary: Snapshot support for distcp
 Key: YARN-898
 URL: https://issues.apache.org/jira/browse/YARN-898
 Project: Hadoop YARN
  Issue Type: Bug
Affects Versions: 2.1.0-beta
Reporter: Binglin Chang


Add snapshot incremental copy ability to distcp, so we can do iterative 
consistent backup between hadoop clusters.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Resolved] (YARN-898) Snapshot support for distcp

2013-07-03 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-898?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang resolved YARN-898.


Resolution: Invalid

sorry, should be in HADOOP

 Snapshot support for distcp
 ---

 Key: YARN-898
 URL: https://issues.apache.org/jira/browse/YARN-898
 Project: Hadoop YARN
  Issue Type: Bug
Affects Versions: 2.1.0-beta
Reporter: Binglin Chang

 Add snapshot incremental copy ability to distcp, so we can do iterative 
 consistent backup between hadoop clusters.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (YARN-783) RM and NM web server /logs pages link not working

2013-06-07 Thread Binglin Chang (JIRA)
Binglin Chang created YARN-783:
--

 Summary: RM and NM web server /logs pages link not working
 Key: YARN-783
 URL: https://issues.apache.org/jira/browse/YARN-783
 Project: Hadoop YARN
  Issue Type: Bug
Affects Versions: 2.1.0-beta
Reporter: Binglin Chang


RM and NM web server /logs page link to main default apps page

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (YARN-784) YARN does not provide Version info in JMX like hdfs and mapreducev1

2013-06-07 Thread Binglin Chang (JIRA)
Binglin Chang created YARN-784:
--

 Summary: YARN does not provide Version info in JMX like hdfs and 
mapreducev1
 Key: YARN-784
 URL: https://issues.apache.org/jira/browse/YARN-784
 Project: Hadoop YARN
  Issue Type: Improvement
Reporter: Binglin Chang
Priority: Minor


Some third party tool may still need this version info in JMX. Better add this 
for backward compatibility.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (YARN-784) YARN does not provide Version info in JMX like hdfs and mapreducev1

2013-06-07 Thread Binglin Chang (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Binglin Chang updated YARN-784:
---

Affects Version/s: 2.1.0-beta

 YARN does not provide Version info in JMX like hdfs and mapreducev1
 ---

 Key: YARN-784
 URL: https://issues.apache.org/jira/browse/YARN-784
 Project: Hadoop YARN
  Issue Type: Improvement
Affects Versions: 2.1.0-beta
Reporter: Binglin Chang
Priority: Minor

 Some third party tool may still need this version info in JMX. Better add 
 this for backward compatibility.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (YARN-783) RM and NM web server /logs pages link not working

2013-06-07 Thread Binglin Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13678112#comment-13678112
 ] 

Binglin Chang commented on YARN-783:


The problem seems to only exist in extra jetty context root, like /logs/ and 
/static/, and none root paths work fine, like /logs/userlogs/

 RM and NM web server /logs pages link not working
 -

 Key: YARN-783
 URL: https://issues.apache.org/jira/browse/YARN-783
 Project: Hadoop YARN
  Issue Type: Bug
Affects Versions: 2.1.0-beta
Reporter: Binglin Chang

 RM and NM web server /logs page link to main default apps page

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira