Re: problems when I use hadoop2.6

2014-10-30 Thread Rui Zhang
asspath or something similar. On 29 October 2014 15:24, Rui Zhang wrote: Hi, I am using the hadoop2.6 I compiled myself. When I run exists command, it has this error. What's the problem? I have included all the libraries in the slider_client.xml. ~/Slider_Vertica/Linux64/slider-0.51.0/bin/

Re: application not started when I kill a nodemanager manually

2014-10-29 Thread Rui Zhang
02:48 PM, Billie Rinaldi wrote: Do your nodes have enough resources for all of the requested components to start? On Tue, Oct 28, 2014 at 11:40 AM, Rui Zhang wrote: Made the fix but still cannot make it. Actually, the steps to reproduce in SLIDER-439 is different from mine. What I do is firs

problems when I use hadoop2.6

2014-10-29 Thread Rui Zhang
at org.apache.slider.core.main.ServiceLauncher.serviceMain(ServiceLauncher.java:628) at org.apache.slider.Slider.main(Slider.java:49) 2014-10-29 11:09:28,604 [main] INFO util.ExitUtil - Exiting with status 56 Thanks, Rui -- Rui Zhang Software engineer Intern Vertica, an HP Company rzh...@vertica.com

hadoop2.6

2014-10-28 Thread Rui Zhang
The latest slider requires hadoop2.6. I found it is not released. How can I get the hadoop2.6 tar ball now? Should I compile the source code myself or can you please provide me the tar ball? Thanks, Rui -- Rui Zhang Software engineer Intern Vertica, an HP Company rzh...@vertica.com

Re: application not started when I kill a nodemanager manually

2014-10-28 Thread Rui Zhang
It seems I didn't use the new one after compilation. Forget my question. I am trying the new one. On 10/28/2014 02:40 PM, Rui Zhang wrote: Made the fix but still cannot make it. Actually, the steps to reproduce in SLIDER-439 is different from mine. What I do is first use "freeze&qu

Re: application not started when I kill a nodemanager manually

2014-10-28 Thread Rui Zhang
RY_CONTAINER for Container container_1414519516219_0002_01_000003 Thanks, Rui On 10/28/2014 01:47 PM, Sumit Mohanty wrote: There is a bug fix that went in few days back - https://issues.apache.org/jira/browse/SLIDER-439 - that specifically fixed this issue. thanks -Sumit On Tue, Oct 28,

application not started when I kill a nodemanager manually

2014-10-28 Thread Rui Zhang
Hi, When I killed a node manager manually and restart the application, it seems that an instance previously ran on that node manager is not able to restart. Why is this? I think Yarn should allocate a container on a different machine for this instance, right? Thanks, Rui -- Rui Zhang

One question about flex method

2014-10-16 Thread Rui Zhang
Hi. everyone. When the new size is less than the original size in the flex command, how to determine which nodes to stop? Thanks. -- Rui Zhang Software engineer Intern Vertica, an HP Company rzh...@vertica.com

Re: ssl error after changing to slider-dev branch

2014-08-20 Thread Rui Zhang
PM, Rui Zhang wrote: I found this in the slider-out log: Extension ec_point_formats, formats: [uncompressed, ansiX962_compressed_prime, ansiX962_compressed_char2] Extension elliptic_curves, curve names: {sect571r1, sect571k1, secp521r1, sect409k1, sect409r1, secp384r1, sect283k1, sect283r1

Re: ssl error after changing to slider-dev branch

2014-08-20 Thread Rui Zhang
. Thanks, Rui On 08/20/2014 01:59 PM, Jon Maron wrote: "jvm.heapsize": "256M", "jvm.opts": "-Djavax.net.debug=all" -- Rui Zhang Software engineer Intern Vertica, an HP Company rzh...@vertica.com

Re: ssl error after changing to slider-dev branch

2014-08-20 Thread Rui Zhang
bound to port 33238 If it’s an attempt to that latter port then you apparently haven’t picked up the fix for SLIDER-333. — Jon On Aug 20, 2014, at 12:17 PM, Rui Zhang wrote: Hi, Jon, I tried the new version but the error still exists. I attached my slider-client.xml. Is there something

Re: ssl error after changing to slider-dev branch

2014-08-20 Thread Rui Zhang
. You may want to checkout the latest from develop branch and see if that works better. — Jon On Aug 19, 2014, at 4:41 PM, Rui Zhang wrote: Log attached. This is generated when I run the command logger example. On 08/19/2014 03:38 PM, Jon Maron wrote: I guess send the full agent and AM logs

Re: ssl error after changing to slider-dev branch

2014-08-19 Thread Rui Zhang
Log attached. This is generated when I run the command logger example. On 08/19/2014 03:38 PM, Jon Maron wrote: I guess send the full agent and AM logs - I just this morning setup a cluster with no issue (admittedly this was on centos 6.4) — Jon On Aug 19, 2014, at 3:20 PM, Rui Zhang wrote

Re: ssl error after changing to slider-dev branch

2014-08-19 Thread Rui Zhang
rnoon (finishing up my current patch) and see if I can recreate the issue. — Jon On Aug 19, 2014, at 2:52 PM, Rui Zhang wrote: Tried so many methods. Changing the signature algorithm to sha256 in the java code and adding the cert to trusted list. All does not work and the same error.

Re: ssl error after changing to slider-dev branch

2014-08-19 Thread Rui Zhang
2014, at 12:05 PM, Rui Zhang wrote: Having done all of these but still got this error. It also says that it is not verified when I opened the link in the browser. Maybe there is some issue with my openssl. I will try to solve and report to you my progress. Thanks. On 08/15/2014 11:17 AM

Re: ssl error after changing to slider-dev branch

2014-08-15 Thread Rui Zhang
has been modified to communicate via SSL. That code is downloaded to each launched container from /slider/agent HDFS folder (slider-agent.tar.gz). If you have installed an up to date version of slider you’ll need to update that file in HDFS. -- Rui Zhang Software engineer Intern Vertica, an HP

Re: ssl error after changing to slider-dev branch

2014-08-15 Thread Rui Zhang
: On 15 August 2014 06:59, Rui Zhang wrote: Exception in thread "main" java.lang.NoClassDefFoundError: com/codahale/metrics/MetricRegistry at org.apache.slider.server.appmaster.SliderAppMaster.< clinit>(SliderAppMaster.java:206) Caused by: java.lang.ClassN

Re: ssl error after changing to slider-dev branch

2014-08-14 Thread Rui Zhang
ible to start with a fresh installation directory and HDFS resources? The default communication mechanism was changed to SSL, and there have been updates to accommodate that update across the board. Going Mobile On Aug 14, 2014, at 4:18 PM, Rui Zhang wrote: Yes. I removed all the f

Re: ssl error after changing to slider-dev branch

2014-08-14 Thread Rui Zhang
Yes. I removed all the files in the original directory and clone the new branch and then compiled. On 08/14/2014 04:15 PM, Jon Maron wrote: Also, can you describe the upgrade process? Did you reinstall or just update the core jar? Going Mobile On Aug 14, 2014, at 3:54 PM, Rui Zhang

Re: ssl error after changing to slider-dev branch

2014-08-14 Thread Rui Zhang
trace from either the agent logs or the application master log? Thanks! Going Mobile On Aug 14, 2014, at 3:54 PM, Rui Zhang wrote: Hi, everyone, I have changed to the dev branch but now it has this error. ERROR 2014-08-14 15:52:28,244 NetUtil.py:52 - [Errno 8] _ssl.c:510: EOF occurred in

ssl error after changing to slider-dev branch

2014-08-14 Thread Rui Zhang
. How to solve it? Thanks -- Rui Zhang Software engineer Intern Vertica, an HP Company rzh...@vertica.com

[jira] [Commented] (SLIDER-239) Component instances should be able to advertise associated ports

2014-08-06 Thread Rui Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SLIDER-239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14088279#comment-14088279 ] Rui Zhang commented on SLIDER-239: -- Thanks. But how to get the port number in the py

When the stop method and restart method will be called in the python agents

2014-07-30 Thread Rui Zhang
also found I can override the function restart, and when is restart method called? Thanks, -- Rui Zhang Software engineer Intern Vertica, an HP Company rzh...@vertica.com

Role History Question

2014-07-30 Thread Rui Zhang
those machines if the role history does not ensure this, we will lose data when restarting. Thanks, -- Rui Zhang Software engineer Intern Vertica, an HP Company rzh...@vertica.com

Re: Start a role always IN_PROGRESS status

2014-07-24 Thread Rui Zhang
tput=False, *wait_for_finish=False* ) See the memecached sample I shared. How does your start() implementation look like? -Sumit On Thu, Jul 24, 2014 at 1:56 PM, Rui Zhang wrote: Actually Vertica is not killed by the script and run perfectly. But Slider will think that Vertica is killed and try a

Re: Start a role always IN_PROGRESS status

2014-07-24 Thread Rui Zhang
Actually Vertica is not killed by the script and run perfectly. But Slider will think that Vertica is killed and try again. On 07/24/2014 04:52 PM, Rui Zhang wrote: Hi, thanks for your reply. I know why it hangs. I have read the code of agent/PythonExecutor.py It blocks in the line

Re: Start a role always IN_PROGRESS status

2014-07-24 Thread Rui Zhang
where its enclosed within "". http://slider.incubator.apache.org/docs/slider_specs/hello_world_slider_app.html is a work-in-progress doc for creating an application package which will provide you the details. -Sumit On Thu, Jul 24, 2014 at 12:26 PM, Rui Zhang wrote: Hi, I can start my

Start a role always IN_PROGRESS status

2014-07-24 Thread Rui Zhang
Hi, I can start my own created package now. But it always stay IN_PROGRESS status according to the log and after a long time it shows Failed. However, my application runs perfectly without any errors. Why is this? How to determine the start is completed or not? Thanks. -- Rui Zhang

Re: What is the status used for?

2014-07-23 Thread Rui Zhang
I have checked the agent.log. It is not a failure of Status. It's a failure when start each role. But they actually have started normally without any error. On 07/23/2014 06:13 PM, Rui Zhang wrote: And actually my application has already been started and runs perfectly. On 07/23/2014

Re: What is the status used for?

2014-07-23 Thread Rui Zhang
And actually my application has already been started and runs perfectly. On 07/23/2014 06:02 PM, Rui Zhang wrote: Hi, I have commented out the status code in the python agents. But the status action will always lead to a failure after a couple of of IN PROGRESS. What does the status used

What is the status used for?

2014-07-23 Thread Rui Zhang
Hi, I have commented out the status code in the python agents. But the status action will always lead to a failure after a couple of of IN PROGRESS. What does the status used for? How can I avoid the failure? Thanks, -- Rui Zhang Software engineer Intern Vertica, an HP Company rzh

Convenient way to know the number of running instances for a role

2014-07-18 Thread Rui Zhang
Hi, Is there a convenient way to know the number of running instances for a role? Such as REST APIs? Thanks -- Rui Zhang Software engineer Intern Vertica, an HP Company rzh...@vertica.com

Re: Port number issues

2014-07-17 Thread Rui Zhang
u have any other specific scenario that would benefit from preferred port? thanks Sumit On Wed, Jul 16, 2014 at 8:51 AM, Rui Zhang wrote: One more question, if I understand correctly, I can tell the registry I want a port number and it will tell me the port number allocated to me. And how can I

Re: Port number issues

2014-07-16 Thread Rui Zhang
One more question, if I understand correctly, I can tell the registry I want a port number and it will tell me the port number allocated to me. And how can I publish my configurations? One more thing, can I tell the registry my preferred port numbers? On 07/16/2014 11:28 AM, Rui Zhang wrote

Re: Port number issues

2014-07-16 Thread Rui Zhang
:11 AM, Rui Zhang wrote: My command should set the port number for my application. So if Slider allocates the port number for it, I don't need to set it in the python script any more, right? Vertica nodes are peers and can all be connected by clients. The dba needs to know the port numb

[jira] [Created] (SLIDER-239) Add port number advertising feature

2014-07-16 Thread Rui Zhang (JIRA)
Rui Zhang created SLIDER-239: Summary: Add port number advertising feature Key: SLIDER-239 URL: https://issues.apache.org/jira/browse/SLIDER-239 Project: Slider Issue Type: Wish

Re: Port number issues

2014-07-16 Thread Rui Zhang
ility. -Sumit On Tue, Jul 15, 2014 at 12:24 PM, Rui Zhang wrote: Hi, If Slider give me containers on the same machine for the same role of my application. Is it possible for the python agents in Slider to know there are the same roles that are going to be started in the same machine and avoid the

Port number issues

2014-07-15 Thread Rui Zhang
running, I can check by catching the exception easily. What if multiple same roles are starting simultaneously? Thanks, -- Rui Zhang Software engineer Intern Vertica, an HP Company rzh...@vertica.com

Questions about the containers and processes

2014-07-08 Thread Rui Zhang
machine A. Will these processes be running in the container on machine A or not? Thanks -- Rui Zhang Software engineer Intern Vertica, an HP Company rzh...@vertica.com

Re: error when I run flex

2014-07-08 Thread Rui Zhang
It should fail because I run on the same machine with the same port number but why this error? On 07/08/2014 10:11 AM, Rui Zhang wrote: ./slider flex cl3 --component [component name] 2 --filesystem hdfs://localhost:9000/ And the component name is the one in resource.json. On 07/07/2014 08

Re: error when I run flex

2014-07-08 Thread Rui Zhang
./slider flex cl3 --component [component name] 2 --filesystem hdfs://localhost:9000/ And the component name is the one in resource.json. On 07/07/2014 08:05 PM, Sumit Mohanty wrote: What is the exact command you used? On Mon, Jul 7, 2014 at 1:55 PM, Rui Zhang wrote: Hi, I have tried to

error when I run flex

2014-07-07 Thread Rui Zhang
s the problem? I am using slider-0.3.0. Thanks, -- Rui Zhang Software engineer Intern Vertica, an HP Company rzh...@vertica.com

Re: IP address of containers

2014-07-07 Thread Rui Zhang
17:34, Rui Zhang wrote: Hi, How can I know the ip address or hostname of containers after they are allocated? Because if I want to start an distributed application, I need to set the host names. YARN chooses where to run them -you can get the list of the containers via the web UI/REST API once

IP address of containers

2014-07-07 Thread Rui Zhang
by Yarn? Thanks, Rui -- Rui Zhang Software engineer Intern Vertica, an HP Company rzh...@vertica.com

Changing cluster size?

2014-07-02 Thread Rui Zhang
Hi, Is there a way to change the size of an application cluster in Slider? For example, change the number of nodes of existing HBase cluster running on Slider. Thanks, Rui -- Rui Zhang Software engineer Intern Vertica, an HP Company rzh...@vertica.com

Some tutorials to run my own apps on Slider

2014-06-20 Thread Rui Zhang
Hi, I have just ran HBase on Slider successfully. Now I want to run my own app on Slider such as Vertica, is there some tutorials for me to do this? What is the start point? Thanks, Rui

Re: HBase example run

2014-06-20 Thread Rui Zhang
etained at directory identified by "yarn.nodemanager.log-dirs". You can look for component instance logs @ a path similar to ./log/application_1403274157370_0002/container_1403274157370_0002_01_09/app/log/hbase-yarn-master-c6401.ambari.apache.org.log -Sumit On Fri, Jun 20, 2014 at 9:3

Re: HBase example run

2014-06-20 Thread Rui Zhang
l.concurrent.LinkedBlockingQueue.take(LinkedBlockingQueue.java:442) at org.apache.hadoop.yarn.client.api.async.impl.AMRMClientAsyncImpl$CallbackHandlerThread.run(AMRMClientAsyncImpl.java:275) On 06/20/2014 11:55 AM, Ted Yu wrote: Let us know if you encounter any other error. Cheers On Fri

Re: HBase example run

2014-06-20 Thread Rui Zhang
I am stupid. There is a punctuation error in this configuration file. Sorry about that. Thanks. Billie and Ted. On 06/20/2014 11:13 AM, Rui Zhang wrote: My slider-client.xml file is like this: yarn.application.classpath /opt/hadoop/etc/hadoop,/opt/hadoop/*,/opt/hadoop/lib/*,/opt

Re: HBase example run

2014-06-20 Thread Rui Zhang
r/lib/hadoop-mapreduce/*,/usr/lib/hadoop-mapreduce/lib/* On Thu, Jun 19, 2014 at 3:27 PM, Rui Zhang wrote: Now there is a different error after I set the SLIDER_CLASSPATH. 14/06/19 18:24:06 WARN util.NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java cl

Re: HBase example run

2014-06-19 Thread Rui Zhang
PATH_EXTRA=$HADOOP_CONF_DIR` On 6/19/14, 2:49 PM, Rui Zhang wrote: Hi, I am new to Slider and try to run the HBase example under your instruction. But I have met an error, the logs for the container are: OpenJDK 64-Bit Server VM warning: You have loaded library /opt/hadoop/lib/native/libhadoop.so.1

Re: HBase example run

2014-06-19 Thread Rui Zhang
Yes. I am using 0.30. Is the version a problem? On 06/19/2014 05:58 PM, Ted Yu wrote: Looks like you were using 0.30 Can you build from tip of develop branch ? You would get 0.31.0-incubating-SNAPSHOT. Thanks On Thu, Jun 19, 2014 at 2:49 PM, Rui Zhang wrote: Hi, I am new to Slider and

HBase example run

2014-06-19 Thread Rui Zhang
Hi, I am new to Slider and try to run the HBase example under your instruction. But I have met an error, the logs for the container are: OpenJDK 64-Bit Server VM warning: You have loaded library /opt/hadoop/lib/native/libhadoop.so.1.0.0 which might have disabled stack guard. The VM will try t