----- Original Message -----
> From: "Luke Meyer" <lme...@redhat.com>
> To: "Matt Wringe" <mwri...@redhat.com>
> Cc: "Srinivas Naga Kotaru (skotaru)" <skot...@cisco.com>, "users" 
> <users@lists.openshift.redhat.com>
> Sent: Tuesday, June 14, 2016 5:10:05 PM
> Subject: Re: Metrics deployment
> 
> The readiness probe status seems like an important indicator to me:
> 
> Readiness probe failed: cat: /etc/ld.so.conf.d/*.conf: No such file or
> directory
> 
> What could cause that failure? Or is that a red herring...

red herring. Its on the readiness probe and happens when the readiness probe 
should be failing for Cassandra. It looks like we have a bug in our readiness 
probe script, but its benign and can be ignored.

> 
> On Tue, Jun 14, 2016 at 1:53 PM, Matt Wringe <mwri...@redhat.com> wrote:
> 
> > ----- Original Message -----
> > > From: "Srinivas Naga Kotaru (skotaru)" <skot...@cisco.com>
> > > To: "Matt Wringe" <mwri...@redhat.com>
> > > Cc: users@lists.openshift.redhat.com
> > > Sent: Tuesday, June 14, 2016 1:37:01 PM
> > > Subject: Re: Metrics deployment
> > >
> > > I removed readiness probes from both hawkular-cassandra-1 &
> > hawkular-metrics
> > > as both status shows probes failed.
> >
> > You should not have to remove the probes, this indicates that something is
> > wrong with your installation.
> >
> > >
> > > It looks good now. Both containers looks and running
> > > (hawkular-cassandra-1-kr8ka , hawkular-metrics-vhe3u) however
> > heapster-7yl34
> > > logs still shows Could not connect to
> > > https://hawkular-metrics:443/hawkular/metrics/status. Curl exit code: 6.
> > > Status Code 000.
> > >
> > > Are we good or still had issues?
> > >
> > >
> > > # oc get pods
> > > NAME                         READY     STATUS    RESTARTS   AGE
> > > hawkular-cassandra-1-kr8ka   1/1       Running   0          6m
> > > hawkular-metrics-vhe3u       1/1       Running   2          5m
> > > heapster-7yl34               0/1       Running   2          5m
> > >
> > >
> > >
> > >
> > >
> > > --
> > > Srinivas Kotaru
> > >
> > > On 6/14/16, 10:07 AM, "Srinivas Naga Kotaru (skotaru)" <
> > skot...@cisco.com>
> > > wrote:
> > >
> > > >Matt
> > > >
> > > >Just want to share more info by running describe pod.
> > > >
> > > >It seems to be health probe failing. Do you think it is the issue?
> > > >
> > > >
> > > >
> > > ># oc describe pod hawkular-cassandra-1-it5uh
> > > >Name:                hawkular-cassandra-1-it5uh
> > > >Namespace:   openshift-infra
> > > >Node:                l3inpn-id2-003.cisco.com/173.36.96.16
> > > >Start Time:  Tue, 14 Jun 2016 16:36:21 +0000
> > > >Labels:
> > > >
> >  
> > metrics-infra=hawkular-cassandra,name=hawkular-cassandra-1,type=hawkular-cassandra
> > > >Status:              Running
> > > >IP:          10.1.9.2
> > > >Controllers: ReplicationController/hawkular-cassandra-1
> > > >Containers:
> > > >  hawkular-cassandra-1:
> > > >    Container ID:
> > > >
> >  docker://17a9575eb655145859a9207f5c4bde7456f947e27188a056ff2bd08c4ce6ae5d
> > > >    Image:
> > registry.access.redhat.com/openshift3/metrics-cassandra:latest
> > > >    Image ID:
> > > >
> >  docker://ee2117c9848298ca5a0cbbce354fd4adff370435225324ab9d60cd9cd9a95c53
> > > >    Ports:           9042/TCP, 9160/TCP, 7000/TCP, 7001/TCP
> > > >    Command:
> > > >      /opt/apache-cassandra/bin/cassandra-docker.sh
> > > >      --cluster_name=hawkular-metrics
> > > >      --data_volume=/cassandra_data
> > > >      --internode_encryption=all
> > > >      --require_node_auth=true
> > > >      --enable_client_encryption=true
> > > >      --require_client_auth=true
> > > >      --keystore_file=/secret/cassandra.keystore
> > > >      --keystore_password_file=/secret/cassandra.keystore.password
> > > >      --truststore_file=/secret/cassandra.truststore
> > > >      --truststore_password_file=/secret/cassandra.truststore.password
> > > >      --cassandra_pem_file=/secret/cassandra.pem
> > > >    QoS Tier:
> > > >      cpu:           BestEffort
> > > >      memory:                BestEffort
> > > >    State:           Running
> > > >      Started:               Tue, 14 Jun 2016 16:37:01 +0000
> > > >    Ready:           True
> > > >    Restart Count:   0
> > > >    Readiness:               exec
> > [/opt/apache-cassandra/bin/cassandra-docker-ready.sh]
> > > >    delay=0s timeout=1s period=10s #success=1 #failure=3
> > > >    Environment Variables:
> > > >      CASSANDRA_MASTER:      true
> > > >      POD_NAMESPACE: openshift-infra (v1:metadata.namespace)
> > > >Conditions:
> > > >  Type               Status
> > > >  Ready      True
> > > >Volumes:
> > > >  cassandra-data:
> > > >    Type:    PersistentVolumeClaim (a reference to a
> > PersistentVolumeClaim in
> > > >    the same namespace)
> > > >    ClaimName:       metrics-cassandra-1
> > > >    ReadOnly:        false
> > > >  hawkular-cassandra-secrets:
> > > >    Type:    Secret (a volume populated by a Secret)
> > > >    SecretName:      hawkular-cassandra-secrets
> > > >  cassandra-token-4urfd:
> > > >    Type:    Secret (a volume populated by a Secret)
> > > >    SecretName:      cassandra-token-4urfd
> > > >Events:
> > > >  FirstSeen  LastSeen        Count   From
> >       SubobjectPath                           Type            Reason
> >   Message
> > > >  ---------  --------        -----   ----
> >       -------------                           --------        ------
> > > >             -------
> > > >  27m                27m             1       {default-scheduler }
> >                                                       Normal
> > Scheduled       Successfully
> > > >  assigned hawkular-cassandra-1-it5uh to l3inpn-id2-003.cisco.com
> > > >  27m                27m             1       {kubelet
> > l3inpn-id2-003.cisco.com}
> > > >     spec.containers{hawkular-cassandra-1}   Normal          Pulling
> >      pulling image
> > > >  "registry.access.redhat.com/openshift3/metrics-cassandra:latest"
> > > >  27m                27m             1       {kubelet
> > l3inpn-id2-003.cisco.com}
> > > >     spec.containers{hawkular-cassandra-1}   Normal          Pulled
> >       Successfully pulled
> > > >  image "registry.access.redhat.com/openshift3/metrics-cassandra:latest
> > "
> > > >  27m                27m             1       {kubelet
> > l3inpn-id2-003.cisco.com}
> > > >     spec.containers{hawkular-cassandra-1}   Normal          Created
> >      Created container
> > > >  with docker id 17a9575eb655
> > > >  27m                27m             1       {kubelet
> > l3inpn-id2-003.cisco.com}
> > > >     spec.containers{hawkular-cassandra-1}   Normal          Started
> >      Started container
> > > >  with docker id 17a9575eb655
> > > >  27m                26m             3       {kubelet
> > l3inpn-id2-003.cisco.com}
> > > >     spec.containers{hawkular-cassandra-1}   Warning         Unhealthy
> >      Readiness probe
> > > >  failed: cat: /etc/ld.so.conf.d/*.conf: No such file or directory
> > > >nodetool: Failed to connect to '127.0.0.1:7199' - ConnectException:
> > > >'Connection refused'.
> > > >Cassandra not in the up and normal state. Current state is
> > > >/opt/apache-cassandra/bin/cassandra-docker-ready.sh: line 28: [: =:
> > unary
> > > >operator expected
> > > >
> > > >
> > > >
> > > >
> > > >
> > > >--
> > > >Srinivas Kotaru
> > > >
> > > >On 6/14/16, 10:00 AM, "Srinivas Naga Kotaru (skotaru)" <
> > skot...@cisco.com>
> > > >wrote:
> > > >
> > > >>Matt
> > > >>
> > > >>DNS service is working
> > > >>
> > > >>
> > > >>~ dig +short @master-01 kubernetes.default.svc.cluster.local
> > > >>172.30.0.1
> > > >>~  dig +short @master-01  jenkins.alln-test.svc.cluster.local
> > > >>172.30.85.148
> > > >>~  dig +short @master-01 cakephp-example.alln-test.svc.cluster.local
> > > >>172.30.31.6
> > > >>
> > > >>I captured hawkular-metrics, it shows the problem. It seems to be
> > unable to
> > > >>connect Cassandra cluster
> > > >>
> > > >># oc exec hawkular-metrics-qldd8 cat /opt/eap/standalone/log/server.log
> > > >>
> > > >>12:40:58,848 INFO  [org.jboss.as.clustering.infinispan] (ServerService
> > > >>Thread Pool -- 31) JBAS010280: Activating Infinispan subsystem.
> > > >>12:40:58,898 INFO  [org.jboss.as.webservices] (ServerService Thread
> > Pool --
> > > >>48) JBAS015537: Activating WebServices Extension
> > > >>12:40:58,901 WARN  [org.jboss.as.txn] (ServerService Thread Pool -- 46)
> > > >>JBAS010153: Node identifier property is set to the default value.
> > Please
> > > >>make sure it is unique.
> > > >>12:40:58,902 INFO  [org.jboss.as.security] (ServerService Thread Pool
> > --
> > > >>44) JBAS013371: Activating Security Subsystem
> > > >>12:40:58,912 INFO  [org.jboss.as.naming] (ServerService Thread Pool --
> > 39)
> > > >>JBAS011800: Activating Naming Subsystem
> > > >>12:40:58,928 INFO  [org.jboss.as.jsf] (ServerService Thread Pool -- 37)
> > > >>JBAS012615: Activated the following JSF Implementations: [main, 1.2]
> > > >>12:40:58,939 INFO  [org.jboss.as.connector.logging] (MSC service thread
> > > >>1-2) JBAS010408: Starting JCA Subsystem (IronJacamar
> > > >>1.0.32.Final-redhat-1)
> > > >>12:40:58,981 INFO  [org.jboss.as.security] (MSC service thread 1-2)
> > > >>JBAS013370: Current PicketBox version=4.1.1.Final-redhat-1
> > > >>12:40:58,961 INFO  [org.jboss.as.connector.subsystems.datasources]
> > > >>(ServerService Thread Pool -- 27) JBAS010403: Deploying JDBC-compliant
> > > >>driver class org.h2.Driver (version 1.3)
> > > >>12:40:59,039 INFO  [org.jboss.as.naming] (MSC service thread 1-2)
> > > >>JBAS011802: Starting Naming Service
> > > >>12:40:59,052 INFO  [org.jboss.as.mail.extension] (MSC service thread
> > 1-4)
> > > >>JBAS015400: Bound mail session [java:jboss/mail/Default]
> > > >>12:40:59,614 INFO  [org.apache.coyote.http11.Http11Protocol] (MSC
> > service
> > > >>thread 1-1) JBWEB003001: Coyote HTTP/1.1 initializing on :
> > > >>http-/0.0.0.0:8444
> > > >>12:40:59,620 INFO  [org.apache.coyote.http11.Http11Protocol] (MSC
> > service
> > > >>thread 1-2) JBWEB003001: Coyote HTTP/1.1 initializing on :
> > > >>http-/0.0.0.0:8080
> > > >>12:40:59,621 INFO  [org.apache.coyote.http11.Http11Protocol] (MSC
> > service
> > > >>thread 1-1) JBWEB003000: Coyote HTTP/1.1 starting on: http-/
> > 0.0.0.0:8444
> > > >>12:40:59,627 INFO  [org.apache.coyote.http11.Http11Protocol] (MSC
> > service
> > > >>thread 1-2) JBWEB003000: Coyote HTTP/1.1 starting on: http-/
> > 0.0.0.0:8080
> > > >>12:40:59,654 INFO  [org.jboss.as.connector.subsystems.datasources] (MSC
> > > >>service thread 1-3) JBAS010400: Bound data source
> > > >>[java:jboss/datasources/ExampleDS]
> > > >>12:40:59,985 INFO  [org.jboss.as.server.deployment] (MSC service thread
> > > >>1-2) JBAS015876: Starting deployment of "activemq-rar.rar"
> > (runtime-name:
> > > >>"activemq-rar.rar")
> > > >>12:40:59,986 INFO  [org.jboss.as.server.deployment] (MSC service thread
> > > >>1-1) JBAS015876: Starting deployment of
> > "hawkular-metrics-api-jaxrs.war"
> > > >>(runtime-name: "hawkular-metrics-api-jaxrs.war")
> > > >>12:40:59,987 INFO  [org.jboss.as.server.deployment.scanner] (MSC
> > service
> > > >>thread 1-3) JBAS015012: Started FileSystemDeploymentService for
> > directory
> > > >>/opt/eap/standalone/deployments
> > > >>12:41:00,011 INFO  [org.jboss.as.remoting] (MSC service thread 1-2)
> > > >>JBAS017100: Listening on 127.0.0.1:9999
> > > >>12:41:00,030 WARN  [org.jboss.as.domain.http.api] (MSC service thread
> > 1-3)
> > > >>JBAS015102: Unable to load console module for slot eap, disabling
> > console
> > > >>12:41:00,042 INFO  [org.jboss.ws.common.management] (MSC service thread
> > > >>1-4) JBWS022052: Starting JBoss Web Services - Stack CXF Server
> > > >>4.3.5.Final-redhat-3
> > > >>12:41:00,045 INFO  [org.jboss.as.remoting] (MSC service thread 1-4)
> > > >>JBAS017100: Listening on 0.0.0.0:4447
> > > >>12:41:01,361 INFO  [org.jboss.as.connector.deployers.RADeployer] (MSC
> > > >>service thread 1-3) IJ020001: Required license terms for
> > >
> > >>file:/opt/eap/standalone/tmp/vfs/temp/temp912bb9d1372c33d3/activemq-rar.rar-a722089a00658182/contents/
> > > >>12:41:01,784 WARN  [org.jboss.as.dependency.private] (MSC service
> > thread
> > > >>1-1) JBAS015867: Deployment
> > "deployment.hawkular-metrics-api-jaxrs.war" is
> > > >>using a private module ("org.codehaus.jackson.jackson-core-asl:main")
> > > >>which may be changed or removed in future versions without notice.
> > > >>12:41:01,785 WARN  [org.jboss.as.dependency.private] (MSC service
> > thread
> > > >>1-1) JBAS015867: Deployment
> > "deployment.hawkular-metrics-api-jaxrs.war" is
> > > >>using a private module ("org.codehaus.jackson.jackson-core-asl:main")
> > > >>which may be changed or removed in future versions without notice.
> > > >>12:41:01,785 WARN  [org.jboss.as.dependency.private] (MSC service
> > thread
> > > >>1-1) JBAS015867: Deployment
> > "deployment.hawkular-metrics-api-jaxrs.war" is
> > > >>using a private module ("org.codehaus.jackson.jackson-mapper-asl:main")
> > > >>which may be changed or removed in future versions without notice.
> > > >>12:41:01,785 WARN  [org.jboss.as.dependency.private] (MSC service
> > thread
> > > >>1-1) JBAS015867: Deployment
> > "deployment.hawkular-metrics-api-jaxrs.war" is
> > > >>using a private module ("org.codehaus.jackson.jackson-mapper-asl:main")
> > > >>which may be changed or removed in future versions without notice.
> > > >>12:41:01,843 INFO  [org.jboss.weld.deployer] (MSC service thread 1-1)
> > > >>JBAS016002: Processing weld deployment hawkular-metrics-api-jaxrs.war
> > > >>12:41:01,941 INFO  [org.jboss.weld.deployer] (MSC service thread 1-1)
> > > >>JBAS016005: Starting Services for CDI deployment:
> > > >>hawkular-metrics-api-jaxrs.war
> > > >>12:41:01,986 INFO  [org.jboss.weld.Version] (MSC service thread 1-1)
> > > >>WELD-000900 1.1.31 (redhat)
> > > >>12:41:02,013 INFO  [org.jboss.weld.deployer] (MSC service thread 1-3)
> > > >>JBAS016008: Starting weld service for deployment
> > > >>hawkular-metrics-api-jaxrs.war
> > > >>12:41:02,820 INFO
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200002: Initializing
> > metrics
> > > >>service
> > > >>12:41:02,835 INFO  [org.jboss.web] (ServerService Thread Pool -- 51)
> > > >>JBAS018210: Register web context: /hawkular/metrics
> > > >>12:41:02,958 INFO  [org.jboss.as.server] (ServerService Thread Pool --
> > 28)
> > > >>JBAS015859: Deployed "hawkular-metrics-api-jaxrs.war" (runtime-name :
> > > >>"hawkular-metrics-api-jaxrs.war")
> > > >>12:41:02,959 INFO  [org.jboss.as.server] (ServerService Thread Pool --
> > 28)
> > > >>JBAS015859: Deployed "activemq-rar.rar" (runtime-name :
> > > >>"activemq-rar.rar")
> > > >>12:41:02,994 INFO  [org.jboss.as] (Controller Boot Thread)
> > JBAS015961: Http
> > > >>management interface listening on http://127.0.0.1:9990/management
> > > >>12:41:02,994 INFO  [org.jboss.as] (Controller Boot Thread) JBAS015951:
> > > >>Admin console listening on http://127.0.0.1:9990
> > > >>12:41:02,994 INFO  [org.jboss.as] (Controller Boot Thread) JBAS015874:
> > > >>JBoss EAP 6.4.4.GA (AS 7.5.4.Final-redhat-4) started in 6593ms -
> > Started
> > > >>267 of 303 services (62 services are lazy, passive or on-demand)
> > > >>12:41:02,999 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200003: Could not connect
> > to
> > > >>Cassandra cluster - assuming its not up yet: hawkular-cassandra:
> > unknown
> > > >>error
> > > >>12:41:02,999 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200004: [1] Retrying
> > > >>connecting to Cassandra cluster in [1]s...
> > > >>12:41:04,000 INFO
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200002: Initializing
> > metrics
> > > >>service
> > > >>12:41:04,001 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200003: Could not connect
> > to
> > > >>Cassandra cluster - assuming its not up yet: hawkular-cassandra
> > > >>12:41:04,001 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200004: [2] Retrying
> > > >>connecting to Cassandra cluster in [2]s...
> > > >>12:41:06,002 INFO
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200002: Initializing
> > metrics
> > > >>service
> > > >>12:41:06,002 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200003: Could not connect
> > to
> > > >>Cassandra cluster - assuming its not up yet: hawkular-cassandra
> > > >>12:41:06,003 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200004: [3] Retrying
> > > >>connecting to Cassandra cluster in [3]s...
> > > >>12:41:08,203 INFO  [org.jboss.resteasy.cdi.i18n] (http-/0.0.0.0:8080
> > -2)
> > > >>RESTEASY006050: Found BeanManager at java:comp/BeanManager
> > > >>12:41:08,299 INFO  [org.hibernate.validator.internal.util.Version]
> > > >>(http-/0.0.0.0:8080-2) HV000001: Hibernate Validator
> > 4.3.2.Final-redhat-2
> > > >>12:41:08,418 INFO
> > [org.hawkular.metrics.api.jaxrs.HawkularMetricsRestApp]
> > > >>(http-/0.0.0.0:8080-2) HAWKMETRICS200001: Hawkular Metrics starting
> > > >>12:41:08,421 INFO  [org.jboss.resteasy.resteasy_jaxrs.i18n]
> > > >>(http-/0.0.0.0:8080-2) RESTEASY000320: Deploying
> > > >>javax.ws.rs.core.Application: class
> > >
> > >>org.hawkular.metrics.api.jaxrs.HawkularMetricsRestApp$Proxy$_$$_WeldClientProxy
> > > >>12:41:08,421 INFO
> > [org.hawkular.metrics.api.jaxrs.HawkularMetricsRestApp]
> > > >>(http-/0.0.0.0:8080-2) HAWKMETRICS200001: Hawkular Metrics starting
> > > >>12:41:08,422 INFO
> > [org.hawkular.metrics.api.jaxrs.HawkularMetricsRestApp]
> > > >>(http-/0.0.0.0:8080-2) Virtual clock is disabled
> > > >>12:41:08,422 INFO
> > [org.hawkular.metrics.api.jaxrs.HawkularMetricsRestApp]
> > > >>(http-/0.0.0.0:8080-2) Virtual clock is disabled
> > > >>12:41:09,003 INFO
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200002: Initializing
> > metrics
> > > >>service
> > > >>12:41:09,004 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200003: Could not connect
> > to
> > > >>Cassandra cluster - assuming its not up yet: hawkular-cassandra
> > > >>12:41:09,004 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200004: [4] Retrying
> > > >>connecting to Cassandra cluster in [4]s...
> > > >>12:41:13,004 INFO
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200002: Initializing
> > metrics
> > > >>service
> > > >>12:41:13,123 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200003: Could not connect
> > to
> > > >>Cassandra cluster - assuming its not up yet: hawkular-cassandra:
> > unknown
> > > >>error
> > > >>12:41:13,124 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200004: [5] Retrying
> > > >>connecting to Cassandra cluster in [1]s...
> > > >>12:41:14,124 INFO
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200002: Initializing
> > metrics
> > > >>service
> > > >>12:41:14,124 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200003: Could not connect
> > to
> > > >>Cassandra cluster - assuming its not up yet: hawkular-cassandra
> > > >>12:41:14,125 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200004: [6] Retrying
> > > >>connecting to Cassandra cluster in [2]s...
> > > >>12:41:16,125 INFO
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200002: Initializing
> > metrics
> > > >>service
> > > >>12:41:16,126 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200003: Could not connect
> > to
> > > >>Cassandra cluster - assuming its not up yet: hawkular-cassandra
> > > >>12:41:16,126 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200004: [7] Retrying
> > > >>connecting to Cassandra cluster in [3]s...
> > > >>12:41:19,126 INFO
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200002: Initializing
> > metrics
> > > >>service
> > > >>12:41:19,127 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200003: Could not connect
> > to
> > > >>Cassandra cluster - assuming its not up yet: hawkular-cassandra
> > > >>12:41:19,127 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200004: [8] Retrying
> > > >>connecting to Cassandra cluster in [4]s...
> > > >>12:41:23,128 INFO
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200002: Initializing
> > metrics
> > > >>service
> > > >>12:41:23,249 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200003: Could not connect
> > to
> > > >>Cassandra cluster - assuming its not up yet: hawkular-cassandra:
> > unknown
> > > >>error
> > > >>12:41:23,249 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200004: [9] Retrying
> > > >>connecting to Cassandra cluster in [1]s...
> > > >>12:41:24,250 INFO
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200002: Initializing
> > metrics
> > > >>service
> > > >>12:41:24,250 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200003: Could not connect
> > to
> > > >>Cassandra cluster - assuming its not up yet: hawkular-cassandra
> > > >>12:41:24,250 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200004: [10] Retrying
> > > >>connecting to Cassandra cluster in [2]s...
> > > >>12:41:26,251 INFO
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200002: Initializing
> > metrics
> > > >>service
> > > >>12:41:26,251 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200003: Could not connect
> > to
> > > >>Cassandra cluster - assuming its not up yet: hawkular-cassandra
> > > >>12:41:26,251 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200004: [11] Retrying
> > > >>connecting to Cassandra cluster in [3]s...
> > > >>12:41:29,252 INFO
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200002: Initializing
> > metrics
> > > >>service
> > > >>12:41:29,252 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200003: Could not connect
> > to
> > > >>Cassandra cluster - assuming its not up yet: hawkular-cassandra
> > > >>12:41:29,253 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200004: [12] Retrying
> > > >>connecting to Cassandra cluster in [4]s...
> > > >>12:41:33,253 INFO
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200002: Initializing
> > metrics
> > > >>service
> > > >>12:41:33,376 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200003: Could not connect
> > to
> > > >>Cassandra cluster - assuming its not up yet: hawkular-cassandra:
> > unknown
> > > >>error
> > > >>12:41:33,377 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200004: [13] Retrying
> > > >>connecting to Cassandra cluster in [1]s...
> > > >>12:41:34,377 INFO
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200002: Initializing
> > metrics
> > > >>service
> > > >>12:41:34,378 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200003: Could not connect
> > to
> > > >>Cassandra cluster - assuming its not up yet: hawkular-cassandra
> > > >>12:41:34,378 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200004: [14] Retrying
> > > >>connecting to Cassandra cluster in [2]s...
> > > >>12:41:36,378 INFO
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200002: Initializing
> > metrics
> > > >>service
> > > >>12:41:36,379 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200003: Could not connect
> > to
> > > >>Cassandra cluster - assuming its not up yet: hawkular-cassandra
> > > >>12:41:36,379 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200004: [15] Retrying
> > > >>connecting to Cassandra cluster in [3]s...
> > > >>12:41:39,379 INFO
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200002: Initializing
> > metrics
> > > >>service
> > > >>12:41:39,380 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200003: Could not connect
> > to
> > > >>Cassandra cluster - assuming its not up yet: hawkular-cassandra
> > > >>12:41:39,380 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200004: [16] Retrying
> > > >>connecting to Cassandra cluster in [4]s...
> > > >>12:41:43,380 INFO
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200002: Initializing
> > metrics
> > > >>service
> > > >>12:41:43,503 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200003: Could not connect
> > to
> > > >>Cassandra cluster - assuming its not up yet: hawkular-cassandra:
> > unknown
> > > >>error
> > > >>12:41:43,504 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200004: [17] Retrying
> > > >>connecting to Cassandra cluster in [1]s...
> > > >>12:41:44,504 INFO
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200002: Initializing
> > metrics
> > > >>service
> > > >>12:41:44,505 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200003: Could not connect
> > to
> > > >>Cassandra cluster - assuming its not up yet: hawkular-cassandra
> > > >>12:41:44,505 WARN
> > [org.hawkular.metrics.api.jaxrs.MetricsServiceLifecycle]
> > > >>(metricsservice-lifecycle-thread) HAWKMETRICS200004: [18] Retrying
> > > >>connecting to Cassandra cluster in [2]s...
> > > >>
> > > >>--
> > > >>Srinivas Kotaru
> > > >>
> > > >>On 6/14/16, 6:06 AM, "Matt Wringe" <mwri...@redhat.com> wrote:
> > > >>
> > > >>>----- Original Message -----
> > > >>>> From: "Srinivas Naga Kotaru (skotaru)" <skot...@cisco.com>
> > > >>>> To: "Matt Wringe" <mwri...@redhat.com>
> > > >>>> Cc: users@lists.openshift.redhat.com
> > > >>>> Sent: Monday, June 13, 2016 7:26:06 PM
> > > >>>> Subject: Re: Metrics deployment
> > > >>>>
> > > >>>> Matt
> > > >>>>
> > > >>>> PV issue resolved. Was able to to see PV successfully bounded and
> > > >>>> Casandra
> > > >>>> container has been running. However, it seems puzzle not fully yet
> > > >>>> solved.
> > > >>>
> > > >>>Are you sure the OpenShift DNS server is running?
> > > >>>
> > > >>>If you are running OSE 3.1, can you please follow this
> > > >>>https://access.redhat.com/solutions/2329131 and see if you are now
> > seeing
> > > >>>errors in the Hawkular Metrics logs (essentially just run `oc exec
> > > >>>hawkular-metrics-xxxxx cat /opt/eap/standalone/log/server.log`)
> > > >>>
> > > >>>>
> > > >>>> I could see other container(heapster) not coming up, and seeing
> > below
> > > >>>> errors
> > > >>>>
> > > >>>> [skotaru@l3imas-id2-01 metrics]$ oc logs -f heapster-fnkdc
> > > >>>> Endpoint Check in effect. Checking
> > > >>>> https://hawkular-metrics:443/hawkular/metrics/status
> > > >>>> Could not connect to
> > > >>>> https://hawkular-metrics:443/hawkular/metrics/status.
> > > >>>> Curl exit code: 6. Status Code 000
> > > >>>> 'https://hawkular-metrics:443/hawkular/metrics/status' is not
> > accessible
> > > >>>> [HTTP status code: 000. Curl exit code 6]. Retrying.
> > > >>>> Could not connect to
> > > >>>> https://hawkular-metrics:443/hawkular/metrics/status.
> > > >>>> Curl exit code: 6. Status Code 000
> > > >>>> 'https://hawkular-metrics:443/hawkular/metrics/status' is not
> > accessible
> > > >>>> [HTTP status code: 000. Curl exit code 6]. Retrying.
> > > >>>>
> > > >>>>
> > > >>>> # oc get pv
> > > >>>> pv-5gb-0011   5Gi        RWO           Bound
> > > >>>> openshift-infra/metrics-cassandra-1             22m
> > > >>>>
> > > >>>>
> > > >>>> $ oc get pods
> > > >>>> NAME                         READY     STATUS      RESTARTS   AGE
> > > >>>> hawkular-cassandra-1-2pzd7   1/1       Running     0          20m
> > > >>>> hawkular-metrics-mf5qf       0/1       Running     7          20m
> > > >>>> heapster-fnkdc               0/1       Error       6          20m
> > > >>>> metrics-deployer-cvep0       0/1       Completed   0          21m
> > > >>>>
> > > >>>> # oc logs -f hawkular-metrics-mf5qf
> > > >>>>
> > > >>>> 19:20:00,819 INFO  [org.xnio] (MSC service thread 1-2) XNIO Version
> > > >>>> 3.0.14.GA-redhat-1
> > > >>>> 19:20:00,831 INFO  [org.jboss.as.server] (Controller Boot Thread)
> > > >>>> JBAS015888:
> > > >>>> Creating http management service using socket-binding
> > (management-http)
> > > >>>> 19:20:00,834 INFO  [org.xnio.nio] (MSC service thread 1-2) XNIO NIO
> > > >>>> Implementation Version 3.0.14.GA-redhat-1
> > > >>>> 19:20:00,844 INFO  [org.jboss.remoting] (MSC service thread 1-2)
> > JBoss
> > > >>>> Remoting version 3.3.5.Final-redhat-1
> > > >>>>
> > > >>>> $ oc logs -f heapster-fnkdc
> > > >>>> Endpoint Check in effect. Checking
> > > >>>> https://hawkular-metrics:443/hawkular/metrics/status
> > > >>>> Could not connect to
> > > >>>> https://hawkular-metrics:443/hawkular/metrics/status.
> > > >>>> Curl exit code: 6. Status Code 000
> > > >>>> 'https://hawkular-metrics:443/hawkular/metrics/status' is not
> > accessible
> > > >>>> [HTTP status code: 000. Curl exit code 6]. Retrying.
> > > >>>> Could not connect to
> > > >>>> https://hawkular-metrics:443/hawkular/metrics/status.
> > > >>>> Curl exit code: 6. Status Code 000
> > > >>>> 'https://hawkular-metrics:443/hawkular/metrics/status' is not
> > accessible
> > > >>>> [HTTP status code: 000. Curl exit code 6]. Retrying.
> > > >>>> Could not connect to
> > > >>>> https://hawkular-metrics:443/hawkular/metrics/status.
> > > >>>> Curl exit code: 6. Status Code 000
> > > >>>>
> > > >>>> $ oc logs -f hawkular-cassandra-1-2pzd7
> > > >>>> INFO  23:00:24 Starting listening for CQL clients on
> > > >>>> hawkular-cassandra-1-2pzd7/10.1.6.2:9042...
> > > >>>> INFO  23:00:24 Binding thrift service to
> > > >>>> hawkular-cassandra-1-2pzd7/10.1.6.2:9160
> > > >>>> INFO  23:00:24 enabling encrypted thrift connections between client
> > and
> > > >>>> server
> > > >>>> INFO  23:00:24 Listening for thrift clients...
> > > >>>> INFO  23:00:26 Created default superuser role 'cassandra'
> > > >>>>
> > > >>>> # oc get svc
> > > >>>> NAME                       CLUSTER-IP       EXTERNAL-IP   PORT(S)
> > > >>>> AGE
> > > >>>> hawkular-cassandra         172.30.2.13      <none>
> > > >>>> 9042/TCP,9160/TCP,7000/TCP,7001/TCP   25m
> > > >>>> hawkular-cassandra-nodes   None             <none>
> > > >>>> 9042/TCP,9160/TCP,7000/TCP,7001/TCP   25m
> > > >>>> hawkular-metrics           172.30.117.176   <none>        443/TCP
> > > >>>> 25m
> > > >>>> heapster                   172.30.107.135   <none>        80/TCP
> > > >>>> 25m
> > > >>>>
> > > >>>> #curl -I 172.30.117.176:443//hawkular/metrics/status
> > > >>>>
> > > >>>> HTTP/1.1 504 Gateway Timeout
> > > >>>> Mime-Version: 1.0
> > > >>>> Date: Mon, 13 Jun 2016 23:25:47 GMT
> > > >>>> Content-Type: text/html
> > > >>>> Connection: keep-alive
> > > >>>> Proxy-Connection: keep-alive
> > > >>>> Content-Length: 1572
> > > >>>>
> > > >>>> --
> > > >>>> Srinivas Kotaru
> > > >>>>
> > > >>>> On 6/13/16, 2:33 PM, "Srinivas Naga Kotaru (skotaru)"
> > > >>>> <skot...@cisco.com>
> > > >>>> wrote:
> > > >>>>
> > > >>>> >Matt
> > > >>>> >
> > > >>>> >That is good catch. I ran without USE_PERSISTENT_STORAGE=false and
> > > >>>> >working
> > > >>>> >
> > > >>>> >I adjusted PV to 5Gi and reran. Will update progress.
> > > >>>> >
> > > >>>> >Thanks you for your help so far.
> > > >>>> >
> > > >>>> >--
> > > >>>> >Srinivas Kotaru
> > > >>>> >
> > > >>>> >On 6/13/16, 2:27 PM, "Matt Wringe" <mwri...@redhat.com> wrote:
> > > >>>> >
> > > >>>> >>
> > > >>>> >>
> > > >>>> >>----- Original Message -----
> > > >>>> >>> From: "Srinivas Naga Kotaru (skotaru)" <skot...@cisco.com>
> > > >>>> >>> To: "Matt Wringe" <mwri...@redhat.com>
> > > >>>> >>> Cc: users@lists.openshift.redhat.com
> > > >>>> >>> Sent: Monday, June 13, 2016 5:21:01 PM
> > > >>>> >>> Subject: Re: Metrics deployment
> > > >>>> >>>
> > > >>>> >>> Oh ok
> > > >>>> >>>
> > > >>>> >>> Am using PV for metrics
> > > >>>> >>>
> > > >>>> >>> description: "The persistent volume size for each of the
> > Cassandra
> > > >>>> >>> nodes"
> > > >>>> >>>   name: CASSANDRA_PV_SIZE
> > > >>>> >>>   value: "10Gi"
> > > >>>> >>>
> > > >>>> >>> oc get pv
> > > >>>> >>> NAME          CAPACITY   ACCESSMODES   STATUS      CLAIM
> > > >>>> >>> REASON
> > > >>>> >>> AGE
> > > >>>> >>> pv-1gb-001    1Gi        RWO           Available
> > > >>>> >>> 4d
> > > >>>> >>> pv-1gb-002    1Gi        RWO           Available
> > > >>>> >>> 4d
> > > >>>> >>> pv-1gb-003    1Gi        RWO           Available
> > > >>>> >>> 4d
> > > >>>> >>> pv-1gb-004    1Gi        RWO           Bound
> >  thlatt/mongodb
> > > >>>> >>> 4d
> > > >>>> >>> pv-1gb-005    1Gi        RWO           Available
> > > >>>> >>> 4d
> > > >>>> >>> pv-2gb-0010   2Gi        RWO           Available
> > > >>>> >>> 4d
> > > >>>> >>> pv-2gb-006    2Gi        RWO           Available
> > > >>>> >>> 4d
> > > >>>> >>> pv-2gb-007    2Gi        RWO           Available
> > > >>>> >>> 4d
> > > >>>> >>> pv-2gb-008    2Gi        RWO           Available
> > > >>>> >>> 4d
> > > >>>> >>> pv-2gb-009    2Gi        RWO           Available
> > > >>>> >>> 4d
> > > >>>> >>> pv-5gb-0011   5Gi        RWO           Available
> > > >>>> >>> 4d
> > > >>>> >>> pv-5gb-0012   5Gi        RWO           Available
> > > >>>> >>> 4d
> > > >>>> >>> pv-5gb-0013   5Gi        RWO           Available
> > > >>>> >>> 4d
> > > >>>> >>> pv-5gb-0014   5Gi        RWO           Available
> > > >>>> >>> 4d
> > > >>>> >>> pv-5gb-0015   5Gi        RWO           Available
> > > >>>> >>> 4d
> > > >>>> >>>
> > > >>>> >>> am running with below command
> > > >>>> >>>
> > > >>>> >>> $ oc new-app -f metrics-deployer.yaml  ( hardcoded HOSTNAME,
> > > >>>> >>> MASTER_API
> > > >>>> >>> and
> > > >>>> >>> PV info so not passing any parameters)
> > > >>>> >>>
> > > >>>> >>
> > > >>>> >>I would suspect that Cassandra is blocked because its waiting for
> > 10Gi
> > > >>>> >>PV
> > > >>>> >>to become available, and none of the PV listed above are big
> > enough.
> > > >>>> >>
> > > >>>> >>>
> > > >>>> >>> --
> > > >>>> >>> Srinivas Kotaru
> > > >>>> >>>
> > > >>>> >>> On 6/13/16, 2:12 PM, "Matt Wringe" <mwri...@redhat.com> wrote:
> > > >>>> >>>
> > > >>>> >>> >----- Original Message -----
> > > >>>> >>> >> From: "Srinivas Naga Kotaru (skotaru)" <skot...@cisco.com>
> > > >>>> >>> >> To: "Matt Wringe" <mwri...@redhat.com>
> > > >>>> >>> >> Cc: users@lists.openshift.redhat.com
> > > >>>> >>> >> Sent: Monday, June 13, 2016 4:55:55 PM
> > > >>>> >>> >> Subject: Re: Metrics deployment
> > > >>>> >>> >>
> > > >>>> >>> >> Matt
> > > >>>> >>> >>
> > > >>>> >>> >> Thanks for looking into. I rerun the setup, but had the same
> > > >>>> >>> >> issue
> > > >>>> >>> >>
> > > >>>> >>> >> # oc get pods
> > > >>>> >>> >> NAME                         READY     STATUS
> > > >>>> >>> >> RESTARTS
> > > >>>> >>> >> AGE
> > > >>>> >>> >> hawkular-cassandra-1-y2egy   0/1       ContainerCreating   0
> > > >>>> >>> >> 5m
> > > >>>> >>> >> hawkular-metrics-4b16f       0/1       Running             1
> > > >>>> >>> >> 4m
> > > >>>> >>> >> heapster-x2gj2               0/1       Running             2
> > > >>>> >>> >> 4m
> > > >>>> >>> >> metrics-deployer-9v7vc       0/1       Completed           0
> > > >>>> >>> >> 6m
> > > >>>> >>> >>
> > > >>>> >>> >> $ oc logs -f hawkular-cassandra-1-y2egy
> > > >>>> >>> >> Error from server: container "hawkular-cassandra-1" in pod
> > > >>>> >>> >> "hawkular-cassandra-1-y2egy" is waiting to start:
> > > >>>> >>> >> ContainerCreating
> > > >>>> >>> >
> > > >>>> >>> >Ok, so it looks like something is blocking the Cassandra pod
> > from
> > > >>>> >>> >starting.
> > > >>>> >>> >
> > > >>>> >>> >If you are using persistent storage, Cassandra will not start
> > until
> > > >>>> >>> >the
> > > >>>> >>> >PV
> > > >>>> >>> >is available. There may be some more information about
> > Cassandra in
> > > >>>> >>> >the
> > > >>>> >>> >pod
> > > >>>> >>> >section of the console under events.
> > > >>>> >>> >
> > > >>>> >>> >What command did you use when deploying the deployer?
> > > >>>> >>> >
> > > >>>> >>> >>
> > > >>>> >>> >> $ oc logs -f hawkular-metrics-4b16f
> > > >>>> >>> >>
> > > >>>> >>> >> 16:54:25,703 DEBUG [org.jboss.as.config] (MSC service thread
> > 1-4)
> > > >>>> >>> >> VM
> > > >>>> >>> >> Arguments: -Duser.home=/home/jboss -Duser.name=jboss
> > > >>>> >>> >> -D[Standalone]
> > > >>>> >>> >> -XX:+UseCompressedOops -verbose:gc
> > > >>>> >>> >> -Xloggc:/opt/eap/standalone/log/gc.log
> > > >>>> >>> >> -XX:+PrintGCDetails -XX:+PrintGCDateStamps
> > > >>>> >>> >> -XX:+UseGCLogFileRotation
> > > >>>> >>> >> -XX:NumberOfGCLogFiles=5 -XX:GCLogFileSize=3M
> > > >>>> >>> >> -XX:-TraceClassUnloading
> > > >>>> >>> >> -Xms1303m -Xmx1303m -XX:MaxPermSize=256m
> > > >>>> >>> >> -Djava.net.preferIPv4Stack=true
> > > >>>> >>> >> -Djboss.modules.system.pkgs=org.jboss.logmanager
> > > >>>> >>> >> -Djava.awt.headless=true
> > > >>>> >>> >> -Djboss.modules.policy-permissions=true
> > > >>>> >>> >>
> > -Xbootclasspath/p:/opt/eap/jboss-modules.jar:/opt/eap/modules/system/layers/base/org/jboss/logmanager/main/jboss-logmanager-1.5.4.Final-redhat-1.jar:/opt/eap/modules/system/layers/base/org/jboss/logmanager/ext/main/javax.json-1.0.4.jar:/opt/eap/modules/system/layers/base/org/jboss/logmanager/ext/main/jboss-logmanager-ext-1.0.0.Alpha2-redhat-1.jar
> > > >>>> >>> >> -Djava.util.logging.manager=org.jboss.logmanager.LogManager
> > > >>>> >>> >>
> > -javaagent:/opt/eap/jolokia.jar=port=8778,protocol=https,caCert=/var/run/secrets/
> > kubernetes.io/serviceaccount/ca.crt,clientPrincipal=cn=system:master-proxy,useSslClientAuthentication=true,extraClientCheck=true,host=0.0.0.0,discoveryEnabled=false
> > > >>>> >>> >> -Djava.security.egd=file:/dev/./urandom
> > > >>>> >>> >> -Dorg.jboss.boot.log.file=/opt/eap/standalone/log/server.log
> > > >>>> >>> >>
> > -Dlogging.configuration=file:/opt/eap/standalone/configuration/logging.properties
> > > >>>> >>> >> 16:54:27,079 INFO  [org.xnio] (MSC service thread 1-3) XNIO
> > > >>>> >>> >> Version
> > > >>>> >>> >> 3.0.14.GA-redhat-1
> > > >>>> >>> >> 16:54:27,083 INFO  [org.xnio.nio] (MSC service thread 1-3)
> > XNIO
> > > >>>> >>> >> NIO
> > > >>>> >>> >> Implementation Version 3.0.14.GA-redhat-1
> > > >>>> >>> >> 16:54:27,101 INFO  [org.jboss.as.server] (Controller Boot
> > Thread)
> > > >>>> >>> >> JBAS015888:
> > > >>>> >>> >> Creating http management service using socket-binding
> > > >>>> >>> >> (management-http)
> > > >>>> >>> >> 16:54:27,104 INFO  [org.jboss.remoting] (MSC service thread
> > 1-3)
> > > >>>> >>> >> JBoss
> > > >>>> >>> >> Remoting version 3.3.5.Final-redhat-1
> > > >>>> >>> >>
> > > >>>> >>> >> $ oc logs -f heapster-x2gj2
> > > >>>> >>> >> Endpoint Check in effect. Checking
> > > >>>> >>> >> https://hawkular-metrics:443/hawkular/metrics/status
> > > >>>> >>> >> Could not connect to
> > > >>>> >>> >> https://hawkular-metrics:443/hawkular/metrics/status.
> > > >>>> >>> >> Curl exit code: 6. Status Code 000
> > > >>>> >>> >> 'https://hawkular-metrics:443/hawkular/metrics/status' is
> > not
> > > >>>> >>> >> accessible
> > > >>>> >>> >> [HTTP status code: 000. Curl exit code 6]. Retrying.
> > > >>>> >>> >> Could not connect to
> > > >>>> >>> >> https://hawkular-metrics:443/hawkular/metrics/status.
> > > >>>> >>> >> Curl exit code: 6. Status Code 000
> > > >>>> >>> >> 'https://hawkular-metrics:443/hawkular/metrics/status' is
> > not
> > > >>>> >>> >> accessible
> > > >>>> >>> >> [HTTP status code: 000. Curl exit code 6]. Retrying.
> > > >>>> >>> >> Could not connect to
> > > >>>> >>> >> https://hawkular-metrics:443/hawkular/metrics/status.
> > > >>>> >>> >> Curl exit code: 6. Status Code 000
> > > >>>> >>> >>
> > > >>>> >>> >>
> > > >>>> >>> >>  $ oc logs -f metrics-deployer-9v7vc
> > > >>>> >>> >>
> > > >>>> >>> >> ++ oc create -f -
> > > >>>> >>> >> serviceaccount "heapster" created
> > > >>>> >>> >> service "heapster" created
> > > >>>> >>> >> replicationcontroller "heapster" created
> > > >>>> >>> >> + echo 'Success!'
> > > >>>> >>> >> Success!
> > > >>>> >>> >>
> > > >>>> >>> >> --
> > > >>>> >>> >> Srinivas Kotaru
> > > >>>> >>> >>
> > > >>>> >>> >> On 6/13/16, 1:49 PM, "Matt Wringe" <mwri...@redhat.com>
> > wrote:
> > > >>>> >>> >>
> > > >>>> >>> >> >
> > > >>>> >>> >> >
> > > >>>> >>> >> >----- Original Message -----
> > > >>>> >>> >> >> From: "Srinivas Naga Kotaru (skotaru)" <skot...@cisco.com
> > >
> > > >>>> >>> >> >> To: users@lists.openshift.redhat.com
> > > >>>> >>> >> >> Sent: Monday, June 13, 2016 3:58:12 PM
> > > >>>> >>> >> >> Subject: Metrics deployment
> > > >>>> >>> >> >>
> > > >>>> >>> >> >>
> > > >>>> >>> >> >>
> > > >>>> >>> >> >> Hi
> > > >>>> >>> >> >>
> > > >>>> >>> >> >>
> > > >>>> >>> >> >>
> > > >>>> >>> >> >> Am trying to configure metrics in our newly installed
> > > >>>> >>> >> >> clusters. Am
> > > >>>> >>> >> >> seeing
> > > >>>> >>> >> >> below errors once metrics-deploy script was successful. I
> > used
> > > >>>> >>> >> >> our
> > > >>>> >>> >> >> environment specific HAWKULAR_METRICS_HOSTNAME and
> > MASTER_URL
> > > >>>> >>> >> >>
> > > >>>> >>> >> >>
> > > >>>> >>> >> >>
> > > >>>> >>> >> >> # oc new-app -f metrics-deployer.yaml
> > > >>>> >>> >> >>
> > > >>>> >>> >> >>
> > > >>>> >>> >> >>
> > > >>>> >>> >> >> Note: customized, CASSANDARA PV, MASTER_URL, and
> > > >>>> >>> >> >> HAWKULAR_METRICS_HOSTNAME
> > > >>>> >>> >> >> (
> > > >>>> >>> >> >> hard coded as values)
> > > >>>> >>> >> >>
> > > >>>> >>> >> >>
> > > >>>> >>> >> >>
> > > >>>> >>> >> >> template "hawkular-heapster" created
> > > >>>> >>> >> >>
> > > >>>> >>> >> >> Deploying the Heapster component
> > > >>>> >>> >> >>
> > > >>>> >>> >> >> ++ echo 'Deploying the Heapster component'
> > > >>>> >>> >> >>
> > > >>>> >>> >> >> ++ '[' -n '' ']'
> > > >>>> >>> >> >>
> > > >>>> >>> >> >> ++ oc create -f -
> > > >>>> >>> >> >>
> > > >>>> >>> >> >> ++ oc process hawkular-heapster -v
> > > >>>> >>> >> >> IMAGE_PREFIX=
> > registry.access.redhat.com/openshift3/,IMAGE_VERSION=latest,MASTER_URL=https://lae3-alln-int-idev01.cisco.com:443,NODE_ID=nodename
> > > >>>> >>> >> >>
> > > >>>> >>> >> >> serviceaccount "heapster" created
> > > >>>> >>> >> >>
> > > >>>> >>> >> >> service "heapster" created
> > > >>>> >>> >> >>
> > > >>>> >>> >> >> replicationcontroller "heapster" created
> > > >>>> >>> >> >>
> > > >>>> >>> >> >> + echo 'Success!'
> > > >>>> >>> >> >>
> > > >>>> >>> >> >> Success!
> > > >>>> >>> >> >>
> > > >>>> >>> >> >>
> > > >>>> >>> >> >>
> > > >>>> >>> >> >> # oc get pods
> > > >>>> >>> >> >>
> > > >>>> >>> >> >> NAME READY STATUS RESTARTS AGE
> > > >>>> >>> >> >>
> > > >>>> >>> >> >> hawkular-cassandra-1-9nzio 0/1 ContainerCreating 0 4m
> > > >>>> >>> >> >>
> > > >>>> >>> >> >> hawkular-metrics-hi7mb 0/1 Running 1 4m
> > > >>>> >>> >> >>
> > > >>>> >>> >> >> heapster-e8gbu 0/1 Running 2 4m
> > > >>>> >>> >> >>
> > > >>>> >>> >> >> metrics-deployer-64703 0/1 ContainerCreating 0 3s
> > > >>>> >>> >> >>
> > > >>>> >>> >> >> metrics-deployer-cd1nf 0/1 Completed 0 5m
> > > >>>> >>> >> >>
> > > >>>> >>> >> >
> > > >>>> >>> >> >It looks like none of your containers are fully up and
> > running
> > > >>>> >>> >> >yet.
> > > >>>> >>> >> >
> > > >>>> >>> >> >Without Cassandra running, Hawkular Metrics will not run,
> > and
> > > >>>> >>> >> >Heapster
> > > >>>> >>> >> >will
> > > >>>> >>> >> >wait until Hawkular Metrics is fully running.
> > > >>>> >>> >> >
> > > >>>> >>> >> >Do you see anything in the Cassandra logs? The first step
> > will
> > > >>>> >>> >> >be to
> > > >>>> >>> >> >get
> > > >>>> >>> >> >Cassandra running properly.
> > > >>>> >>> >> >
> > > >>>> >>> >> >>
> > > >>>> >>> >> >>
> > > >>>> >>> >> >>
> > > >>>> >>> >> >> $ oc logs -f heapster-e8gbu
> > > >>>> >>> >> >>
> > > >>>> >>> >> >> Endpoint Check in effect. Checking
> > > >>>> >>> >> >> https://hawkular-metrics:443/hawkular/metrics/status
> > > >>>> >>> >> >>
> > > >>>> >>> >> >> Could not connect to
> > > >>>> >>> >> >> https://hawkular-metrics:443/hawkular/metrics/status.
> > > >>>> >>> >> >> Curl exit code: 6. Status Code 000
> > > >>>> >>> >> >>
> > > >>>> >>> >> >> 'https://hawkular-metrics:443/hawkular/metrics/status'
> > is not
> > > >>>> >>> >> >> accessible
> > > >>>> >>> >> >> [HTTP status code: 000. Curl exit code 6]. Retrying.
> > > >>>> >>> >> >>
> > > >>>> >>> >> >> Could not connect to
> > > >>>> >>> >> >> https://hawkular-metrics:443/hawkular/metrics/status.
> > > >>>> >>> >> >> Curl exit code: 6. Status Code 000
> > > >>>> >>> >> >
> > > >>>> >>> >> >Heapster waits until Hawkular Metrics is started before
> > trying
> > > >>>> >>> >> >to
> > > >>>> >>> >> >push
> > > >>>> >>> >> >metrics to it. The issue that you are seeing is because
> > Heapster
> > > >>>> >>> >> >could
> > > >>>> >>> >> >not
> > > >>>> >>> >> >properly connect to Hawkular Metrics. Until the Hawkular
> > Metrics
> > > >>>> >>> >> >service
> > > >>>> >>> >> >is
> > > >>>> >>> >> >fully up, Heapster will not be able to connect to it.
> > > >>>> >>> >> >
> > > >>>> >>> >> >
> > > >>>> >>> >> >>
> > > >>>> >>> >> >>
> > > >>>> >>> >> >>
> > > >>>> >>> >> >>
> > > >>>> >>> >> >> What is the wrong? Why it checking just hawkular-metrics
> > > >>>> >>> >> >> rather
> > > >>>> >>> >> >> full
> > > >>>> >>> >> >> routing
> > > >>>> >>> >> >> URL which was provided as HAWKULAR_METRICS_HOSTNAME
> > > >>>> >>> >> >
> > > >>>> >>> >> >The Hawkular Metrics service has two hostnames: the internal
> > > >>>> >>> >> >hostname
> > > >>>> >>> >> >used
> > > >>>> >>> >> >by the internal components (eg 'hawkular-metrics') and the
> > > >>>> >>> >> >external
> > > >>>> >>> >> >hostname (eg what is configured via
> > HAWKULAR_METRICS_HOSTNAME).
> > > >>>> >>> >> >The
> > > >>>> >>> >> >OpenShift dns server will resolve hostnames to the name of
> > > >>>> >>> >> >services,
> > > >>>> >>> >> >which
> > > >>>> >>> >> >is where the internal 'hawkular-metrics' comes from.
> > > >>>> >>> >> >
> > > >>>> >>> >> >>
> > > >>>> >>> >> >>
> > > >>>> >>> >> >>
> > > >>>> >>> >> >>
> > > >>>> >>> >> >>
> > > >>>> >>> >> >>
> > > >>>> >>> >> >>
> > > >>>> >>> >> >>
> > > >>>> >>> >> >>
> > > >>>> >>> >> >>
> > > >>>> >>> >> >> --
> > > >>>> >>> >> >>
> > > >>>> >>> >> >>
> > > >>>> >>> >> >> Srinivas Kotaru
> > > >>>> >>> >> >>
> > > >>>> >>> >> >> _______________________________________________
> > > >>>> >>> >> >> users mailing list
> > > >>>> >>> >> >> users@lists.openshift.redhat.com
> > > >>>> >>> >> >>
> > http://lists.openshift.redhat.com/openshiftmm/listinfo/users
> > > >>>> >>> >> >>
> > > >>>> >>> >>
> > > >>>> >>> >>
> > > >>>> >>>
> > > >>>> >>>
> > > >>>> >
> > > >>>>
> > > >>>>
> > > >>
> > > >
> > >
> > >
> >
> > _______________________________________________
> > users mailing list
> > users@lists.openshift.redhat.com
> > http://lists.openshift.redhat.com/openshiftmm/listinfo/users
> >
> 

_______________________________________________
users mailing list
users@lists.openshift.redhat.com
http://lists.openshift.redhat.com/openshiftmm/listinfo/users

Reply via email to