subject:"strange problem\: my ES server almost lost all its data. \(All shards failed\)"

Re: strange problem: my ES server almost lost all its data. (All shards failed)

2014-05-02 Thread joergpra...@gmail.com

On FreeBSD, do you have multicast on IPv6 enabled? You should disable IPv6
on the JVM.

Seems you received a severe network error from the OS.

Jörg

On Thu, May 1, 2014 at 11:46 PM, Patrick Proniewski
elasticsea...@patpro.net wrote:

Hello,

I'm running a small server with logstash, ES, Kibana. Tonight, I've
restarted my ES process. Very bad idea: it restarted with lots of errors,
and finally lost all its data.
Basically, before restart, I've had:

elasticsearch/nodes/0/indices/logstash-2014.*
elasticsearch/nodes/0/_state/

after restart, I've had:

elasticsearch/nodes/0/indices/logstash-2014.*
elasticsearch/nodes/0/_state/
elasticsearch/nodes/1/indices/logstash-2014.05.01
elasticsearch/nodes/1/_state/

Then, Kibana was not able to find anything (dashboards lost, etc.).

I've stopped Logstash, stopped Elasticsearch, waited a bit and checked
everything is down, then restarted ES. It looked OK, then I've restarted
Logstash, and I was able to access my dashboards again. I've just lost 15
minutes of data.
Now I can see that elasticsearch/nodes/0 is the current working directory,
and I can browse old data and current data.
elasticsearch/nodes/1 is not used anymore.

I'm running FreeBSD, and used the service command to restart ES. When
attempting the second shutdown, the script wouldn't find the pid file, so
I've had to kill the Java process.

I don't understand what happened. But I don't feel comfortable putting ES
in production. Full log for first and second restart here:
http://patpro.net/elastic.log

Any idea?
Regards,
Patrick

--
You received this message because you are subscribed to the Google Groups
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/DEC08780-FC7C-44F7-B7B8-B70215060351%40patpro.net
.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/CAKdsXoG2Dvb2RTGdyukXOKS1DYGsnDTNQLnzCX%2Ba%2Bx%2B-KiuXjQ%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Re: strange problem: my ES server almost lost all its data. (All shards failed)

2014-05-02 Thread Patrick Proniewski

Hi Jörg,

Thank you for your reply.
The service script includes an option that might deal with IPv6, but it's not
active:

# Force the JVM to use IPv4 stack
# elasticshearch_props-Djava.net.preferIPv4Stack=true

(http://svnweb.freebsd.org/ports/head/textproc/elasticsearch/files/elasticsearch.in?revision=349955)

In past years, I used to disable IPv6 everywhere (kernel, ports compilation,
etc.) but now I don't bother anymore.
Do you mean I should use this option to force IPv4?

Thanks,
Patrick

On 2 mai 2014, at 09:38, joergpra...@gmail.com wrote:

On FreeBSD, do you have multicast on IPv6 enabled? You should disable IPv6
on the JVM.

Seems you received a severe network error from the OS.

Jörg

On Thu, May 1, 2014 at 11:46 PM, Patrick Proniewski
elasticsea...@patpro.net wrote:

Hello,

elasticsearch/nodes/0/indices/logstash-2014.*
elasticsearch/nodes/0/_state/

after restart, I've had:

elasticsearch/nodes/0/indices/logstash-2014.*
elasticsearch/nodes/0/_state/
elasticsearch/nodes/1/indices/logstash-2014.05.01
elasticsearch/nodes/1/_state/

Then, Kibana was not able to find anything (dashboards lost, etc.).

I'm running FreeBSD, and used the service command to restart ES. When
attempting the second shutdown, the script wouldn't find the pid file, so
I've had to kill the Java process.

I don't understand what happened. But I don't feel comfortable putting ES
in production. Full log for first and second restart here:
http://patpro.net/elastic.log

Any idea?
Regards,
Patrick

--
You received this message because you are subscribed to the Google Groups
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/AA0BC7BA-8856-4A23-A172-0601BC0B4FEE%40patpro.net.
For more options, visit https://groups.google.com/d/optout.

Re: strange problem: my ES server almost lost all its data. (All shards failed)

2014-05-02 Thread joergpra...@gmail.com

Yes, you should use this option.

Some FreeBSD kernels seem to have difficulties to run UDP multicast on IPv6
together with IPv4 properly, so I would like to suggest disabling IPv6 use
on the JVM.

Jörg

On Fri, May 2, 2014 at 10:23 AM, Patrick Proniewski
elasticsea...@patpro.net wrote:

Hi Jörg,

Thank you for your reply.
The service script includes an option that might deal with IPv6, but it's
not active:

# Force the JVM to use IPv4 stack
# elasticshearch_props-Djava.net.preferIPv4Stack=true

(
http://svnweb.freebsd.org/ports/head/textproc/elasticsearch/files/elasticsearch.in?revision=349955
)

In past years, I used to disable IPv6 everywhere (kernel, ports
compilation, etc.) but now I don't bother anymore.
Do you mean I should use this option to force IPv4?

Thanks,
Patrick

On 2 mai 2014, at 09:38, joergpra...@gmail.com wrote:

On FreeBSD, do you have multicast on IPv6 enabled? You should disable
IPv6
on the JVM.

Seems you received a severe network error from the OS.

Jörg

On Thu, May 1, 2014 at 11:46 PM, Patrick Proniewski
elasticsea...@patpro.net wrote:

Hello,

I'm running a small server with logstash, ES, Kibana. Tonight, I've
restarted my ES process. Very bad idea: it restarted with lots of
errors,
and finally lost all its data.
Basically, before restart, I've had:

elasticsearch/nodes/0/indices/logstash-2014.*
elasticsearch/nodes/0/_state/

after restart, I've had:

elasticsearch/nodes/0/indices/logstash-2014.*
elasticsearch/nodes/0/_state/
elasticsearch/nodes/1/indices/logstash-2014.05.01
elasticsearch/nodes/1/_state/

Then, Kibana was not able to find anything (dashboards lost, etc.).

I've stopped Logstash, stopped Elasticsearch, waited a bit and checked
everything is down, then restarted ES. It looked OK, then I've restarted
Logstash, and I was able to access my dashboards again. I've just lost
15
minutes of data.
Now I can see that elasticsearch/nodes/0 is the current working
directory,
and I can browse old data and current data.
elasticsearch/nodes/1 is not used anymore.

I'm running FreeBSD, and used the service command to restart ES. When
attempting the second shutdown, the script wouldn't find the pid file,
so
I've had to kill the Java process.

I don't understand what happened. But I don't feel comfortable putting
ES
in production. Full log for first and second restart here:
http://patpro.net/elastic.log

Any idea?
Regards,
Patrick

--
You received this message because you are subscribed to the Google Groups
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/AA0BC7BA-8856-4A23-A172-0601BC0B4FEE%40patpro.net
.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/CAKdsXoGN_zdUi9cCAoAgprBh-Lxtu_g1ejSQWT_nZU0fd_YRTA%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Re: strange problem: my ES server almost lost all its data. (All shards failed)

2014-05-02 Thread Patrick Proniewski

Thank you for the tip, Jörg.
I've activated this option and carefully restarted. I've re-read yesterday's
log file, and now I think may be the new ES instance started before the former
one was completely terminated. This too can cause some network/socket trouble.
I might try and add a short sleep into the restart command.

On 2 mai 2014, at 14:07, joergpra...@gmail.com wrote:

Yes, you should use this option.

Some FreeBSD kernels seem to have difficulties to run UDP multicast on IPv6
together with IPv4 properly, so I would like to suggest disabling IPv6 use
on the JVM.

Jörg

On Fri, May 2, 2014 at 10:23 AM, Patrick Proniewski
elasticsea...@patpro.net wrote:

Hi Jörg,

Thank you for your reply.
The service script includes an option that might deal with IPv6, but it's
not active:

# Force the JVM to use IPv4 stack
# elasticshearch_props-Djava.net.preferIPv4Stack=true

(
http://svnweb.freebsd.org/ports/head/textproc/elasticsearch/files/elasticsearch.in?revision=349955
)

In past years, I used to disable IPv6 everywhere (kernel, ports
compilation, etc.) but now I don't bother anymore.
Do you mean I should use this option to force IPv4?

Thanks,
Patrick

--
You received this message because you are subscribed to the Google Groups
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/CE3C61C8-3EC1-49A0-A6DC-F38432CF123C%40patpro.net.
For more options, visit https://groups.google.com/d/optout.

strange problem: my ES server almost lost all its data. (All shards failed)

2014-05-01 Thread Patrick Proniewski

Hello,

I'm running a small server with logstash, ES, Kibana. Tonight, I've restarted
my ES process. Very bad idea: it restarted with lots of errors, and finally
lost all its data.
Basically, before restart, I've had:

elasticsearch/nodes/0/indices/logstash-2014.*
elasticsearch/nodes/0/_state/

after restart, I've had:

elasticsearch/nodes/0/indices/logstash-2014.*
elasticsearch/nodes/0/_state/
elasticsearch/nodes/1/indices/logstash-2014.05.01
elasticsearch/nodes/1/_state/

Then, Kibana was not able to find anything (dashboards lost, etc.).

I've stopped Logstash, stopped Elasticsearch, waited a bit and checked
everything is down, then restarted ES. It looked OK, then I've restarted
Logstash, and I was able to access my dashboards again. I've just lost 15
minutes of data.
Now I can see that elasticsearch/nodes/0 is the current working directory, and
I can browse old data and current data.
elasticsearch/nodes/1 is not used anymore.

I'm running FreeBSD, and used the service command to restart ES. When
attempting the second shutdown, the script wouldn't find the pid file, so I've
had to kill the Java process.

I don't understand what happened. But I don't feel comfortable putting ES in
production. Full log for first and second restart here:
http://patpro.net/elastic.log

Any idea?
Regards,
Patrick

--
You received this message because you are subscribed to the Google Groups
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/DEC08780-FC7C-44F7-B7B8-B70215060351%40patpro.net.
For more options, visit https://groups.google.com/d/optout.

Re: strange problem: my ES server almost lost all its data. (All shards failed)

Re: strange problem: my ES server almost lost all its data. (All shards failed)

Re: strange problem: my ES server almost lost all its data. (All shards failed)

Re: strange problem: my ES server almost lost all its data. (All shards failed)

strange problem: my ES server almost lost all its data. (All shards failed)

5 matches

Site Navigation

Mail list logo

Footer information