date:20121226

tiny patch for java7 on os X

2012-12-26 Thread Robert Muir

i installed java7 on my os X... with the following build patch
pylucene seems to work fine (tests pass etc).
I think java7 is just pickier about -source/-target both being set for
jcc. And the extensions should use the same explicit source/target (or
the build can hit classfile version problems).

Index: extensions.xml
===
--- extensions.xml  (revision 1425975)
+++ extensions.xml  (working copy)
@@ -16,7 +16,7 @@

   target name=compile
 mkdir dir=${classes.dir}/
-javac srcdir=java destdir=${classes.dir} classpathref=classpath /
+javac srcdir=java destdir=${classes.dir}
classpathref=classpath source=1.5 target=1.5 /
   /target

   target name=jar depends=compile
Index: jcc/setup.py
===
--- jcc/setup.py(revision 1425975)
+++ jcc/setup.py(working copy)
@@ -149,7 +149,7 @@
 LFLAGS['linux2'] = LFLAGS['linux2/%s' %(machine)]

 JAVAC = {
-'darwin': ['javac', '-target', '1.5'],
+'darwin': ['javac', '-source', '1.5', '-target', '1.5'],
 'ipod': ['jikes', '-cp', '/usr/share/classpath/glibj.zip'],
 'linux2': ['javac'],
 'sunos5': ['javac'],




http://pastebin.com/raw.php?i=qHpMw9Na

Re: [VOTE] Release PyLucene 3.6.2

2012-12-26 Thread Andi Vajda


On Dec 26, 2012, at 10:50, Robert Muir rcm...@gmail.com wrote:

 On OS X, i had to run 'make test' twice. The first time, i got a strange 
 error:
 
 Installed 
 /Users/rmuir/pylucene/pylucene-3.6.2-1/build/test/lucene-3.6.2-py2.7-macosx-10.7-x86_64.egg
 Processing dependencies for lucene==3.6.2
 Searching for lucene==3.6.2
 Reading http://pypi.python.org/simple/lucene/
 Couldn't find index page for 'lucene' (maybe misspelled?)
 Scanning index of all packages (this may take a while)
 Reading http://pypi.python.org/simple/
 No local packages or download links found for lucene==3.6.2
 error: Could not find suitable distribution for
 Requirement.parse('lucene==3.6.2')
 make: *** [install-test] Error 1
 
 I just ran it again and it worked...

Very strange. Why would it go out to pypi to install unrelated packages ?
Odd. Did you run just 'make' first before running 'make test' ? (my workflow).

Andi..

 
 On Tue, Dec 25, 2012 at 6:56 PM, Andi Vajda va...@apache.org wrote:
 
 The PyLucene 3.6.2-1 release tracking the recent release of Apache Lucene
 3.6.2 is ready.
 
 A release candidate is available from:
 http://people.apache.org/~vajda/staging_area/
 
 A list of changes in this release can be seen at:
 http://svn.apache.org/repos/asf/lucene/pylucene/branches/pylucene_3_6/CHANGES
 
 PyLucene 3.6.2 is built with JCC 2.15 included in these release artifacts:
 http://svn.apache.org/repos/asf/lucene/pylucene/trunk/jcc/CHANGES
 
 A list of Lucene Java changes can be seen at:
 http://svn.apache.org/repos/asf/lucene/dev/tags/lucene_solr_3_6_2/lucene/CHANGES.txt
 
 Please vote to release these artifacts as PyLucene 3.6.2-1.
 
 Thanks !
 
 Andi..
 
 ps: the KEYS file for PyLucene release signing is at:
 http://svn.apache.org/repos/asf/lucene/pylucene/dist/KEYS
 http://people.apache.org/~vajda/staging_area/KEYS
 
 pps: here is my +1

Re: [VOTE] Release PyLucene 3.6.2

2012-12-26 Thread Robert Muir

On Wed, Dec 26, 2012 at 11:14 AM, Andi Vajda va...@apache.org wrote:

 Very strange. Why would it go out to pypi to install unrelated packages ?
 Odd. Did you run just 'make' first before running 'make test' ? (my workflow).


I just tried make, followed by make test, and it worked fine. So I
think i must have just tried 'make test' in one shot... must be a
little build thing.

doesn't seem like a blocker to me, just seemed a bit odd.

RE: [JENKINS] Lucene-Solr-trunk-Linux (32bit/jdk1.6.0_37) - Build # 3421 - Failure!

2012-12-26 Thread Uwe Schindler

Hi Mark,

No, this is a default one with default multiplier, so just ant test.
What's important to reproduce:
- Number of JVMs because this dictates, how many tests are run inside one JVM: 
-Dtests.jvms=2.
- And this is 32 bit Java!
- more settings like used garbage collector in build description on website

Uwe

-
Uwe Schindler
H.-H.-Meier-Allee 63, D-28213 Bremen
http://www.thetaphi.de
eMail: u...@thetaphi.de


 -Original Message-
 From: Mark Miller [mailto:markrmil...@gmail.com]
 Sent: Wednesday, December 26, 2012 3:22 AM
 To: dev@lucene.apache.org
 Subject: Re: [JENKINS] Lucene-Solr-trunk-Linux (32bit/jdk1.6.0_37) - Build #
 3421 - Failure!
 
 Is this one a nightly build?
 
 I can run it and look at it closely tomorrow.
 
 - Mark
 
 On Dec 25, 2012, at 6:04 PM, Uwe Schindler u...@thetaphi.de wrote:
 
  Can we add a finally/try block that catches permgen errors and calls
 System.halt (not exit)? I could add another extra allowance to the security
 manager, disallowing exits.
 
  But we should try to find the issue in the tests, maybe Mark has an idea.
 We have the heap dump readily available, but I don't have the tools to
 inspect it.
 
  Uwe
 
 
 
  Dawid Weiss dawid.we...@cs.put.poznan.pl schrieb:
   the test framework crashes somehow and does not respond anymore.
 
  I think I know exactly how it crashes -- there's not much mystery
  about this: once the permgen is exhausted OOM errors are thrown from
  tests; what happens then is these errors are caught and an attempt is
  made to serialize these errors to the master node. Unfortunately this
  process involves loading some classes that are not yet loaded and,
  since the permgen is already exhausted, everything goes insane (the
  thread apparently just silently quits; there are finally blocks that
  are never reached).
 
  Like I said -- I'll see what I can do about it but I don't have any
  optimistic feelings. This is really riding a critical edge and short
  of preallocating static data structures I don't see any way of
  implementing a clean solution for the problem.
 
  Dawid
 
 
  To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For
  additional commands, e-mail: dev-h...@lucene.apache.org
 
 
  --
  Uwe Schindler
  H.-H.-Meier-Allee 63, 28213 Bremen
  http://www.thetaphi.de
 
 
 -
 To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional
 commands, e-mail: dev-h...@lucene.apache.org


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: [JENKINS] Lucene-Solr-trunk-Linux (32bit/jdk1.6.0_37) - Build # 3421 - Failure!

2012-12-26 Thread Dawid Weiss

I think it would be nice if Mike could add permgen pool stats (mx
bean) to his charts :) This way we would see the average permgen usage
over time -- it's easy to spot the regression then. Something to think
of for the future.

D.

On Wed, Dec 26, 2012 at 9:02 AM, Uwe Schindler u...@thetaphi.de wrote:
 Hi Mark,

 No, this is a default one with default multiplier, so just ant test.
 What's important to reproduce:
 - Number of JVMs because this dictates, how many tests are run inside one 
 JVM: -Dtests.jvms=2.
 - And this is 32 bit Java!
 - more settings like used garbage collector in build description on website

 Uwe

 -
 Uwe Schindler
 H.-H.-Meier-Allee 63, D-28213 Bremen
 http://www.thetaphi.de
 eMail: u...@thetaphi.de


 -Original Message-
 From: Mark Miller [mailto:markrmil...@gmail.com]
 Sent: Wednesday, December 26, 2012 3:22 AM
 To: dev@lucene.apache.org
 Subject: Re: [JENKINS] Lucene-Solr-trunk-Linux (32bit/jdk1.6.0_37) - Build #
 3421 - Failure!

 Is this one a nightly build?

 I can run it and look at it closely tomorrow.

 - Mark

 On Dec 25, 2012, at 6:04 PM, Uwe Schindler u...@thetaphi.de wrote:

  Can we add a finally/try block that catches permgen errors and calls
 System.halt (not exit)? I could add another extra allowance to the security
 manager, disallowing exits.
 
  But we should try to find the issue in the tests, maybe Mark has an idea.
 We have the heap dump readily available, but I don't have the tools to
 inspect it.
 
  Uwe
 
 
 
  Dawid Weiss dawid.we...@cs.put.poznan.pl schrieb:
   the test framework crashes somehow and does not respond anymore.
 
  I think I know exactly how it crashes -- there's not much mystery
  about this: once the permgen is exhausted OOM errors are thrown from
  tests; what happens then is these errors are caught and an attempt is
  made to serialize these errors to the master node. Unfortunately this
  process involves loading some classes that are not yet loaded and,
  since the permgen is already exhausted, everything goes insane (the
  thread apparently just silently quits; there are finally blocks that
  are never reached).
 
  Like I said -- I'll see what I can do about it but I don't have any
  optimistic feelings. This is really riding a critical edge and short
  of preallocating static data structures I don't see any way of
  implementing a clean solution for the problem.
 
  Dawid
 
 
  To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For
  additional commands, e-mail: dev-h...@lucene.apache.org
 
 
  --
  Uwe Schindler
  H.-H.-Meier-Allee 63, 28213 Bremen
  http://www.thetaphi.de


 -
 To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional
 commands, e-mail: dev-h...@lucene.apache.org


 -
 To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
 For additional commands, e-mail: dev-h...@lucene.apache.org


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

RE: [JENKINS] Lucene-Solr-trunk-Linux (32bit/jdk1.6.0_37) - Build # 3421 - Failure!

2012-12-26 Thread Uwe Schindler

I started jhat on the machine:

http://jenkins.sd-datasolutions.de:7000/

you can inspect the heap dump with it. The Jenkins build was made sticky, so it
stays alive until I delete it. It is also nice to look to the heap dump with
visualvm (shipped with JDK @ jvisualvm heapfile). You should use the same
bitness and version of the JDK (32bit/jdk1.6.0_37) like used for this build
after downloading the heapdump:
http://jenkins.sd-datasolutions.de/job/Lucene-Solr-trunk-Linux/3421/artifact/heapdumps/java_pid13141.hprof

Unfortunately I did not find a good tool to inspect permgen heap only (it
contains loaded classes and interned strings). I checked the heapdump, we have
no strange classloaders involved, all classes seem to be loaded by the standard
app-classloader of the JDK and there are no duplicates (same class loaded
multiple times by different class loaders). So SolrResourceLoader is not the
bad guy as Robert and I expected as a first guess. Interestingly the dump has
milltions of java.lang.String objects (which makes me wonder, I thought Lucene
4.x does no longer use Strings? - BUT Solr, 90% of all strings look like this:
http://jenkins.sd-datasolutions.de:7000/object/0xdbf3d938, contents are similar
to
org.apache.solr.handler:type=RequestHandlerBase,scope=metrics-scope-22344,name=numTimeouts.
The parent object are some huge HashMaps of com.yammer.metrics.core.MetricName
instances).

When looking at the MBean mess, it looks like:
The whole VM is filled with MBean statistics (20% of the total heap!!!), just
for statistics. It looks like the MBean server is not shut down correctly when
the Solr instance shuts down, so it sums up while running tests, every new Solr
instance adds new statistics to the huge MBean maps eating all the heap (and
possibly permgen, because most strings may be interned)! This is a huge leak,
we should fix this (or disable the whole useless MBean shit completely, at
least for tests). Was this strange, never-seen package com.yammer.metrics
introduced recently related to mbeans - or is zookeeper the bad guy?

Uwe

-
Uwe Schindler
H.-H.-Meier-Allee 63, D-28213 Bremen
http://www.thetaphi.de
eMail: u...@thetaphi.de

-Original Message-
From: Mark Miller [mailto:markrmil...@gmail.com]
Sent: Wednesday, December 26, 2012 3:22 AM
To: dev@lucene.apache.org
Subject: Re: [JENKINS] Lucene-Solr-trunk-Linux (32bit/jdk1.6.0_37) - Build #
3421 - Failure!

Is this one a nightly build?

I can run it and look at it closely tomorrow.

- Mark

On Dec 25, 2012, at 6:04 PM, Uwe Schindler u...@thetaphi.de wrote:

Can we add a finally/try block that catches permgen errors and calls
System.halt (not exit)? I could add another extra allowance to the security
manager, disallowing exits.

But we should try to find the issue in the tests, maybe Mark has an idea.
We have the heap dump readily available, but I don't have the tools to
inspect it.

Uwe

Dawid Weiss dawid.we...@cs.put.poznan.pl schrieb:
the test framework crashes somehow and does not respond anymore.

I think I know exactly how it crashes -- there's not much mystery
about this: once the permgen is exhausted OOM errors are thrown from
tests; what happens then is these errors are caught and an attempt is
made to serialize these errors to the master node. Unfortunately this
process involves loading some classes that are not yet loaded and,
since the permgen is already exhausted, everything goes insane (the
thread apparently just silently quits; there are finally blocks that
are never reached).

Like I said -- I'll see what I can do about it but I don't have any
optimistic feelings. This is really riding a critical edge and short
of preallocating static data structures I don't see any way of
implementing a clean solution for the problem.

Dawid

To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For
additional commands, e-mail: dev-h...@lucene.apache.org

--
Uwe Schindler
H.-H.-Meier-Allee 63, 28213 Bremen
http://www.thetaphi.de

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional
commands, e-mail: dev-h...@lucene.apache.org

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

RE: [JENKINS] Lucene-Solr-trunk-Linux (32bit/jdk1.6.0_37) - Build # 3421 - Failure!

2012-12-26 Thread Uwe Schindler

Hi,

I did further investigation (with jvisualvm - you can use any version, also the 
newest one with other bitness, it can always read the heap dump - I recommend 
the Java 7 64bit one, its most fancy and does not itself OOM): 

 When looking at the MBean mess, it looks like:
 The whole VM is filled with MBean statistics (20% of the total heap!!!), just
 for statistics. It looks like the MBean server is not shut down correctly when
 the Solr instance shuts down, so it sums up while running tests, every new
 Solr instance adds new statistics to the huge MBean maps eating all the heap
 (and possibly permgen, because most strings may be interned)! This is a
 huge leak, we should fix this (or disable the whole useless MBean shit
 completely, at least for tests). Was this strange, never-seen package
 com.yammer.metrics introduced recently related to mbeans - or is
 zookeeper the bad guy?

It's much worse: the String instances are only 20% of heap, but 26% are used 
for the ConcurrentHashMap.Entry classes holding those references and tons of 
ConcurrentHashMaps and com.yammer.metrics.core instances, eating up 60% of the 
total heap space (only reachable object, not those to be GCed).

The big question: Do we need com.yammer.metrics.core (it is 
metrics-core-2.1.2.jar in solr/core/lib) at all? When was it introduced? Lucene 
3.6 does not have it, neither Solr 4.0. It must be introduced recently - and 
eats up all memory.

Uwe

  -Original Message-
  From: Mark Miller [mailto:markrmil...@gmail.com]
  Sent: Wednesday, December 26, 2012 3:22 AM
  To: dev@lucene.apache.org
  Subject: Re: [JENKINS] Lucene-Solr-trunk-Linux (32bit/jdk1.6.0_37) -
  Build #
  3421 - Failure!
 
  Is this one a nightly build?
 
  I can run it and look at it closely tomorrow.
 
  - Mark
 
  On Dec 25, 2012, at 6:04 PM, Uwe Schindler u...@thetaphi.de wrote:
 
   Can we add a finally/try block that catches permgen errors and calls
  System.halt (not exit)? I could add another extra allowance to the
  security manager, disallowing exits.
  
   But we should try to find the issue in the tests, maybe Mark has an idea.
  We have the heap dump readily available, but I don't have the tools to
  inspect it.
  
   Uwe
  
  
  
   Dawid Weiss dawid.we...@cs.put.poznan.pl schrieb:
the test framework crashes somehow and does not respond anymore.
  
   I think I know exactly how it crashes -- there's not much mystery
   about this: once the permgen is exhausted OOM errors are thrown from
   tests; what happens then is these errors are caught and an attempt
   is made to serialize these errors to the master node. Unfortunately
   this process involves loading some classes that are not yet loaded
   and, since the permgen is already exhausted, everything goes insane
   (the thread apparently just silently quits; there are finally blocks
   that are never reached).
  
   Like I said -- I'll see what I can do about it but I don't have any
   optimistic feelings. This is really riding a critical edge and short
   of preallocating static data structures I don't see any way of
   implementing a clean solution for the problem.
  
   Dawid
  
  
   To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For
   additional commands, e-mail: dev-h...@lucene.apache.org
  
  
   --
   Uwe Schindler
   H.-H.-Meier-Allee 63, 28213 Bremen
   http://www.thetaphi.de
 
 
  -
  To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For
  additional commands, e-mail: dev-h...@lucene.apache.org
 
 
 -
 To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional
 commands, e-mail: dev-h...@lucene.apache.org


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

RE: [JENKINS] Lucene-Solr-trunk-Linux (32bit/jdk1.6.0_37) - Build # 3421 - Failure!

2012-12-26 Thread Uwe Schindler

It was introduced by:

Revision: 1403555
Author: romseygeek
Date: Montag, 29. Oktober 2012 23:13:03
Message:
SOLR-1972: Add extra query stats to RequestHandler

Modified : /lucene/dev/trunk/solr/CHANGES.txt
Modified : /lucene/dev/trunk/solr/core/ivy.xml
Modified : 
/lucene/dev/trunk/solr/core/src/java/org/apache/solr/handler/RequestHandlerBase.java
Modified : 
/lucene/dev/trunk/solr/core/src/test/org/apache/solr/core/RequestHandlersTest.java
Modified : 
/lucene/dev/trunk/solr/test-framework/src/java/org/apache/solr/SolrIgnoredThreadsFilter.java

This one adds com.yammer.metrics.core to Solr and causes the huge memory leak. 
I'll reopen the issue.

-
Uwe Schindler
H.-H.-Meier-Allee 63, D-28213 Bremen
http://www.thetaphi.de
eMail: u...@thetaphi.de


 -Original Message-
 From: Uwe Schindler [mailto:u...@thetaphi.de]
 Sent: Wednesday, December 26, 2012 1:36 PM
 To: dev@lucene.apache.org
 Subject: RE: [JENKINS] Lucene-Solr-trunk-Linux (32bit/jdk1.6.0_37) - Build #
 3421 - Failure!
 
 Hi,
 
 I did further investigation (with jvisualvm - you can use any version, also 
 the
 newest one with other bitness, it can always read the heap dump - I
 recommend the Java 7 64bit one, its most fancy and does not itself OOM):
 
  When looking at the MBean mess, it looks like:
  The whole VM is filled with MBean statistics (20% of the total
  heap!!!), just for statistics. It looks like the MBean server is not
  shut down correctly when the Solr instance shuts down, so it sums up
  while running tests, every new Solr instance adds new statistics to
  the huge MBean maps eating all the heap (and possibly permgen, because
  most strings may be interned)! This is a huge leak, we should fix this
  (or disable the whole useless MBean shit completely, at least for
  tests). Was this strange, never-seen package com.yammer.metrics
  introduced recently related to mbeans - or is zookeeper the bad guy?
 
 It's much worse: the String instances are only 20% of heap, but 26% are used
 for the ConcurrentHashMap.Entry classes holding those references and tons
 of ConcurrentHashMaps and com.yammer.metrics.core instances, eating up
 60% of the total heap space (only reachable object, not those to be GCed).
 
 The big question: Do we need com.yammer.metrics.core (it is metrics-core-
 2.1.2.jar in solr/core/lib) at all? When was it introduced? Lucene 3.6 does 
 not
 have it, neither Solr 4.0. It must be introduced recently - and eats up all
 memory.
 
 Uwe
 
   -Original Message-
   From: Mark Miller [mailto:markrmil...@gmail.com]
   Sent: Wednesday, December 26, 2012 3:22 AM
   To: dev@lucene.apache.org
   Subject: Re: [JENKINS] Lucene-Solr-trunk-Linux (32bit/jdk1.6.0_37) -
   Build #
   3421 - Failure!
  
   Is this one a nightly build?
  
   I can run it and look at it closely tomorrow.
  
   - Mark
  
   On Dec 25, 2012, at 6:04 PM, Uwe Schindler u...@thetaphi.de wrote:
  
Can we add a finally/try block that catches permgen errors and
calls
   System.halt (not exit)? I could add another extra allowance to the
   security manager, disallowing exits.
   
But we should try to find the issue in the tests, maybe Mark has an
 idea.
   We have the heap dump readily available, but I don't have the tools
   to inspect it.
   
Uwe
   
   
   
Dawid Weiss dawid.we...@cs.put.poznan.pl schrieb:
 the test framework crashes somehow and does not respond
 anymore.
   
I think I know exactly how it crashes -- there's not much mystery
about this: once the permgen is exhausted OOM errors are thrown
from tests; what happens then is these errors are caught and an
attempt is made to serialize these errors to the master node.
Unfortunately this process involves loading some classes that are
not yet loaded and, since the permgen is already exhausted,
everything goes insane (the thread apparently just silently quits;
there are finally blocks that are never reached).
   
Like I said -- I'll see what I can do about it but I don't have
any optimistic feelings. This is really riding a critical edge and
short of preallocating static data structures I don't see any way
of implementing a clean solution for the problem.
   
Dawid
   
   
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For
additional commands, e-mail: dev-h...@lucene.apache.org
   
   
--
Uwe Schindler
H.-H.-Meier-Allee 63, 28213 Bremen http://www.thetaphi.de
  
  
   
   - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For
   additional commands, e-mail: dev-h...@lucene.apache.org
 
 
  -
  To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For
  additional commands, e-mail: dev-h...@lucene.apache.org
 
 
 -
 To unsubscribe, e-mail:

[jira] [Reopened] (SOLR-1972) Need additional query stats in admin interface - median, 95th and 99th percentile

2012-12-26 Thread Uwe Schindler (JIRA)


 [ 
https://issues.apache.org/jira/browse/SOLR-1972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Uwe Schindler reopened SOLR-1972:
-


This commit causes a huge memory leak in Solr: The hole heap is filled with 
(interned) Strings, ConcurrentHashMap.Entry, and lots of class instances from 
com.yammer.metrics package. 
This causes the the recent permgen issues in running Solr tests.

We should revert this and investigate how to remove the com.yammer.metrics 
package dependency (or make the stats cleaner). To me it looks like every query 
to solr creates a new entry in those huge maps, causing them to grow and grow 
and grow... There is also no cleanup that shuts down the MBean servers holding 
all those references on core shutdown.

See: 
http://lucene.472066.n3.nabble.com/JENKINS-Lucene-Solr-trunk-Linux-32bit-jdk1-6-0-37-Build-3421-Failure-td4029050.html

 Need additional query stats in admin interface - median, 95th and 99th 
 percentile
 -

 Key: SOLR-1972
 URL: https://issues.apache.org/jira/browse/SOLR-1972
 Project: Solr
  Issue Type: Improvement
  Components: web gui
Affects Versions: 1.4
Reporter: Shawn Heisey
Assignee: Alan Woodward
Priority: Minor
 Fix For: 4.1, 5.0

 Attachments: elyograg-1972-3.2.patch, elyograg-1972-3.2.patch, 
 elyograg-1972-trunk.patch, elyograg-1972-trunk.patch, 
 SOLR-1972-branch3x-url_pattern.patch, SOLR-1972-branch4x.patch, 
 SOLR-1972-branch4x.patch, SOLR-1972_metrics.patch, SOLR-1972_metrics.patch, 
 SOLR-1972_metrics.patch, SOLR-1972_metrics.patch, SOLR-1972_metrics.patch, 
 SOLR-1972_metrics.patch, SOLR-1972_metrics.patch, SOLR-1972_metrics.patch, 
 solr1972-metricsregistry-branch4x-failure.log, SOLR-1972.patch, 
 SOLR-1972.patch, SOLR-1972.patch, SOLR-1972.patch, SOLR-1972-url_pattern.patch


 I would like to see more detailed query statistics from the admin GUI.  This 
 is what you can get now:
 requests : 809
 errors : 0
 timeouts : 0
 totalTime : 70053
 avgTimePerRequest : 86.59209
 avgRequestsPerSecond : 0.8148785 
 I'd like to see more data on the time per request - median, 95th percentile, 
 99th percentile, and any other statistical function that makes sense to 
 include.  In my environment, the first bunch of queries after startup tend to 
 take several seconds each.  I find that the average value tends to be useless 
 until it has several thousand queries under its belt and the caches are 
 thoroughly warmed.  The statistical functions I have mentioned would quickly 
 eliminate the influence of those initial slow queries.
 The system will have to store individual data about each query.  I don't know 
 if this is something Solr does already.  It would be nice to have a 
 configurable count of how many of the most recent data points are kept, to 
 control the amount of memory the feature uses.  The default value could be 
 something like 1024 or 4096.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Comment Edited] (SOLR-1972) Need additional query stats in admin interface - median, 95th and 99th percentile

2012-12-26 Thread Uwe Schindler (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-1972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13539526#comment-13539526
 ] 

Uwe Schindler edited comment on SOLR-1972 at 12/26/12 12:50 PM:


This commit causes a huge memory leak in Solr: The whole heap (60% while 
running tests) is filled with (interned) Strings, ConcurrentHashMap.Entry, and 
lots of class instances from the com.yammer.metrics package. This causes the 
the recent permgen issues in running Solr tests.

We should revert this and investigate how to remove the com.yammer.metrics 
package dependency (or make the stats cleaner). To me it looks like every query 
to solr creates a new entry in those huge maps, causing them to grow and grow 
and grow... There is also no cleanup that shuts down the MBean servers holding 
all those references on core shutdown.

See: 
http://lucene.472066.n3.nabble.com/JENKINS-Lucene-Solr-trunk-Linux-32bit-jdk1-6-0-37-Build-3421-Failure-td4029050.html

  was (Author: thetaphi):
This commit causes a huge memory leak in Solr: The hole heap is filled with 
(interned) Strings, ConcurrentHashMap.Entry, and lots of class instances from 
com.yammer.metrics package. 
This causes the the recent permgen issues in running Solr tests.

We should revert this and investigate how to remove the com.yammer.metrics 
package dependency (or make the stats cleaner). To me it looks like every query 
to solr creates a new entry in those huge maps, causing them to grow and grow 
and grow... There is also no cleanup that shuts down the MBean servers holding 
all those references on core shutdown.

See: 
http://lucene.472066.n3.nabble.com/JENKINS-Lucene-Solr-trunk-Linux-32bit-jdk1-6-0-37-Build-3421-Failure-td4029050.html
  
 Need additional query stats in admin interface - median, 95th and 99th 
 percentile
 -

 Key: SOLR-1972
 URL: https://issues.apache.org/jira/browse/SOLR-1972
 Project: Solr
  Issue Type: Improvement
  Components: web gui
Affects Versions: 1.4
Reporter: Shawn Heisey
Assignee: Alan Woodward
Priority: Minor
 Fix For: 4.1, 5.0

 Attachments: elyograg-1972-3.2.patch, elyograg-1972-3.2.patch, 
 elyograg-1972-trunk.patch, elyograg-1972-trunk.patch, 
 SOLR-1972-branch3x-url_pattern.patch, SOLR-1972-branch4x.patch, 
 SOLR-1972-branch4x.patch, SOLR-1972_metrics.patch, SOLR-1972_metrics.patch, 
 SOLR-1972_metrics.patch, SOLR-1972_metrics.patch, SOLR-1972_metrics.patch, 
 SOLR-1972_metrics.patch, SOLR-1972_metrics.patch, SOLR-1972_metrics.patch, 
 solr1972-metricsregistry-branch4x-failure.log, SOLR-1972.patch, 
 SOLR-1972.patch, SOLR-1972.patch, SOLR-1972.patch, SOLR-1972-url_pattern.patch


 I would like to see more detailed query statistics from the admin GUI.  This 
 is what you can get now:
 requests : 809
 errors : 0
 timeouts : 0
 totalTime : 70053
 avgTimePerRequest : 86.59209
 avgRequestsPerSecond : 0.8148785 
 I'd like to see more data on the time per request - median, 95th percentile, 
 99th percentile, and any other statistical function that makes sense to 
 include.  In my environment, the first bunch of queries after startup tend to 
 take several seconds each.  I find that the average value tends to be useless 
 until it has several thousand queries under its belt and the caches are 
 thoroughly warmed.  The statistical functions I have mentioned would quickly 
 eliminate the influence of those initial slow queries.
 The system will have to store individual data about each query.  I don't know 
 if this is something Solr does already.  It would be nice to have a 
 configurable count of how many of the most recent data points are kept, to 
 control the amount of memory the feature uses.  The default value could be 
 something like 1024 or 4096.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: [JENKINS] Lucene-Solr-trunk-Linux (32bit/jdk1.6.0_37) - Build # 3421 - Failure!

2012-12-26 Thread Dawid Weiss

Good bug hunting, Mr Holmes!

 Unfortunately I did not find a good tool to inspect permgen heap only (it 
 contains loaded classes and interned strings). I

Not sure but YourKit may be able to do that (in particular if you
attach it using its own agent, not the default one).

 (and possibly permgen, because most strings may be interned)!

Just a note -- interned string pool is no longer stored in permgen as
of 1.7+ so they probably don't contribute to permgen exhaustion. Still
not good that there's so many of them of course ;)

Dawid

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: [JENKINS] Lucene-Solr-trunk-Linux (32bit/jdk1.6.0_37) - Build # 3421 - Failure!

2012-12-26 Thread Dawid Weiss

 Just a note -- interned string pool is no longer stored in permgen as
 of 1.7+ so they probably don't contribute to permgen exhaustion.

And I meant 1.7 build plans of course; the ones using 1.6 may be
affected if these strings are indeed interned.

D.

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-4191) Exceptions thrown when /admin/mbeans is accessed during update/commit

2012-12-26 Thread Mark Smith (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-4191?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13539538#comment-13539538
 ] 

Mark Smith commented on SOLR-4191:
--

This happening for me as well on Solr 4.0.  Interestingly enough, I'm not doing 
any writes.  I've got my solr app running for the past few weeks, and have not 
done any changes (no updates, no inserts, only selects).  Every few days I see 
this exception, so I simply kill my server and restart, and everything is fine. 
 Please let me know if there is any test or more info that can help.

 Exceptions thrown when /admin/mbeans is accessed during update/commit
 -

 Key: SOLR-4191
 URL: https://issues.apache.org/jira/browse/SOLR-4191
 Project: Solr
  Issue Type: Bug
Affects Versions: 4.1
 Environment: solr-impl 4.1-SNAPSHOT 1421496 - ncindex - 2012-12-13 
 14:56:25
Reporter: Shawn Heisey
 Fix For: 4.1, 5.0

 Attachments: solr-2012-12-14[1].log


 I am getting the following exceptions in quick succession in the solr log 
 when /admin/mbeans is accessed at the moment that an update/commit is 
 happening:
 ERROR - 2012-12-13 18:17:01.930; org.apache.solr.common.SolrException; 
 null:org.eclipse.jetty.io.EofException
 ERROR - 2012-12-13 18:17:01.982; org.apache.solr.common.SolrException; 
 null:org.eclipse.jetty.io.EofException
 WARN  - 2012-12-13 18:17:01.984; org.eclipse.jetty.server.Response; Committed 
 before 500 {msg=Broken pipe,trace=org.eclipse.jetty.io.EofException
 WARN  - 2012-12-13 18:17:01.984; org.eclipse.jetty.servlet.ServletHandler; 
 /solr/s0live/admin/mbeans
 java.lang.IllegalStateException: Committed
 I will attach the full solr log.  Before SOLR-4135 was fixed, I got a *lot* 
 of those exceptions, but these were far less common.  Now these appear to be 
 the only thing I am getting in my logs, which log4j is logging at WARN.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Comment Edited] (SOLR-4191) Exceptions thrown when /admin/mbeans is accessed during update/commit

2012-12-26 Thread Mark Smith (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-4191?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13539538#comment-13539538
 ] 

Mark Smith edited comment on SOLR-4191 at 12/26/12 1:55 PM:


This happening for me as well on Solr 4.0.  Interestingly enough, I'm not doing 
any writes.  I've got my solr app running for the past few weeks, and have not 
done any changes (no updates, no inserts, only selects).  Every few days I see 
this exception, so I simply kill my server and restart, and everything is fine. 
 Please let me know if anything I can do to help.

  was (Author: marksmithurbana):
This happening for me as well on Solr 4.0.  Interestingly enough, I'm not 
doing any writes.  I've got my solr app running for the past few weeks, and 
have not done any changes (no updates, no inserts, only selects).  Every few 
days I see this exception, so I simply kill my server and restart, and 
everything is fine.  Please let me know if there is any test or more info that 
can help.
  
 Exceptions thrown when /admin/mbeans is accessed during update/commit
 -

 Key: SOLR-4191
 URL: https://issues.apache.org/jira/browse/SOLR-4191
 Project: Solr
  Issue Type: Bug
Affects Versions: 4.1
 Environment: solr-impl 4.1-SNAPSHOT 1421496 - ncindex - 2012-12-13 
 14:56:25
Reporter: Shawn Heisey
 Fix For: 4.1, 5.0

 Attachments: solr-2012-12-14[1].log


 I am getting the following exceptions in quick succession in the solr log 
 when /admin/mbeans is accessed at the moment that an update/commit is 
 happening:
 ERROR - 2012-12-13 18:17:01.930; org.apache.solr.common.SolrException; 
 null:org.eclipse.jetty.io.EofException
 ERROR - 2012-12-13 18:17:01.982; org.apache.solr.common.SolrException; 
 null:org.eclipse.jetty.io.EofException
 WARN  - 2012-12-13 18:17:01.984; org.eclipse.jetty.server.Response; Committed 
 before 500 {msg=Broken pipe,trace=org.eclipse.jetty.io.EofException
 WARN  - 2012-12-13 18:17:01.984; org.eclipse.jetty.servlet.ServletHandler; 
 /solr/s0live/admin/mbeans
 java.lang.IllegalStateException: Committed
 I will attach the full solr log.  Before SOLR-4135 was fixed, I got a *lot* 
 of those exceptions, but these were far less common.  Now these appear to be 
 the only thing I am getting in my logs, which log4j is logging at WARN.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: Cache stats and the log files

2012-12-26 Thread Erick Erickson

H. Well, I guess I can argue that this shouldn't be on by default, it's
just a bit startling. Thanks for finding that!

On a related note, setting the root logging level in the admin UI to
finest changes all of the leaves to FINE level. But no changes happen
in the logging (at least nothing goes out to the console). Doesn't seem
right, anyone seen anything similar?



On Tue, Dec 25, 2012 at 4:23 PM, Jack Krupansky j...@basetechnology.comwrote:

 SOLR-3157

[jira] [Commented] (SOLR-1972) Need additional query stats in admin interface - median, 95th and 99th percentile

2012-12-26 Thread Shawn Heisey (JIRA)

[
https://issues.apache.org/jira/browse/SOLR-1972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13539544#comment-13539544
]

Shawn Heisey commented on SOLR-1972:

This is really useful to me, I'm very sad to learn it's causing problems. I
haven't noticed anything in my own running of branch_4x, which runs for days at
a time. I'm attempting to get some heap dumps so I can compare them. Based on
what I know from my own experience, I don't think this is actually a leak, but
even a minimal test JVM ends up with a lot of request handlers, each of which
has these additional objects. As for seeing what looks like a leak with each
query, after 1024 queries, the growth in non-garbage objects should stop,
because it throws away an entry to make room for the new one.

I wonder if there might be an easy way to make these new statistics optional in
the request handler, so they do not cause memory problems with minimal test
configs. Although it's really cool to be able to see detailed query statistics
on the SolrInfoMBeanHandler, it's unnecessary.

If it does get removed, I guess I'll have to go back to the previous version of
the patch that only has additional statistics on qtime. I depend on these
statistics - the average values otherwise available are nearly useless.

I will attempt a patch. No guarantees about success!

Need additional query stats in admin interface - median, 95th and 99th
percentile
-

Key: SOLR-1972
URL: https://issues.apache.org/jira/browse/SOLR-1972
Project: Solr
Issue Type: Improvement
Components: web gui
Affects Versions: 1.4
Reporter: Shawn Heisey
Assignee: Alan Woodward
Priority: Minor
Fix For: 4.1, 5.0

Attachments: elyograg-1972-3.2.patch, elyograg-1972-3.2.patch,
elyograg-1972-trunk.patch, elyograg-1972-trunk.patch,
SOLR-1972-branch3x-url_pattern.patch, SOLR-1972-branch4x.patch,
SOLR-1972-branch4x.patch, SOLR-1972_metrics.patch, SOLR-1972_metrics.patch,
SOLR-1972_metrics.patch, SOLR-1972_metrics.patch, SOLR-1972_metrics.patch,
SOLR-1972_metrics.patch, SOLR-1972_metrics.patch, SOLR-1972_metrics.patch,
solr1972-metricsregistry-branch4x-failure.log, SOLR-1972.patch,
SOLR-1972.patch, SOLR-1972.patch, SOLR-1972.patch, SOLR-1972-url_pattern.patch

I would like to see more detailed query statistics from the admin GUI. This
is what you can get now:
requests : 809
errors : 0
timeouts : 0
totalTime : 70053
avgTimePerRequest : 86.59209
avgRequestsPerSecond : 0.8148785
I'd like to see more data on the time per request - median, 95th percentile,
99th percentile, and any other statistical function that makes sense to
include. In my environment, the first bunch of queries after startup tend to
take several seconds each. I find that the average value tends to be useless
until it has several thousand queries under its belt and the caches are
thoroughly warmed. The statistical functions I have mentioned would quickly
eliminate the influence of those initial slow queries.
The system will have to store individual data about each query. I don't know
if this is something Solr does already. It would be nice to have a
configurable count of how many of the most recent data points are kept, to
control the amount of memory the feature uses. The default value could be
something like 1024 or 4096.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: [JENKINS] Lucene-Solr-trunk-Linux (32bit/jdk1.6.0_37) - Build # 3421 - Failure!

2012-12-26 Thread Michael McCandless

On Wed, Dec 26, 2012 at 4:13 AM, Dawid Weiss
dawid.we...@cs.put.poznan.pl wrote:
 I think it would be nice if Mike could add permgen pool stats (mx
 bean) to his charts :) This way we would see the average permgen usage
 over time -- it's easy to spot the regression then. Something to think
 of for the future.

Hmm this is interesting!

Does this just amount to running top-level ant test
-Dargs=-verbose:gc -XX:+PrintGCDetails -Dtests.jvms=1 and then
parsing the GC stdout for the permgen usage?

Mike McCandless

http://blog.mikemccandless.com

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-1972) Need additional query stats in admin interface - median, 95th and 99th percentile

2012-12-26 Thread Robert Muir (JIRA)

[
https://issues.apache.org/jira/browse/SOLR-1972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13539546#comment-13539546
]

Robert Muir commented on SOLR-1972:
---

{quote}
This commit causes a huge memory leak in Solr
...
We should revert this and investigate how to remove the com.yammer.metrics
package dependency (or make the stats cleaner). To me it looks like every query
to solr creates a new entry in those huge maps, causing them to grow and grow
and grow...
{quote}

Maybe the stats have to be implemented by hand or something. I don't understand
why huge maps or string interning is necessary here.

Lets back out the change and implement it in a non-leaky way.

Need additional query stats in admin interface - median, 95th and 99th
percentile
-

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: [JENKINS] Lucene-Solr-trunk-Linux (32bit/jdk1.6.0_37) - Build # 3421 - Failure!

2012-12-26 Thread Robert Muir

On Wed, Dec 26, 2012 at 5:23 AM, Dawid Weiss
dawid.we...@cs.put.poznan.pl wrote:
 Good bug hunting, Mr Holmes!


I think he should be promoted from policeman to detective!

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Created] (SOLR-4232) Make request handler metrics optional

2012-12-26 Thread Shawn Heisey (JIRA)

Shawn Heisey created SOLR-4232:
--

 Summary: Make request handler metrics optional
 Key: SOLR-4232
 URL: https://issues.apache.org/jira/browse/SOLR-4232
 Project: Solr
  Issue Type: Bug
  Components: web gui
Affects Versions: 4.1, 5.0
Reporter: Shawn Heisey
Priority: Blocker
 Fix For: 4.1, 5.0


Uwe Schindler noticed what looked like a memory leak caused by the addition of 
SOLR-1972.  I don't believe it's actually a leak, but the additional memory 
required does appear to be causing problems for Solr test JVMs.  I think this 
is likely because there are a LOT of request handlers defined for even a very 
minimal test config, each of which ends up with the new objects.

This is an attempt to provide an option for turning on the new statistics only 
when required.  For most people, this will only be required for search handlers.

If this is not successful at fixing the test problems, we can remove metrics 
with this issue.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: [JENKINS] Lucene-Solr-trunk-Linux (32bit/jdk1.6.0_37) - Build # 3421 - Failure!

2012-12-26 Thread Dawid Weiss

i was thinking about adding a hook to memorypool mx bean as far as i
remember it does have 'peak' memory usage for permgen and it could be
charted over time. parsing gc logs is kind of hard because they will differ
depending on the vm and even the gc used.

also, the gc logs are dumped to process descriptors and not to
system.out/err streams and the runner will complain about unrecognized
process output.

this isn't crucial, but i think would be nice to add at some point.

On Wednesday, December 26, 2012, Michael McCandless wrote:

 On Wed, Dec 26, 2012 at 4:13 AM, Dawid Weiss
 dawid.we...@cs.put.poznan.pl javascript:; wrote:
  I think it would be nice if Mike could add permgen pool stats (mx
  bean) to his charts :) This way we would see the average permgen usage
  over time -- it's easy to spot the regression then. Something to think
  of for the future.

 Hmm this is interesting!

 Does this just amount to running top-level ant test
 -Dargs=-verbose:gc -XX:+PrintGCDetails -Dtests.jvms=1 and then
 parsing the GC stdout for the permgen usage?

 Mike McCandless

 http://blog.mikemccandless.com

 -
 To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org javascript:;
 For additional commands, e-mail: dev-h...@lucene.apache.org javascript:;

94 matches

Mail list logo