Re: Monitor the QTime.

2011-02-12 Thread Gora Mohanty
On Sat, Feb 12, 2011 at 3:56 AM, Stijn Vanhoorelbeke
stijn.vanhoorelb...@gmail.com wrote:
 QTime is, of course, specific to the query, but it is returned in the
 response XML, so one could run occasional queries to figure it out.
 Please see http://wiki.apache.org/solr/SearchHandler

 Regards,
 Gora


 Yes, this could be a possibility. But then the Solr cache jumps back into
 the picture.
 I cannot simply query the system each minute with the same query - that way
 the result would be completely satisfied by the internal caches. I could
 build a list of heavy queries to do so - but I'd loved to use a more
 straight forward method.
[...]

That is true. The time for the first query gives you some idea, but that
is just one measurement, and can be off for various reasons. You can
do drastic things like restarting the Java container each time before
querying. However, one should be thinking of what it is that one is trying
to measure, as the benefits of the various Solr/Lucene caches do matter
in operations:
* If it is the average time for real queries in a running system, the stats
  component gives an idea of that, as someone already pointed out. You
  can access this from the admin at http://localhost:8983/solr/admin/stats.jsp
  (hostname/port might be different for you).
* One could pass randomly generated strings to each query so as to take
  out the effect of caching. I have been meaning to hack pg_bench to do
  this. There is a caveat even here, as a query for a non-existent string is
  likely to be faster.

Regards,
Gora


Re: Monitor the QTime.

2011-02-12 Thread Gora Mohanty
On Sat, Feb 12, 2011 at 4:54 AM, Stijn Vanhoorelbeke
stijn.vanhoorelb...@gmail.com wrote:
[...]
 Can you access this URL from a web browser (tried but doesn't work ) ? Or
 must this used in jConsole / custom made java program.

Please try http://localhost:8983/solr/admin/stats.jsp (change hostname/port as
needed).

 Could you please point me to a good guide to implement this JMX stuff, cause
 I'm a newbie for JMX.

The easiest way to get access to JMX is indeed a Java console, like jconsole.
There are various open-source JMX clients available, but we could find none
that met our needs, and were being actively maintained. We have been
toying with the idea of a JMX client that offers a REST API to Solr MBeans
(or even to any generic MBeans). This would be a more natural interface for
people used to web development.

Regards,
Gora


Re: SolrCloud (ZooKeeper)

2011-02-12 Thread Stijn Vanhoorelbeke
Hi,

Has anyone used Zookeeper on a Tomcat 5 system?
Could someone point me to a good guide how to implement Zookeeper/solr
combination? The SolCloud wiki page doesn't give much info at all.

Thank you,


Re: Which version of Solr?

2011-02-12 Thread Lance Norskog
There is momentum towards doing a release of 3.x. I would be
comfortable using the 3.x branch.

--- But I'm unable to get SolrJ to work due to the 'javabin' version
mismatch. I'm using the 1.4.1 version of SolrJ, but I always get an
HTTP response code of 200, but the return entity is simply a null
byte, which does not match the version number of 1 defined in Solr
common.  ---

I've never seen this problem. At this point you are better off
starting with 3.x instead of chasing this problem down.

On Sat, Feb 12, 2011 at 1:37 PM, Jeff Schmidt j...@535consulting.com wrote:
 Hello:

 I'm working on incorporating Solr into a SaaS based life sciences semantic 
 search project. This will be released in about six months. I'm trying to 
 determine which version of Solr makes the most sense. When going to the Solr 
 download page, there are 1.3.0, 1.4.0, and 1.4.1. I've been using 1.4.1 while 
 going through some examples in my Packt book (Solr 1.4 Enterprise Search 
 Server).

 But, I also see that Solr 3.1 and 4.0 are in the works.  According to:

        
 https://issues.apache.org/jira/browse/#selectedTab=com.atlassian.jira.plugin.system.project%3Aroadmap-panel

 there is a high degree of progress on both of those releases; including a 
 slew of bug fixes, new features, performance enhancements etc. Should I be 
 making use of one of the newer versions?  The hierarchical faceting seems 
 like it could be quite useful.  Are there any guesses on when either 3.1 or 
 4.0 will be officially released?

 So far, 1.4.1 has been good. But I'm unable to get SolrJ to work due to the 
 'javabin' version mismatch. I'm using the 1.4.1 version of SolrJ, but I 
 always get an HTTP response code of 200, but the return entity is simply a 
 null byte, which does not match the version number of 1 defined in Solr 
 common.  Anyway, I can follow up on that issue if 1.4.1 is still the most 
 appropriate version to use these days. Otherwise, I'll try again with 
 whatever version you suggest.

 Thanks a lot!

 Jeff
 --
 Jeff Schmidt
 535 Consulting
 j...@535consulting.com
 (650) 423-1068









-- 
Lance Norskog
goks...@gmail.com


Re: Difference between Solr and Lucidworks distribution

2011-02-12 Thread Lance Norskog
There are two distributions.

The company is Lucid Imagination. 'Lucidworks for Solr' is the
certified distribution of Solr 1.4.1, with several enhancements.

Markus refers to 'LucidWorks Enterprise', which is LWE. This is a
separate app with tools and a REST API for managing a Solr instance.

Lance Norskog

On Fri, Feb 11, 2011 at 8:36 AM, Markus Jelsma
markus.jel...@openindex.io wrote:
 It is not free for production environments.
 http://www.lucidimagination.com/lwe/subscriptions-and-pricing

 On Friday 11 February 2011 17:31:22 Greg Georges wrote:
 Hello all,

 I just started watching the webinars from Lucidworks, and they mention
 their distribution which has an installer, etc.. Is there any other
 differences? Is it a good idea to use this free distribution?

 Greg

 --
 Markus Jelsma - CTO - Openindex
 http://www.linkedin.com/in/markus17
 050-8536620 / 06-50258350




-- 
Lance Norskog
goks...@gmail.com


Re: Monitor the QTime.

2011-02-12 Thread Lance Norskog
If you're a unix shell scripting wiz, here are a few strategies.

Tail the logfile and filter for the string 'QTime'. The number is the
very last string in the line. So, strip the text between the timestamp
and the number- sort by the timestamp first and the number second. Now
 grab the first qtime for each timestamp. I don't know a command for
this. This gives you the longest query time for each second.

As a separate trick: tail the logfile and filter for QTime. Then,
strip out all text after the time. Now you have a stream of lines with
a timestamp. Run this through 'uniq -c' and voila! you get the queries
per second for each timestamp.

On Sat, Feb 12, 2011 at 1:51 AM, Gora Mohanty g...@mimirtech.com wrote:
 On Sat, Feb 12, 2011 at 4:54 AM, Stijn Vanhoorelbeke
 stijn.vanhoorelb...@gmail.com wrote:
 [...]
 Can you access this URL from a web browser (tried but doesn't work ) ? Or
 must this used in jConsole / custom made java program.

 Please try http://localhost:8983/solr/admin/stats.jsp (change hostname/port as
 needed).

 Could you please point me to a good guide to implement this JMX stuff, cause
 I'm a newbie for JMX.

 The easiest way to get access to JMX is indeed a Java console, like jconsole.
 There are various open-source JMX clients available, but we could find none
 that met our needs, and were being actively maintained. We have been
 toying with the idea of a JMX client that offers a REST API to Solr MBeans
 (or even to any generic MBeans). This would be a more natural interface for
 people used to web development.

 Regards,
 Gora




-- 
Lance Norskog
goks...@gmail.com


Re: Difference between Solr and Lucidworks distribution

2011-02-12 Thread Andy
Now I'm confused.

In http://www.lucidimagination.com/lwe/subscriptions-and-pricing, the price of 
LucidWorks Enterprise Software is stated as FREE. I thought the price for 
Production was for the support service, not for the software.

But you seem to be saying that 'LucidWorks Enterprise' is a separate software 
that isn't free. Did I misunderstand?

--- On Sat, 2/12/11, Lance Norskog goks...@gmail.com wrote:

 From: Lance Norskog goks...@gmail.com
 Subject: Re: Difference between Solr and Lucidworks distribution
 To: solr-user@lucene.apache.org, markus.jel...@openindex.io
 Date: Saturday, February 12, 2011, 8:10 PM
 There are two distributions.
 
 The company is Lucid Imagination. 'Lucidworks for Solr' is
 the
 certified distribution of Solr 1.4.1, with several
 enhancements.
 
 Markus refers to 'LucidWorks Enterprise', which is LWE.
 This is a
 separate app with tools and a REST API for managing a Solr
 instance.
 
 Lance Norskog
 
 On Fri, Feb 11, 2011 at 8:36 AM, Markus Jelsma
 markus.jel...@openindex.io
 wrote:
  It is not free for production environments.
  http://www.lucidimagination.com/lwe/subscriptions-and-pricing
 
  On Friday 11 February 2011 17:31:22 Greg Georges
 wrote:
  Hello all,
 
  I just started watching the webinars from
 Lucidworks, and they mention
  their distribution which has an installer, etc..
 Is there any other
  differences? Is it a good idea to use this free
 distribution?
 
  Greg
 
  --
  Markus Jelsma - CTO - Openindex
  http://www.linkedin.com/in/markus17
  050-8536620 / 06-50258350
 
 
 
 
 -- 
 Lance Norskog
 goks...@gmail.com