Re: MultiCore problem

2007-12-14 Thread Ryan Mckinley
Yup sorry that wasn't more clear.  I have found that 95% of the time I  
think something does not work, making a concise example makes me  
discover the problem.  If it still does not work, there is something  
concrete to investigate/fix


If you started with the example confit, make small changed till it  
stops working as expected.


Ryan


On Dec 14, 2007, at 7:53 PM, Chris Hostetter  
<[EMAIL PROTECTED]> wrote:




: I'd love to write a test but I'm not a java programmer.

I believe Ryan just means "can you reproduce this with the example
configs?" ... ie: can you describe for us, a way we can take a nightly
snapshot, and reproduce the behavior you are describing.


-Hoss



RE: MultiCore problem

2007-12-14 Thread Chris Hostetter

: I'd love to write a test but I'm not a java programmer.

I believe Ryan just means "can you reproduce this with the example 
configs?" ... ie: can you describe for us, a way we can take a nightly 
snapshot, and reproduce the behavior you are describing.


-Hoss



Re: Replication hooks - changing the index while the slave is running ...

2007-12-14 Thread Tracy Flynn
I've looked at everything suggested - documentation, code and scripts  
and I follow almost everything that's happening.


If I understand correctly, when the updated snapshot is installed on  
the slave:


1) The existing index(es) being used by the Solr slave instance are  
physically deleted
2) The new index snapshots are renamed/moved from their temporary  
installation location to the default index location
3) The slave is sent a 'commit' to force a new IndexReader to start to  
read the new index.


What happens to search requests against the existing/old index during  
step 1) and between steps 1 and 2?  Where do they get information if  
they need to go to disk for results that are not cached? Do they a)  
hang b) produce no results c) error in some other way?



Regards,

Tracy

On Dec 11, 2007, at 9:11 AM, Tracy Flynn wrote:


That's what I was after.

As always, thanks for the quick response.

Tracy

On Dec 11, 2007, at 12:18 AM, Yonik Seeley wrote:

On Dec 10, 2007 11:22 PM, climbingrose <[EMAIL PROTECTED]>  
wrote:
I think there is a event listener interface for hooking into Solr  
events
such as post commit, post optimise and open new searcher. I can't  
remember
on top of my head but if you do a search for *EventListener in  
Eclipse,

you'll find it.
The Wiki shows how to trigger snapshooter after each commit and  
optimise.
You should be able to follow this example to create your own  
listener.


Right... you shouldn't need to implement your own listeners though.
Search for postCommit in the example solrconfig.xml

-Yonik






Re: solr.home via getServletContext().getInitParameter("solr.home")

2007-12-14 Thread Chris Hostetter

: Recently, I had to set up a Jetty with multiple Solr homes (not multi-core).  
I used JNDI with Jetty 6.1.6 to get this going.
: Then Matt Kangas' note from July 2007 caught my attention:
...
:   
:   
:   
...
: Is there a reason why this was not done?

because context params can *only* be set from the web.xml of the war ... 
which isn't something we want to encourage people to do (or suggest that 
they need to do) since it makes it extremely difficult to upgrade later.




-Hoss



RE: MultiCore problem

2007-12-14 Thread Owens, Martin
I was able to run more tests by indexing a second document in core1, now both 
documents appear in both cores, very odd like they are both being searched.

Is there any part of the config that could cause this? since the multicore 
solrconfig wasn't very good I replaced it with the solrconfig from the single 
entry; I replaced the directories to be core based. perhaps this is something o 
do with it?

I'd love to write a test but I'm not a java programmer.

Best Regards, Martin Owens


-Original Message-
From: Ryan McKinley [mailto:[EMAIL PROTECTED]
Sent: Fri 12/14/2007 4:26 PM
To: solr-user@lucene.apache.org
Subject: Re: MultiCore problem
 
I'm not sure off hand...  The junit tests go through this case, can you 
make a repeatable test from the example configs?

thanks
ryan


Owens, Martin wrote:
> Hello everyone,
> 
> Using the 2007-12-13 build I'm finding the multicore stuff isn't working as 
> expected. I have indexed 1 document in core0 and 0 documents in core1; dong a 
> search in core0 leads to a result, but I get the exact same result in core1; 
> as if it's searching core0 again.
> 
> Thoughts?
> 
> Best Regards, Martin Owens
> 




Re: MultiCore problem

2007-12-14 Thread Ryan McKinley
I'm not sure off hand...  The junit tests go through this case, can you 
make a repeatable test from the example configs?


thanks
ryan


Owens, Martin wrote:

Hello everyone,

Using the 2007-12-13 build I'm finding the multicore stuff isn't working as 
expected. I have indexed 1 document in core0 and 0 documents in core1; dong a 
search in core0 leads to a result, but I get the exact same result in core1; as 
if it's searching core0 again.

Thoughts?

Best Regards, Martin Owens





MultiCore problem

2007-12-14 Thread Owens, Martin

Hello everyone,

Using the 2007-12-13 build I'm finding the multicore stuff isn't working as 
expected. I have indexed 1 document in core0 and 0 documents in core1; dong a 
search in core0 leads to a result, but I get the exact same result in core1; as 
if it's searching core0 again.

Thoughts?

Best Regards, Martin Owens


Re: Solr Tutorial Issue

2007-12-14 Thread Yousef Ourabi
can you run: "cat /etc/hosts" and paste the output in an email.



- Original Message -
From: "Kirk Beers" <[EMAIL PROTECTED]>
To: solr-user@lucene.apache.org
Sent: Friday, December 14, 2007 12:46:13 PM (GMT-0800) America/Los_Angeles
Subject: Re: Solr Tutorial Issue

Hi Hoss,

I get an error that reads java.net.UnknownHostException kirk : kirk

I will point out that I am new to Linux as well !

Thanks

Kirk


Chris Hostetter wrote:
> This thread in general is really confusing to me ... if you are following 
> along withthe tutorial then tomcat should never enter the equation ... 
> "java -jar start.jar" will use a copy of Jetty that is included in the 
> Solr release to spin up a self contained webserver totally indepent of any 
> application server you may already have installed.  (that's the whole 
> reason for the start.jar, you don't need to worry about wether you have a 
> servlet container installed correctly)
>
> When you run "java -jar start.jar" do you get logging output in your 
> console?  is there anything in that logging output that looks like a stack 
> trace?  what gets added to the end of that loggign output when you then 
> hit the url http://localhost:8983/solr/admin/ in your browser?
>
> : I am running Ubuntu, Java1.6 jdk  and tomcat5.5.  I can not seem to get the
> : tutorial to run. The instructions seem simple and clear.
> : 
> : start.jar ran fine but when I used http://localhost:8983/solr/admin/
> : nothing appeared. I also individually copied the
> : apache-solr-nightly/dist/apache-solr-nightly.war and the
> : apache-solr-nightly/example/webapps/solr.war to my tomcat webapps and still
> : nothing!!
>
>
> -Hoss
>
>   




Re: Solr Tutorial Issue

2007-12-14 Thread Kirk Beers

Hi Hoss,

I get an error that reads java.net.UnknownHostException kirk : kirk

I will point out that I am new to Linux as well !

Thanks

Kirk


Chris Hostetter wrote:
This thread in general is really confusing to me ... if you are following 
along withthe tutorial then tomcat should never enter the equation ... 
"java -jar start.jar" will use a copy of Jetty that is included in the 
Solr release to spin up a self contained webserver totally indepent of any 
application server you may already have installed.  (that's the whole 
reason for the start.jar, you don't need to worry about wether you have a 
servlet container installed correctly)


When you run "java -jar start.jar" do you get logging output in your 
console?  is there anything in that logging output that looks like a stack 
trace?  what gets added to the end of that loggign output when you then 
hit the url http://localhost:8983/solr/admin/ in your browser?


: I am running Ubuntu, Java1.6 jdk  and tomcat5.5.  I can not seem to get the
: tutorial to run. The instructions seem simple and clear.
: 
: start.jar ran fine but when I used http://localhost:8983/solr/admin/

: nothing appeared. I also individually copied the
: apache-solr-nightly/dist/apache-solr-nightly.war and the
: apache-solr-nightly/example/webapps/solr.war to my tomcat webapps and still
: nothing!!


-Hoss

  




Re: embeddedsolr and solrj index update request

2007-12-14 Thread Chris Hostetter

: Subject: embeddedsolr and solrj index update request
: In-Reply-To: <[EMAIL PROTECTED]>

http://people.apache.org/~hossman/#threadhijack

Thread Hijacking on Mailing Lists

When starting a new discussion on a mailing list, please do not reply to 
an existing message, instead start a fresh email.  Even if you change the 
subject line of your email, other mail headers still track which thread 
you replied to and your question is "hidden" in that thread and gets less 
attention.   It makes following discussions in the mailing list archives 
particularly difficult.
See Also:  http://en.wikipedia.org/wiki/Thread_hijacking





-Hoss



RE: Python Solr Writer

2007-12-14 Thread Chris Hostetter

: That would be a python solr client, not a solr writer using this:
: 
: 
http://lucene.apache.org/solr/api/org/apache/solr/request/PythonResponseWriter.html
: 
: Not sure how the hell it's supposed to work to be honest.

Can you clarify your question: Are you asking how the java code works 
internally, or are you asking how to make Solr use it to generate Python 
output?  (just add "&wt=python" to your URLs, or change your 
solrconfig.xml to explicitly declare the python  and 
make it the default...

http://wiki.apache.org/solr/QueryResponseWriter
http://wiki.apache.org/solr/SolPython


-Hoss



Re: Solr Tutorial Issue

2007-12-14 Thread Chris Hostetter

This thread in general is really confusing to me ... if you are following 
along withthe tutorial then tomcat should never enter the equation ... 
"java -jar start.jar" will use a copy of Jetty that is included in the 
Solr release to spin up a self contained webserver totally indepent of any 
application server you may already have installed.  (that's the whole 
reason for the start.jar, you don't need to worry about wether you have a 
servlet container installed correctly)

When you run "java -jar start.jar" do you get logging output in your 
console?  is there anything in that logging output that looks like a stack 
trace?  what gets added to the end of that loggign output when you then 
hit the url http://localhost:8983/solr/admin/ in your browser?

: I am running Ubuntu, Java1.6 jdk  and tomcat5.5.  I can not seem to get the
: tutorial to run. The instructions seem simple and clear.
: 
: start.jar ran fine but when I used http://localhost:8983/solr/admin/
: nothing appeared. I also individually copied the
: apache-solr-nightly/dist/apache-solr-nightly.war and the
: apache-solr-nightly/example/webapps/solr.war to my tomcat webapps and still
: nothing!!


-Hoss



Re: Solr Tutorial Issue

2007-12-14 Thread Yousef Ourabi
Kirk,

Perhaps I mis-understood your initial question. If you are just following the 
get started tutorial you have to do a few things to get solr running on tomcat.

Here is the Wiki resource: http://wiki.apache.org/solr/SolrTomcat

Hope this helps.

-Yousef

- Original Message -
From: "Kirk Beers" <[EMAIL PROTECTED]>
To: solr-user@lucene.apache.org
Sent: Friday, December 14, 2007 11:56:51 AM (GMT-0800) America/Los_Angeles
Subject: Re: Solr Tutorial Issue

Hi Yousef,

Thanks for the reply!  I am presently using the  download from the 
Apache site and don't seem to be having much luck either.

Yousef Ourabi wrote:
> Kirk,
> I'm also having trouble with the apt-installed tomcat5.5 instance -- though I 
> haven't really taken enough time to figure it out. The same Context fragment 
> works fine on the Debian etch apt-installed tomcat -- so I'm fairly sure it's 
> specific to Ubuntu.
>
> A quick work-around would be to just download a copy of tomcat from apache 
> and run that (export JAVA_HOME, bin/catalina.sh start)
>
> -Yousef
>
> - Original Message -
> From: "Kirk Beers" <[EMAIL PROTECTED]>
> To: solr-user@lucene.apache.org
> Sent: Friday, December 14, 2007 11:18:31 AM (GMT-0800) America/Los_Angeles
> Subject: Solr Tutorial Issue
>
> Hi,
>
> I am running Ubuntu, Java1.6 jdk  and tomcat5.5.  I can not seem to get 
> the tutorial to run. The instructions seem simple and clear.
>
> start.jar ran fine but when I used http://localhost:8983/solr/admin/
> nothing appeared. I also individually copied the 
> apache-solr-nightly/dist/apache-solr-nightly.war and the 
> apache-solr-nightly/example/webapps/solr.war to my tomcat webapps and 
> still nothing!!
>
> Any feedback would be greatly appreciated ASAP
>
> Thanks
>
> Kirk
>
>   




Re: Solr Tutorial Issue

2007-12-14 Thread Kirk Beers

Hi Yousef,

Thanks for the reply!  I am presently using the  download from the 
Apache site and don't seem to be having much luck either.


Yousef Ourabi wrote:

Kirk,
I'm also having trouble with the apt-installed tomcat5.5 instance -- though I 
haven't really taken enough time to figure it out. The same Context fragment 
works fine on the Debian etch apt-installed tomcat -- so I'm fairly sure it's 
specific to Ubuntu.

A quick work-around would be to just download a copy of tomcat from apache and 
run that (export JAVA_HOME, bin/catalina.sh start)

-Yousef

- Original Message -
From: "Kirk Beers" <[EMAIL PROTECTED]>
To: solr-user@lucene.apache.org
Sent: Friday, December 14, 2007 11:18:31 AM (GMT-0800) America/Los_Angeles
Subject: Solr Tutorial Issue

Hi,

I am running Ubuntu, Java1.6 jdk  and tomcat5.5.  I can not seem to get 
the tutorial to run. The instructions seem simple and clear.


start.jar ran fine but when I used http://localhost:8983/solr/admin/
nothing appeared. I also individually copied the 
apache-solr-nightly/dist/apache-solr-nightly.war and the 
apache-solr-nightly/example/webapps/solr.war to my tomcat webapps and 
still nothing!!


Any feedback would be greatly appreciated ASAP

Thanks

Kirk

  




Bug with deleteByQuery

2007-12-14 Thread Jörg Kiegeland
When I execute deleteByQuery, and afterwards I add new documents which 
match this query, then these documents are deleted though I added the 
documents AFTER the call to deleteByQuery .

(Even a call of  commit() seems not to help.)
Any idea?

PS: I use communication via HTTP /solrj / nightly build.



Re: sint not works when omitted or Integer.MAX_VALUE?

2007-12-14 Thread Jörg Kiegeland

There is a bug but it is not the bug I just described.. to follow..


Re: Solr replication

2007-12-14 Thread Bill Au
On Dec 14, 2007 7:00 AM, Dilip.TS <[EMAIL PROTECTED]> wrote:

> Hi,
> I have the following requirement for SOLR Collection Distribution using
> Embedded Solr with the Jetty server:
>
> I have different data folders for multiple instances of SOLR within the
> Same
> application.
> Im using the same SOLR_HOME with a single bin and conf folder.
>
> My query is:
> 1)Is is possible to have the same SOLR_HOME for multiple solr instances
> and
> still be able to
>  achieve Solr Distribution?
>  (As i understand that we need to have differnet rsync port for different
> solr instances)


Yes, solr distribution will work for multiple solr instances even if  they
all use the same SOLR_HOME.
All the distribution scripts have a command line argument for specifying the
data directory.


>
> 2)Can i get some more information about how to start this rsyncd daemon
> and
>  which is the best way of doing it i.e. to start during system reboot or
> doing it manually?


Please note that the rsyncd
-CollectionDistributionScripts#head-1e6cdce516ecf1eb31bffceaccf2abeb72bdce81

So it is best to configure the master server to run the rsyncd-start script
at system boot time.  If the rsync daemon has for some reasons been
disabled, it will not be started automatically at system reboot even if it
is configured to do so.  If rsyncd is started manually, then one will have
to remember to start it every time the master server is rebooted.


>
> 3)Let me know if my understanding is correct. We require 1 Master Server
> and
> a minimum of 1 slave server.
>  The master server and the slave server cannot be running on the same
> machine. Am i right?
>
>  In the case of the SOLR Distribution, if the SOLR server acts as the
> Master server
>  then how about this slave server ? Is it the Application server which
> calls the Master SOLR Server
>  acts as slave server?


Both the master and slave are SOLR servers.  Typically they are on different
machines.
It doesn't make sense (at least not to me) to have both of them on the same
machine.


>
> 4)I observe the file scripts.conf for master server:
>solr_port=8983
>rsyncd_port=18983
>
>+Enable and start rsync:
> rsyncd-enable; rsyncd-start
>+Run snapshooter:
> snapshooter
>
>Just to confirm is it mandatory that the solr master server should have
> the solr_port as 8983 only?



It does not to be 8983.   That's just an example.


>
>
> 5) How do we enable and start rsync? The link to
> SolrCollectionDistributionScripts mentions about
>   installing rsyncd daemon either during system boot time or by manually.
>   Which method is more preferrable?
>   How do we achieve this as iam not clear on this?


>
> 6) How do we setup crontab to run snappuller and snapinstaller
> periodically?



How to start rsyncd  at system boot time and setup crontab to run snappuller
and snapinstaller depends on the OS that Solr is running on.


>
>
>
> Regards,
> Dilip TS
> Starmark Services Pvt. Ltd.
>
>


Re: Solr Tutorial Issue

2007-12-14 Thread Yousef Ourabi
Kirk,
I'm also having trouble with the apt-installed tomcat5.5 instance -- though I 
haven't really taken enough time to figure it out. The same Context fragment 
works fine on the Debian etch apt-installed tomcat -- so I'm fairly sure it's 
specific to Ubuntu.

A quick work-around would be to just download a copy of tomcat from apache and 
run that (export JAVA_HOME, bin/catalina.sh start)

-Yousef

- Original Message -
From: "Kirk Beers" <[EMAIL PROTECTED]>
To: solr-user@lucene.apache.org
Sent: Friday, December 14, 2007 11:18:31 AM (GMT-0800) America/Los_Angeles
Subject: Solr Tutorial Issue

Hi,

I am running Ubuntu, Java1.6 jdk  and tomcat5.5.  I can not seem to get 
the tutorial to run. The instructions seem simple and clear.

start.jar ran fine but when I used http://localhost:8983/solr/admin/
nothing appeared. I also individually copied the 
apache-solr-nightly/dist/apache-solr-nightly.war and the 
apache-solr-nightly/example/webapps/solr.war to my tomcat webapps and 
still nothing!!

Any feedback would be greatly appreciated ASAP

Thanks

Kirk



Solr Tutorial Issue

2007-12-14 Thread Kirk Beers

Hi,

I am running Ubuntu, Java1.6 jdk  and tomcat5.5.  I can not seem to get 
the tutorial to run. The instructions seem simple and clear.


start.jar ran fine but when I used http://localhost:8983/solr/admin/
nothing appeared. I also individually copied the 
apache-solr-nightly/dist/apache-solr-nightly.war and the 
apache-solr-nightly/example/webapps/solr.war to my tomcat webapps and 
still nothing!!


Any feedback would be greatly appreciated ASAP

Thanks

Kirk


Re: Problems with SOLR Collection Distribution

2007-12-14 Thread Bill Au
Try setting data_dir to \opt\solr\data.

I noticed that in the error message it shows "\" in the path to snapshooter
(ie \opt\solr\bin\snapshooter)
but "/" in the path to the data directory (ie /opt/solr/data).

Bill

On Dec 13, 2007 7:46 AM, Dilip.TS <[EMAIL PROTECTED]> wrote:

> Hi,
> When i changed the previous entries to the following,
> Strangely i got the error=3 this time...
> Anybody aware of what are these error codes 2,3 corresponds to?
> Is there any links for the same when one can get some info about these
> errors?
>
> scripts.conf
>
>user=jetty-6.1.6
>solr_hostname=localhost
>solr_port=80
>rsyncd_port=10080
>data_dir=/opt/solr/data/data_tenantID_1
>webapp_name=solr
>master_host=localhost
>master_data_dir=/opt/solr/data
>master_status_dir=/opt/solr/logs
>
> entry for postCommit:
>
> 
>/opt/solr/bin/snapshooter
>  solr/bin
>  true
>  -u jetty-6.1.6 -d
> /opt/solr/data
>   
>
> 
>
> The stack trace is as follows:
>
> INFO: start commit(optimize=false,waitFlush=false,waitSearcher=true)
> Dec 13, 2007 6:12:19 PM org.apache.solr.update.DirectUpdateHandler2
> doDeletions
> INFO: DirectUpdateHandler2 deleting and removing dups for 1 ids
> Dec 13, 2007 6:12:19 PM org.apache.solr.search.SolrIndexSearcher 
> INFO: Opening [EMAIL PROTECTED] DirectUpdateHandler2
> Dec 13, 2007 6:12:19 PM org.apache.solr.update.DirectUpdateHandler2
> doDeletions
> INFO: DirectUpdateHandler2 docs deleted=1
> Dec 13, 2007 6:12:19 PM org.apache.solr.common.SolrException log
> SEVERE: java.io.IOException: CreateProcess: \opt\solr\bin\snapshooter "-u
> jetty-6.1.6" "-d /opt/solr/data" error=3
>at java.lang.ProcessImpl.create(Native Method)
>at java.lang.ProcessImpl.(ProcessImpl.java:81)
>at java.lang.ProcessImpl.start(ProcessImpl.java:30)
>at java.lang.ProcessBuilder.start(ProcessBuilder.java:451)
>at java.lang.Runtime.exec(Runtime.java:591)
>at
> org.apache.solr.core.RunExecutableListener.exec(RunExecutableListener.java
> :7
> 3)
>at
> org.apache.solr.core.RunExecutableListener.postCommit
> (RunExecutableListener.
> java:100)
>at
> org.apache.solr.update.UpdateHandler.callPostCommitCallbacks(
> UpdateHandler.j
> ava:100)
>at
> org.apache.solr.update.DirectUpdateHandler2.commit(
> DirectUpdateHandler2.java
> :550)
>at com.plateau.psa.impl.EmbeddedSolr.commit(EmbeddedSolr.java:1008)
>at
> com.plateau.psa.impl.EmbeddedSolr.updateDocuments(EmbeddedSolr.java:877)
>
>
>
> -Original Message-
> From: Dilip.TS [mailto:[EMAIL PROTECTED]
> Sent: Thursday, December 13, 2007 5:44 PM
> To: solr-user@lucene.apache.org
> Subject: RE: Problems with SOLR Collection Distribution
>
>
> Hi,
> I am using the Jetty-6.1.6 server for the Embedded Solr.
>
> I made the following entries as follows:
>
> scripts.conf
>
> user=jetty-6.1.6
> solr_hostname=localhost
> solr_port=5000
> rsyncd_port=15000
> data_dir=solr/data/data_tenantID_1
> webapp_name=solr
> master_host=localhost
> master_data_dir=solr/data
> master_status_dir=solr/log
>
> and the corresponding entry for postCommit
>
>
>  snapshooter
>  solr/bin
>  false
>   -u jetty-6.1.6 -d /solr/data
> 
>
>
>
> Iam getting the following error.Am i missing something?
>
> INFO: Opening [EMAIL PROTECTED] DirectUpdateHandler2
> Dec 13, 2007 5:39:57 PM org.apache.solr.update.DirectUpdateHandler2
> doDeletions
> INFO: DirectUpdateHandler2 docs deleted=1
> Dec 13, 2007 5:39:57 PM org.apache.solr.common.SolrException log
> SEVERE: java.io.IOException: CreateProcess: snapshooter "-u jetty-6.1.6"
> "-d
> /solr/data" error=2
>at java.lang.ProcessImpl.create(Native Method)
>at java.lang.ProcessImpl.(ProcessImpl.java:81)
>at java.lang.ProcessImpl.start(ProcessImpl.java:30)
>at java.lang.ProcessBuilder.start(ProcessBuilder.java:451)
>at java.lang.Runtime.exec(Runtime.java:591)
>at
> org.apache.solr.core.RunExecutableListener.exec(RunExecutableListener.java
> :7
> 3)
>at
> org.apache.solr.core.RunExecutableListener.postCommit
> (RunExecutableListener.
> java:100)
>at
> org.apache.solr.update.UpdateHandler.callPostCommitCallbacks(
> UpdateHandler.j
> ava:100)
>at
> org.apache.solr.update.DirectUpdateHandler2.commit(
> DirectUpdateHandler2.java
> :550)
>at com.plateau.psa.impl.EmbeddedSolr.commit(EmbeddedSolr.java:1008)
>at
> com.plateau.psa.impl.EmbeddedSolr.updateDocuments(EmbeddedSolr.java:877)
>
>
>
>
>
> -Original Message-
> From: Dilip.TS [mailto:[EMAIL PROTECTED]
> Sent: Thursday, December 13, 2007 4:50 PM
> To: SOLR
> Subject: Problems with SOLR Collection Distribution
>
>
> Hi,
>  Iam trying to establish the SOLR Collection Distribution for my
> application
> using Embedded Solr.
> I have made the following entries in the scripts.conf file and included
> the
> entries for the postCommit and postOptimize
> operation in the solrconfig.

Re: Disabling the cache?

2007-12-14 Thread Mike Klaas
I would recommend limiting the documentCache to a small number  
(10-20), rather than zero.  Otherwise, you will retrieve the  
documents multiple times in one request if you are doing highlighting.


-Mike

On 14-Dec-07, at 2:24 AM, Koji Sekiguchi wrote:


Just comment out them:



regards,

Koji

Robert Young wrote:

Hi,

Is it possible to disable all the caches in Solr. We want to be able
to load test our Solr based application but we don't want the caches
to affect the results (we're using Apache benchmark so just sending
the same request over and over again).

Cheers
Rob








Re: Python Solr Writer

2007-12-14 Thread Mike Klaas
Not sure if this helps, but note that the work is done in  
PythonWriter, which is a subclass of JSONWriter.  Most of the work is  
done by JSONWriter due to the similarities of syntax.


-Mike

On 14-Dec-07, at 10:19 AM, Owens, Martin wrote:


That would be a python solr client, not a solr writer using this:

http://lucene.apache.org/solr/api/org/apache/solr/request/ 
PythonResponseWriter.html


Not sure how the hell it's supposed to work to be honest.

-Original Message-
From: [EMAIL PROTECTED] on behalf of Ed Summers
Sent: Fri 12/14/2007 11:30 AM
To: solr-user@lucene.apache.org
Subject: Re: Python Solr Writer

Do you mean something like:

  http://svn.apache.org/repos/asf/lucene/solr/trunk/client/python/ 
solr.py


//Ed

On Dec 14, 2007 10:20 AM, Owens, Martin  
<[EMAIL PROTECTED]> wrote:
I'm having some trouble understanding how the solr writer  
intergrates into python, I can't find any examples so does anyone  
have any good examples of a python writer?


Best Regards, Martin Owens







sint not works when omitted or Integer.MAX_VALUE?

2007-12-14 Thread Jörg Kiegeland

Hi,

I have following field definition:

  

("slong" as defined in the Solr example schema).

Now initially my Solr index is empty.
I realized that as long as I do not add a document which defines a value 
for X which is not Integer.MAX_VALUE, documents added to the index get 
completely lost!


My workaround is to initially (when the index is still empty) add one 
document with a value for field X of e.g. 1.
In the following, I can add arbitrary documents as it should be, e.g. 
which do not define a value for X at all or which define the value 
Integer.MAX_VALUE or which define any other value.


This workaround works for me, so it seems to be a bug somewhere..




RE: Python Solr Writer

2007-12-14 Thread Owens, Martin
That would be a python solr client, not a solr writer using this:

http://lucene.apache.org/solr/api/org/apache/solr/request/PythonResponseWriter.html

Not sure how the hell it's supposed to work to be honest.

-Original Message-
From: [EMAIL PROTECTED] on behalf of Ed Summers
Sent: Fri 12/14/2007 11:30 AM
To: solr-user@lucene.apache.org
Subject: Re: Python Solr Writer
 
Do you mean something like:

  http://svn.apache.org/repos/asf/lucene/solr/trunk/client/python/solr.py

//Ed

On Dec 14, 2007 10:20 AM, Owens, Martin <[EMAIL PROTECTED]> wrote:
> I'm having some trouble understanding how the solr writer intergrates into 
> python, I can't find any examples so does anyone have any good examples of a 
> python writer?
>
> Best Regards, Martin Owens
>



Re: Python Solr Writer

2007-12-14 Thread Ed Summers
Do you mean something like:

  http://svn.apache.org/repos/asf/lucene/solr/trunk/client/python/solr.py

//Ed

On Dec 14, 2007 10:20 AM, Owens, Martin <[EMAIL PROTECTED]> wrote:
> I'm having some trouble understanding how the solr writer intergrates into 
> python, I can't find any examples so does anyone have any good examples of a 
> python writer?
>
> Best Regards, Martin Owens
>


Dynamic autowarming queries

2007-12-14 Thread Robert Purdy

Hey there, is there a current way to specify dynamic autowarming queries in
the solrconfig.xml? If not is it possible for the solrconfg.xml to read from
a file that I can update on the fly to warm new dynamic fields in the schema
or just new queries to warm to prevent a restart of solr and so the changes
are picked up each time a new searcher is warmed? 

Also is it possible to warm a dynamic field ie (category_*) in the warming
section in the solrconfig.xml? If so what does it store in the cache if say
I have category_1, category_2, category_3, category_4 dynamically stored in
the index? Does it make a separate bitset for each field or does it combine
it all of them somehow?

Thanks Robert.
-- 
View this message in context: 
http://www.nabble.com/Dynamic-autowarming-queries-tp14339575p14339575.html
Sent from the Solr - User mailing list archive at Nabble.com.



how to intersect a doclist with a docset and get a doclist back?

2007-12-14 Thread Britske

Is there  a way to get a doclist based on intersecting an existing doclist
with a docset? 

However doing doclist.intersection(docset)  returns docset. 
Is there something I'm missing here? 

I figured this must be possible since the order of the returned doclist is
the same as the order of the inserted doclist. 

Thanks,
Geert-Jan
-- 
View this message in context: 
http://www.nabble.com/how-to-intersect-a-doclist-with-a-docset-and-get-a-doclist-back--tp14338755p14338755.html
Sent from the Solr - User mailing list archive at Nabble.com.



Re: Disabling the cache?

2007-12-14 Thread Walter Underwood
That is not a very useful load test, since it doesn't match what
you'll see in production. About half our requests are served
from cache. Cache hits are all CPU, cache misses are heavy
on IO. Testing with all cache misses will under-estimate CPU
buy a huge amount.

It is very hard to simulate a real query load without using
real queries. If you have a log of real queries, use that
with JMeter. You can load a variable with a new query for
each request with ${__StringFromFile(queries.txt)}
  
http://jakarta.apache.org/jmeter/usermanual/functions.html#__StringFromFile

wunder

On 12/14/07 1:59 AM, "Robert Young" <[EMAIL PROTECTED]> wrote:

> Hi,
> 
> Is it possible to disable all the caches in Solr. We want to be able
> to load test our Solr based application but we don't want the caches
> to affect the results (we're using Apache benchmark so just sending
> the same request over and over again).
> 
> Cheers
> Rob



Python Solr Writer

2007-12-14 Thread Owens, Martin
I'm having some trouble understanding how the solr writer intergrates into 
python, I can't find any examples so does anyone have any good examples of a 
python writer?

Best Regards, Martin Owens


Re: Newbie question about Solr use in web applications

2007-12-14 Thread Stuart Sierra
On Dec 13, 2007 9:20 PM, solruser2 <[EMAIL PROTECTED]> wrote:
> Let's say I have a database containing people, groups, and projects (these
> all have different fields). I want to index these different kinds of objects
> with a view to eventually present search results from all three types mashed
> together and sorted by relevance. Using separate indices (and thus separate
> Solr processes) would make mashing the results together very difficult so
> I'm guessing I just add the separate fields to the schema along with an
> 'object_type' field or equivalent?

That is the approach I would take.  Having three separate indices
would make your searches slower and more complicated.

> Secondly should I just store the database row id for each object (while
> still indexing the field contents) so a query on the index returns a list of
> id's that I can then fetch from the database?

It depends. :)  If you want highlighted snippets in your search
results, then you have to store the field contents in the index.  In
some situations you can make your search pages faster by storing all
the critical fields (the ones you want to appear in search results) in
the index, so that you don't have to fetch a dozen records from the
database just to display a list of search results.  On the other hand,
if your database records are small and you don't need highlighting, it
may be faster to only store database ID's in the index.

Hope this sheds some light,
-Stuart Sierra
AltLaw.org


RE: Solr replication

2007-12-14 Thread Dilip.TS
Hi,
I have the following requirement for SOLR Collection Distribution using
Embedded Solr with the Jetty server:

I have different data folders for multiple instances of SOLR within the Same
application.
Im using the same SOLR_HOME with a single bin and conf folder.

My query is:
1)Is is possible to have the same SOLR_HOME for multiple solr instances and
still be able to
  achieve Solr Distribution?
 (As i understand that we need to have differnet rsync port for different
solr instances)

2)Can i get some more information about how to start this rsyncd daemon and
  which is the best way of doing it i.e. to start during system reboot or
doing it manually?

3)Let me know if my understanding is correct. We require 1 Master Server and
a minimum of 1 slave server.
  The master server and the slave server cannot be running on the same
machine. Am i right?

  In the case of the SOLR Distribution, if the SOLR server acts as the
Master server
  then how about this slave server ? Is it the Application server which
calls the Master SOLR Server
  acts as slave server?

4)I observe the file scripts.conf for master server:
solr_port=8983
rsyncd_port=18983

+Enable and start rsync:
 rsyncd-enable; rsyncd-start
+Run snapshooter:
 snapshooter

Just to confirm is it mandatory that the solr master server should have
the solr_port as 8983 only?

5) How do we enable and start rsync? The link to
SolrCollectionDistributionScripts mentions about
   installing rsyncd daemon either during system boot time or by manually.
   Which method is more preferrable?
   How do we achieve this as iam not clear on this?

6) How do we setup crontab to run snappuller and snapinstaller periodically?



Regards,
Dilip TS
Starmark Services Pvt. Ltd.



Re: How to effectively search inside fields that should be indexed with changing them.

2007-12-14 Thread Brian Carmalt

Hello Otis,

The example I provided was a simplified one. The real usecase is that 
will have to dynamically adapt to field values, from which
we have no idea what form they will have.. So unfortunately, a custom 
tokenizer will not work. I changed the n-gram values to min=max= 2
and I can match sub terms inside the fields that are analyzed with the 
NGramTokenizer. But I haven't had the time to test it completely.

Can you quickly outline why n-grams are not good solution for my problem?

Thanks, Brian

Otis Gospodnetic schrieb:

Brian,

This is not really a job for n-grams.  It sounds like you'll want to write a 
custom Tokenizer that has knowledge about this particular pattern, knows how to 
split input like the one in your example, and produce multiple tokens out of 
it.  For the natural language part you can probably get away with one of the 
existing tokenizers/analyzers/factories.  For the first part you'll likely want 
to extract (W+)0+ -- 1 or morel etters followed by 1 or more zeros as one 
token, and then 0+(D+) -- 1 or more zeros followed by 1 or more digits.

Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch

- Original Message 
From: Brian Carmalt <[EMAIL PROTECTED]>
To: solr-user@lucene.apache.org
Sent: Tuesday, December 11, 2007 9:17:32 AM
Subject: How to effectively search inside fields that should be indexed with 
changing them.

Hello all,

 The titles of our docs have the form "ABC0001231-This is an important 
doc.pdf". I would like to be able to
search for 'important', or '1231',  or 'ABC000*', or 'This is an 
important doc'  in the title field. I looked a the NGramTokenizer and 
tried to use it.
In the index it doesn't seem to work, I cannot get any hits. The 
analysis tool on the admin pages shows me that the
ngram tokenizing works by highlighting the matches between the indexed 
value and a query. I have set the

min and max ngram size to 2 and 6, with side equal to left.

Can anyone recommend a procedure that will allow me to search as stated
 
above?


I would also like to find out more about how to use the NgramTokenizer,
 
but have found little in the form of

documentation. Anyone know about any good sources?

Thanks,

Brian




  




Re: Disabling the cache?

2007-12-14 Thread Koji Sekiguchi

Just comment out them:



regards,

Koji

Robert Young wrote:

Hi,

Is it possible to disable all the caches in Solr. We want to be able
to load test our Solr based application but we don't want the caches
to affect the results (we're using Apache benchmark so just sending
the same request over and over again).

Cheers
Rob

  




Disabling the cache?

2007-12-14 Thread Robert Young
Hi,

Is it possible to disable all the caches in Solr. We want to be able
to load test our Solr based application but we don't want the caches
to affect the results (we're using Apache benchmark so just sending
the same request over and over again).

Cheers
Rob


RE: embeddedsolr and solrj index update request

2007-12-14 Thread Sandeep Shetty
Hi Ryan

i am running solr 1.3

in my solrconfig.xml i can see  is that the right one?

Regards,
Sandeep

-Original Message-
From: Ryan Mckinley [mailto:[EMAIL PROTECTED]
Sent: 13 December 2007 17:43
To: solr-user@lucene.apache.org
Subject: Re: embeddedsolr and solrj index update request


What version solr are you running?
Do you have an updateRequestJandler registered to /update?

On Dec 13, 2007, at 8:36 AM, Sandeep Shetty <[EMAIL PROTECTED]
 > wrote:

> Hi all,
>
> i am using embeddedsolr and solrj to create and update a large
> index. however even after adding documents and commiting documents
> i.e.
> server.add(solrDocs);
> server.commit();
> i am  not able to see the new documents in search results. I have to
> restart the server and run the same search to see the results. is
> there anything i am doing wrong here?
>
> Thanks in advance for your help
> Sandeep
>
>
> This email is confidential and may also be privileged. If you are
> not the intended recipient please notify us immediately by
> telephoning +44 (0)20 7840 4300 or email [EMAIL PROTECTED]
> You should not copy it or use it for any purpose nor disclose its
> contents to any other person. Touch Local cannot accept liability
> for statements made which are clearly the sender's own and are not
> made on behalf of the firm.
>
> Touch Local Limited
> Registered Number: 2885607
> VAT Number: GB896112114
> 9th Floor, 89 Albert Embankment, London SE1 7TP
> +44 (0)20 7840 4300

This email is confidential and may also be privileged. If you are not the 
intended recipient please notify us immediately by telephoning +44 (0)20 7840 
4300 or email [EMAIL PROTECTED] You should not copy it or use it for any 
purpose nor disclose its contents to any other person. Touch Local cannot 
accept liability for statements made which are clearly the sender's own and are 
not made on behalf of the firm.

Touch Local Limited
Registered Number: 2885607
VAT Number: GB896112114
9th Floor, 89 Albert Embankment, London SE1 7TP
+44 (0)20 7840 4300