Re: what does the version parameter in the query mean?

2009-05-22 Thread Shalin Shekhar Mangar
On Fri, May 22, 2009 at 7:40 AM, Anshuman Manur wrote: > ahI see! thank you so much for the response! > > I'm using SolrJ, so I probably don't need to set XML version since the wiki > tells me that it uses binary as a default! > > Solrj automatically adds the correct version parameter/value.

Re: No sanity checks before replicating files?

2009-05-22 Thread Shalin Shekhar Mangar
I think this problem might happen when there are uncommitted changes in S2 and the master S1 comes back online. In that case, slave's generation is still less than master's and installation of index diff from master may fail. However, I do not understand a few points. Damien, if S1 comes back onli

Re: How to index large set data

2009-05-22 Thread Jianbin Dai
about 2.8 m total docs were created. only the first run finishes. In my 2nd try, it hangs there forever at the end of indexing, (I guess right before commit), with cpu usage of 100%. Total 5G (2050) index files are created. Now I have two problems: 1. why it hangs there and failed? 2. how can i

Re: Solr statistics of top searches and results returned

2009-05-22 Thread Umar Shah
Hi, good feature to have, maintaining top N would also require storing all the search queries done so far and keep updating (or atleast in some time window). having pluggable persistent storage for all time search queries would be great. tell me how can I help? -umar On Fri, May 22, 2009 at 12

Re: clustering SOLR-769

2009-05-22 Thread Stanislaw Osinski
Hi there, > Is it possbile to specify more than one snippet field or should I use copy > field to copy copy two or three field into single field and specify it in > snippet field. Currently, you can specify only one snippet field, so you'd need to use copy. Cheers, S.

solr replication 1.3

2009-05-22 Thread Ashish P
I want to add master slave configuration for solr. I have following solr configuration: I am using solr 1.3 on windows. I am also using EmbeddedSolrServer. In this case is it possible to perform master slave configuration?? My second question is if I user solr 1.4 which has solr replication usin

Solr in cluster

2009-05-22 Thread Reza Safari
Hi, One of the problems I have with Lucene is Lock obtained by the IndexWriter. I want to use one Solr running inside a cluster behind the load balancer. Are multiple webservers able to write and commit to Lucene using Solr with out locking issues etc? Is Solr the solution for concurrency

Re: clustering SOLR-769

2009-05-22 Thread Grant Ingersoll
On May 22, 2009, at 4:40 AM, Stanislaw Osinski wrote: Hi there, Is it possbile to specify more than one snippet field or should I use copy field to copy copy two or three field into single field and specify it in snippet field. Currently, you can specify only one snippet field, so you

Re: solr replication 1.3

2009-05-22 Thread Noble Paul നോബിള്‍ नोब्ळ्
On Fri, May 22, 2009 at 3:12 PM, Ashish P wrote: > > I want to add master slave configuration for solr. I have following solr > configuration: > I am using solr 1.3 on windows. I am also using EmbeddedSolrServer. > In this case is it possible to perform master slave configuration?? > > My second q

Re: How to index large set data

2009-05-22 Thread Grant Ingersoll
Can you parallelize this? I don't know that the DIH can handle it, but having multiple threads sending docs to Solr is the best performance wise, so maybe you need to look at alternatives to pulling with DIH and instead use a client to push into Solr. On May 22, 2009, at 3:42 AM, Jianbin

Re: How to index large set data

2009-05-22 Thread Noble Paul നോബിള്‍ नोब्ळ्
there is already an issue for writing to Solr in multiple threads SOLR-1089 On Fri, May 22, 2009 at 6:08 PM, Grant Ingersoll wrote: > Can you parallelize this?  I don't know that the DIH can handle it, but > having multiple threads sending docs to Solr is the best performance wise, > so maybe yo

Re: Solr in cluster

2009-05-22 Thread Otis Gospodnetic
Reza, You can't have multiple Solr instances write to the same index at the same time. But you can add documents to a single Solr instance in parallel (e.g. from multiple threads of one or more applications) and Solr will do the right thing without you having to put JMS or some other type of qu

Re: How to index large set data

2009-05-22 Thread Otis Gospodnetic
Hi, Those settings are a little "crazy". Are you sure you want to give Solr/Lucene 3G to buffer documents before flushing them to disk? Are you sure you want to use the mergeFactor of 1000? Checking the logs to see if there are any errors. Look at the index directory to see if Solr is actu

Re: Solr in cluster

2009-05-22 Thread Reza Safari
Master work. This is exactly what I'm looking for. Now I'm happy :) Gr, Reza On May 22, 2009, at 4:23 PM, Otis Gospodnetic wrote: Reza, You can't have multiple Solr instances write to the same index at the same time. But you can add documents to a single Solr instance in parallel (e.g. f

Re: Plugin Not Found

2009-05-22 Thread Jeff Newburn
I have included the configuration and the log for the error on startup. I does appear it tries to load the lib but then simply can't referene it. explicit 0.01 productId^10.0 personality^15.0 subCate

Re: How to index large set data

2009-05-22 Thread Jianbin Dai
I dont know exactly what is this 3G Ram buffer used. But what I noticed was both index size and file number were keeping increasing, but stuck in the commit. --- On Fri, 5/22/09, Otis Gospodnetic wrote: > From: Otis Gospodnetic > Subject: Re: How to index large set data > To: solr-user@luce

Re: How to index large set data

2009-05-22 Thread Jianbin Dai
If I do the xml parsing by myself and use embedded client to do the push, would it be more efficient than DIH? --- On Fri, 5/22/09, Grant Ingersoll wrote: > From: Grant Ingersoll > Subject: Re: How to index large set data > To: solr-user@lucene.apache.org > Date: Friday, May 22, 2009, 5:38 A

Filtering query terms

2009-05-22 Thread Branca Marco
Hi, I am experiencing problems using filters. I'm using the following version of Solr: solr/nightly of 2009-04-12 The part of the schema.xml I'm using for setting filters is the following:

RE: Filtering query terms

2009-05-22 Thread Ensdorf Ken
> When I try testing the filter "solr.LowerCaseFilterFactory" I get > different results calling the following urls: > > 1. http://[server-ip]:[server-port]/solr/[core- > name]/select/?q=all%3Apapa&version=2.2&start=0&rows=10&indent=on > 2. http://[server-ip]:[server-port]/solr/[core- > name]/sele

Re: Multicore Solr not showing Cache Stats

2009-05-22 Thread Otis Gospodnetic
Old email. Hoss, thanks for doing this. I had a closer look at my solrconfig.xml and found that I didn't put elements around the settings for caches. Solr didn't complain, so I didn't notice earlier... Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Messa

R: Filtering query terms

2009-05-22 Thread Branca Marco
Thank you very much for the instantaneous support. I couldn't find the conflict for hours :( When I have a response for the ISOLatin1AccentFilterFactory I will write it on the mailing-list. Thanks again, Marco Da: Ensdorf Ken [ensd...@zoominfo.com] Invia

Re: How to index large set data

2009-05-22 Thread Otis Gospodnetic
If the file numbers and index size was increasing, that means Solr was still working. It's possible it's taking extra long because of such high settings. Bring them both down and try. For example, don't go over 20 with mergeFactor, and try just 1GB for ramBufferSizeMB. Bona fortuna! Otis

Re: R: Filtering query terms

2009-05-22 Thread Otis Gospodnetic
Marco, Open-source can be good like that. :) See http://www.jroller.com/otis/entry/lucene_solr_nutch_amazing_tech for a similar example Ciao, Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Branca Marco > To: "solr-user@lucene.apache.org

DIH uses == instead of = in SQL

2009-05-22 Thread Eric Pugh
I am getting this error: Caused by: com.mysql.jdbc.exceptions.jdbc4.MySQLSyntaxErrorException: You have an error in your SQL syntax; check the manual that corresponds to your MySQL server version for the right syntax to use near '=='1433'' at line 1 at sun.reflect.NativeConstructo

Document Boosts don't seem to be having an effect

2009-05-22 Thread Jodi Showers
Greetings - first post here - hoping someone can direct me - grasping at straws. thank you in advance. Jodi I'm trying to tune the sort order using a combination of document and query time boosts. When searching for the term 'builder' with almost identical quantities of this term, and a mu

Re: DIH uses == instead of = in SQL

2009-05-22 Thread Otis Gospodnetic
Eric, WHERE institution_id=1433 vs. WHERE institution_id==1433 Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Eric Pugh > To: solr-user@lucene.apache.org > Sent: Friday, May 22, 2009 2:43:59 PM > Subject: DIH uses == instead of = in SQ

Data Import Handler - parentDeltaImport

2009-05-22 Thread Michael Korthuis
I have the data-config.xml detailed below (Stripped down a bit for simplicity) - When I run the delta import, the design_template delta query is running and modified rows are being returned. However, the parentDeltaQuery is never executed. Any thoughts? Thanks, Micahael

Re: Data Import Handler - parentDeltaImport

2009-05-22 Thread Michael Korthuis
I have the data-config.xml detailed below (Stripped down a bit for simplicity) - When I run the delta import, the design_template delta query is running and modified rows are being returned. However, the parentDeltaQuery is never executed. Any thoughts? Thanks, Micahael

How to use DIH to index attributes in xml file

2009-05-22 Thread Jianbin Dai
I have an xml file like this 301.46 In the data-config.xml, I use but how can I index "id", "mid"? Thanks.

solr machine freeze up during first replication after optimization

2009-05-22 Thread Kyle Lau
Hi all, We recently started running into this solr slave server freeze up problem. After looking into the logs and the timing of such occurrences, it seems that the problem always follows the first replication after an optimization. Once the server freezes up, we are unable to ssh into it, but pi

Re: solr machine freeze up during first replication after optimization

2009-05-22 Thread Otis Gospodnetic
Hm, are you sure this is not a network/switch/disk/something like that problem? Also, precisely because you have such a large index I'd avoid optimizing the index and then replicating it. My wild guess is that simply rsyncing this much data over the network kills your machines. Have you tried

questions about Clustering

2009-05-22 Thread Koji Sekiguchi
I'm thinking using clustering (SOLR-769) function for my project. I have a couple of questions: 1. if q=*:* is requested, Carrot2 will receive "MatchAllDocsQuery" via attributes. Is it OK? 2. I'd like to use it on an environment other than English, e.g. Japanese. I've implemented Carrot2Japanese

Re: DIH uses == instead of = in SQL

2009-05-22 Thread Noble Paul നോബിള്‍ नोब्ळ्
are you using delta-import w/o a deltaImportQuery ? pls paste the relevant portion of data-config.xml On Sat, May 23, 2009 at 12:13 AM, Eric Pugh wrote: > I am getting this error: > > Caused by: com.mysql.jdbc.exceptions.jdbc4.MySQLSyntaxErrorException: You > have an error in your SQL syntax; che

Re: How to use DIH to index attributes in xml file

2009-05-22 Thread Noble Paul നോബിള്‍ नोब्ळ्
wild cards are not supported . u must use full xpath On Sat, May 23, 2009 at 4:55 AM, Jianbin Dai wrote: > > I have an xml file like this > > >                     >                     >                    301.46 > > > In the data-config.xml, I use > > > but how can I index "id", "mid"? > > T

Re: How to index large set data

2009-05-22 Thread Noble Paul നോബിള്‍ नोब्ळ्
no need to use embedded Solrserver. you can use SolrJ with streaming in multiple threads On Fri, May 22, 2009 at 8:36 PM, Jianbin Dai wrote: > > If I do the xml parsing by myself and use embedded client to do the push, > would it be more efficient than DIH? > > > --- On Fri, 5/22/09, Grant Inger

Re: Data Import Handler - parentDeltaImport

2009-05-22 Thread Noble Paul നോബിള്‍ नोब्ळ्
how do you know it is not being executed ?. use deltaImportQuery also if you are using Solr1.4 On Sat, May 23, 2009 at 4:29 AM, Michael Korthuis wrote: > I have the data-config.xml detailed below  (Stripped down a bit for > simplicity) - > When I run the delta import, the design_template delta q

Re: How to index large set data

2009-05-22 Thread Jianbin Dai
Hi Pual, but in your previous post, you said "there is already an issue for writing to Solr in multiple threads SOLR-1089". Do you think use solrj alone would be better than DIH? Thanks and have a good weekend! --- On Fri, 5/22/09, Noble Paul നോബിള്‍ नोब्ळ् wrote: > no need to use embedded

Re: How to use DIH to index attributes in xml file

2009-05-22 Thread Jianbin Dai
Oh, I guess I didn't say it clearly in my post. I didn't use wild cards in xpath. My question was how to index attributes "id" and "mid" in the following xml file. 301.46 In the data-config.xml, I use but what are the xpath for

Re: How to use DIH to index attributes in xml file

2009-05-22 Thread Shalin Shekhar Mangar
On Sat, May 23, 2009 at 10:31 AM, Jianbin Dai wrote: > > Oh, I guess I didn't say it clearly in my post. > I didn't use wild cards in xpath. My question was how to index attributes > "id" and "mid" in the following xml file. > > > > >30