Re: Restarting tomcat deletes all Solr indexes

2009-05-12 Thread Andrey Klochkov
Hi, I know that when starting Solr checks index directory existence, and creates new fresh index if it doesn't exist. Does it help? If no, the next step I'd do in your case is patching SolrCore.initIndex method - insert some logging, or run EmbeddedSolrServer with debugger etc. On Mon, May 11,

How to deal with Mark invalid?

2009-05-12 Thread Nikolai Derzhak
Good day, people. We use solr to search in mailboxes (dovecot). But with some bad messages solr 1.4-dev generate error: SEVERE: java.io.IOException: Mark invalid at java.io.BufferedReader.reset(BufferedReader.java:485) at

Custom Servlet Filter, Where to put filter-mappings

2009-05-12 Thread Jacob Singh
Hi folks, I just wrote a Servlet Filter to handle authentication for our service. Here's what I did: 1. Created a dir in contrib 2. Put my project in there, I took the dataimporthandler build.xml as an example and modified it to suit my needs. Worked great! 3. ant dist now builds my jar and

Re: QueryElevationComponent : hot update of elevate.xml

2009-05-12 Thread Nicolas Pastorino
Hi, On May 7, 2009, at 6:03 , Noble Paul നോബിള്‍ नोब्ळ् wrote: going forward the java based replication is going to be the preferred means replicating index. It does not support replicating files in the dataDir , it only supports replicating index files and conf files (files in conf dir). I

Solr Loggin issue

2009-05-12 Thread Sagar Khetkade
Hi, I have solr implemented in multi-core scenario and also implemented solr-560-slf4j.patch for implementing the logging. But the problem I am facing is that the logs are going to the stdout.log file not the log file that I have mentioned in the log4j.properties file. Can anybody give me

Re: Restarting tomcat deletes all Solr indexes

2009-05-12 Thread KK
Thanks for your response @aklochkov. But I again noticed that something is wrong in my solr/tomcat config[I spent a lot of time making solr run], b'coz in the solr admin page [ http://localhost:8080/solr/admin/] what I see is that the $CWD is the location where from I restarted tomcat and seems

Geographical search based on latitude and longitude

2009-05-12 Thread Norman Leutner
Hi together, I'm new to Solr and want to port a geographical range search from MySQL to Solr. Currently I'm using some mathematical functions (based on GRS80 modell) directly within MySQL to calculate the actual distance from the locations within the database to a current location (lat and

Re: Geographical search based on latitude and longitude

2009-05-12 Thread Grant Ingersoll
See https://issues.apache.org/jira/browse/SOLR-773. In other words, we're working on it and would love some help! -Grant On May 12, 2009, at 7:12 AM, Norman Leutner wrote: Hi together, I'm new to Solr and want to port a geographical range search from MySQL to Solr. Currently I'm using

Re: Restarting tomcat deletes all Solr indexes

2009-05-12 Thread KK
One more information I would like to add. The entry in solr stats page says this: readerDir : org.apache.lucene.store.FSDirectory@/home/kk/solr/data/index when I ran from /home/kk and this: readerDir : org.apache.lucene.store.FSDirectory@ /home/kk/junk/solr/data/index after running from

AW: Geographical search based on latitude and longitude

2009-05-12 Thread Norman Leutner
So are you using boundary box to find results within a given range(km) like mentioned here: http://www.nsshutdown.com/projects/lucene/whitepaper/locallucene_v2.html ? Best regards Norman Leutner all2e GmbH -Ursprüngliche Nachricht- Von: Grant Ingersoll [mailto:gsing...@apache.org]

Re: fieldType without tokenizer

2009-05-12 Thread sunnyfr
hi I tried but Ive an error : May 12 15:48:51 solr-test jsvc.exec[2583]: May 12, 2009 3:48:51 PM org.apache.solr.common.SolrException log SEVERE: org.apache.solr.common.SolrException: Error loading class 'solr.KeywordTokenizer' ^Iat

Re: fieldType without tokenizer

2009-05-12 Thread Erik Hatcher
Use KeywordTokenizerFactory. Pasted from Solr's example schema.xml: tokenizer class=solr.KeywordTokenizerFactory/ Erik On May 12, 2009, at 9:49 AM, sunnyfr wrote: hi I tried but Ive an error : May 12 15:48:51 solr-test jsvc.exec[2583]: May 12, 2009 3:48:51 PM

Re: fieldType without tokenizer

2009-05-12 Thread Koji Sekiguchi
It must be KeywordTokenizer*Factory* :) Koji sunnyfr wrote: hi I tried but Ive an error : May 12 15:48:51 solr-test jsvc.exec[2583]: May 12, 2009 3:48:51 PM org.apache.solr.common.SolrException log SEVERE: org.apache.solr.common.SolrException: Error loading class 'solr.KeywordTokenizer' ^Iat

Re: Facet counts for common terms of the searched field

2009-05-12 Thread sachin78
Does anybody have answer to this post.I have a similar requirement. Suppose I have free text field say I index the field.If I search for textfield:copper.I have to get facet counts for the most common words found in a textfield. ie. example:search for textfield:glass should return facet counts

Re: Facet counts for common terms of the searched field

2009-05-12 Thread Matt Weber
You may have to take care of this at index time. You can create a new multivalued field that has minimal processing. Then at index time, index the full contents of textfield as normal, but then also split it on whitespace and index each word in the new field you just created. Now you

Re: Facet counts for common terms of the searched field

2009-05-12 Thread sachin78
Thanks Matt for your reply. What do you mean by frequency(the default)? Can you please provide an example schema and query will look like. --Sachin Matt Weber-2 wrote: You may have to take care of this at index time. You can create a new multivalued field that has minimal processing.

Re: How to deal with Mark invalid?

2009-05-12 Thread Nikolai Derzhak
OK. I've applied dirty hack as temporary solution: in src/java/org/apache/solr/analysis/HTMLStripReader.java of 1.4-dev - enclosed io.reset in try structure. ( * @version $Id: HTMLStripReader.java 646799 2008-04-10 13:36:23Z yonik $) private void restoreState() throws IOException { try

Re: How to deal with Mark invalid?

2009-05-12 Thread Yonik Seeley
I just committed a minor match suggested by Jim Murphy in SOLR-42 to slightly lower the safe read ahead limit to avoid reading beyond a a mark. Could you try out trunk (or wait until the next nightly build?) -Yonik http://www.lucidimagination.com On Tue, May 12, 2009 at 10:57 AM, Nikolai

Re: Facet counts for common terms of the searched field

2009-05-12 Thread Matt Weber
I mean you can sort the facet results by frequency, which happens to be the default behavior. Here is an example field for your schema: field name=textfieldfacet type=string indexed=true stored=true multiValued=true / Here is an example query:

Newbie question

2009-05-12 Thread Wayne Pope
Hi, We're implemented search into our product here at our very small company, and the developer who integrated Solr has left. I'm picking up the code base and have run into a problem , which I imagine is simple to solve. I have this request:

Re: Solr Loggin issue

2009-05-12 Thread Jay Hill
Usually that means there is another log4j.properties or log4j.xml file in your classpath that is being found before the one you are intending to use. Check your classpath for other versions of these files. -Jay On Tue, May 12, 2009 at 3:38 AM, Sagar Khetkade sagar.khetk...@hotmail.comwrote:

Replication master+slave

2009-05-12 Thread Bryan Talbot
For replication in 1.4, the wiki at http://wiki.apache.org/solr/SolrReplication says that a node can be both the master and a slave: A node can act as both master and slave. In that case both the master and slave configuration lists need to be present inside the ReplicationHandler

Re: AW: Geographical search based on latitude and longitude

2009-05-12 Thread Grant Ingersoll
Yes, that is part of it, but there is more to it. See Yonik's comment about needs further down. On May 12, 2009, at 7:36 AM, Norman Leutner wrote: So are you using boundary box to find results within a given range(km) like mentioned here:

Re: Restarting tomcat deletes all Solr indexes

2009-05-12 Thread Shalin Shekhar Mangar
You can fix the path of the index in your solrconfig.xml On Tue, May 12, 2009 at 4:48 PM, KK dioxide.softw...@gmail.com wrote: One more information I would like to add. The entry in solr stats page says this: readerDir : org.apache.lucene.store.FSDirectory@/home/kk/solr/data/index when I

Re: Newbie question

2009-05-12 Thread Shalin Shekhar Mangar
On Tue, May 12, 2009 at 9:48 PM, Wayne Pope waynemailingli...@gmail.comwrote: I have this request: http://localhost:8983/solr/select?start=0rows=20qt=dismaxq=copyhl=truehl.snippets=4hl.fragsize=50facet=truefacet.mincount=1facet.limit=8facet.field=typefq=company-id%3A1wt=javabinversion=2.2

Re: Replication master+slave

2009-05-12 Thread Shalin Shekhar Mangar
On Tue, May 12, 2009 at 10:42 PM, Bryan Talbot btal...@aeriagames.comwrote: For replication in 1.4, the wiki at http://wiki.apache.org/solr/SolrReplication says that a node can be both the master and a slave: A node can act as both master and slave. In that case both the master and slave

error when seting queryResultWindowSize to zero

2009-05-12 Thread Marc Sturlese
I have seen that if I set the value of queryResultWindowSize to 0 in solrconfig.xml solr will return an error of divided by zero. Checking the source I have seen it can be fixed in SolrIndexSearcher. At the end of the function getDocListC it's coded: if (maxDocRequested

RE: Selective Searches Based on User Identity

2009-05-12 Thread Terence Gannon
Paul -- thanks for the reply, I appreciate it. That's a very practical approach, and is worth taking a closer look at. Actually, taking your idea one step further, perhaps three fields; 1) ownerUid (uid of the document's owner) 2) grantedUid (uid of users who have been granted access), and 3)

Re: error when seting queryResultWindowSize to zero

2009-05-12 Thread Yonik Seeley
On Tue, May 12, 2009 at 3:03 PM, Marc Sturlese marc.sturl...@gmail.com wrote: I have seen that if I set the value of queryResultWindowSize  to 0 in solrconfig.xml solr will return an error of divided by zero. Seems like a configuration error since requesting that results be retrieved in 0 size

Re: Selective Searches Based on User Identity

2009-05-12 Thread Matt Weber
I also work with the FAST Enterprise Search engine and this is exactly how their Security Access Module works. They actually use a modified base-32 encoded value for indexing, but that is because they don't have the luxury of untokenized/un-processed String fields like Solr. Thanks, Matt

Re: Selective Searches Based on User Identity

2009-05-12 Thread Jay Hill
The only downside would be that you would have to update a document anytime a user was granted or denied access. You would have to query before the update to get the current values for grantedUID and deniedUID, remove/add values, and update the index. If you don't have a lot of changes in the

RE: Selective Searches Based on User Identity

2009-05-12 Thread Terence Gannon
Thanks for the tip. I went to their website (www.fastsearch.com), and got as far as the second line, top left 'A Microsoft Subsidiary'...at which point, hopes of it being another open source solution quickly faded. ;-) Seriously, though, it looks like an interesting product, but open source is a

Re: Selective Searches Based on User Identity

2009-05-12 Thread Matt Weber
Here is a good presentation on search security from the Infonortics Search Conference that was held a few weeks ago. http://www.infonortics.com/searchengines/sh09/slides/kehoe.pdf The approach you are using is called early-binding. As Jay mentioned, one of the downsides is updating the

Who is running 1.4 nightly in production?

2009-05-12 Thread Walter Underwood
We're planning our move to 1.4, and want to run one of our production servers with the new code. Just to feel better about it, is anyone else running 1.4 in production? I'm building 2009-05-11 right now. wuner

Re: Who is running 1.4 nightly in production?

2009-05-12 Thread Matthew Runo
We're using 1.4-dev 749558:749756M that we built on 2009-03-03 13:10:05 for our master/slave production environment using the Java Replication code. Thanks for your time! Matthew Runo Software Engineer, Zappos.com mr...@zappos.com - 702-943-7833 On May 12, 2009, at 2:02 PM, Walter

camel-casing and dismax troubles

2009-05-12 Thread Geoffrey Young
hi all :) I'm having trouble with camel-cased query strings and the dismax handler. a user query LeAnn Rimes isn't matching the indexed term Leann Rimes even though both are lower-cased in the end. furthermore, the analysis tool shows a match. the debug query looks like

Re: how to manually add data to indexes generated by nutch-1.0 using solr

2009-05-12 Thread alxsss
Tried to add a new record using curl http://localhost:8983/solr/update -H Content-Type: text/xml --data-binary 'add doc boost=2.5 field name=segment20090512170318/field field name=digest86937aaee8e748ac3007ed8b66477624/field field name=boost0.21189615/field field name=urltest.com/field

Re: Who is running 1.4 nightly in production?

2009-05-12 Thread Erik Hatcher
We run a not too distant trunk (1.4, probably a month or so ago) version of Solr on LucidFind at http://www.lucidimagination.com/search Erik On May 12, 2009, at 5:02 PM, Walter Underwood wrote: We're planning our move to 1.4, and want to run one of our production servers with the

Re: how to manually add data to indexes generated by nutch-1.0 using solr

2009-05-12 Thread Erik Hatcher
send a commit/ request afterwards, or you can add ?commit=true to the /update request with the adds. Erik On May 12, 2009, at 8:57 PM, alx...@aim.com wrote: Tried to add a new record using curl http://localhost:8983/solr/update -H Content-Type: text/xml -- data-binary 'add doc

RE: Selective Searches Based on User Identity

2009-05-12 Thread Terence Gannon
In reply to both Matt and Jay's comments, the particular situation I'm dealing with is one where rights will change relatively little once they are established.  Typically a document will be loaded and indexed, and a decision will be made on sharing that more-or-less immediately.  It might change

Re: Replication master+slave

2009-05-12 Thread Jian Han Guo
I was looking at the same problem, and had a discussion with Noble. You can use a hack to achieve what you want, see https://issues.apache.org/jira/browse/SOLR-1154 Thanks, Jianhan On Tue, May 12, 2009 at 5:13 PM, Bryan Talbot btal...@aeriagames.comwrote: So how are people managing

RE: Solr Loggin issue

2009-05-12 Thread Sagar Khetkade
I have only one log4j.properties file in classpath and even if i configure for the particular package where the solr exception would come then also the same issue. I had removed the logger for my application and using only for solr logging. ~Sagar Date: Tue, 12 May 2009 09:59:01