Strange behavior
Hi all Please take a look at this strange behavior (connected with stemming I suppose): type: fieldtype name=customTextField class=solr.TextField indexed=true stored=false analyzer type=query tokenizer class=solr.StandardTokenizerFactory/ filter class=solr.StopFilterFactory ignoreCase=true words= stopwords.txt/ filter class=solr.EnglishPorterFilterFactory protected= protwords.txt/ filter class=solr.LowerCaseFilterFactory/ /analyzer analyzer type=index tokenizer class=solr.StandardTokenizerFactory/ filter class=solr.StopFilterFactory ignoreCase=true words= stopwords.txt/ filter class=solr.EnglishPorterFilterFactory protected= protwords.txt/ filter class=solr.LowerCaseFilterFactory/ /analyzer /fieldtype field: field name=name type=customTextField indexed=true stored=false/ I'm adding a document: adddocfield name=id99/fieldfield name=nameApple/field/doc/add commit/ Queriyng name:apple - 0 results. Searching name:Apple - 1 result. But name:appl* - 1 result Adding next document: adddocfield name=id8/fieldfield name=nameSomenamele/field/doc/add commit/ Searching for name:somenamele - 1 result, for name:Somenamele - 1 result What is the problem with Apple ? Maybe StandardTokenizer understands it as trademark :) ? Thank you in advence -- Best regards, Traut
Re: Strange behavior
Thank you, it works. Stemming filter works only with lowercased words? On Feb 12, 2008 4:29 PM, Yonik Seeley [EMAIL PROTECTED] wrote: Try putting the stemmer after the lowercase filter. -Yonik On Feb 12, 2008 9:15 AM, Traut [EMAIL PROTECTED] wrote: Hi all Please take a look at this strange behavior (connected with stemming I suppose): type: fieldtype name=customTextField class=solr.TextField indexed=true stored=false analyzer type=query tokenizer class=solr.StandardTokenizerFactory/ filter class=solr.StopFilterFactory ignoreCase=true words= stopwords.txt/ filter class=solr.EnglishPorterFilterFactory protected= protwords.txt/ filter class=solr.LowerCaseFilterFactory/ /analyzer analyzer type=index tokenizer class=solr.StandardTokenizerFactory/ filter class=solr.StopFilterFactory ignoreCase=true words= stopwords.txt/ filter class=solr.EnglishPorterFilterFactory protected= protwords.txt/ filter class=solr.LowerCaseFilterFactory/ /analyzer /fieldtype field: field name=name type=customTextField indexed=true stored=false/ I'm adding a document: adddocfield name=id99/fieldfield name=nameApple/field/doc/add commit/ Queriyng name:apple - 0 results. Searching name:Apple - 1 result. But name:appl* - 1 result Adding next document: adddocfield name=id8/fieldfield name=nameSomenamele/field/doc/add commit/ Searching for name:somenamele - 1 result, for name:Somenamele - 1 result What is the problem with Apple ? Maybe StandardTokenizer understands it as trademark :) ? Thank you in advence -- Best regards, Traut -- Best regards, Traut
Re: i think it is time to release new solr version
+ 1 Looking forward to get new release version :) On Jan 28, 2008 6:01 AM, j. L [EMAIL PROTECTED] wrote: because lucene 2.3.0 today released.. -- regards j.L -- Best regards, Traut
SolrJ commit problem
Hi I've got a problem with solrj from nightly build (from 2007-11-12). I have this code: solrClient = new CommonsHttpSolrServer(new URL(indexServerUrl)); and after add operation firing solrClient.commit(true, true); But commit operation is not processing in Solr as I can see in log files (but I can see in debug mode that status 200 is returning after executing getHttpConnection().executeMethod(method); in SolrJ client class file) Command from console actually do the trick [EMAIL PROTECTED] ~]$ curl http://traut-base:/-solr-network/update -H Content-Type: text/xml --data-binary 'commit/' I must say that I'm trying to use SolrJ client from nightly build with Solr server release 1.2. Most likely it is actually the root of the problem so, can I use Solr release 1.2 with nightly-build SolrJ client? Are there any problems? What can you cay about my commit problem? Thank you in advance -- Best regards, Traut
leading wildcards
Hi I found the thread about enabling leading wildcards in Solr as additional option in config file. I've got nightly Solr build and I can't find any options connected with leading wildcards in config files. How I can enable leading wildcard queries in Solr? Thank you -- Best regards, Traut
Re: solr workflow ?
rtfm :) http://lucene.apache.org/solr/tutorial.html On Nov 12, 2007 4:33 PM, Dwarak R [EMAIL PROTECTED] wrote: Hi Guys How do we add word documents / pdf / text / etc documents in solr ?. How do the content of the files are stored or indexed ?. Are these documents stored as XML in the SOLR filesystem ? Regards Dwarak R This message is for the designated recipient only and may contain privileged, proprietary, or otherwise private information. If you have received it in error, please notify the sender[EMAIL PROTECTED] immediately and delete the original. Any other use of the email by you is prohibited. -- Best regards, Traut
Re: leading wildcards
Seems like there is no way to enable leading wildcard queries except code editing and files repacking. :( On 11/12/07, Bill Au [EMAIL PROTECTED] wrote: The related bug is still open: http://issues.apache.org/jira/browse/SOLR-218 Bill On Nov 12, 2007 10:25 AM, Traut [EMAIL PROTECTED] wrote: Hi I found the thread about enabling leading wildcards in Solr as additional option in config file. I've got nightly Solr build and I can't find any options connected with leading wildcards in config files. How I can enable leading wildcard queries in Solr? Thank you -- Best regards, Traut -- Best regards, Traut
query syntax
Hi I have in index document with field name and its value is somename123 Why I can't find anything with query name:somename123* but there are results on query name:somename123* As far as I understand first query is just fine. why Solr didn't understand it? I'm using StandardRequestHandler thank you -- Best regards, Traut
Re: query syntax
So you think my problem is caused by different index/query data analysis? i'll check it. thank you On Nov 6, 2007 6:02 PM, Yonik Seeley [EMAIL PROTECTED] wrote: On 11/6/07, Traut [EMAIL PROTECTED] wrote: I have in index document with field name and its value is somename123 Why I can't find anything with query name:somename123* This is a prefix query. No analysis is done on the prefix, so it may not match analysis that was done when the document was indexed. For example, if you use WordDelimiterFilter, this may be indexed as somename 123 but there are results on query name:somename123* This is not a prefix query. The * will most likely be removed by the analyzer, leaving you effectively with a query of name:somename123 -Yonik -- Best regards, Traut
Re: can solr do it?
you can implement oscache as additional layer in Solr :) On 9/25/07, climbingrose [EMAIL PROTECTED] wrote: I don't think you can with the current Solr because each instance runs in a separate web app. On 9/25/07, James liu [EMAIL PROTECTED] wrote: if use multi solr with one index, it will cache individually. so i think can it share their cache.(they have same config) -- regards jl -- Regards, Cuong Hoang -- Best regards, Traut
Re: Search with in search results
Create a second query with IDs from first query? (id:XX id:YY id:ZZ) ? On 4/26/07, Suresh Kannan [EMAIL PROTECTED] wrote: How to extend a query type? I need to implement search with in search results without faceted. Suresh -- Best regards, Traut
[question] speed
Hi all I'm working on some presentation for my co-workers about Lucene/Solr index/search process. The question is why lucene (especially Solr) is such a fast engine? Has it something to do with index file formats (http://lucene.apache.org/java/docs/fileformats.html)? or is it all about using reverse indexing strategy? I'm have been digging into Lucene/Solr for nearly three month but only as an advanced user so I am not familiar with the details of engine implementation thank you Traut
Re: Solr logo poll
B
Re: failing post-optimize command execution
What about access rights on file snapshooter and on directories in path /home/solr/solr/bin ? Maybe this is the root of the problem? On 3/28/07, galo [EMAIL PROTECTED] wrote: Hi, I've configured my solrconfig.xml to execute a snapshoot after an optimize is made but I keep getting the following exception in the tomcat logs: SEVERE: java.io.IOException: Cannot run program snapshooter (in directory /home/solr/solr/bin): java.io.IOException: error=2, No such file or directory I'm certain the path and filename is correct.. does anybody have problems with this? Cheers, galo -- Best regards, Traut
Re: Reindex only records that changed
additional field in your DB as flag? 1 - dirty, 0 - clean. Debra wrote: Hi all, This is not a direct solr issue but I need it for indexing. Is there a way to check if a database record changed since the last index (with out using a specail flag field that has to be set any-where the record is updated). I would like to re-index only records that changed. TIA Debra