Re: Interesting stuff; Solr as a syslog store.

2010-02-13 Thread Kay Kay
Thanks Antonio for sharing this. I believe this could be one of the interesting case studies for Solr In Action, if you are interested in sharing a bit more - I am sure the authors would be more interested for upcoming revisions. -- K K. On 02/12/2010 06:02 PM, Antonio Lobato wrote: Hey

Re: Query elevation based on field

2010-02-13 Thread Grant Ingersoll
On Feb 10, 2010, at 8:46 PM, Jason Chaffee wrote: Is it possible to do query elevation based on field? Basically, I would like to search the same term on three different fields: q=field1:term OR field2:term OR field3:term and I would like to sort the results by fourth

Re: ExternalFileField

2010-02-13 Thread Grant Ingersoll
On Feb 11, 2010, at 5:34 AM, Julian Hille wrote: Hi, were trying to implement another sortby Algorythm which is calculate outside of our solr Server. Is there a limit for the lines in that outside file? Cause we sometimes have 1.5 million lines in some situations. Also is this a

Re: ExternalFileField

2010-02-13 Thread Koji Sekiguchi
Most of the other files have rows about 1000-3000 thousand. What does happen if im writing that file, and then solr tries to read it, is there somekind of timeout? Again, if I recall correctly, it loads up the file at startup or after commit (the later depends on the location of the

Highlighting Inconsistency

2010-02-13 Thread Jan
Hi gurus, I am having some issues with making the highlighting work properly. If I search for a word in a title field and request a highlighted summary from another long_description field, this works on some documents, but on some doesn't. Have you seen anything like this before? Example

Re: migrating from solr 1.3 to 1.4

2010-02-13 Thread Mat Brown
On Fri, Feb 12, 2010 at 19:46, Sachin Sebastian sac...@gossamer-threads.com wrote: Hi there,       I'm trying to migrate from solr 1.3 to solr 1.4 and I've few issues. Initially my localsolr was throwing NullPointer exception and I fixed it by changing type of lat and lng to 'tdouble'. But

Re: Solr 1.4: Full import FileNotFoundException

2010-02-13 Thread Noble Paul നോബിള്‍ नोब्ळ्
can we confirm that the user does not have multiple DIH configured? any request for an import, while an import is going on, is rejected On Sat, Feb 13, 2010 at 11:40 AM, Chris Hostetter hossman_luc...@fucit.org wrote: : concurrent imports are not allowed in DIH, unless u setup multiple DIH

Re: Highlighting Inconsistency

2010-02-13 Thread Ahmet Arslan
I am having some issues with making the highlighting work properly. If I search for a word in a title field and request a highlighted summary from another long_description field, this works on some documents, but on some doesn't. Have you seen anything like this before? Default value of

Re: reloading sharedlib folder

2010-02-13 Thread Mark Miller
Ryan Kennedy wrote: Have you tried using the RELOAD command in the core admin? http://wiki.apache.org/solr/CoreAdmin#RELOAD I'm not sure if the sharedlib classloader is global or if it's local to each core, but if it's local to each core then a RELOAD may do the trick. Ryan You

Re: Highlighting Inconsistency

2010-02-13 Thread Jan
I am having some issues with making the highlighting work properly. If I search for a word in a title field and request a highlighted summary from another long_description field, this works on some documents, but on some doesn't. Have you seen anything like this before? Default

Re: Highlighting Inconsistency

2010-02-13 Thread Ahmet Arslan
The contents of the long_description field are actually pretty short - max. 2000 characters. But I've tried setting it to -1 as well, and still the same results. Then we should confirm that long_description really contains term terminator. What is numFound when you execute this query?

Re: How to reindex data without restarting server

2010-02-13 Thread Emad Mushtaq
How do I SWAP the old_core with the new_core. Is it to be done manually or does solr provide with a command for doing so. What if I don't make a new core, make changes to the existing core, reindex it and then utilize the RELOAD command. Would this be a bad approach ? On Sat, Feb 13, 2010 at

Re: Removing single-term results / reordering

2010-02-13 Thread Jan Høydahl / Cominvent
Hi, This is probably due to length normalization. Normally this is wanted, as you want to penalize partial match vs a more exact match. Try specifying omitNorms=true on your field. You should ask yourself what kind of relevancy or sorting you really need in your project. If you search short

Re: Highlighting Inconsistency

2010-02-13 Thread Jan
Then we should confirm that long_description really contains term terminator. What is numFound when you execute this query? http://localhost:8983/solr/select?q=long_description:terminatorfl=sku,title,long_descriptionhl=truehl.fl=long_descriptionfq=sku:10699058 The response is: result

Re: Highlighting Inconsistency

2010-02-13 Thread Ahmet Arslan
What is numFound when you execute this query? http://localhost:8983/solr/select?q=long_description:terminatorfl=sku,title,long_descriptionhl=truehl.fl=long_descriptionfq=sku:10699058 The response is: result name=response numFound=1 start=0 The odd thing is that if I do a search in

Re: Highlighting Inconsistency

2010-02-13 Thread Jan
Interesting there is a parameter (hl.requireFieldMatch) about this but default value is false. Interesting indeed! I have tried setting hl.requireFieldMatch manually to false before - but no luck. Are you using some default highlighting parameters defined in solrconfig.xml? You can

Re: Highlighting Inconsistency

2010-02-13 Thread Ahmet Arslan
The title and long_description are almost the same. The only difference is that I've taken out the solr.SnowballPorterFilterFactory filter, so that the titles are not stemmed. I actually as part of the test before writing to this list I tried creating the type that has only the standard

Re: Highlighting Inconsistency

2010-02-13 Thread Jan
I mean solr's Http caching. When testing from browser I usually disable it with in solrconfig.xml. I have enabled httpCaching never304=true, changed the title field to be the same text type, cleared the index, restarted and re-indexed and - it works! I then changed the title field

Re: cannot match on phrase queries

2010-02-13 Thread Erick Erickson
It's really hard to help unless you include the analysis and query schema for the field in question since so much of how things work is dependent upon those choices. Also include the query you fire at SOLR I suspect that omitTermFreqAndPositions is irrelevant Erick On Fri, Feb 12, 2010

Re: parsing strings into phrase queries

2010-02-13 Thread Erick Erickson
I don't see a good way to fix this without some heuristic you'd have to implement to munge your query. There's no good for SOLR to intuit that what you want is a partial match in this case. If you can create some rules like remove any single letters after numbers in the query that would be good

Facet search concept problem

2010-02-13 Thread Ranveer Kumar
Hi All, My concept still not clear about facet search. I am trying to search using facet query. I am indexing data from three table, following is the detail of table: table name: news news_id news_details table name : article article_id article_details table name: blog blog_id blog_details I

problem with edgengramtokenfilter and highlighter

2010-02-13 Thread Joe Calderon
i ran into a problem while using the edgengramtokenfilter, it seems to report incorrect offsets when generating tokens, more specifically all the tokens have offset 0 and term length as start and end, this leads to goofy highlighting behavior when creating edge grams for tokens beyond the first

Re: problem with edgengramtokenfilter and highlighter

2010-02-13 Thread Robert Muir
Joe, can you open a Lucene JIRA issue for this? I just glanced at the code and it looks like a bug to me. On Sun, Feb 14, 2010 at 12:07 AM, Joe Calderon calderon@gmail.comwrote: i ran into a problem while using the edgengramtokenfilter, it seems to report incorrect offsets when generating

Upgrading Tika in Solr

2010-02-13 Thread Liam O'Boyle
Afternoon, I've got a large collections of documents which I'm attempting to add to a Solr index using Tika via the ExtractingRequestHandler, but there are a large number that it has problems with (PDFs, PPTX and XLS documents mainly). I've tried them with the most recent stand alone version