Re: delta-import of rich documents like word and pdf files!

2011-11-12 Thread Erick Erickson
Can you give more details about what you're trying to do? It looks like you're using DataImportHandler? What defines a document needing to be re-indexed? How do you expect to be able to identify them??? Perhaps you can review: http://wiki.apache.org/solr/UsingMailingLists Best Erick On Sat, Nov

Re: Missing Results

2011-11-12 Thread Erick Erickson
I suspect that your index doesn't contain what you think it does, or your query isn't doing what you think. Here's what I'd do: 1> use Luke or perhaps TermsComponent to see what is actually in your designations field. Watch out for deleted documents (this is where an optimize might b

Re: Preventing DIH to stop on error

2011-11-12 Thread Erick Erickson
Interesting, it is marked as "fixed" in 1.5, but there never was a 1.5 released, wonder if it got lost somehow? Could you open a new bug and attach this patch? I'm afraid that having it attached to 1597 will be too confusing, 1.4 was long enough ago that this might as well be a new one Best E

Re: Delete by Query with limited number of rows

2011-11-12 Thread Yury Kats
On 11/12/2011 4:08 PM, mikr00 wrote: > Similar to a first in first out list. The problem is: It's easy to check the > limit, but how can I delete the oldest documents to go again below the > limit? Can I do it with a delete by query request? In that case, I would > probably have to limit the number

Re: writing to shards

2011-11-12 Thread Erick Erickson
You currently have to handle it programmatically. They are entirely separate Solr instances for all that they talk to each other, there's no way currently to just hand of the new document and have Solr "do the right thing", so you have to address the shard specifically... Some of the new Solr clou

Re: Solr 3.3 Sorting is not working for long fields

2011-11-12 Thread Erick Erickson
Well, 3.3 has been around for quite a while, I'd suspect that something this fundamental would have been found... Is your field multi-valued? And what kind of field is studyid? You really have to provide more details, input, output, etc to get reasonable help. It might help to review: http://wik

Sorting by aggregates of grouped documents' fields.

2011-11-12 Thread jonhinson
I'm performing a search on booking dates for properties that can have variable prices depending on the date. I'm doing a query that filters by a bounding box, a date range, and makes sure that the available flag is true. I then group by the property_id. Is there a way to sort (and perhaps do an fra

Delete by Query with limited number of rows

2011-11-12 Thread mikr00
I have the following problem and can't seem to find a solution: I'm building up a frequently updated solr index. In order to deal with limited ressources I would like to limit the total number of documents in the index. In other words: I would like to declare that no more than (for example) 1.000.

Re: TikaEntityProcesor Exception Handling

2011-11-12 Thread Mark Miller
I'd file a JIRA issue. On Nov 12, 2011, at 10:39 AM, David T. Webb wrote: > Same result on onError="continue" . > > Any help is appreciatedthank you. > > -- > Sincerely, > David Webb > > > > -Original Message- > From: David T. Webb [mailto:david.w...@brightmove.com] > Sent: Sat

RE: TikaEntityProcesor Exception Handling

2011-11-12 Thread David T. Webb
Same result on onError="continue" . Any help is appreciatedthank you. -- Sincerely, David Webb -Original Message- From: David T. Webb [mailto:david.w...@brightmove.com] Sent: Saturday, November 12, 2011 10:27 AM To: solr-user@lucene.apache.org Subject: RE: TikaEntityProcesor Excep

RE: TikaEntityProcesor Exception Handling

2011-11-12 Thread David T. Webb
I found the answer with the onError="skip" on the Entity, However, after adding that parameter to the data-config.xml, the index processing still stops when the TikaEntityProcessor throws an Exception. Nov 12, 2011 10:22:16 AM org.apache.solr.common.SolrException log SEVERE: Full Import failed:or

TikaEntityProcesor Exception Handling

2011-11-12 Thread David T. Webb
When indexing over 2MM documents with Solr and the TikaEntityProcessor, the indexing fails if Tika encounters an exception with one of the documents. How can I tell Solr to keep going and just ignore the failed documents from the Tika Processor? Thanks. -- Sincerely, David Webb

Re: Facet counts incorrect on filtered field when using groups

2011-11-12 Thread Rubinho
Hi Erick, Well, these records are trips, each trip has multiple variants (departuredates, number of days). Each variant has his own record. Grouping is used to show only 1 record/trip. When you search in this minidatabase (with 3 records) without a fq, then you receive 1 facetcount (grouped by t

Re: Group.ngroup parameter memory consumption

2011-11-12 Thread Martijn v Groningen
BTW this applies for 4.0-dev. In 3x the String instance from a StringIndex is directly used, this is then put into a list. So there is no extra object instance created per group matching the query. Martijn On 12 November 2011 08:49, Rafał Kuć wrote: > Hello! > > Thanks, that's what I was looking