Re: Need help with facets

2010-08-11 Thread Ahmet Arslan
--- On Wed, 8/11/10, Moazzam Khan moazz...@gmail.com wrote: From: Moazzam Khan moazz...@gmail.com Subject: Re: Need help with facets To: solr-user@lucene.apache.org Date: Wednesday, August 11, 2010, 1:32 AM Thanks Ahmet that worked! Here's another issues I have : Like I said before,

Problem instantiating CommonsHttpSolrServer using solrj

2010-08-11 Thread bijeet singh
Hi all, I'm trying to use solrj for indexing in solr, but when I try to instantiate the server, using : SolrServer server = new CommonsHttpSolrServer(http://localhost:8080/solr;); I get the following runtime error: Exception in thread main java.lang.NoClassDefFoundError:

Delta-import with solrj client

2010-08-11 Thread Hando420
Greetings. I have a solrj client for fetching data from database. I am using delta-import for fetching data. If a column is changed in database using timestamp with delta-import i get the latest column indexed but there are duplicate values in the index similar to the column but the data is

Re: solr query result not read the latest xml file

2010-08-11 Thread Jan Høydahl / Cominvent
Hi, Yes, this is normal behavior. This is because Solr is *document* based, it does not know about *files*. What happens here is that your source database (or whatever) has had deletinons within this category in addition to updates, and you need to relay those to Solr. The best way to

timestamp field

2010-08-11 Thread Frederico Azeiteiro
Hi, I have on my schema field name=timestamp type=date indexed=true stored=true default=NOW / This field is returned as date name=timestamp2010-08-11T10:11:03.354Z/date For an article added at 2010-08-11T11:11:03.354Z! And the server has the time of 2010-08-11T11:11:03.354Z...

Re: timestamp field

2010-08-11 Thread Jan Høydahl / Cominvent
Hi, Which time zone are you located in? Do you have DST? Solr uses UTC internally for dates, which means that NOW will be the time in London right now :) Does that appear to be right 4 u? Also see this thread: http://search-lucene.com/m/hqBed2jhu2e2/ -- Jan Høydahl, search solution architect

RE: timestamp field

2010-08-11 Thread Frederico Azeiteiro
Hi Jan, Dah, I didn't know that :( I always thought it used the servertime. Anyway,just out of curiosity, the hour is UTC but NOT the time in London right now. London is UTC+1 (same as here in Portugal) :). So, London solr users should have the same problem. Well, I must be careful when

Re: Delta-import with solrj client

2010-08-11 Thread kenf_nc
Short answer is no, there isn't a way. Solr doesn't have the concept of 'Update' to an indexed document. You need to add the full document (all 'columns') each time any one field changes. If doing that in your DataImportHandler logic is difficult you may need to write a separate Update Service

Re: timestamp field

2010-08-11 Thread Mark Allan
For what it's worth, London and the rest of the UK is currently observing British Summer Time (called Daylight Savings Time in other parts of the world) which is why we appear to be UTC+1 between the last Sunday in March and the last Sunday in October. Mark On 11 Aug 2010, at 12:36 pm,

Re: Delta-import with solrj client

2010-08-11 Thread Jan Høydahl / Cominvent
Hi, Make sure you use a proper ID field, which does *not* change even if the content in the database changes. In this way, when your delta-import fetches changed rows to index, they will update the existing rows in your index. -- Jan Høydahl, search solution architect Cominvent AS -

Re: Solr 1.4 - stats page slow

2010-08-11 Thread Yonik Seeley
FYI, I opened https://issues.apache.org/jira/browse/SOLR-2036 for this. -Yonik http://www.lucidimagination.com On Tue, Aug 10, 2010 at 8:35 PM, entdeveloper cameron.develo...@gmail.com wrote: Apologies if this was resolved, but we just deployed Solr 1.4.1 and the stats page takes over a

DataImportHandler in Solr 1.4.1: exception handling in FileListEntityProcessor

2010-08-11 Thread Sascha Szott
Hi folks, why does FileListEntityProcessor ignores onError=continue and abort indexing if a directory or a file does not exist? I'm using both XPathEntityProcessor and FileListEntityProcessor with onError set to continue. In case a directory or file is not present an Exception is thrown and

Re: DataImportHandler in Solr 1.4.1: exception handling in FileListEntityProcessor

2010-08-11 Thread Sascha Szott
Sorry, there was a mistake in the stack trace. The correct one is: SEVERE: Full Import failed org.apache.solr.handler.dataimport.DataImportHandlerException: 'baseDir' value: /home/doe/foo is not a directory Processing Document # 3 at

Re: Solr Doc Lucene Doc !?

2010-08-11 Thread stockii
i have a question about the solr index mechanism with DIH ... i try to understand how solr index an doc, and on wich code-elements, solr using lucene. thats my stand: DIH is using the SolrWriter to add an doc. To create an SolrInoputDocument SolrWriter uses the addUpdateCommand, This Command

RE: PDF file

2010-08-11 Thread Ma, Xiaohui (NIH/NLM/LHC) [C]
Thanks so much for your help! I got Remote Streaming is disabled error. Would you please tell me if I miss something? Thanks, -Original Message- From: Jayendra Patil [mailto:jayendra.patil@gmail.com] Sent: Tuesday, August 10, 2010 8:51 PM To: solr-user@lucene.apache.org Subject:

Re: Solr Doc Lucene Doc !?

2010-08-11 Thread stockii
oh, i see that i mixed DIH classes with other Solr classes ^^ -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-Doc-Lucene-Doc-tp995922p1088738.html Sent from the Solr - User mailing list archive at Nabble.com.

RE: PDF file

2010-08-11 Thread Ma, Xiaohui (NIH/NLM/LHC) [C]
Thanks, I knew how to enable Streaming. But I got another error, ERROR:unknown field 'metadata_trapped'. Does anyone know how to match up with SolrCell metadata? I found the following in schema.xml. I don't know how to make changes for PDF. !-- Common metadata fields, named specifically to

SolrException log

2010-08-11 Thread Bastian Spitzer
Hi, we are using solr 1.4.1 in a master-slave setup with replication, requests are loadbalanced to both instances. this is just working fine, but the slave behaves strange sometimes with a SolrException log (trace below). We are using 1.4.1 for weeks now, and this has happened only a few times so

RE: Improve Query Time For Large Index

2010-08-11 Thread Burton-West, Tom
Hi Peter, Can you give a few more examples of slow queries? Are they phrase queries? Boolean queries? prefix or wildcard queries? If one word queries are your slow queries, than CommonGrams won't help. CommonGrams will only help with phrase queries. How are you using termvectors? That may

Re: how to support implicit trailing wildcards

2010-08-11 Thread yandong yao
Hi Jan, Seems q=mount OR mount* have different sorting order with q=mount for those documents including mount. Change to q=mount^100 OR (mount?* -mount)^1.0, and test well. Thanks very much! 2010/8/10 Jan Høydahl / Cominvent jan@cominvent.com Hi, You don't need to duplicate the content

Re: Improve Query Time For Large Index

2010-08-11 Thread Robert Muir
On Wed, Aug 11, 2010 at 11:47 AM, Burton-West, Tom tburt...@umich.eduwrote: Hi Peter, Can you give a few more examples of slow queries? Are they phrase queries? Boolean queries? prefix or wildcard queries? If one word queries are your slow queries, than CommonGrams won't help. CommonGrams

Re: Need help with facets

2010-08-11 Thread Moazzam Khan
That's awesome. Thanks Ahmet! On Wed, Aug 11, 2010 at 1:50 AM, Ahmet Arslan iori...@yahoo.com wrote: --- On Wed, 8/11/10, Moazzam Khan moazz...@gmail.com wrote: From: Moazzam Khan moazz...@gmail.com Subject: Re: Need help with facets To: solr-user@lucene.apache.org Date: Wednesday,

Analysing SOLR logfiles

2010-08-11 Thread Jay Flattery
Hi there, Just wondering what tools people use to analyse SOLR log files. We're looking to do things like extracting common queries, calculating averaging Qtime and hits, returning particularly slow/expensive queries, etc. Would prefer not to code something (completely) from scratch.

Data Import Handler Query

2010-08-11 Thread Manali Joshi
Hi, I have installed solr 1.4 and am trying to use the Data Import Handler to import data from a database. I have 2 tables which share a 1 to many relation (1 Story to Many Images). I want my index to contain attributes regarding “Story” and also all “Images” that it has. Based on the DIH

RE: Filter Performance in Solr 1.3

2010-08-11 Thread Bargar, Matthew B
The search with the filter takes longer than a search for the same term but no filter after repeated searches, after the cache should have come into play. To be more specific, this happens on filters that exclude very few results from the overall set. For instance, type:video returns few results

Re: how to support implicit trailing wildcards

2010-08-11 Thread Jan Høydahl / Cominvent
I guess q=mount OR (mount*)^0.01 would work equally as well, i.e. diminishing the effect of wildcard matches. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Training in Europe - www.solrtraining.com On 11. aug. 2010, at 17.53, yandong yao wrote: Hi Jan, Seems

Re: Data Import Handler Query

2010-08-11 Thread kenf_nc
It may not be the data config. Do you have the fields in the schema.xml that the image data is going to set to be multiValued=true? Although, I would think the last image would be stored, not the first, but haven't really tested this. -- View this message in context:

bug or feature???

2010-08-11 Thread Jean-Sebastien Vachon
Hi, Can someone tell me why the two following queries do not return the same results? Is that a bug or a feature? http://localhost:8983/jobs/select?fq=title:(NOT janitor)fq=description:(NOT janitor)q=*:* http://localhost:8983/jobs/select?q=title:(NOT janitor) AND description:(NOT janitor)

General questions about distributed solr shards

2010-08-11 Thread JohnRodey
1) Is there any information on preferred maximum sizes for a single solr index. I've read some people say 10 million, some say 80 million, etc... Is there any official recommendation or has anyone experimented with large datasets into the tens of billions? 2) Is there any down side to running

Indexing and ExtractingRequestHandler

2010-08-11 Thread Harry Hochheiser
I'm trying to use Solr to index the contents of an Excel file, using the ExtractingRequestHandler (CSV handler won't work for me - I need to consider the whole spreadsheet as one document), and I'm running into some trouble. Is there any way to see what's going on during the indexing process? I'm

Re: Analysing SOLR logfiles

2010-08-11 Thread Jan Høydahl / Cominvent
Have a look at www.splunk.com -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Training in Europe - www.solrtraining.com On 11. aug. 2010, at 19.34, Jay Flattery wrote: Hi there, Just wondering what tools people use to analyse SOLR log files. We're looking to

Re: bug or feature???

2010-08-11 Thread Jan Høydahl / Cominvent
Your syntax looks a bit funny. Which version of Solr are you using? Pure negative queries are not supported, try q=(*:* -title:janitor) instead. Also, for debugging what's going on, please add debugQuery=true and share the parsed query for both cases with us. -- Jan Høydahl, search solution

Re: Data Import Handler Query

2010-08-11 Thread Manali Joshi
I tried making the schema fields that get the image data to multiValued=true. But it still gets only the first image data. It doesn't have information about all the images. On Wed, Aug 11, 2010 at 1:15 PM, kenf_nc ken.fos...@realestate.com wrote: It may not be the data config. Do you have

Re: Indexing and ExtractingRequestHandler

2010-08-11 Thread Jan Høydahl / Cominvent
Hi, You can try Tika command line to parse your Excel file, then you will se the exact textual output from it, which will be indexed into Solr, and thus inspect whether something is missing. Are you sure you use a version of Luke which supports your version of Lucene? -- Jan Høydahl, search

Re: DIH transformer script size limitations with Jetty?

2010-08-11 Thread harrysmith
To follow up on my own question, it appears this is only an issue when using the DataImport console debugging tools. It looks like when submitting the debugging request, the data-config.xml is sent via a GET request, which would fail. However, using the exact same data-config.xml via a

DIH - Insert another record After first load

2010-08-11 Thread Girish
Hi, I did load of the data with DIH and now once the data is loaded. I want to load the records dynamically as an when I received. Use cases: 1. I did load of 7MM records and now everything is working fine. 2. A new record is received, now I want to add this new record into the indexed

How to OR facet queries

2010-08-11 Thread Frank A
Hi, I have 3 facet fields (A,B,C) the values of each facet field will be shown as check boxes to users: Field A [x] Val1a [x] Val2a [] Val3a Field B [x] Val1b [] Val2b [] Val3b Within a field if the user selects two items I want the queries to be an OR query. Currently I'm generating

Re: How to OR facet queries

2010-08-11 Thread Geek Gamer
On Thu, Aug 12, 2010 at 7:12 AM, Frank A fsa...@gmail.com wrote: Hi, I have 3 facet fields (A,B,C) the values of each facet field will be shown as check boxes to users: Field A [x] Val1a [x] Val2a [] Val3a Field B [x] Val1b [] Val2b [] Val3b Within a field if the user selects

Re: DIH transformer script size limitations with Jetty?

2010-08-11 Thread Girish Pandit
Have you tried changing the -Xmx value to bump to -Xmx1300m? I had some problem with DIH loading the data and when I bumped the memory everything worked fine! harrysmith wrote: To follow up on my own question, it appears this is only an issue when using the DataImport console debugging

Re: Indexing and ExtractingRequestHandler

2010-08-11 Thread Harry Hochheiser
Thanks. I've done Tika command line to parse the Excel file, and I see contents in it that don't appear to be indexed. I've tried the path of using Tika to parse the Excel and then using extracting request handler to index the resulting text, and that doesn't work either. As far as Luke goes,

Re: Schema Definition Question

2010-08-11 Thread Lance Norskog
Can do you a DB join on OurID? That makes the association in the database, before it gets to the DataImportHandler. On Sun, Aug 8, 2010 at 6:17 PM, Frank A fsa...@gmail.com wrote: Hi, I have a db handler with the following definition:            entity name=place                query=select

In multicore env, can I make it access core0 by default

2010-08-11 Thread Chengyang
Thus when I access http://localhost/solr/select?q=*:* equals http://localhost/solr/core0/select?q=*:*.

Re: Schema Definition Question

2010-08-11 Thread harrysmith
I think I know where you're headed, I was struggling with the same issue. In my case, using results from Solr I link to a detailed profile using an ID, but I am displaying the String value. I was looking for something like: doc arr name=ITEM str name=ITEM_ID12345/str sub_doc str