Re: SolrServer instances

2011-08-26 Thread Jonty Rhods
do I also required to close the connection from solr server (CommonHttpSolrServer). regards On Fri, Aug 26, 2011 at 9:45 AM, Jonty Rhods jonty.rh...@gmail.com wrote: Deal all please help I am stuck here as I have not much experience.. thanks On Thu, Aug 25, 2011 at 6:51 PM, Jonty Rhods

solr indexing exception

2011-08-26 Thread abhijit bashetti
Hi, I am using DIH for indexing 50K documents . I am using 64-bit machine with 4GB RAM I got the following exception: org.apache.solr.handler.dataimport.DataImportHandlerException: java.lang.OutOfMemoryError: Java heap space at

how to differentiate multiple datasources when building solr query....

2011-08-26 Thread vighnesh
hi all I have a two data sources in data-config file and i need data from first datasource , second datasource and from both .how can acheive this in solr query. example like: first datasource:

Re: how to differentiate multiple datasources when building solr query....

2011-08-26 Thread Lance Norskog
Did you mean datasource-1 and datasource-2 ? On Fri, Aug 26, 2011 at 2:41 AM, vighnesh svighnesh...@gmail.com wrote: hi all I have a two data sources in data-config file and i need data from first datasource , second datasource and from both .how can acheive this in solr query. example

Re: how to differentiate multiple datasources when building solr query....

2011-08-26 Thread vighnesh
yes those are two data-sources name . how can i get the data from only datasource-1 or from data source-2 or from both -- View this message in context: http://lucene.472066.n3.nabble.com/how-to-differentiate-multiple-datasources-when-building-solr-query-tp3286309p3286325.html Sent from the Solr

Re: solr indexing exception

2011-08-26 Thread Gora Mohanty
On Fri, Aug 26, 2011 at 1:47 PM, abhijit bashetti abhijitbashe...@gmail.com wrote: Hi, I am using DIH for indexing 50K documents . I am using 64-bit machine with 4GB RAM How much memory is allocated to Solr? What is the approximate size of the data being indexed into Solr. Regards, Gora

Re: missing field in schema browser on solr admin

2011-08-26 Thread Erik Hatcher
Is the field stored? Do you see it on documents when you do a q=*:* search? How is that field defined and populated? (exact config/code needed here) Erik On Aug 25, 2011, at 23:07 , deniz wrote: hi all... i have added a new field to index... but now when i check solr admin, i see

Re: SolrServer instances

2011-08-26 Thread François Schiettecatte
Sounds to me that you are looking for HTTP Persistent Connections (connection keep-alive as opposed to close), and a singleton object. This would be outside SOLR per se. A few caveats though, I am not sure if tomcat supports keep-alive, and I am not sure how SOLR deals with multiple requests

Re: Solr Implementations

2011-08-26 Thread Erick Erickson
See below On Thu, Aug 25, 2011 at 4:22 PM, zarni aung zau...@gmail.com wrote: First, I would like to apologize if this is a repeat question but can't seem to get the right answer anywhere.   - What happens to pending documents when the server dies abruptly?  I   understand that when the

Re: how to differentiate multiple datasources when building solr query....

2011-08-26 Thread Erik Hatcher
Vighnesh - What you're looking for is DataImportHandler's TemplateTransformer. Docs here: http://wiki.apache.org/solr/DataImportHandler#TemplateTransformer Basically just enable the TemplateTransformer in each of your DIH configs then set a literal field value like this differently for each

Re: Paging over mutlivalued field results?

2011-08-26 Thread Erick Erickson
OK, I think I have it. It's a problem, indeed. And no, there's no way I know of to make a doc fetch only bring back some range of values in a multivalued field. So you're stuck with either getting the whole book back and peeling out the pages (how do you know which sentences are on which page

Re: where should i keep the class files to perform scheduling?

2011-08-26 Thread Erick Erickson
Actually, the easiest thing to do would be to make a cron job on *nix or use task scheduler on windows to fire off a delta-import request to your solr server on a schedule you'd like. The code you reference appears to be for Solr 1.2, which is way old... Best Erick On Fri, Aug 26, 2011 at 7:45

Re: how to differentiate multiple datasources when building solr query....

2011-08-26 Thread Erick Erickson
Although I'd really recommend using underscore rather than hyphen, since '-' is a query operator and it'll mess you up *sometime* G... Best Erick On Fri, Aug 26, 2011 at 8:43 AM, Erik Hatcher erik.hatc...@gmail.com wrote: Vighnesh - What you're looking for is DataImportHandler's

Re: how to differentiate multiple datasources when building solr query....

2011-08-26 Thread vighnesh
thanx for giving response i am unable to configure this please provide any samples code for how to use template transformer -- View this message in context: http://lucene.472066.n3.nabble.com/how-to-differentiate-multiple-datasources-when-building-solr-query-tp3286309p3286816.html Sent from

Re: where should i keep the class files to perform scheduling?

2011-08-26 Thread Igor MILOVANOVIC
Easiest way is to do it via cron job. 2011/8/26 nagarjuna nagarjuna.avul...@gmail.com hi everybody... i dont know about how to perform DIH scheduling for fullimport in solri got little bit information from

Re: where should i keep the class files to perform scheduling?

2011-08-26 Thread nagarjuna
Thank u very much for ur reply Erick Erickson i am using solr 3.3.0 version and i have no idea about the cron job i thought that it would be for unix but i am using windows and i would like to integrate my scheduling task with my solr server please give me the suggestion -- View this

Re: Paging over mutlivalued field results?

2011-08-26 Thread darren
Many thanks Erick. I think a good feature to add to Solr to address this is to allow a query to return either the document as a result or the matching (multivalued) fields of a document as individual results (subject to paging too). Because sometimes the field value (only) is the desired result

Re: Paging over mutlivalued field results?

2011-08-26 Thread Erik Hatcher
The way folks have addressed this situation to date is to model the multivalued fields as additional documents too. On Aug 26, 2011, at 09:32 , dar...@ontrenet.com dar...@ontrenet.com wrote: Many thanks Erick. I think a good feature to add to Solr to address this is to allow a query to

commas in synonyms.txt are not escaping

2011-08-26 Thread Moore, Gary
I have a number of chemical names containing commas which I'm mapping in index_synonyms.txt thusly: 2\,4-D-butotyl=Aqua-Kleen,BRN 1996617,Bladex-B,Brush killer 64,Butoxy-D 3,CCRIS 8562 According to the sample synonyms.txt, the comma above should be. i.e. a\,a=b\,b.The problem is that

Solr and client app on same Jetty?

2011-08-26 Thread Arcadius Ahouansou
Hello. I have Solr running on Jetty and I also have a web client application running on another jetty instance on the same box. The question is: wouldn't it be better to run the client and solr on the very same jetty instance? I came across http://wiki.apache.org/solr/Solrj#EmbeddedSolrServer

Re: commas in synonyms.txt are not escaping

2011-08-26 Thread Alexei Martchenko
Gary, please post the entire field declaration so I can try to reproduce here 2011/8/26 Moore, Gary gary.mo...@ars.usda.gov I have a number of chemical names containing commas which I'm mapping in index_synonyms.txt thusly: 2\,4-D-butotyl=Aqua-Kleen,BRN 1996617,Bladex-B,Brush killer

RE: commas in synonyms.txt are not escaping

2011-08-26 Thread Moore, Gary
Here you go -- I'm just hacking the text field at the moment. Thanks, Gary fieldType name=text class=solr.TextField positionIncrementGap=100 analyzer type=index tokenizer class=solr.WhitespaceTokenizerFactory/ filter class=solr.SynonymFilterFactory

Re: commas in synonyms.txt are not escaping

2011-08-26 Thread Alexei Martchenko
Gary, isn't your wordDelimiter removing your commas in the query time? have u tried it in the analyzer? 2011/8/26 Moore, Gary gary.mo...@ars.usda.gov Here you go -- I'm just hacking the text field at the moment. Thanks, Gary fieldType name=text class=solr.TextField positionIncrementGap=100

Re: Solr and client app on same Jetty?

2011-08-26 Thread Gérard Dupont
Hi, On 26 August 2011 16:23, Arcadius Ahouansou arcad...@menelic.com wrote: Hello. I have Solr running on Jetty and I also have a web client application running on another jetty instance on the same box. The question is: wouldn't it be better to run the client and solr on the very same

Re: Solr Implementations

2011-08-26 Thread zarni aung
Thank you so much for your response Erik. On Fri, Aug 26, 2011 at 8:30 AM, Erick Erickson erickerick...@gmail.comwrote: See below On Thu, Aug 25, 2011 at 4:22 PM, zarni aung zau...@gmail.com wrote: First, I would like to apologize if this is a repeat question but can't seem to get the

what is scheduling ? why should we do this?how to achieve this ?

2011-08-26 Thread nagarjuna
i dont know what exactly the meaning of scheduling in solr and why should i do ?and how can achieve this please help me to do this i already saw this link http://wiki.apache.org/solr/DataImportHandler?highlight=%28scheduling%29#Scheduling except this one please send me any sample codes or links

core creation and instanceDir parameter

2011-08-26 Thread Gérard Dupont
Hi all, Playing with multicore and dynamic creation of new core, I found out that there is one mandatory parameter instanceDir which is mandaotry to find out the location of solrconfig.xml and schema.xml. Since all my cores share the same configuration (found realtively to the $SOLR_HOME defined

New IndexSearcher and autowarming

2011-08-26 Thread Mike Austin
I would like to have the ability to keep requests from being slowed from new document adds and commits by having a separate index that gets updated. Basically a read-only and an updatable index. After the update index has finished updating with new adds and commits, I'd like to switch the update

Re: commas in synonyms.txt are not escaping

2011-08-26 Thread Yonik Seeley
On Fri, Aug 26, 2011 at 10:17 AM, Moore, Gary gary.mo...@ars.usda.gov wrote: I have a number of chemical names containing commas which I'm mapping in index_synonyms.txt thusly: 2\,4-D-butotyl=Aqua-Kleen,BRN 1996617,Bladex-B,Brush killer 64,Butoxy-D 3,CCRIS 8562 According to the sample

Re: commas in synonyms.txt are not escaping

2011-08-26 Thread Yonik Seeley
On Fri, Aug 26, 2011 at 11:16 AM, Yonik Seeley yo...@lucidimagination.com wrote: On Fri, Aug 26, 2011 at 10:17 AM, Moore, Gary gary.mo...@ars.usda.gov wrote: I have a number of chemical names containing commas which I'm mapping in index_synonyms.txt thusly: 2\,4-D-butotyl=Aqua-Kleen,BRN

RE: commas in synonyms.txt are not escaping

2011-08-26 Thread Moore, Gary
Thanks, Yonik. Gary -Original Message- From: ysee...@gmail.com [mailto:ysee...@gmail.com] On Behalf Of Yonik Seeley Sent: Friday, August 26, 2011 11:25 AM To: solr-user@lucene.apache.org Subject: Re: commas in synonyms.txt are not escaping On Fri, Aug 26, 2011 at 11:16 AM, Yonik Seeley

RE: commas in synonyms.txt are not escaping

2011-08-26 Thread Moore, Gary
Alexi, Yes but no difference. This is apparently an issue introduced in 3.*. Thanks for your help. -Gary -Original Message- From: Alexei Martchenko [mailto:ale...@superdownloads.com.br] Sent: Friday, August 26, 2011 10:45 AM To: solr-user@lucene.apache.org Subject: Re: commas in

DIH importing

2011-08-26 Thread Mark
We are currently delta-importing using DIH after which all of our servers have to download the full index (16G). This obviously puts quite a strain on our slaves while they are syncing over the index. Is there anyway not to sync over the whole index, but rather just the parts that have

Re: Paging over mutlivalued field results?

2011-08-26 Thread darren
Yeah, I've resigned that this is the most practical workaround. But it also means a 100-1 explosion in my index size. For every book document, there will now be a 100 (for example) sentence documents from it. What's the best way to submit a feature request for Solr? Many thanks. On Fri, 26 Aug

Re: DIH importing

2011-08-26 Thread simon
It sounds as though you are optimizing the index after the delta import. If you don't do that, then only new segments will be replicated and syncing will be much faster. On Fri, Aug 26, 2011 at 12:08 PM, Mark static.void@gmail.com wrote: We are currently delta-importing using DIH after

Re: New IndexSearcher and autowarming

2011-08-26 Thread simon
The multicore API (see http://wiki.apache.org/solr/CoreAdmin ) allows you to swap, unload, reload cores. That should allow you to do what you want, -Simon On Fri, Aug 26, 2011 at 11:13 AM, Mike Austin mike.aus...@juggle.comwrote: I would like to have the ability to keep requests from being

Re: New IndexSearcher and autowarming

2011-08-26 Thread Erick Erickson
Why doesn't standard replication with auto-warming work for you? You can control how often replication gets triggered by controlling your commit points and/or your replication interval. This seems easier than maintaining cores like your problem statement indicates. Best Erick On Fri, Aug 26,

Re: New IndexSearcher and autowarming

2011-08-26 Thread Mike Austin
Hi Erick, It might work. I've only worked with solr having one index on one server over a year ago so I might need to just research more about the replication. I am using windows and I remember that replication on windows had some issues with scripts and hard links, however it looks like we have

syntax for functions used in the fq parameter

2011-08-26 Thread Jason Toy
I'm trying to limit my data to only docs that have the word 'foo' appear at least once. I am trying to use: fq=termfreqdata,'foo'):[1+TO+*] but I get the syntax error: Caused by: org.apache.lucene.queryparser.classic.ParseException: Encountered : : at line 1, column 33. Was expecting one of:

Re: syntax for functions used in the fq parameter

2011-08-26 Thread Erick Erickson
Why doesn't AND text:foo fill this requirement? Best Erick On Fri, Aug 26, 2011 at 2:27 PM, Jason Toy jason...@gmail.com wrote: I'm trying to limit my data to only docs that have the word 'foo' appear at least once. I am trying to use: fq=termfreqdata,'foo'):[1+TO+*] but I get the syntax

Re: where should i keep the class files to perform scheduling?

2011-08-26 Thread simon
The built-in DIH scheduling was never implemented as far as I know - the Wiki section is just a design proposal and explicitly says Hasn't been committed to SVN (published only here) On Windows, you can use the Task Scheduler to do the kinds of things that cron does on Unix/Linux. -Simon On

Solr Geodist

2011-08-26 Thread solrnovice
Hi, i am trying to return distance in the solr query, by passing in the fl=geodist() and i dont see distance being returned. We have a field called coordinates which is configured as latlong and when i perform the following search, i do see results

Re: Solr Geodist

2011-08-26 Thread Erick Erickson
When I try this form the stock solr example (using store rather than coordinate for the field, your first example gives me an error of d must be 0 or some such. When I add a d value to the query, both your first and second queries work just fine and the second returns a geodist value in the

Re: Solr Geodist

2011-08-26 Thread solrnovice
Eric, thanks for the quick response. I left out the d value, yes, when you perform a spatial query, we should have a distance of d0, sorry about that. What is the setting of your store value, i mean in the schema, was it marked at LatLong. For some reason i dont see the geodist() being returned

Re: Highlight on alternateField

2011-08-26 Thread Val Minyaylo
Thanks a lot Koji. On 8/25/2011 5:04 PM, Koji Sekiguchi wrote: (11/08/26 2:32), Val Minyaylo wrote: Hi there, I am trying to utilize highlighting alternateField and can't get highlights on the results from targeted fields. Is this expected behavior or am I understanding alternateFields

Re: where should i keep the class files to perform scheduling?

2011-08-26 Thread Gora Mohanty
On Fri, Aug 26, 2011 at 6:51 PM, nagarjuna nagarjuna.avul...@gmail.com wrote: Thank u very much for ur reply Erick Erickson    i am using solr 3.3.0 version  and i have no idea about the cron job i thought that it would be for unix but i am using windows and i would like to integrate my

synonyms vs replacements

2011-08-26 Thread Robert Petersen
Hello all, Which is better? Say you add an index time synonym between nunchuck and nunchuk and then both words will be in the document and both will be searchable. I can get the same exact behavior by putting an index time replacement of nunchuck = nunchuk and a search time replacement of

Viewing the complete document from within the index

2011-08-26 Thread karthik
Hi Everyone, I am trying to see whats the best way to view the entire document as its indexed within solr/lucene. I have tried to use Luke but it's still showing me the fields that i have configured to be returned back [ie., stored=true] unless I am not enabling some option in the tool. Is there

Shingle and Query Performance

2011-08-26 Thread Lord Khan Han
Hi, We are indexing news document from the various sites. Currently we have 200K docs indexed. Total index size is 36 gig. There is also attachement to the news (pdf -docs etc) So document size could be high (ie 10mb). We are using some complex queries which includes around 30 - 40 terms per

auto suggestion with text_en field

2011-08-26 Thread Paul
Sorry if this has been asked before, but I couldn't seem to find it... I've got a fairly simple index, and I'm searching on a field of type text_en, and the results are good: I search for computer and I get back hits for computer, computation, computational, computing. I also want to create an

Re: Shingle and Query Performance

2011-08-26 Thread Erik Hatcher
On Aug 26, 2011, at 17:49 , Lord Khan Han wrote: We are indexing news document from the various sites. Currently we have 200K docs indexed. Total index size is 36 gig. There is also attachement to the news (pdf -docs etc) So document size could be high (ie 10mb). We are using some complex

getting data from only one database

2011-08-26 Thread mss.mss
hi , we created a solr which is connected to two databases and we created a jquery auto complete.in two databases we r having keywords and it is default search.so beside the search button we r ctearing more more drop down list and nmaed the two databases when the user click one one database and