more than one text corpus with solr?

2012-06-30 Thread Giovanni Gherdovich
Hi all, i am experimenting with solr, and I feel the need to index more than just one corpus and search them with solr independently. is it possible to have this setup? Several independent indices all managed by the same solr instance? cheers, Giovanni

how to retrieve a doc from its docID ?

2012-06-30 Thread Giovanni Gherdovich
Hi all, when querying my solr instance, the answers I get are the document IDs of my docs. Here is how one of my docs looks like: -- -- 8 -- -- 8 -- -- 8 -- -- 8 -- -- 8 -- -- add doc field name=texthello solar!/field field name=id123/field /doc /add -- -- 8 -- -- 8 -- --

querying thru solritas gives me zero results

2012-06-30 Thread Giovanni Gherdovich
Hi all, this morning I was very proud of myself since I managed to set up solritas ( http://wiki.apache.org/solr/VelocityResponseWriter ) for the solr instance on my server (ubuntu natty). This joy lasted only half a minute, since the only query that gets more than zero results with solritas is

Re: how to retrieve a doc from its docID ?

2012-06-30 Thread Sascha Szott
Hi, did you include the fl parameter in the Solr query URL? If that's the case make sure that the field name 'text' is mentioned there. You should also make sure that the field definition (in schema.xml) for 'text' says stored=true, otherwise the field will not be returned. -Sascha

Re: querying thru solritas gives me zero results

2012-06-30 Thread Sascha Szott
Hi, Solritas uses the dismax query parser. The dismax config parameter 'qf' specifies the index fields to be searched in. Make sure that 'name' is your default search field. -Sascha Giovanni Gherdovich g.gherdov...@gmail.com schrieb: Hi all, this morning I was very proud of myself since

Re: querying thru solritas gives me zero results

2012-06-30 Thread Erik Hatcher
Debugging this you can add debugQuery=truewt=xml to get the full classic Solr XML output that drives it all. Erik On Jun 30, 2012, at 7:36, Giovanni Gherdovich g.gherdov...@gmail.com wrote: Hi all, this morning I was very proud of myself since I managed to set up solritas (

Re: Using custom user-defined caches to store user app data while indexing

2012-06-30 Thread Dmitry Kan
Hello! If you implement SolrCoreAware interface in your custom UpdateRequestProcessorFactory, you could then access your cache via Solr Core in the inform method, I think. Haven't tried it myself, but it looks logical to me to start from there. // Dmitry On Fri, Jun 29, 2012 at 4:44 PM, Iana

Re: querying thru solritas gives me zero results

2012-06-30 Thread Giovanni Gherdovich
Hello Sascha, Sascha: Solritas uses the dismax query parser. The dismax config parameter 'qf' specifies the index fields to be searched in. Make sure that 'name' is your default search field. I am not sure I understand this; I have no field named 'name'. My documents are like -- -- 8 -- --

Re: more than one text corpus with solr?

2012-06-30 Thread Giovanni Gherdovich
2012/6/30 Afroz Ahmad: You can set up multiple cores, each core managing a different index. See http://wiki.apache.org/solr/CoreAdmin thank you very much Ahmad for this hint. cheers, Giovanni

Re: how to retrieve a doc from its docID ?

2012-06-30 Thread Jack Krupansky
Don't try doing this with the text field of the Solr example schema, which is a catchall field that is populated via CopyFields. Rather, add the original source field(s) to fl that was/were copied to the catchall field. The catchall field is designed for indexing, not result display. But do

Re: querying thru solritas gives me zero results

2012-06-30 Thread Giovanni Gherdovich
2012/6/30 Erik Hatcher: Debugging this you can add debugQuery=truewt=xml to get the full classic Solr XML output that drives it all. Thank you Erik, I'll see what I get from it. cheers, GGhh

Re: how do I search the archives for solr-user

2012-06-30 Thread Jack Krupansky
Just use a simple Google search for any Solr question using specific technical terms. Google will find the Solr archives as well as quite a few discussions on StackOverflow. -- Jack Krupansky -Original Message- From: Giovanni Gherdovich Sent: Saturday, June 30, 2012 5:39 AM To:

Re: how to retrieve a doc from its docID ?

2012-06-30 Thread Giovanni Gherdovich
Sascha: You should also make sure that the field definition (in schema.xml) for 'text' says stored=true, otherwise the field will not be returned. I guess you're hitting my problem. The field I want to search on is declared with store=false in the schema.xml: -- -- 8 -- -- 8 -- -- 8 -- -- 8

how do I trash a whole index and start over?

2012-06-30 Thread Giovanni Gherdovich
Hi all, how do I trash a whole index and start over with a new fresh index of my corpus? I need that since I modified my schema.xml since my last indexing, and I'd like the changes to be taken into account. Cheers, Giovanni

Atomic Multicore Operations - E.G. Move Docs

2012-06-30 Thread Nicholas Ball
Hey all, Trying to figure out the best way to perform atomic operation across multiple cores on the same solr instance i.e. a multi-core environment. An example would be to move a set of docs from one core onto another core and ensure that a softcommit is done as the exact same time. If one

documentation on the pragmatics behind the example schema.xml

2012-06-30 Thread Giovanni Gherdovich
Hi all, in the example schema.xml I can find a wide variety of fieldType and field, already there to be used. I believe each of them has been designed for a specific usage case, with some pragmatics in mind. Where can I find documentation on what those field / fieldTypes were designed for? Is

Re: how do I trash a whole index and start over?

2012-06-30 Thread Dmitry Kan
Hello, The easiest way is to remove what's inside data/index directory; in case you have a spell-checker index, remove it as well. This requires solr instance restart. Another way, without restarting the server, is to issue deleteByQuery over http. When you are done, you need to reindex your

difference between stored=false and stored=true ?

2012-06-30 Thread Giovanni Gherdovich
Hi all, when declaring a field in the schema.xml file you can set the attributes 'indexed' and 'stored' to true or false. What is the difference between a indexed=true stored=false and a indexed=true stored=true? I guess understanding this would require me to have a closer look to lucene's

Re: how do I trash a whole index and start over?

2012-06-30 Thread Giovanni Gherdovich
2012/6/30 Dmitry Kan: Hello, The easiest way is to remove what's inside data/index directory; in case you have a spell-checker index, remove it as well. This requires solr instance restart. thanks dmitry, I'll go for this solution. cheers, GGhh

Re: difference between stored=false and stored=true ?

2012-06-30 Thread Jack Krupansky
indexed and stored are independent, orthogonal attributes - you can use any of the four combinations of true and false. indexed is used for search or query, the lookup portion of processing a query request. Once the search/query/lookup is complete and a set of documents is selected, stored is

Re: difference between stored=false and stored=true ?

2012-06-30 Thread François Schiettecatte
Giovanni stored=true means the data is stored in the index and can be returned with the search results (see the 'fl' parameter). This is independent of indexed=.. Which means that you can store but not index a field: indexed=false stored=true Best regards François On Jun 30,

Re: more than one text corpus with solr?

2012-06-30 Thread Gora Mohanty
On 30 June 2012 15:28, Giovanni Gherdovich g.gherdov...@gmail.com wrote: Hi all, i am experimenting with solr, and I feel the need to index more than just one corpus and search them with solr independently. is it possible to have this setup? Several independent indices all managed by the

Re: difference between stored=false and stored=true ?

2012-06-30 Thread Giovanni Gherdovich
Thank you François and Jack for those explainations. Cheers, GGhh 2012/6/30 François Schiettecatte: Giovanni stored=true means the data is stored in the index and [...] 2012/6/30 Jack Krupansky: indexed and stored are independent [...]

Can't find solr.xml

2012-06-30 Thread Nabeel Sulieman
Hi, I really hate bothering this group with something that should be trivial, but I've been googling and experimenting to get this to work for the last week now. I had no trouble getting my simple configuration working on 3.5, but when I moved over to 3.6, I seem to have hit something strange.

Re: more than one text corpus with solr?

2012-06-30 Thread Giovanni Gherdovich
Hi Gora, yes I was actually looking for a multi-core setup. thanks! GGhh 2012/6/30 Gora Mohanty Not quite sure what you mean by more than one corpus, and by several independent indices in this context, but maybe multi-core Solr will meet your needs: http://wiki.apache.org/solr/CoreAdmin

Re: Atomic Multicore Operations - E.G. Move Docs

2012-06-30 Thread Lance Norskog
Index all documents to both cores, but do not call commit until both report that indexing worked. If one of the cores throws an exception, call roll back on both cores. On Sat, Jun 30, 2012 at 6:50 AM, Nicholas Ball nicholas.b...@nodelay.com wrote: Hey all, Trying to figure out the best way

Re: Can't find solr.xml

2012-06-30 Thread Lance Norskog
Try starting with the example/multicore directory. It shows how solr.xml describes different available cores. On Sat, Jun 30, 2012 at 11:28 AM, Nabeel Sulieman nabeel.sulie...@gmail.com wrote: Hi, I really hate bothering this group with something that should be trivial, but I've been

Re: Error loading class 'org.apache.solr.handler.dataimport.DataImportHandler'

2012-06-30 Thread Erick Erickson
What is the exception you're encountering? You might review: http://wiki.apache.org/solr/UsingMailingLists Best Erick On Thu, Jun 28, 2012 at 2:48 PM, derohit mailrohi...@gmail.com wrote: Hi All, I am facing an ecpetion while trying to use dataImportHandler for Indexing My solrcofig.xml

Re: index writer in searchComponent

2012-06-30 Thread Erick Erickson
Lots of the index modification (all of it?) has been removed in 4.0 from IndexReaders... It seems like you could always get the directory and open a SolrIndexWriter wherever you wanted, but I'm not sure it's a good idea, are there other processes that will be writing to the index at the same

Re: Filtering a query by range returning unexpected results

2012-06-30 Thread Erick Erickson
This works fine for me with 3.6, float fields and even on a currency type. I'm assuming a typo for 15.00.00 BTW. I admit I'm not all that familiar with the currency type, which I infer you're using given the USD bits. But I ran a quick test with currency types and it worked at least the way I

Re: Wildcard searches with leading and ending wildcard

2012-06-30 Thread Erick Erickson
for searching sub-strings, ngrams are generally preferred. To expand on Jack's point. The whole purpose behind reversed wildcards is that without them, searching for *abcd requires that _every_ term in your field be enumerated, which can be very expensive. Adding in reversed wildcards causes this

Re: documentation on the pragmatics behind the example schema.xml

2012-06-30 Thread Erick Erickson
Your very best way of figuring this out is to use the admin/analysis page. It will show you the exact effects of each element of the analysis chains for the field type you specify. From there it's just a matter of getting your head around the fact that the various filters and tokenizers can be

Re: index writer in searchComponent

2012-06-30 Thread Peyman Faratin
Hi Erik The workflow I'd like to implement is 1- search the index using the incoming query 2- the query is of the type does entity X exist 3- if X does not exist in the index then I'd like to add X to the index Currently I am using a custom search component to achieve this by creating a