Re: Short DismaxRequestHandler Question

2010-05-07 Thread MitchK
Okay, let me be more specific: I got a custom StopWordFilter and a WordMarkingFilter. The WordMarkingFilter is an easy implementation to determine which type a word is. The StopWordFilter (my implementation) removes specific types of words *and* all markers from all words. This leads to a

Embedded Solr search query

2010-05-07 Thread Eric Grobler
Hello Solr community, When a user search on our web page, we need to run 3 related but different queries. For SEO reasons, we cannot use Ajax so at the moment we run 3 queries sequentially inside a PHP script. Allthough Solr is superfast, the extra network overhead can make the 3 queries 400ms

RE: Embedded Solr search query

2010-05-07 Thread caman
Why not write a custom request handler which can parse, split, execute and combine results to your queries? From: Eric Grobler [via Lucene] [mailto:ml-node+783150-1027691461-124...@n3.nabble.com] Sent: Friday, May 07, 2010 1:01 AM To: caman Subject: Embedded Solr search query Hello

Re: Embedded Solr search query

2010-05-07 Thread Svein Parnas
Or send the queries in parallell from the PHP script (use CURL). Svein 2010/5/7 caman aboxfortheotherst...@gmail.com: Why not write a custom request handler which can parse, split, execute and combine results to your queries? From: Eric Grobler [via Lucene]

Re: Embedded Solr search query

2010-05-07 Thread Eric Grobler
Hi Camen, I was hoping someone has done it already :-) I am also new to Solr/lucene, can you perhaps point me to a request handler example page? Thanks and Regards Eric On Fri, May 7, 2010 at 9:05 AM, caman aboxfortheotherst...@gmail.comwrote: Why not write a custom request handler which can

Re: Embedded Solr search query

2010-05-07 Thread Eric Grobler
Hi Svein, Yes we thought of sending parallell queries, but you still have the extra network overhead. Regards Eric On Fri, May 7, 2010 at 9:11 AM, Svein Parnas sv...@trank.no wrote: Or send the queries in parallell from the PHP script (use CURL). Svein 2010/5/7 caman

Re: Short DismaxRequestHandler Question

2010-05-07 Thread MitchK
Btw: This thread helps a lot to understand the difference between qf and pf :-) http://lucene.472066.n3.nabble.com/Dismax-query-phrases-td489994.html#a489995 -- View this message in context: http://lucene.472066.n3.nabble.com/Short-DismaxRequestHandler-Question-tp775913p783379.html Sent from

Long Lucene queries

2010-05-07 Thread Pooja Verlani
Hi all, In my web-app, i have to fire a query thats too long due to the various boosts I have to give. The size changes according to the query and many a times I get a blank page as I probably cross lucene's character limit. Is it possible to post it otherwise, to solr. Shall I be using POST

Re: Long Lucene queries

2010-05-07 Thread Erik Hatcher
On May 7, 2010, at 6:56 AM, Pooja Verlani wrote: In my web-app, i have to fire a query thats too long due to the various boosts I have to give. The size changes according to the query and many a times I get a blank page as I probably cross lucene's character limit. Is it possible to post

schema.xml question

2010-05-07 Thread Antonello Mangone
Hello everyone, my question is Is it possible in schema.xml set a group of fields to use as a default field to query in OR or in AND ??? example: group name=group_name field name=a type=. / field name=b type=. / field name=c type=. / /group

RE: schema.xml question

2010-05-07 Thread Markus Jelsma
You could write your own requestHandler in solrconfig.xml, it'll allow you to predefine parameters for your configured search components.   -Original message- From: Antonello Mangone antonello.mang...@gmail.com Sent: Fri 07-05-2010 15:17 To: solr-user@lucene.apache.org; Subject:

Re: schema.xml question

2010-05-07 Thread Antonello Mangone
For the moment I don't know how to do it, but I'll follow your suggestion :) Thank you very much ... ps. I'm just a novel 2010/5/7 Markus Jelsma markus.jel...@buyways.nl You could write your own requestHandler in solrconfig.xml, it'll allow you to predefine parameters for your configured

RE: How to load Core Properties after Core creation?

2010-05-07 Thread Ankit Bhatnagar
What properties are you adding ? Do you have the persistence = true Ankit -Original Message- From: Ying Huang [mailto:yhu...@capitaliq.com] Sent: Thursday, May 06, 2010 6:33 PM To: solr-user@lucene.apache.org Subject: How to load Core Properties after Core creation? Hi All, Does

Help indexing PDF files

2010-05-07 Thread Leonardo Azize Martins
Hi, I am new in Solr. I would like to index some PDF files. How can I do using example schema from 1.4.0 version? Regards, Leo

RE: Re: schema.xml question

2010-05-07 Thread Markus Jelsma
A requestHandler works as an URL that can have predefined parameters. By default you will be querying the /select/ requestHandler. It, for instance, predefines the default number of rows to return (10) and returns all fields of a document (*).   requestHandler name=standard

RE: Re: schema.xml question

2010-05-07 Thread Markus Jelsma
I forgot, there is actually a proper wiki page on this subject: http://wiki.apache.org/solr/SolrRequestHandler     -Original message- From: Antonello Mangone antonello.mang...@gmail.com Sent: Fri 07-05-2010 15:26 To: solr-user@lucene.apache.org; Subject: Re: schema.xml question For

RE: Help indexing PDF files

2010-05-07 Thread Markus Jelsma
Hi,     The wiki page [1] on this subject will get you started.   [1]: http://wiki.apache.org/solr/ExtractingRequestHandler     Cheers   -Original message- From: Leonardo Azize Martins laz...@gmail.com Sent: Fri 07-05-2010 15:37 To: solr-user@lucene.apache.org; Subject: Help

Re: Help indexing PDF files

2010-05-07 Thread Leonardo Azize Martins
I am using this page, but in my downloaded version there is no site directory. Thanks 2010/5/7 Markus Jelsma markus.jel...@buyways.nl Hi, The wiki page [1] on this subject will get you started. [1]: http://wiki.apache.org/solr/ExtractingRequestHandler Cheers -Original

RE: Re: Help indexing PDF files

2010-05-07 Thread Markus Jelsma
You don't need it, you can use any PDF file.   -Original message- From: Leonardo Azize Martins laz...@gmail.com Sent: Fri 07-05-2010 15:45 To: solr-user@lucene.apache.org; Subject: Re: Help indexing PDF files I am using this page, but in my downloaded version there is no site directory.

Re: increase(change) relevancy

2010-05-07 Thread MitchK
Hi Ramzesua, take a look at the example of the function query that influences relvancy by the popular-field of the example-directory. http://wiki.apache.org/solr/FunctionQuery#Using_FunctionQuery Kind regards - Mitch -- View this message in context:

Re: Example of using stream.file to post a binary file to solr

2010-05-07 Thread Chris Hostetter
: Sorry. That is what I meant. But, I put it wrongly. I have not been : able to find examples of using solrj, for this. did you look at the link i included? : To POST a raw stream using SolrJ you need to use the : ContentStreamUpdateRequest... : :

Re: Example of using stream.file to post a binary file to solr

2010-05-07 Thread Sandhya Agarwal
Yes, I did. But, I don't find a solrj example there. The example in the doc uses curl. - Sent from iPhone On 07-May-2010, at 8:12 PM, Chris Hostetter hossman_luc...@fucit.org wrote: : Sorry. That is what I meant. But, I put it wrongly. I have not been : able to find examples of using

RE: How to load Core Properties after Core creation?

2010-05-07 Thread Ying Huang
Thanks for your reply, Ahkit. I'm adding properties like masterEnabled/slaveEnabled, pollInterval, autoCommitTime and etc., so that I can easily configure these properties respectively for each Core and use them in solrconfig.xml. I'm also using persistent = true, and that's exactly the

Re: Example of using stream.file to post a binary file to solr

2010-05-07 Thread Praveen Agrawal
Sandhya, Chris's link (with anchor name) directly goes to solrj example On Fri, May 7, 2010 at 8:15 PM, Sandhya Agarwal sagar...@opentext.comwrote: Yes, I did. But, I don't find a solrj example there. The example in the doc uses curl. - Sent from iPhone On 07-May-2010, at 8:12 PM, Chris

RE: Help indexing PDF files

2010-05-07 Thread caman
Take a look at Tika library From: Leonardo Azize Martins [via Lucene] [mailto:ml-node+783677-325080270-124...@n3.nabble.com] Sent: Friday, May 07, 2010 6:37 AM To: caman Subject: Help indexing PDF files Hi, I am new in Solr. I would like to index some PDF files. How can I do using

RE: Embedded Solr search query

2010-05-07 Thread caman
I would just look at SOLR source code and see how standard search handler and dismaxSearchHandler are implemented. Look under package 'org.apache.solr. http://hudson.zones.apache.org/hudson/job/Solr-trunk/clover/org/apache/solr /handler/pkg-summary.html handler' From: Eric Grobler [via

Re: Help indexing PDF files

2010-05-07 Thread Leonardo Azize Martins
I had Solr in machine A. In machine B I run the command below: curl http://10.33.19.201:8983/solr/update/extract?extractOnly=true; --data-binary @VPSX_V1_R10.pdf and I get the response: java.lang.IllegalStateException: Form too large What I and doing wrong? Is it the right or best way to send

CommonsHttpSolrServer vs EmbeddedSolrServer

2010-05-07 Thread Blargy
Can someone please explain to me the use cases when one would use one over the other. All I got from the wiki was: (In reference to Embedded) If you need to use solr in an embedded application, this is the recommended approach. It allows you to work with the same interface whether or not you

Re: Sanity check on numeric types and which of them to use

2010-05-07 Thread wojtekpia
3) The only reason to use a sint field is for backward compatibility and/or to use sortMissingFirst/SortMissingLast, correct? I'm using sint so I can facet and sort facets numerically. -- View this message in context:

Re: Help indexing PDF files

2010-05-07 Thread Leonardo Azize Martins
Hi, Sorry, I am newbie. Using these two commands it works. curl http://10.33.19.201:8983/solr/update/extract?stream.file=C:\\temp\\VPSX_V1_R10.pdfstream.contentType=application/pdfliteral.id=M4968\\C$\\temp\\VPSX_V1_R10.pdfcommit=true curl '

Re: Can I use per field analyzers and dynamic fields?

2010-05-07 Thread Chris Hostetter
: : The source of my problems is the fact that I do not know in advance the : field names. Users are allowed to decide they own field names, they can, : at runtime, add new fields and different Lucene documents might have : different field names. I would suggest you abstract away the field names

Re: Re: schema.xml question

2010-05-07 Thread Antonello Mangone
Thank you very much for your suggestions, I'll study immediatly ... 2010/5/7 Markus Jelsma markus.jel...@buyways.nl I forgot, there is actually a proper wiki page on this subject: http://wiki.apache.org/solr/SolrRequestHandler -Original message- From: Antonello Mangone

Re: schema.xml question

2010-05-07 Thread Chris Hostetter
: group name=group_name : field name=a type=. / : field name=b type=. / : field name=c type=. / : /group : : defaultSearchFieldgroup_name/defaultSearchField at first glance, it seems like what you want is to use copyField... field name=a ... / field name=b ... / field

Re: schema.xml question

2010-05-07 Thread Antonello Mangone
It's seems like a copyField but is a group that I want ... and in your version is not a group, I want the possibility to search in a group of field using AND or OR 2010/5/7 Chris Hostetter hossman_luc...@fucit.org : group name=group_name : field name=a type=. / : field name=b

Re: Short DismaxRequestHandler Question

2010-05-07 Thread Chris Hostetter
: The StopWordFilter (my implementation) removes specific types of words *and* : all markers from all words. : : This leads to a deletion of some parts of sentences. Ah, yes i think you're running into the same confusion people have with dismax and stopwords -- there was a blog about this

Re: Highlighting Performance On Large Documents

2010-05-07 Thread Lance Norskog
Do you have these options turned on when you index the text field: termVectors/termPositions/termOffsets ? Highlighting needs the information created by these anlysis options. If they are not turned on, Solr has load the document text and run the analyzer again with these options on, uses that

Re: caching repeated OR'd terms

2010-05-07 Thread Lance Norskog
I would suggest benchmarking this before doing any more complex design. A field with only 10k unique integer or string values will search very very quickly. On Thu, May 6, 2010 at 7:54 AM, Nagelberg, Kallin knagelb...@globeandmail.com wrote: Hey everyone, I'm having some difficulty figuring

Re: Custom DIH variables

2010-05-07 Thread Lance Norskog
Using a core via the Embedded front and the HTTP front end seems dangerous. SOLR-1499 does an HTTP call for the same info. https://issues.apache.org/jira/browse/SOLR-1499 On Thu, May 6, 2010 at 8:18 PM, Blargy zman...@hotmail.com wrote: So I came up with the following class. public class

Re: Custom DIH variables

2010-05-07 Thread Blargy
Thanks for the tip Lance. Just for reference, why is it dangerous to use the HTTP method? I realized that the embedded method is probably not the way to go (obviously since I was getting that SEVERE: java.util.concurrent.RejectedExecutionException) -- View this message in context: