Regarding Response Builder

2009-07-13 Thread Amandeep Singh09
The responsebuiilder class has SolrQueryRequest as public type. Using SolrQueryRequest we can get a list of SolrParams like SolrParams params = req.getParams(); Now I want to get the values of those params. What should be the approach as SolrParams is an abstract class and its get(String)

Re: Boosting certain documents dynamically at query-time

2009-07-13 Thread Shalin Shekhar Mangar
On Sat, Jul 11, 2009 at 11:25 PM, Michael Lugassy mlu...@gmail.com wrote: Hi guys -- Using solr 1.4 functions at query-time, can I dynamically boost certain documents which are: a) not on the same range, i.e. have very different document ids, Yes. b) have different boost values, Yes.

Does semi-colon still works as special character for sorting?

2009-07-13 Thread Gargate, Siddharth
I read somewhere that it is deprecated

Re: Does semi-colon still works as special character for sorting?

2009-07-13 Thread Koji Sekiguchi
Gargate, Siddharth wrote: I read somewhere that it is deprecated Yeah, as long as you explicitly use 'lucenePlusSort' parser via defType parameter: q=*:*;id descdefType=lucenePlusSort Koji

Re: Deleting index containg a perticular pattern in 'url' field

2009-07-13 Thread Mark Miller
On Mon, Jul 13, 2009 at 6:34 AM, Beats tarun_agrawal...@yahoo.com wrote: HI, i m using nutch to crawl and solr to index the document. i want to delete the index containing a perticular word or pattern in url field. Is there something like Prune Index tool in solr? thanx in advance

Behaviour when we get more than 1 million hits

2009-07-13 Thread Rakhi Khatwani
Hi, If while using Solr, what would the behaviour be like if we perform the search and we get more than one million hits Regards, Raakhi

Re: Does semi-colon still works as special character for sorting?

2009-07-13 Thread Erik Hatcher
On Jul 13, 2009, at 4:58 AM, Gargate, Siddharth wrote: I read somewhere that it is deprecated see the 2nd paragraph in CHANGES.txt: http://svn.apache.org/repos/asf/lucene/solr/trunk/CHANGES.txt Erik

Re: Deleting index containg a perticular pattern in 'url' field

2009-07-13 Thread Erik Hatcher
You can delete by query - deletequeryurl:some-word/query/delete Erik On Jul 13, 2009, at 6:34 AM, Beats wrote: HI, i m using nutch to crawl and solr to index the document. i want to delete the index containing a perticular word or pattern in url field. Is there something like

Solrj, tomcat and a proxy

2009-07-13 Thread Schilperoort , René
Hello, I'm using SolrJ on a Tomcat environment with a proxy configured in the catalina.properties http.proxySet=true http.proxyPort=8080 http.proxyHost=XX.XX.XX.XX My CommonsHttpSolrServer does not seem to use the configured proxy, this results in a java.net.ConnectException: Connection

Re: Behaviour when we get more than 1 million hits

2009-07-13 Thread Erick Erickson
It depends (tm) on what you try to do with the results. You really need togive us some more details on what you want to *do* with 1,000,000 hits before any meaningful response is possible. Best Erick On Mon, Jul 13, 2009 at 8:47 AM, Rakhi Khatwani rkhatw...@gmail.com wrote: Hi, If while

Faceting

2009-07-13 Thread gwk
Hi, I'm in the process of making a javascriptless web interface to Solr (the nice ajax-version will be built on top of it unobtrusively). Our database has a lot of fields and so I've grouped those with similar characteristics to make several different 'widgets' (like a numerical type which

Re: Aggregating/Grouping Document Search Results on a Field

2009-07-13 Thread Bradford Stephens
Thanks for this -- we're also trying out bobo-browse for Lucene, and early results look pretty enticing. They greatly sped up how fast you read in documents from disk, among other things: http://bobo-browse.wiki.sourceforge.net/ On Sat, Jul 11, 2009 at 12:10 AM, Shalin Shekhar

Re: Aggregating/Grouping Document Search Results on a Field

2009-07-13 Thread Jason Rutherglen
SOLR 1.4 has a new feature https://issues.apache.org/jira/browse/SOLR-475that speeds up faceting on fields with many terms by adding an UnInvertedField. Bobo uses a custom field cache as well. It may be useful to benchmark the 3 different approaches (bitsets, SOLR-475, Bobo). This could be a good

Get TermVectors for query hits only

2009-07-13 Thread Walter Ravenek
Hi all, When I'm using the TermVectorComponent I receive term vectors with all tokens in the documents that meet my search criteria. I would be interested in getting the offsets for just those terms in the documents that meet the search citeria. My documents are about 200 K and are in XML.

Are subqueries possible in Solr? If so, are they performant?

2009-07-13 Thread Edoardo Marcora
Does Solr have the ability to do subqueries, like this one (in SQL): SELECT id, first_name FROM student_details WHERE first_name IN (SELECT first_name FROM student_details WHERE subject= 'Science'); If so, how performant is this kind of queries? -- View this message in context:

Improve indexing time

2009-07-13 Thread Gurjot Singh
Hi, We have a solr index of size 626 MB and number of douments indexed are 141810. We have configured index based spellchecker with buildOnCommit option set to true. Spellcheck index is of size 8.67 MB. We use data import handler to create the index from scratch and also to update the index

Re: Faceting

2009-07-13 Thread Shalin Shekhar Mangar
On Mon, Jul 13, 2009 at 7:56 PM, gwk g...@eyefi.nl wrote: Is there a good way to select the top X facets and include some terms you want to include as well something like facet.field=countryf.country.facet.limit=Xf.country.facet.includeterms=Narnia,Guilder or is there some other way to

Re: Select tika output for extract-only?

2009-07-13 Thread Peter Wolanin
Ok, thanks. I played with it enough to to get plain text out at least, but I'll wait for the resolution of SOLR-284 -Peter On Sun, Jul 12, 2009 at 9:20 AM, Yonik Seeleyyo...@lucidimagination.com wrote: Peter, I'm hacking up solr cell right now, trying to simplify the parameters and fix some

Re: Get TermVectors for query hits only

2009-07-13 Thread Grant Ingersoll
I seem to recall that the Highlighter in Solr is pluggable, so you may want to work at that level instead of the client side. Otherwise, you likely would have to implement your own TermVectorMapper and add that to the TermVectorComponent capability which then feeds your client. For an

lucene or Solr bug with dismax?

2009-07-13 Thread Peter Wolanin
I have been getting exceptions thrown when users try to send boolean queries into the dismax handler. In particular, with a leading 'OR'. I'm really not sure why this happens - I thought the dsimax parser ignored AND/OR? I'm using rev 779609 in case there were recent changes to this. Is this a

Merge Policy

2009-07-13 Thread Jason Rutherglen
SolrIndexConfig accepts a mergePolicy class name, however how does one inject properties into it?

Implementing Solr for the first time

2009-07-13 Thread Kevin Miller
I am new to Solr and trying to get it set up to index files from a directory structure on a server. I have a few questions. 1.) Is there an application that will return the search results in a user friendly format? 2.) How do I move Solr from the example environment into a production

Re: lucene or Solr bug with dismax?

2009-07-13 Thread Mark Miller
It doesn't ignore OR and AND, though it probably should. I think there is a JIRA issue for it somewhere. On Mon, Jul 13, 2009 at 4:10 PM, Peter Wolanin peter.wola...@acquia.comwrote: I can still generate this error with Solr built from svn trunk just now.

Re: Aggregating/Grouping Document Search Results on a Field

2009-07-13 Thread John Wang
Hi Brad: We have since (Bobo) added some perf tests which allows you to do some benchmarking very quickly: http://code.google.com/p/bobo-browse/wiki/BoboPerformance Let me know if you need help setting up. -John On Mon, Jul 13, 2009 at 10:41 AM, Jason Rutherglen

Re: lucene or Solr bug with dismax?

2009-07-13 Thread Peter Wolanin
Indeed - I assumed that only the + and - characters had any special meaning when parsing dismax queries and that all other content would be treated just as keywords. That seems to be how it's described in the dismax documentation? Looks like this is a relevant issue (is there another)?

Sharded Index Creation Magic?

2009-07-13 Thread Nick Dimiduk
Hello! I'm working with Solr-1.3.0 using a sharded index for distributed, aggregated search. I've successfully run through the example described in the DistributedSearch wiki page. I have built an index from a corpus of some 50mil documents in an HBase table and created 7 shards using the

Re: Trying to run embedded server from unit test...but getting configuration error

2009-07-13 Thread Mark Miller
I believe that constructor expects to find an alternate format solr config that specifies the cores, eg like the one you can find in example/multicore/solr.xml http://svn.apache.org/repos/asf/lucene/solr/trunk/example/multicore/solr.xml Looks like that error is not finding the root solr node, so

Availability during merge

2009-07-13 Thread Charlie Jackson
The wiki page for merging solr cores (http://wiki.apache.org/solr/MergingSolrIndexes) mentions that the cores being merged cannot be indexed to during the merge. What about the core being merged *to*? In terms of the example on the wiki page, I'm asking if core0 can add docs while core1 and core2

Re: Get TermVectors for query hits only

2009-07-13 Thread Walter Ravenek
Thanks Grant, I think I get the idea. Grant Ingersoll wrote: I seem to recall that the Highlighter in Solr is pluggable, so you may want to work at that level instead of the client side. Otherwise, you likely would have to implement your own TermVectorMapper and add that to the

Re: Caching per segmentReader?

2009-07-13 Thread Jason Rutherglen
Shall we create an issue for this so we can list out desirable features? On Sun, Jul 12, 2009 at 7:01 AM, Yonik Seeley ysee...@gmail.com wrote: On Sat, Jul 11, 2009 at 7:38 PM, Jason Rutherglenjason.rutherg...@gmail.com wrote: Are we planning on implementing caching (docsets, documents,

Re: Trying to run embedded server from unit test...but getting configuration error

2009-07-13 Thread Reuben Firmin
Thanks. I should have googled first. I came across: http://www.nabble.com/EmbeddedSolrServer-API-usage-td19778623.html For reference, my code is now: final File dir = FileUtils.createTmpSubdir(); System.setProperty(solr.solr.home, dir.getAbsolutePath()); final File conf =

allowDocsOutOfOrder support?

2009-07-13 Thread Jason Rutherglen
Is there a way to set this in SOLR 1.3 using solrconfig? Otherwise one needs to instantiate a class that statically calls BooleanQuery.setAllowDocsOutOfOrder?

Spell checking: Is there a way to exclude words known to be wrong?

2009-07-13 Thread Jay Hill
We're building a spell index from a field in our main index with the following configuration: searchComponent name=spellcheck class=solr.SpellCheckComponent str name=queryAnalyzerFieldTypetextSpell/str lst name=spellchecker str name=namedefault/str str name=fieldspell/str

Re: Spell checking: Is there a way to exclude words known to be wrong?

2009-07-13 Thread Mark Miller
I don't think there is a way currently, but it might make a nice patch. Or you could just implement a custom SolrSpellChecker - both FileBasedSpellChecker and IndexBasedSpellChecker are actually like maybe 50 lines of code or less. It would be fairly quick to just plug a custom version in as a

Re: Improve indexing time

2009-07-13 Thread Noble Paul നോബിള്‍ नोब्ळ्
considering the fact that there are only 20 to 30 docs changed the indexing is not the bottleneck. Bottleneck is probably the db and the time taken for the query to run. Are there deltaQueries in the sub-entities? if you can create a 'VIEW' in DB to identify the delta it could be faster On Tue,

Re: Solr 1.4 Release Date

2009-07-13 Thread pof
Any updates on this? Cheers. Gurjot Singh wrote: Hi, I am curious to know when is the scheduled/tentative release date of Solr 1.4. Thanks, Gurjot -- View this message in context: http://www.nabble.com/Solr-1.4-Release-Date-tp23260381p24473570.html Sent from the Solr - User