Re: Solr Score threshold 'reasonably', independent of results returned

2012-08-22 Thread Ravish Bhagdev
Commercial solutions often have %age that is meant to signify the quality of match. Solr has relative score and you cannot tell by just looking at this value if a result is relevant enough to be in first page or not. Score depends on "what else is in the index" so not easy to normalize in the way

Re: Solr - case-insensitive search do not work

2012-08-22 Thread Ravish Bhagdev
Did you see my message about debugging parameters? Try that and see what's happening behind the scenes. I can confirm that by default the queries are NOT case sensitive. Ravish On Wed, Aug 22, 2012 at 2:45 PM, meghana wrote: > Hi Ravish , the defination for text_en_splitting in solr default s

Re: Solr - case-insensitive search do not work

2012-08-22 Thread Ravish Bhagdev
Also, try comparing your field configuration to Solrs default text field and see if you can spot any differences. Ravish On Wed, Aug 22, 2012 at 1:09 PM, Ravish Bhagdev wrote: > OK. Try without quotes like myfield:cloud+university and see if it has > any effect. > > Also, try both

Re: Solr - case-insensitive search do not work

2012-08-22 Thread Ravish Bhagdev
causing some analyzers to not work on your query. Hope this helps. Ravish On Wed, Aug 22, 2012 at 12:11 PM, meghana wrote: > @Ravish Bhagdev , Yes I am adding double quotes around my search , as shown > in my post. Like, > > myfield:"cloud university" > > >

Re: Solr - case-insensitive search do not work

2012-08-22 Thread Ravish Bhagdev
is already present in your field type definition (its twice now) Are you adding quotes around your query by any chance? Ravish On Wed, Aug 22, 2012 at 11:31 AM, meghana wrote: > I want to apply case-insensitive search for field *myfield* in solr. > > I googled a bit for that , and i found tha

Re: Use a different folder for schema.xml

2012-08-22 Thread Ravish Bhagdev
You can include one xml file into another, something like 1. 2. ]> 4. 5. &resourcedb; 6. - Ravish On Wed, Aug 22, 2012 at 8:56 AM, Alexander Cougarman wrote: > Thanks, Lance. Please forgive my ignorance, but what do you mean by soft > links/XML include feature? Can you prov

Re: No Effect of omitNorms and omitTermFreqAndPositions when using MLT handler?

2012-05-21 Thread Ravish Bhagdev
at 11:03 AM, Ravish Bhagdev wrote: > Ahh, this is because I have to override DefaultSimilarity to turn off > tf/idf scoring? But this will apply to all the fields and general search > on text fields as well? Is there a way to apply custom similarity to > specific field types or field

Re: No Effect of omitNorms and omitTermFreqAndPositions when using MLT handler?

2012-05-21 Thread Ravish Bhagdev
this? Thanks, Ravish On Mon, May 21, 2012 at 10:24 AM, Ravish Bhagdev wrote: > Hi All, > > I was wondering if omitNorms will have any effect on MLT handler at all? > > I'm using schema version 1.2 with Solr 1.4 and have defined couple of > fields, which I want to use for ML

Re: A tool for frequent re-indexing...

2012-04-17 Thread Ravish Bhagdev
Thanks. This is useful to know as well. I was actually after SolrEntityProcessor which I failed to notice until pointed out by previous reply because I'm using 1.4 still. Cheers, Ravish On Fri, Apr 6, 2012 at 11:01 AM, Valeriy F

Re: Search for "library" returns 0 results, but search for "marion library" returns many results

2012-04-04 Thread Ravish Bhagdev
Yes, can you check if results you get with "marion library" match on marion or library? By default solr uses OR between words (specified in solrconfig.xml). You can also easily check this by enabling highlighting. Ravish On Wed, Apr 4, 2012 at 4:11 PM, Joshua Sumali wrote: > Did you try to ap

Re: Incremantally updating a VERY LARGE field - Is this possibe ?

2012-04-04 Thread Ravish Bhagdev
ed to modify. But I guess if its going out of memory you might have already done this? Ravish On Wed, Apr 4, 2012 at 1:34 PM, Mikhail Khludnev wrote: > There is https://issues.apache.org/jira/browse/LUCENE-3837 but I suppose > it's too far from completion. > > On Wed, Apr 4, 2

Re: Incremantally updating a VERY LARGE field - Is this possibe ?

2012-04-04 Thread Ravish Bhagdev
Updating a single field is not possible in solr. The whole record has to be rewritten. 300 MB is still not that big a file. Have you tried doing the indexing (if its only a one time thing) by giving it ~2 GB or xmx? A single file with that size is strange! May I ask what is it? Rav On Tue, A

Re: pagerank??

2012-04-04 Thread Ravish Bhagdev
You might want to look into Nutch and its LinkRank instead of Solr for this. For obtaining such information, you need a crawler to crawl through the links. Not what Solr is meant for. Rav On Wed, Apr 4, 2012 at 8:46 AM, Bing Li wrote: > According to my knowledge, Solr cannot support this. > >

Re: Position Solr results

2012-04-03 Thread Ravish Bhagdev
Hi, I don't believe Solr has anything built in that will do this for you. You will likely have to just get the IDs and lookup at what position the ID you are referring to occurs (using Java or other programming language/scripts). Rav On Sun, Apr 1, 2012 at 5:54 PM, Manuel Antonio Novoa Proenza

Re: ExtractingRequestHandler

2012-04-03 Thread Ravish Bhagdev
(Bit off-topic but...) I understand the fact that Solr isn't meant to 'store' everything, but because highlighting matches requires a field to be stored I would expect most people having to end-up storing full document content in their indexes? Can't think there is any good workaround for this...

Re: Tags and Folksonomies

2012-04-03 Thread Ravish Bhagdev
OK, yes that's true. Although I'd expect term vectors to just increment term count when a tag is re-applied (if you have term vectors enabled), increasing a boost stored as a payload with each tag, each time an existing tag is re-tagged maybe a more sensible approach if this is the case. You'll s

Re: Apache solr not indexing complete pdf file using tikka

2012-04-03 Thread Ravish Bhagdev
I'd also suggest trying extracting text using tika-app (shipped with tika distribution as executable jar) on the PDF(s) in question to see if problem is with extraction or with indexing. Rav On Mon, Apr 2, 2012 at 1:55 PM, Erick Erickson wrote: > You can index 2B tokens, so upping maxFieldLength

Re: Tags and Folksonomies

2012-04-03 Thread Ravish Bhagdev
Hi Hoss, I am not sure why you suggest Payload for ranking documents with more frequent tags above those with fewer tags. Wont the term frequency part of relevancy score ensure this by default? If you make tags a 'lowercase' field (with full value tokenisation), the frequency of tags in multival

Highlighting matched interesting terms in MoreLikeThisHandler...

2012-03-19 Thread Ravish Bhagdev
Hi All, I wonder if anyone else has had a requirement similar to this: I'm using MLT handler to return matching documents, matched on a specific field which works perfectly. But I want to be able to show which interesting terms matched for a given result set. If there was a way of listing these

Fwd: Using MLT Handler to find similar documents but also filter similar documents by a keyword.

2012-03-10 Thread Ravish Bhagdev
I will appreciate any comments or help on this. Thanks. Rav -- Forwarded message -- From: Ravish Bhagdev Date: Fri, Mar 2, 2012 at 12:12 AM Subject: Using MLT Handler to find similar documents but also filter similar documents by a keyword. To: solr-user@lucene.apache.org Hi

Using MLT Handler to find similar documents but also filter similar documents by a keyword.

2012-03-01 Thread Ravish Bhagdev
Hi, Apologies if this has been answered before, I tried searching for it and didn't find anything answering this exactly. I want to find similar documents using MLT Handler using some specified fields but I want to filter down the returned matches with some keywords as well. I looked at the exam

Re: highlight issue

2011-12-02 Thread Ravish Bhagdev
Also, not entirely sure wild-cards are supported in text based fields, only on strings. Although things may have changed in recent versions of Solr, I am not sure. R On Thu, Dec 1, 2011 at 3:55 PM, Koji Sekiguchi wrote: > Suppose my search query is "*Rak*".In my database i have "*Rakesh >> Cha

Re: Solr messing up the UK GBP (pound) symbol in response, even though Java environment variabe has file encoding is set to UTF 8....

2011-09-28 Thread Ravish Bhagdev
Thanks Chris. Yes, changing connector settings not just in solr but also in all webapps that were sending queries into it solved the problem! Appreciate the help. R On Tue, Sep 13, 2011 at 6:11 PM, Chris Hostetter wrote: > > : Any idea why solr is unable to return the pound sign as-is? > : > :

Solr messing up the UK GBP (pound) symbol in response, even though Java environment variabe has file encoding is set to UTF 8....

2011-09-11 Thread Ravish Bhagdev
Any idea why solr is unable to return the pound sign as-is? I tried typing in £ 1 million in Solr admin GUI and got following response. 0 5 on 0 £ 1 million 10 2.2 Here is my Java Properties I got also from admin interface: java.runtime.name = Java(TM) SE Runtime Environment sun.boot.li

Re: Getting sum of all terms count in dataset instead of document count using TermsComponent....(and TermsComponent vs Facets)

2011-02-27 Thread Ravish Bhagdev
Yes, you are right. Ignore the query (document:*), it wont matter if i have it for termscomponent i guess. I've compiled current source from head, but also tried on 1.4.1. Any idea how to go about finding a solution to this? Thanks, Ravish On Sun, Feb 27, 2011 at 1:56 PM, Ahmet Arslan wrote:

Getting sum of all terms count in dataset instead of document count using TermsComponent....(and TermsComponent vs Facets)

2011-02-27 Thread Ravish Bhagdev
olrResp = conf._solr.executeQuery(solrQuery, 0, 10); TermsResponse termsResp = solrResp.getTermsResponse(); List terms = termsResp.getTerms("document"); Ignore the conf object and _solr variable thats just my internal singleton object. Thanks, Ravish Bhagdev

Re: Are there any restrictions on what kind of how many fields you can use in Pivot Query? I get ClassCastException when I use some of my string fields, and don't when I use some other sting fields

2011-02-15 Thread Ravish Bhagdev
Looks like its a bug? Is it not? Ravish On Tue, Feb 15, 2011 at 4:03 PM, Ravish Bhagdev wrote: > When include some of the fields in my search query: > > SEVERE: java.lang.ClassCastException: [Ljava.lang.Object; cannot be cast to > [Lorg.apache.solr.common.util.ConcurrentLRUCach

Re: very quick question that will help me greatly... OR query syntax when using fields for solr dataset....

2011-02-15 Thread Ravish Bhagdev
r you want, but this is often > a point of confusion. > > Best > Erick > > On Tue, Feb 15, 2011 at 9:00 AM, Ravish Bhagdev > wrote: > > Arghhh.. > > > > I think its the regexp parser messing things up (just looked at the > > debugQuery ouput and its parsing in

Re: very quick question that will help me greatly... OR query syntax when using fields for solr dataset....

2011-02-15 Thread Ravish Bhagdev
at 1:54 PM, Ravish Bhagdev wrote: > Hi Jan, > > Thanks for reply. > > I have tried the first variation in your example (and again after reading > your reply). > > It returns no results! > > Note: it is not a multivalued field, I think when you use example 1 below, > it l

Re: very quick question that will help me greatly... OR query syntax when using fields for solr dataset....

2011-02-15 Thread Ravish Bhagdev
> Jan Høydahl, search solution architect > Cominvent AS - www.cominvent.com > > On 15. feb. 2011, at 14.39, Ravish Bhagdev wrote: > > > Hi Guys, > > > > I've been trying various combinations but unable to perform a "OR" query > for > > a sp

very quick question that will help me greatly... OR query syntax when using fields for solr dataset....

2011-02-15 Thread Ravish Bhagdev
Hi Guys, I've been trying various combinations but unable to perform a "OR" query for a specific field in my solr schema. I have a string field called myfield and I want to return all documents that have this field which either matches "abc" or "xyz" So all records that have myfield=abc and all

Re: Why solr relies on solr.solr.home???

2008-08-09 Thread Ravish Bhagdev
wrote: > Hi Ravi, > > > On Sat, Aug 9, 2008 at 4:48 AM, Ravish Bhagdev <[EMAIL PROTECTED] > >wrote: > > > Hi, > > > > This may be a naive question but do we really need to have solr.solr.home > > variable for solr installation? It is a bit annoying m

Re: Why solr relies on solr.solr.home???

2008-08-09 Thread Ravish Bhagdev
or-an-application.html after following your lead. Cheers, Ravish On Sat, Aug 9, 2008 at 1:16 AM, Alexander Ramos Jardim < [EMAIL PROTECTED]> wrote: > you can configure its home dir at web.xml > > 2008/8/8 Ravish Bhagdev <[EMAIL PROTECTED]> > > > Hi, > > > > Thi

Why solr relies on solr.solr.home???

2008-08-08 Thread Ravish Bhagdev
Hi, This may be a naive question but do we really need to have solr.solr.home variable for solr installation? It is a bit annoying modifying tomcat settings in automated install. If I create a packaged application, how do I ensure a normal user would be able to install it without having to modif

Re: Incremental indexing of database

2008-07-22 Thread Ravish Bhagdev
Can't you write triggers for your database/tables you want to index? That way you can keep track of all kinds of changes and updates and not just addition of a new record. Ravish On Tue, Jul 22, 2008 at 8:15 PM, anshuljohri <[EMAIL PROTECTED]> wrote: > > Hi, > > In my project i have to index whol

Re: Is it possible to add synonyms run time?

2008-01-25 Thread Ravish Bhagdev
;re right. Injecting synonyms at index time really requires a fixed > synonym list that doesn't vary by user. So if you want synonym > lists on a per-user basis, you're probably going to have to inject synonyms > at query time. > > Best > Erick > > > On Jan 25, 2008

Re: Is it possible to add synonyms run time?

2008-01-25 Thread Ravish Bhagdev
get the match. > > Alternately, use the synonym dictionary at run-time to expand a > user's query terms, like a thesaurus. > > That said, I'm new to the tool, and not clear on how synonyms are implemented. > > Jon > = > From: Ravish Bhagde

Is it possible to add synonyms run time?

2008-01-25 Thread Ravish Bhagdev
As I understood from available documentation, synonyms need to be defined before starting the indexing process. Is it possible to add synonyms at run time such that all index fields of all documents get updated? Does it work for newly added documents atleast? Also, how to make each user of appli

Re: solr.home via getServletContext().getInitParameter("solr.home")

2007-12-15 Thread Ravish Bhagdev
But how do I maintain multiple copies of solr on same machine. For instance if I have two webapps using two independent solr indexes? Thanks, R On Dec 14, 2007 11:04 PM, Chris Hostetter <[EMAIL PROTECTED]> wrote: > > : Recently, I had to set up a Jetty with multiple Solr homes (not > multi-core

Re: SOLR X FAST

2007-12-11 Thread Ravish Bhagdev
Could you please elaborate on what you mean by ingestion pipeline and horizontal scalability? I apologize if this is a stupid question everyone else on the forum is familiar with. Thanks, Ravi On Dec 12, 2007 1:09 AM, Nuno Leitao <[EMAIL PROTECTED]> wrote: > Depends, if you are looking for a sma

Re: SOLR X FAST

2007-12-11 Thread Ravish Bhagdev
Stability and better Support (at great cost obviously) On Dec 11, 2007 10:20 PM, William Silva <[EMAIL PROTECTED]> wrote: > Hi, > Why use FAST and not use SOLR ? For example. > What will FAST offer that will justify the investment ? > I would like a matrix comparing both. > Thanks, > William. > >

Re: SOLR 1.2 - Updates sent containing fields that are not on the Schema fail silently

2007-11-28 Thread Ravish Bhagdev
Yup, I do remember that happening to me before. Is this intentionally so? Ravish On Nov 28, 2007 1:41 PM, Daniel Alheiros <[EMAIL PROTECTED]> wrote: > Hi > > I experienced a very unpleasant problem recently, when my search indexing > adaptor was changed to add some new fields. The problem is my

Re: solr, snippets and stored field in nutch...

2007-10-11 Thread Ravish Bhagdev
ing summaries > without storing doc contents, I would pee my pants with happiness and > it would be in Solr faster than you can say "diaper". > > cheers, > -Mike > > On 11-Oct-07, at 3:48 PM, Ravish Bhagdev wrote: > > > Hey guys, > > > > Checkout thi

Fwd: solr, snippets and stored field in nutch...

2007-10-11 Thread Ravish Bhagdev
amount of data being pushed over the network. Dennis Kubes Ravish Bhagdev wrote: > Ah, I see, didn't know that, Thanks! > > Interesting that nutch stores it in a different structure (segments) > and doesn't reuse Lucene strategy of storing within index. Any > par

Re: index size

2007-10-11 Thread Ravish Bhagdev
Hi All, I'm facing similar problem. I want to index entire document as a field. But I also want to be able to retrieve snippets (like Google/Nutch return in results page below the links). To achieve this I have to keep the document field to "stored" right? When I do this my index becomes huge 1

Re: unable to figure out nutch type highlighting in solr....

2007-10-05 Thread Ravish Bhagdev
Thanks all for very valuable contributions, I understand these aspects of Solr much better now but... >But a different use-case might be for the highlighting to encompass the markup rather than >just the text, e.g. > Paris >which would have to be accomplished some other way. Yes, exactly. And

Re: unable to figure out nutch type highlighting in solr....

2007-10-05 Thread Ravish Bhagdev
ut HTML as is indexer throws exceptions (as having tags within XML tags is obviously not valid. How to do this part? Ravish On 10/5/07, Adrian Sutton <[EMAIL PROTECTED]> wrote: > On 05/10/2007, at 4:07 PM, Ravish Bhagdev wrote: > > (Query esp. Adrian): > > > > If you ar

Re: unable to figure out nutch type highlighting in solr....

2007-10-04 Thread Ravish Bhagdev
Thanks all for help. Just to make sure I understand correctly, am I right in summarizing this way than?: No significance of using HTML: Unlike nutch Solr doesn't parse HTML, so it ignores the anchors, titles etc and is not good for page rank -esq indexing. HTMLAnalyser (by with you probably mean

Re: Indexing HTML

2007-10-03 Thread Ravish Bhagdev
Hi Erik, All, I escaped HTML text into entities before sending to Solr and indexing went fine. The problem now is that when I get back a snippet with highlighted text for the html field, its not well formed as the highliting dosen't somtimes include the entire tag if present. For e.g.: −

unable to figure out nutch type highlighting in solr....

2007-10-02 Thread Ravish Bhagdev
I have tried very hard to follow documentation and forums that try to answer questions about how to return snippets with highlights for relevant searched term using Solr (as nutch does with such ease). I will be really grateful if someone can guide me with basics, i have made sure that the field t

Re: Indexing longer documents using Solr...memory issue after index grows to about 800 MB...

2007-09-05 Thread Ravish Bhagdev
thanks for your reply, my response below: On 9/5/07, Mike Klaas <[EMAIL PROTECTED]> wrote: > On 4-Sep-07, at 4:50 PM, Ravish Bhagdev wrote: > > > - I have about 11K html documents to index. > > - I'm trying to index these documents (along with 3 more small string >

Indexing longer documents using Solr...memory issue after index grows to about 800 MB...

2007-09-04 Thread Ravish Bhagdev
Hi, The problem: - I have about 11K html documents to index. - I'm trying to index these documents (along with 3 more small string fields) so that when I search within the "doc" field (field with the html file content), I can get results with snippets or highlights as I get when using nutch. - Wh

Processing solr response....

2007-09-04 Thread Ravish Bhagdev
Hi, Apologies if this has been asked before but I couldn't find anything when I searched... I have been looking ant SolJava examples. I've been using Nutch/Lucene before which returns results from query nicely in a class with url, title and snippet (summary). While Solr seems to return XML with

Re: Indexing HTML content... (Embed HTML into XML?)

2007-08-22 Thread Ravish Bhagdev
, '<>&"' ); > > Also you need to make sure your Html is encoded in UTF-8 . To comply > with solr need for UTF-8 encoded xml. > > I hope it helps. > > J. > > On 8/22/07, Ravish Bhagdev <[EMAIL PROTECTED]> wrote: > > Hello, > >

Indexing HTML content... (Embed HTML into XML?)

2007-08-22 Thread Ravish Bhagdev
Hello, Sorry for stupid question. I'm trying to index html file as one of the fields in Solr, I've setup appropriate analyzer in schema but I'm not sure how to add html content to Solr. Encapsulating HTML content within field tag is obviously not valid. How do I add html content? Hope the query