Re: Search within words

2012-01-22 Thread jawedshamshedi
Hi Thanks for the reply.. I am using NGramFilterFactory for this. But it's not working as desired. Like I have a field article_type that has been indexed using the below mentioned field type. The field definition for indexing is : now the problem is that I have a

Solr Cores

2012-01-22 Thread Sujatha Arun
Hello, We have in production a number of individual solr Instnaces on a single JVM.As a result ,we see that the permgenSpace keeps increasing with each additional instance added. I would Like to know ,if we can have solr cores , instead of individual instances. - Is there any limit to the nu

Re: Phonetic search for portuguese

2012-01-22 Thread Gora Mohanty
On Mon, Jan 23, 2012 at 9:21 AM, Anderson vasconcelos wrote: > Thanks a lot Gora. > I need to delivery the first release for my client on 25 january. > With your explanation, i can negociate better the date to delivery of > this feature for next month, because i have other business rules for > del

Re: Phonetic search for portuguese

2012-01-22 Thread Anderson vasconcelos
Thanks a lot Gora. I need to delivery the first release for my client on 25 january. With your explanation, i can negociate better the date to delivery of this feature for next month, because i have other business rules for delivery and this features is more complex than i thought. I could help you

Re: Phonetic search for portuguese

2012-01-22 Thread Gora Mohanty
On Mon, Jan 23, 2012 at 5:58 AM, Anderson vasconcelos wrote: > Hi Gora, thanks for the reply. > > I'm interesting in see how you did this solution. But , my time is not > to long and i need to create some solution for my client early. If > anyone knows some other simple and fast solution, please p

Re: Improving Solr Spell Checker Results

2012-01-22 Thread Erick Erickson
David: There's some good info here: http://wiki.apache.org/solr/HowToContribute#Working_With_Patches But the short form is to go into solr_home and issue this command: 'svn diff > SOLR-2585.patch'. IDE's may also have a "create patch" feature, but I find the straight SVN command more reliable. N

Re: Improving Solr Spell Checker Results

2012-01-22 Thread David Radunz
Hey Erick, Sure, can you explain the process to create the patch and upload it and i'll do it first thing tomorrow. Thanks again for your help, David On 23/01/2012 12:51 PM, Erick Erickson wrote: I can't help with your *real* problem, but when looking at patches, if the "resolution" fie

Re: Improving Solr Spell Checker Results

2012-01-22 Thread Erick Erickson
I can't help with your *real* problem, but when looking at patches, if the "resolution" field isn't set to something like "fixed" it means that the patch has NOT been applied to any code lines. There also should be commit revisions specified in the comments. If "Fix Versions" has values, that does

Re: Failure noticed from new...@zju.edu.cn

2012-01-22 Thread David Radunz
Hey, That seems to have helped, I didn't get a failure notice re-sending the message. I'll have to keep that in mind. Thanks very much, David On 23/01/2012 12:41 PM, Erick Erickson wrote: I've seen the spam filter be pretty aggressive with HTML formatting etc, what happens when you just

Re: Improving Solr Spell Checker Results

2012-01-22 Thread David Radunz
Hey, I am trying to send this again as 'plain-text' to see if it delivers ok this time. All of the previous messages I sent should be below.. Cheers, David On 22/01/2012 11:42 PM, David Radunz wrote: Hey James, I have played around a bit more with the settings and tried setting sp

Re: Tika0.10 language identifier in Solr3.5.0

2012-01-22 Thread Erick Erickson
Would "doing the right thing" include firing the results at different fields based on the language detected? Your answer to Jan seems to indicate not, in which case my original comments stand. The main point is that mixing all the *results* of the analysis chains for multiple languages into a singl

Re: Failure noticed from new...@zju.edu.cn

2012-01-22 Thread Erick Erickson
I've seen the spam filter be pretty aggressive with HTML formatting etc, what happens when you just send them as "plain text"? Best Erick On Sat, Jan 21, 2012 at 7:24 AM, David Radunz wrote: > Hey, > >    Every time I send a reply to the list I get a failure for > new...@zju.edu.cn. Should I jus

Re: Getting a word count frequency out of a page field

2012-01-22 Thread solr user
See comments inline below. On Sun, Jan 22, 2012 at 8:27 PM, Erick Erickson wrote: > Faceting won't work at all. Its function is to return the count > of the *documents* that a value occurs in, so that's no good > for your use case. > > "I don't know how to issue a proper SOLR query that returns a

Re: Validating solr user query

2012-01-22 Thread Erick Erickson
Good luck on that If you allow free-form input, bad queries are just going to happen. To prevent this from getting to Solr, you essentially have to reproduce the entire Solr/Lucene parser. So why not just let the parser to it for you and present some pretty message to the user? The other thi

Re: Getting a word count frequency out of a page field

2012-01-22 Thread Erick Erickson
Faceting won't work at all. Its function is to return the count of the *documents* that a value occurs in, so that's no good for your use case. "I don't know how to issue a proper SOLR query that returns a word count for a paragraph of text such as the term "amplifier" for a field. For some reason

Re: Sort for Retrieved Data

2012-01-22 Thread Erick Erickson
See belowl On Fri, Jan 20, 2012 at 10:42 AM, Bing Li wrote: > Dear all, > > I have a question when sorting retrieved data from Solr. As I know, Lucene > retrieves data according to the degree of keyword matching on text field > (partial matching). > > If I search data by string field (complete ma

Re: Phonetic search for portuguese

2012-01-22 Thread Anderson vasconcelos
Hi Gora, thanks for the reply. I'm interesting in see how you did this solution. But , my time is not to long and i need to create some solution for my client early. If anyone knows some other simple and fast solution, please post on this thread. Gora, you could talk how you implemented the Custo

facet pivot and range

2012-01-22 Thread Antoine LE FLOC'H
Hello, I can't find anything related to what I would like to do: a facet.pivot but have ranges on the second level, something like facet.pivot=cat,price where price is a range facet facet.range=price&facet.range.start=0&facet.range.end=1000&facet.range.gap=10 Is it doable with Solr4 ? How did

Re: Trying to understand SOLR memory requirements

2012-01-22 Thread Dave
I take it from the overwhelming silence on the list that what I've asked is not possible? It seems like the suggester component is not well supported or understood, and limited in functionality. Does anyone have any ideas for how I would implement the functionality I'm looking for. I'm trying to i

Re: "index-time" over boosted

2012-01-22 Thread remi tassing
Hi, I got wrong in beginning but putting omitNorms in the query url. Now following your advice, I merged the schema.xml from Nutch and Solr and made sure omitNorms was set to "true" for the content, just as you said. Unfortunately the problem remains :-( On Thursday, January 19, 2012, Jan Høyda

Re: How to Sort in a Different Way

2012-01-22 Thread yunfei wu
what kind of new sorting ways you want? If you want to change Lucene's score of how relevant the result is, you may play with the boosting. If you just want to sort on fields, you can use "sort=fieldname" to sort on string, integer, date fields. Yunfei On Sat, Jan 21, 2012 at 8:39 AM, Bing Li

Re: Phonetic search for portuguese

2012-01-22 Thread Gora Mohanty
On Sun, Jan 22, 2012 at 5:47 PM, Anderson vasconcelos wrote: > Anyone could help? > > Thanks > > 2012/1/20, Anderson vasconcelos : >> Hi >> >> The phonetic filters (DoubleMetaphone, Metaphone, Soundex, RefinedSoundex, >> Caverphone) is only for english language or works for other languages? Have >

Re: Improving Solr Spell Checker Results

2012-01-22 Thread David Radunz
Hey James, I have played around a bit more with the settings and tried setting spellcheck.maxResultsForSuggest=100 and spellcheck.maxCollations=3. This yields 'Sigourney Weaver' as ONE of the corrections, but it's the second one and not the first. Which is wrong if this is a patch for 'con

Re: Phonetic search for portuguese

2012-01-22 Thread Anderson vasconcelos
Anyone could help? Thanks 2012/1/20, Anderson vasconcelos : > Hi > > The phonetic filters (DoubleMetaphone, Metaphone, Soundex, RefinedSoundex, > Caverphone) is only for english language or works for other languages? Have > some phonetic filter for portuguese? If dont have, how i can implement >

RE: Tika0.10 language identifier in Solr3.5.0

2012-01-22 Thread nibing
Hi, This is exactly what I hope you can elaborate on - analyzer that detects the language and then analyze accordingly. How to do that? Thank you. Best Regards Ni, Bing > From: ted.dunn...@gmail.com > Date: Fri, 20 Jan 2012 09:15:30 -0800 > Subject: Re: Tika0.10 language identifier in Sol

Re: Improving Solr Spell Checker Results

2012-01-22 Thread David Radunz
James, I worked out that I actually needed to 'apply' patch SOLR-2585, whoops. So I have done that now and it seems to return 'correctlySpelled=true' for 'Sigorney Wever' (when Sigorney isn't even in the dictionary). Could something have changed in the trunk to make your patch no longer w

Re: How to Sort By a PageRank-Like Complicated Strategy?

2012-01-22 Thread Bing Li
Dear Shashi, Thanks so much for your reply! However, I think the value of PageRank is not a static one. It must update on the fly. As I know, Lucene index is not suitable to be updated too frequently. If so, how to deal with that? Best regards, Bing On Sun, Jan 22, 2012 at 12:43 PM, Shashi Kan