Re: NGramFilterFactory for auto-complete that matches the middle of multi-lingual tags?

2010-10-03 Thread Andy
--- On Sat, 10/2/10, Ahmet Arslan wrote: > > I don't understand. Many tags like "electric吉他" > or > > "古典吉他" have no whitespace at all, so how does > > WhitespaceTokenizer help? > > It makes sense for tags having more than one words. i.e. > "electric guitar" > > If you tokenize this using whi

Re: NGramFilterFactory for auto-complete that matches the middle of multi-lingual tags?

2010-10-03 Thread Gert Brinkmann
On 03.10.2010 09:20, Andy wrote: NGramFilterFactory would then take that one toke ("electric guitar") and generate N-grams out of it. One of the ngrams would be "guit" because "guit" is a substring of "electric guitar". AFAIK it only produces prefix-strings like gui guit guita guitar etc. So

RE: Is Solr right for our project?

2010-10-03 Thread Ephraim Ofir
The shards parameter can be added by the search handler if you configure it to do so, then the client doesn't have to know about it. You can put your replicated shards behind a proxy/balancer which will check their health and that way fallback will be automatic. Ephraim Ofir -Original Mes

Re: NGramFilterFactory for auto-complete that matches the middle of multi-lingual tags?

2010-10-03 Thread Ahmet Arslan
> But I thought NGramFilterFactory would generate substrings > that start in the "middle", hence ensuring autocomplete > matching in the middle. > > So in the case of "electric guitar", keywordtokenizer would > create one token - "electric guitar" > > NGramFilterFactory would then take that one t

How to use different solrconfig.xml for nodes in SolrCloud?

2010-10-03 Thread newsam
Hi all, I am trying to set up a two-shards solr cluster with dev cloud branch (http://wiki.apache.org/solr/SolrCloud). We'd like to use different solrconfig.xml for the two shards. The main difference is the replication information. However, the cluster model will use a single solrconfig.xml an

Re: TermVector filter

2010-10-03 Thread Grant Ingersoll
I think you'd have to hook in your own TermVectorMapper. On Oct 1, 2010, at 1:30 AM, Scott Yeadon wrote: > Hi, > > With the TermVector component, is there a means of limiting/filtering the > returned information to only those terms found in a query? > > Scott. -- Grant

Re: LocalSolr, Spatial Search, LatLonType clarification

2010-10-03 Thread Grant Ingersoll
On Sep 30, 2010, at 11:40 AM, webdev1977 wrote: > > I have been reading through all the jira issues and patches, as well as the > wikis and I still have a few things that are not clear to me. > > I am currently running with Solr 1.4.1 and using Nutch for my crawling. > Everything is working g

Re: Autosuggest with inner phrases

2010-10-03 Thread Arunkumar Ayyavu
I had the same question few days back. You can look at the solution suggested by Chantal in this link. http://www.lucidimagination.com/search/document/9bbce5302bd3940e/autocomplete_match_words_anywhere_in_the_token#cec7133bbaf5b49c On Sat, Oct 2, 2010 at 3:44 PM, sivaprasad wrote: > > Hi , > I im

RE: NGramFilterFactory for auto-complete that matches the middle of multi-lingual tags?

2010-10-03 Thread Jonathan Rochkind
Huh, the NGramFilterFactory itself isn't listed on the the analyzers wiki at: http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters That wiki page seems to be protected to certain users only. Anyone know if there's a way to send a 'patch' to the maintainers for the wiki, or if there's a p

Multiple masters and replication between masters?

2010-10-03 Thread Arunkumar Ayyavu
I'm looking at setting up multiple masters for redundancy (for index updates). I found the thread in this link (http://www.lucidimagination.com/search/document/68ac303ce8425506/multiple_masters_solr_replication_1_4) discussed this subject more than a year back. Does Solr support such configuration

Re: NGramFilterFactory for auto-complete that matches the middle of multi-lingual tags?

2010-10-03 Thread Robert Muir
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters?action=newaccount On Sun, Oct 3, 2010 at 2:40 PM, Jonathan Rochkind wrote: > Huh, the NGramFilterFactory itself isn't listed on the the analyzers wiki > at: http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters > > That wiki page

Re: NGramFilterFactory for auto-complete that matches the middle of multi-lingual tags?

2010-10-03 Thread Dennis Gearon
What's the difference between the filter/anayzers that have 'factory' in their name, and the ones that don't? Dennis Gearon Signature Warning EARTH has a Right To Life, otherwise we all die. Read 'Hot, Flat, and Crowded' Laugh at http://www.yert.com/film.php --- On Sun, 10

Re: Upgrade to Solr 1.4, very slow at start up when loading all cores

2010-10-03 Thread Lance Norskog
Could Solr just load cores one at a time, waiting for loader events to finish? Or continuously stage 2 or three simultaneously? On Sat, Oct 2, 2010 at 7:08 AM, Yonik Seeley wrote: > On Fri, Oct 1, 2010 at 5:42 PM, Renee Sun wrote: >> Hi Yonik, >> >> I attached the solrconfig.xml to you in previo

Re: timestamp column

2010-10-03 Thread Lance Norskog
The "date" type in schema.xml does this. It is a Trie type, meaning it stores very efficiently. http://search.lucidimagination.com/search/out?u=http%3A%2F%2Fwiki.apache.org%2Fsolr%2FSolrQuerySyntax On Sat, Oct 2, 2010 at 11:08 AM, Dennis Gearon wrote: > Is there a timestamp column in Solr,i.e. I

Re: NGramFilterFactory for auto-complete that matches the middle of multi-lingual tags?

2010-10-03 Thread Andy
Ah Thanks for clearing that up. Does anyone know how to deal with these 2 issues when using NGramFilterFactory for autocomplete? 1) hyphens - if user types "ema" or "e-ma" I want to suggest "email" 2) accents - if user types "herme" want to suggest "Hermès" Thanks. --- On Sun, 10/3/10, Ahmet

How to do MoreLikeThis with documents in seperate indexes?

2010-10-03 Thread Savannah Beckett
Is it possible to do MoreLikeThis with documents that are in seperate indexes?  If so, how?  Thanks.

Re: NGramFilterFactory for auto-complete that matches the middle of multi-lingual tags?

2010-10-03 Thread Lance Norskog
Start a new thread. Dennis Gearon wrote: What's the difference between the filter/anayzers that have 'factory' in their name, and the ones that don't? Dennis Gearon Signature Warning EARTH has a Right To Life, otherwise we all die. Read 'Hot, Flat, and Crowded' Laugh at

MultiValued compound types

2010-10-03 Thread Lance Norskog
From another thread: Spake Grant Ingersoll: You can have multiple lat/lons per document, you just can't have multiple per field. Is this a temporary limitation, a quirk of the LatLon type, or is it an architectural limitation in the compound type design? Lance Norskog

Re: Autosuggest with inner phrases

2010-10-03 Thread Bhavnik Gajjar
Hi, This thread can be useful http://www.lucidimagination.com/search/document/9edc01a90a195336/enhancing_auto_complete#d1340d7715162608 Regards, Bhavnik On 10/3/2010 11:51 PM, Arunkumar Ayyavu wrote: > I had the same question few days back. You can look at the solution > suggested by Chantal