SOLR 1.4: how to configure the improved chinese analyzer?

2009-12-09 Thread Fer-Bj
Hello, is there any existing FAQ or HowTo on how to setup the improved (and new?) chinese analyzer on Solr 1.4? I'd appreciate any help you may provide on this. Thanks, -- View this message in context: http://old.nabble.com/SOLR-1.4%3A-how-to-configure-the-improved-chinese-analyzer--tp267067

using q= , adding fq=

2009-12-11 Thread Fer-Bj
We're running a 14M documents index. For each document we have: (and a few other fields). Our most usual query is something like this: q=cat_id:xxx AND geo_id:&sort=id desc where cat_id = which "category" (cars,sports,toys,etc) the item belongs to, and geo_id = whic

Using Chinese / How to ?

2009-05-31 Thread Fer-Bj
Hello, is there any "how to" already created to get me up using SOLR 1.3 running for a chinese based website? Currently our site is using SOLR 1.2, and we tried to move into 1.3 but we couldn't complete our reindex as it seems like 1.3 is more strict when it comes to special chars. I would app

Re: Using Chinese / How to ?

2009-06-01 Thread Fer-Bj
think we have a > specific how to, but I wouldn't think it would be much different from > 1.2 > > -Grant > On May 31, 2009, at 10:31 PM, Fer-Bj wrote: > >> >> Hello, >> >> is there any "how to" already created to get me up using SOLR 1.3 >&

Re: Using Chinese / How to ?

2009-06-02 Thread Fer-Bj
ese? > > Update problem? > > On Tuesday, June 2, 2009, Fer-Bj wrote: >> >> I'm sending 3 files: >> - schema.xml >> - solrconfig.xml >> - error.txt (with the error description) >> >> I can confirm by now that this error is due to invalid cha

Re: indexing Chienese langage

2009-06-03 Thread Fer-Bj
We are trying SOLR 1.3 with Paoding Chinese Analyzer , and after reindexing the index size went from 1.5 Gb to 2.7 Gb. Is that some expected behavior ? Is there any switch or trick to avoid having a double + index file size? Koji Sekiguchi-2 wrote: > > CharFilter can normalize (convert) tradit

Re: Field Compression

2009-06-04 Thread Fer-Bj
Is it correct to assume that using field compression will cause performance issues if we decide to allow search over this field? ie: if I decide to add "compressed=true" to the BODY field... and a I allow search on body... would that be a problem? At the same time: if I add compress

Re: indexing Chienese langage

2009-06-04 Thread Fer-Bj
netic wrote: > > > I can't tell what that analyzer does, but I'm guessing it uses n-grams? > Maybe consider trying https://issues.apache.org/jira/browse/LUCENE-1629 > instead? > > Otis > -- > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > >

Re: Field Compression

2009-06-04 Thread Fer-Bj
Here is what we have: for all the documents we have a field called "small_body" , which is a 60 chars max text field that were we store the "abstract" for each article. We have about 8,000,000 documents indexed, and usually we display this small_body on our "listing pages". For each listing pa