Re: Solr performance is slow with just 1GB of data indexed

2015-08-23 Thread Jimmy Lin
unsubscribe On Sat, Aug 22, 2015 at 9:31 PM, Zheng Lin Edwin Yeo wrote: > Hi, > > I'm using Solr 5.2.1, and I've indexed about 1GB of data into Solr. > > However, I find that clustering is exceeding slow after I index this 1GB of > data. It took almost 30 seconds to return the cluster results wh

Optimal setup for multiple tools

2014-04-26 Thread Jimmy Lin
;views" of the data? Also any tips on how to ensure that these two collections are "in sync" (meaning any documents indexed into the documents collection are also properly indexed in places)? Thanks alot, Jimmy Lin

SOLR and Kerberos enabled HDFS

2014-03-03 Thread Jimmy
Hello, I am trying to connect SOLR (tried 4.4 and 4.7) to kerberos enabled HDFS - I am using Cloudera CDH 4.2.1 http://maven-repository.com/artifact/com.cloudera.cdh/cdh-root/4.2.1/pom_effective the keytab and principal is valid (I tested it with flume as well as simple hdfs cli) did anobody su

Re: Adding attributes to Solr fields ?

2013-10-04 Thread jimmy nguyen
Hello, and thank you for your answer Shawn. I tried to simplify my problem but I realize I chose a bad example : I don't process phone numbers, and I do process unstructured documents. My GATE application might return several annotations for the same group of words (because I'm using an ontology

Adding attributes to Solr fields ?

2013-10-03 Thread jimmy nguyen
Hi all, is it possible to add attributes to our Solr fields ? I'm indexing GATE-annotated documents into solr. The annotations produced by my GATE application usually have several features (for example, Person.title, Person.name, Person.phoneNumber...). Now each of my documents may contain more

Re: Indexing several sub-fields in one solr field

2013-09-19 Thread jimmy nguyen
p 19, 2013 at 4:05 PM, Jack Krupansky wrote: > There is no such fieldType attribute as "subSuffix". Solr is just > complaining about extraneous, junk attributes. Delete the crap. > > -- Jack Krupansky > > -Original Message- From: jimmy nguyen > Sent: Thursda

Indexing several sub-fields in one solr field

2013-09-19 Thread jimmy nguyen
Hello, I'd like to index into Solr (4.4.0) documents that I previously annotated with GATE (7.1). I use Behemoth to be able to run my GATE application on a corpus of documents on Hadoop, and then Behemoth allows me to directly send my annotated documents to solr. But my question is not about the B

Solr Fuzzy search on short string

2013-03-26 Thread Jimmy Dean
I did a fuzzy search on solr. The result is a little strange to me. Query "carj~" can match "carl". But "cari" can't match "carl". As a matter of fact, car[x]~, [x]>"i" can match "carl". Is this the correct behavior? Jimmey

Re: complex keywords, hierarchical data, Solr representation problem

2012-01-09 Thread jimmy
Thanks for the fast reply. I went with your suggestion and saved the full category path as well the category_id as integer. I also tested the index space consumption and it was less than I thought. So, if i only store the category_id as an integer I have a full index size of 246MB. With the full c

complex keywords, hierarchical data, Solr representation problem

2012-01-08 Thread jimmy
Hi, I'm new to Solr and already highly impressed about its possibilities and speed. Until now, I only have used a relational database (MySQL) and programmed so far everything in php or Java. Now, I'm stuck and don't know how to represent my data in a Solr Index. To simplify things, first I want

Re: add CJKTokenizer to solr

2007-01-30 Thread zha jimmy
Thank you all, it's works now:). 2007/1/30, James liu <[EMAIL PROTECTED]>: he now is ok. -- regards jl

add CJKTokenizer to solr

2007-01-28 Thread zha jimmy
hi, all I am try to config solr to support chinese tokenize。 I saw the tips in schema.xml: Then I modified schema.xml : When I start the solr there is some error Caused by: java.lang.ClassNotFoundException: org.apache.lucene.analysis.cjk.CJ