Re: Question about Simple Post tool

2016-08-01 Thread Scott Chu
I don't think it's possible purely using the out-of-box post.jar. But why not disassemble post.jar (or get the source from internet) and modify it yourself. It seems not that hard. Scott Chu,scott@udngroup.com 2016/8/1 (週一) - Original Message - From: Jamal, Sarfaraz To:

Re(2): Are there issues with the use of SolrCloud / embedded Zookeeperin non-HA deployments?

2016-07-28 Thread Scott Chu
e thing as Markus did. No matter single or multiple Solr instances, I always uses SolrCloud 5 although we have some old Solr 3.5 using HA proxy and master/slave configuration. But they all run under Linux. Scott Chu,scott@udngroup.com 2016/7/29 (週五) - Original Message - From: Mar

Re: Recommended api/lib to search Solr using PHP

2016-06-02 Thread Scott Chu
ecking value of $result->getNumFound();. 7. If there're any document(s) returned, $result will essentially be an array of document object. Use 'foreach' to iterate it. 8. document object is essentially array of field. Say its variable name is $doc. You can access field by 3 ways:

What if adding 3rd node exceeds replication Factor? [scottchu]

2016-05-25 Thread Scott Chu
Do I just start node3 and join in same zk ensemble, Solrcloud will automatically create replica on 3rd node? Or do I have to manually call some API to add replica to 3rd node? Either way, doesn't this exceed the replicationFactor? Scott Chu,scott@udngroup.com 2016/5/25 (週三)

Re: Import html data in mysql and map schemas using onlySolrCELL+TIKA+DIH [scottchu]

2016-05-24 Thread Scott Chu
Justa let everybody know. I use DIH+template (without TIKA and Solr Cell, I really don't understand that part in reference guide) to achieve what I want. But still need to test more various form of HTML source. Scott Chu,scott@udngroup.com 2016/5/24 (週二) p.s. There're really

What to do best when expaning from 2 nodes to 4 nodes? (fix typo) [scottchu]

2016-05-23 Thread Scott Chu
the best adequate experienced values of numShards and replicationFactor? Lastly, If I add new shard(s), I think there's no other way but reindex if I want my data to be evenly distributed into every shard, right? Scott Chu,scott@udngroup.com 2016/5/23 (週一)

What to do best when expaning from 2 nodes to 4 nodes? [scottchu]

2016-05-23 Thread Scott Chu
Shards and replicationFactor? Lastly, I think there's no other way but reindex if I want my data to be evenly distributed into every shard I create, right? Scott Chu,scott@udngroup.com 2016/5/23 (週一) P.S. For those who are curious of why I add [scottchu] in subject, the reason is that I want my

Re: How to use "fq"

2016-05-23 Thread Scott Chu
Yonik has a very well article about term qp: Solr Terms Query for matching many terms - Solr 'n Stuff http://yonik.com/solr-terms-query/ Scott Chu,scott@udngroup.com 2016/5/23 (週一) - Original Message - From: Erik Hatcher To: solr-user CC: Date: 2016/5/23 (週一) 21:14 Subjec

Re: Import html data in mysql and map schemas using only SolrCELL+TIKA+DIH [scottchu]

2016-05-20 Thread Scott Chu
For this project, I intend to use Solr 5.5 or Solr 6. I know how to modify config to go back to use ClassicIndex, ie. manual schema.xml. Scott Chu,scott@udngroup.com 2016/5/21 (週六) - Original Message - From: Siddhartha Singh Sandhu To: solr-user ; scott.chu CC: Date: 2016/5/21

Re: Switching zk node cause load conf error

2016-05-20 Thread Scott Chu
Is it possible to repair the config data? Scott Chu,scott@udngroup.com 2016/5/21 (週六) - Original Message - From: scott.chu To: solr-user CC: Date: 2016/5/21 (週六) 00:44 Subject: Switching zk node cause load conf error I intially start up 3 zk nodes and upload config 'cugn

Switching zk node cause load conf error

2016-05-20 Thread Scott Chu
config on co-op and standalone seem to have difference? Scott Chu,scott@udngroup.com 2016/5/21 (週六)

Re: [scottchu] Cab I migrate solrcloud by just copying whole package folder?

2016-05-16 Thread Scott Chu
Thanks to Binoy and Erick. I'll go use external zk tomorrow and do what you suggest. Scott Chu,scott@udngroup.com 2016/5/16 (週一) - Original Message - From: Erick Erickson To: solr-user ; scott.chu CC: Date: 2016/5/16 (週一) 23:41 Subject: Re: [scottchu] Cab I migrate solrclo

Re: [scottchu] Cab I migrate solrcloud by just copying whole package folder?

2016-05-16 Thread Scott Chu
But I'm using embeded zk nodes provided by solr start command. I thought they are all under d:\solr5. How can I run that embedded zk node independently? Scott Chu,scott@udngroup.com 2016/5/16 (週一) - Original Message - From: Binoy Dalal To: scott.chu ; solr-user CC: Date: 2

[scottchu] Cab I migrate solrcloud by just copying whole package folder?

2016-05-16 Thread Scott Chu
On my office pc, I install Solr 5 on d:\solr5 and create myconfigsets and mynodes under it. Then run a solrcloud with 2 nodes and embedded zk nodes by executing these commands: cd /d d:\solr5 bin\solr start -c -s mynode\node1 bin\solr start -c -s mynode\node2 -p 7973 -z localhost:998

Re: Kate Winslet vs Winslet Kate

2015-11-03 Thread scott chu
solr-user,妳好 With repsect to querying, Dismax makes solr query syntax quite like Google's, you type simple keywords, you can boost them, you can use +/- just like Google's. Meaning they give users a lot of covenince and less boolean knowlege to establish intended query string. Normal Lucene sea

Re: Highlighting content field problem when using JiebaTokenizerFactory

2015-10-26 Thread Scott Chu
ojects to close)! If you find the solution or workaround, pls. let us know. Good luck again! Scott Chu,scott@udngroup.com 2015/10/27 - Original Message ----- From: Scott Chu To: solr-user Date: 2015-10-27, 10:27:45 Subject: Re: Highlighting content field problem when using JiebaTokeni

Re: Highlighting content field problem when using JiebaTokenizerFactory

2015-10-26 Thread Scott Chu
ng is disabled. - AS... https://issues.apache.org/jira/browse/SOLR-4722 This one is modified from FastVectorHighLighter, so ensure those 3 term* attributes are on. Scott Chu,scott@udngroup.com 2015/10/27 - Original Message - From: Zheng Lin Edwin Yeo To: solr-user Date: 2015-10-23,

Re: Highlighting content field problem when using JiebaTokenizerFactory

2015-10-22 Thread Scott Chu
xing process causes wrong position info. for that field and when Solr do highlighting, it retrieves wrong position info. and mark wrong position of highlight target terms. Scott Chu,scott@udngroup.com 2015/10/23 - Original Message - From: Zheng Lin Edwin Yeo To: solr-user Date: 2015-1

Re: Is it possible to specigfy only one-character term synonymfor2-gram tokenizer?

2015-10-22 Thread Scott Chu
C1 in their articles become C2 since some term containing C1 are proper nouns or terminologies. I'll talk to them to see if this is ok. Thanks anyway. Scott Chu,scott@udngroup.com 2015/10/23 - Original Message - From: Emir Arnautovic To: solr-user Date: 2015-10-22, 18:20:38 Subje

Re: Is it possible to specigfy only one-character term synonym for2-gram tokenizer?

2015-10-22 Thread Scott Chu
rule. But it is kinda ugly, especially when applying highlight, e.g. search "C1C2" Solr returns highlight snippet such as "...C1C2...". Scott Chu,scott@udngroup.com 2015/10/22 - Original Message - From: Emir Arnautovic To: solr-user Date: 2015-10-22, 17:08:26 Su

Is it possible to specigfy only one-character term synonym for 2-gram tokenizer?

2015-10-22 Thread Scott Chu
SynonymFilter, I only have to query "C1Cm..." (say Cm is arbitrary Chinese character) and Solr will return documents that matche whether "C1Cm" or "C2Cm"? Scott Chu,scott@udngroup.com 2015/10/22

Re: Highlighting content field problem when using JiebaTokenizerFactory

2015-10-22 Thread Scott Chu
d, url, score on id, title, content, author, tag true true html 200 true signature true 100 WORD en SG Meanwhile, I'll take a look at the articles too. Thank you. Regards, Edwin On 20 October 2015 at 11:32, Scott Chu wrote: >

Re: Highlighting content field problem when using JiebaTokenizerFactory

2015-10-19 Thread Scott Chu
cles, maybe the answer or workaround can be deducted from them. https://issues.apache.org/jira/browse/SOLR-3390 http://qnalist.com/questions/661133/solr-is-highlighting-wrong-words http://qnalist.com/questions/667066/highlighting-marks-wrong-words Good luck! Scott Chu,scott@udngroup

Re: concept and choice: custom sharding or auto sharding?

2015-09-23 Thread scott chu
to get you started with SolrJ: http://lucidworks.com/blog/indexing-with-solrj/ Best, Erick On Thu, Sep 3, 2015 at 10:26 AM, Toke Eskildsen wrote: > scott chu wrote: >> No, both. But first I have to face the indexing performance problem. >> Where can I see information about concurrent/parallel i

Re: Re: Re: Re: Re: Re: concept and choice: custom sharding or autosharding?

2015-09-03 Thread scott chu
, 01:26:38 Subject: Re: Re: Re: Re: Re: concept and choice: custom sharding or autosharding? scott chu wrote: > No, both. But first I have to face the indexing performance problem. > Where can I see information about concurrent/parallel indexing on Solr? Depends on how you index. If you

Re: Re: Re: Re: Re: concept and choice: custom sharding or auto sharding?

2015-09-03 Thread scott chu
: Re: Re: Re: concept and choice: custom sharding or auto sharding? scott chu wrote: ? > I keep forgeting to mention one thing along the discussion session. > Our data is Chinese news articles and we use CJK tokenizer > (i.e. 2-gram) currently. The time spent to indexing is quite slow, &

Re: Re: Re: Re: concept and choice: custom sharding or auto sharding?

2015-09-03 Thread scott chu
27;ll actually do a complete replication from the leader to the followers, but thereafter incremental updates will be sent to all the nodes in the cluster rather than the older style master/slave occasional replication. Best, Erick On Thu, Sep 3, 2015 at 8:54 AM, scott chu wrote: > >

Re: Re: Re: concept and choice: custom sharding or auto sharding?

2015-09-03 Thread scott chu
--- Original Message - From: Toke Eskildsen To: solr-user Date: 2015-09-03, 18:33:39 Subject: Re: Re: concept and choice: custom sharding or auto sharding? On Thu, 2015-09-03 at 18:24 +0800, Scott Chu wrote: > Do you use master-slave or SolrCloud for that single shard? Due to legacy reasons we

Re: Re: concept and choice: custom sharding or auto sharding?

2015-09-03 Thread Scott Chu
Do you use master-slave or SolrCloud for that single shard? Erick suggest that I can still can use SolrCloud for HA/DR purpose cause Zookeeper can do the work for me. Should I just give up master-slave choice even there's only one single shard? Scott Chu,scott@udngroup.com 201

Re: Re: Re: Re: concept and choice: custom sharding or auto sharding?

2015-09-02 Thread scott chu
solr-user,妳好 Sorry ,wrong again. Auto sharding is not implicit router. - Original Message - From: scott chu To: solr-user Date: 2015-09-02, 23:50:20 Subject: Re: Re: Re: concept and choice: custom sharding or auto sharding? solr-user,妳好 Thanks! I'll go back to check m

Re: Re: Re: concept and choice: custom sharding or auto sharding?

2015-09-02 Thread scott chu
elp: https://wiki.apache.org/solr/SolrPerformanceProblems Best, Erick On Wed, Sep 2, 2015 at 8:19 AM, scott chu wrote: > > solr-user,妳好 > > Do you mean I only have to put 10M documents in one index and copy it to > many slaves in a classic Solr master-slave architecture to

Re: Re: concept and choice: custom sharding or auto sharding?

2015-09-02 Thread scott chu
ud and have it manage multiple _replicas_ of a single shard for HA/DR. So this seems like an XY problem, you're asking specific questions about shard routing because you think it'll solve some problem without telling us what the problem is. Best, Erick On Wed, Sep 2, 2015 at 7:47 AM,

concept and choice: custom sharding or auto sharding?

2015-09-02 Thread scott chu
I post a question on Stackoverflow http://stackoverflow.com/questions/32343813/custom-sharding-or-auto-sharding-on-solrcloud: However, since this is a mail-list, I repost the question below to request for suggestion and more subtle concept of SolrCloud's behavior on document routing. I want to es

custom shard or auto shard for SolrCloud?

2015-09-01 Thread Scott Chu
I post this question on Stackoverflow and would like some suggestion: solr - Custom sharding or auto Sharding on SolrCloud? - Stack Overflow http://stackoverflow.com/questions/32343813/custom-sharding-or-auto-sharding-on-solrcloud Scott Chu,scott@udngroup.com 2015/9/2

Does anyone notice this site?

2010-10-25 Thread scott chu
I happen to bump into this site: http://www.solr.biz/ They said they are also developing a search engine? Is this any connection to open source "Solr"?

Re: Hardware Specs Question

2010-09-03 Thread scott chu
well balanced system = Agree. Here we'll start a performance & load test this month. I've defined a test criteria of 'qps', 'RTpQ' & worse case according to our use case & past experience. Our goal is pursuing this criteria & adjust hardware & system configuration to find a well

Re: Doing Shingle but also keep special single word

2010-08-22 Thread scott chu
Isn't set outputUnigrams="true" will make index size about twice than when it's set to false? Scott - Original Message - From: "Ahmet Arslan" To: Sent: Saturday, August 21, 2010 1:15 AM Subject: Re: Doing Shingle but also keep special single word I am building index with Shingle

Re: Doing Shingle but also keep special single word

2010-08-20 Thread scott chu
g able to boost on the special word field if you wanted. HTH Brendan On Aug 20, 2010, at 6:19 AM, scott chu (朱炎詹) wrote: I am building index with Shingle filter. We know it's minimum 2-gram but I also want keep some special single word, e.g. IBM, Microsoft, etc. i.e. I want to do a minim

How can I do this in Solr?

2010-03-25 Thread scott chu
ng. But in my case, it's a field with mutiple value of another paired name-title values. How can I configure Solr to deal with this case? Best Regards, Scott Chu