Question: How do I run the solr analysis tool programtically ?

2009-09-03 Thread Yatir
Form java code I want to contact solr through Http and supply a text buffer (or a url that returns text, whatever is easier) and I want to get in return the final list of tokens (or the final text buffer) after it went through all the query time filters defined for this solr instance (stemming, st

Howto? Applying a filter across schema fileds using state information

2009-05-18 Thread Yatir
Hi, I need to write a filter that extracts information from the content of one filed (say the Body field) and then applies some transformation based on this content, to a *different* filed (say: the Title field) is this possible ? Example: I will find certain keywords in the body and then locate

Howto? Obtain the IndexReader from within a solr filter

2009-05-18 Thread Yatir
Hi, I am writing a solr filter that needs the DocFreq of each Token in order to decide what to do with it. What is the easiest way to obtain this information from within the filter code ? thanks, Yatir -- View this message in context: http://www.nabble.com/Howto--Obtain-the-IndexReader-from

Re: Howto? Obtain the IndexReader from within a solr filter

2009-05-18 Thread Yatir
filter >> >> what is a solr filter? >> >> On Mon, May 18, 2009 at 2:18 PM, Yatir wrote: >>> >>> Hi, >>> >>> I am writing a solr filter that needs the DocFreq of each Token in >>> order to >>> decide what to do with

Re: Howto? Obtain the IndexReader from within a solr filter

2009-05-18 Thread Yatir
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters Noble Paul നോബിള്‍ नोब्ळ्-2 wrote: > > I am writing a solr filter > > what is a solr filter? > > On Mon, May 18, 2009 at 2:18 PM, Yatir wrote: >> >> Hi, >> >> I am writing a solr filt

Re: changing the root directory where solrCloud stores info inside zookeeper File system

2011-08-02 Thread yatir
Thanks A lot mark, Since My SolrCloud code was old I tried downloading and building the newest code from here https://svn.apache.org/repos/asf/lucene/dev/trunk/ I am using tomcat6 I manually created the sc sub-directory in my zooKeeper ensemble file-system I used this connection String to my ZK ens

error when using the term vector component in solrCloud

2011-08-07 Thread yatir
IS the term vector component supported with solr cloud ? this works fine: http://scmb.nydc1.outbrain.com:8080/solr1/ENPwlMaster/select/?q=id:195652782&fl=*,score&qt=&tv=true&tv.fl=content&tv.tf=true&distrib=true reponse: 195652782http://www.slccglobelink.com/opinion/color-blindness-1.2526237948137

question: solrCloud with multiple cores on each machine

2010-07-27 Thread Yatir Ben Shlomo
one at a time so the IO intensity at any given moment will be low and will not degrade the online performance Is there a way to configure my solr.xml so that when I am doing a distributed search (distrib=true) it will know to query all 8 cores ? Thanks Yatir

question: havnig multiple solrCloud configuration on the same machine

2010-08-15 Thread Yatir Ben Shlomo
Yatir Ben Shlomo Outbrain Engineering yat...@outbrain.com<mailto:yat...@outbrain.com> tel: +972-73-223912 fax: +972-9-8350055 www.outbrain.com<http://www.outbrain.com/>

solrCloud zookeepr related excpetions

2010-08-25 Thread Yatir Ben Shlomo
Hi I am running a zookeeper ensemble of 3 zookeeper instances and established a solrCloud to work with it (2 masters , 2 slaves) on each master machine I have 2 shards (4 shards in total) on one of the masters I keep noticing ZooKeeper related exceptions which I can't understand: One appears to be

possible bug in zookeeper / solrCloud ?

2010-09-14 Thread Yatir Ben Shlomo
currently is using zookeeper-3.2.2.jar Is this a know bug that was fixed in later versions ?( 3.3.1) Thanks in advance, Yatir Logs: Sep 14, 2010 9:02:20 AM org.apache.log4j.Category warn WARNING: Ignoring exception during shutdown input java.nio.channels.ClosedChannelException at

help required: how to design a large scale solr system

2008-09-23 Thread Ben Shlomo, Yatir
Do you agree that the bottle neck is the indexing time? Any help is appreciated Thanks in advance yatir

RE: help required: how to design a large scale solr system

2008-09-24 Thread Ben Shlomo, Yatir
ch will be missing 1 doc. 10 mil each on 3 machines, a *:* search will be missing 30. Not a big deal, but could be a concern for some with picky, look at everything customers. - Mark Ben Shlomo, Yatir wrote: > Hi! > > I am already using solr 1.2 and happy with it. > > In a new pro

*Very* slow Commit after upgrading to solr 1.3

2008-09-29 Thread Ben Shlomo, Yatir
as to check? Thanks. Here is part of my solrConfig file: - < - - false 1000 1000 2147483647 1 1000 1 - - false 1000 1000 2147483647 1 - true Yatir Ben-shlomo | eBay, Inc. | Classificati

RE: *Very* slow Commit after upgrading to solr 1.3

2008-10-07 Thread Ben Shlomo, Yatir
sync'd It may help to try the IndexWriter settings from the 1.3 example setup... the important changes being: 10 32 -Yonik On Mon, Sep 29, 2008 at 5:33 AM, Ben Shlomo, Yatir <[EMAIL PROTECTED]> wrote: > Hi! > > > > I am running on widows 64 bit ... > I

RE: *Very* slow Commit after upgrading to solr 1.3

2008-10-07 Thread Ben Shlomo, Yatir
So other than me doing trial & error, do you have any guidance on how to configure the merge factor (and ramBufferSizeMB ? ). any "formula" that supplies the optimal value ? Thanks, Yatir -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Yon

question: how to divide the indexing into sperate domains

2007-08-09 Thread Ben Shlomo, Yatir
Hi! say I have 300 csv files that I need to index. Each one holds millions of lines (each line is a few fields separated by commas) Each csv file represents a different domain of data (e,g, file1 is computers, file2 is flowers, etc) There is no indication of the domain ID in the data insid

RE: question: how to divide the indexing into sperate domains

2007-08-11 Thread Ben Shlomo, Yatir
Thanks yonik! I do have some unused fields inside the csv file. But they are not empty. They are numeric they can be anything between 0 to 10,000 Can I do something like f.unused.map=*:98765 yatir -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Yonik

problem with quering solr after indexing UTF-8 encoded CSV files

2007-08-20 Thread Ben Shlomo, Yatir
&rows=10&indent=on##> 1 desc is a very good camera display is יתיר ABC res123 1 1 ABC res123 C123 123456 72900010123 yatir

solved: quering UTF-8 encoded CSV files

2007-08-21 Thread Ben Shlomo, Yatir
tions (in catalina.bat) yatir From: Ben Shlomo, Yatir [mailto:[EMAIL PROTECTED] Sent: Monday, August 20, 2007 6:40 PM To: solr-user@lucene.apache.org Subject: problem with quering solr after indexing UTF-8 encoded CSV files Hi! I have utf-8 encoded data in

I can't delete, why?

2007-09-25 Thread Ben Shlomo, Yatir
turer:123456 Thanks yatir

RE: solr not finding all results

2007-10-15 Thread Ben Shlomo, Yatir
Did you try to add a backslash to escape the "-" in Geckoplp4-M (Geckoplp4\-M) -Original Message- From: Kevin Lewandowski [mailto:[EMAIL PROTECTED] Sent: Friday, October 12, 2007 9:40 PM To: solr-user@lucene.apache.org Subject: solr not finding all results I've found an odd situation wh

changing the root directory where solrCloud stores info inside zookeeper File system

2011-07-26 Thread Yatir Ben Shlomo
Hi! I am using solrCloud with a zookeeper ensamble of 3. I noticed that solcOuld stores information direclt under the root dir in the ZooKeepr file system: \config \live_nodes \ collections In my setup Zookeepr is also used by other modules so I would like solrCLoud to store everything under /s

RE: changing the root directory where solrCloud stores info inside zookeeper File system

2011-08-02 Thread Yatir Ben Shlomo
Thanks A lot mark, Since My SolrCloud code was old I tried downloading and building the newest code from here https://svn.apache.org/repos/asf/lucene/dev/trunk/ I am using tomcat6 I manually created the sc sub-directory in my zooKeeper ensemble file-system I used this connection String to my ZK ens