Re: Is there a way for SOLR / SOLRJ to index files directly bypassing HTTP streaming?

2012-03-18 Thread vybe3142
I'm going to try the approach described here and see what happens http://lucene.472066.n3.nabble.com/Fastest-way-to-use-solrj-td502659.html -- View this message in context: http://lucene.472066.n3.nabble.com/Is-there-a-way-for-SOLR-SOLRJ-to-index-files-directly-bypassing-HTTP-streaming-tp3833419

Re: Is there a way for SOLR / SOLRJ to index files directly bypassing HTTP streaming?

2012-03-18 Thread vybe3142
Thanks much. I plan to try this tomorrow. Can someone describe how to use remote streaming programmatically with solrj. For example, see the basic clients described here: http://androidyou.blogspot.com/2010/05/client-integration-with-solr-by-using.html and observe that the data is transferred in

Invalid version (expected 2, but 60) or the data in not in 'javabin' format

2012-03-18 Thread 怪侠
Hi, all. I want to update the file's index. The folowing is my code: ContentStreamUpdateRequest up = new ContentStreamUpdateRequest( "/update/extract"); up.addFile(file); up.setParam("uprefix", "attr_"); up.setParam("fmap.content", "attr_content"); up.setParam("literal.id",

Does the Solr provide hightlight token position in the field?

2012-03-18 Thread neosky
Does the hightlight can provide the exact position of the query For instance: MSAQLRKPTA*RVCES*CGRAEHWDDDLEAWQIARTDGTKQVGSPHCLHEWDINGNFNPVAMDD I want to know the Position of "R" in the highlight token. I want to do the secondary query based on the position, Thanks! -- View this message in context:

Re: Any way to get reference to original request object from within Solr component?

2012-03-18 Thread SUJIT PAL
Thanks Russel, thats a good idea, I think this would work too... I will try this and update the thread with details once. -sujit On Mar 18, 2012, at 7:11 AM, Russell Black wrote: > One way to do this is to register a servlet filter that places the current > request in a global static ThreadLoc

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-03-18 Thread Matthew Parker
That idea was short lived. I excluded the document. The cluster isn't syncing even after shutting everything down and restarting. On Sun, Mar 18, 2012 at 2:58 PM, Matthew Parker < mpar...@apogeeintegration.com> wrote: > I had tried importing data from Manifold, and one document threw a Tika > Exc

Re: Boosting terms

2012-03-18 Thread Ahmet Arslan
> Is there any possibility to boost > terms during indexing? Searching > that using google I found information that there is no such > feature in > Solr (we can only boost fields). Is it true? Yes, only field and document boosting exist. You might find this article interesting. http://www.luc

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-03-18 Thread Matthew Parker
I had tried importing data from Manifold, and one document threw a Tika Exception. If I shut everything down and restart SOLR cloud, the system sync'd on startup. Could extraction errors be the issue? On Sun, Mar 18, 2012 at 2:50 PM, Matthew Parker < mpar...@apogeeintegration.com> wrote: > I h

Re: which mergePolicy

2012-03-18 Thread Tirthankar Chatterjee
Hi, Do you see any issues with the default one. On Mar 18, 2012, at 6:10 AM, Messpero wrote: > hi everyone, > > i have a big index (~100 GB, ~55 documents) with 200 fields per > document. I search with large queries, that's the reason why i must change > the value of maxBooleanClauses to 8

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-03-18 Thread Matthew Parker
I have nodes running on ports: 8081-8084 A couple of the other SOLR cloud nodes we complaining about not being talk with 8081, which is the first node brought up in the cluster. The startup process is: 1. start 3 zookeeper nodes 2. wait until complete 3. start first solr node. 4. wait until c

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-03-18 Thread Darren Govoni
I think he's asking if all the nodes (same machine or not) return a response. Presumably you have different ports for each node since they are on the same machine. On Sun, 2012-03-18 at 14:44 -0400, Matthew Parker wrote: > The cluster is running on one machine. > > On Sun, Mar 18, 2012 at 2:07 PM

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-03-18 Thread Matthew Parker
The cluster is running on one machine. On Sun, Mar 18, 2012 at 2:07 PM, Mark Miller wrote: > From every node in your cluster you can hit http://MACHINE1:8084/solr in > your browser and get a response? > > On Mar 18, 2012, at 1:46 PM, Matthew Parker wrote: > > > My cloud instance finally tried to

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-03-18 Thread Mark Miller
From every node in your cluster you can hit http://MACHINE1:8084/solr in your browser and get a response? On Mar 18, 2012, at 1:46 PM, Matthew Parker wrote: > My cloud instance finally tried to sync. It looks like it's having connection > issues, but I can bring the SOLR instance up in the brow

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-03-18 Thread Matthew Parker
This might explain another thing I'm seeing. If I take a node down, clusterstate.json still shows it as active. Also if I'm running 4 nodes, take one down and assign it a new port, clusterstate.json will show 5 nodes running. On Sat, Mar 17, 2012 at 10:10 PM, Mark Miller wrote: > Nodes talk to Z

which mergePolicy

2012-03-18 Thread Messpero
hi everyone, i have a big index (~100 GB, ~55 documents) with 200 fields per document. I search with large queries, that's the reason why i must change the value of maxBooleanClauses to 8192. I use a queryResultCache with 20 size, because a search during over 30sec without cache. I insert

RE: mailto: scheme aware tokenizer

2012-03-18 Thread Steven A Rowe
Hi Kai, I have created an issue for this: https://issues.apache.org/jira/browse/LUCENE-3880 Thanks for reporting! Steve -Original Message- From: Kai Gülzau [mailto:kguel...@novomind.com] Sent: Friday, March 16, 2012 9:59 AM To: solr-user@lucene.apache.org Subject: mailto: scheme aware

Re: Any way to get reference to original request object from within Solr component?

2012-03-18 Thread Russell Black
One way to do this is to register a servlet filter that places the current request in a global static ThreadLocal variable, thereby making it available to your Solr component. It's kind of a hack but would work. Sent from my phone On Mar 17, 2012, at 6:53 PM, "SUJIT PAL" wrote: > Thanks Pra

Re: Too many connections in CLOSE_WAIT state on master solr server

2012-03-18 Thread samarth s
Hi Ranveer, You can try this '-Dhttp.maxConnections' out, may resolve the issue. But the root cause I figured may lie with some queries made to solr that are too heavy to have decent turnaround times. As a result the client may close the connection abruptly, resulting in half closed connections. Y