Re: SOLR 4.4 - Slave always replicates full index

2014-07-21 Thread Robin Woods
I did observe the same.. 1. updated an existing document.. means potentially marking the previous document as "deleted" and adding a new version of it.. posted the JSON doc using the Documents interface on the Admin UI.. left the default commit within "1000" ms there on the Documents UI.. 2. NOT

Re: Solr Cassandra MySQL Best Practice Indexing

2014-07-21 Thread Yavar Husain
Thanks Jack for your guidance on DSE. However it would be great if somebody could help me solving my use case: So my full text data lies on Cassandra along with an ID. Now I have a lot of structured data linked to the ID which lies on an RDBMS (read MySQL). I need this structured data as it would

Re: DocValues without re-index?

2014-07-21 Thread Mikhail Khludnev
Michael, What's "first re-indexing"? I'm sure you are aware about binary/number DocValues updates, but it works for existing column strides. I can guess you are talking about something like sidecar index http://www.youtube.com/watch?v=9h3ax5Wmxpk On Tue, Jul 22, 2014 at 6:50 AM, Michael Ryan w

DocValues without re-index?

2014-07-21 Thread Michael Ryan
Is it possible to use DocValues on an existing index without first re-indexing? -Michael

Re: SolrCloud extended warmup support

2014-07-21 Thread Erick Erickson
I've never seen it necessary to run "thousands of queries" to warm Solr. Usually less than a dozen will work fine. My challenge would be for you to measure performance differences on queries after running, say, 12 well-chosen queries as opposed to hundreds/thousands. I bet that if 1> you search acr

Re: How do I disable distributed search feature when I have only one shard

2014-07-21 Thread Erick Erickson
Are you using CloudSolrServer in your SolrJ program? No matter what, the &distrib=false should be keeping the query from going to more than one shard So I'd check the logs and see if the suspect query appears in more than one node. FWIW, Erick On Mon, Jul 21, 2014 at 4:13 PM, pramodEbay w

Re: text search problem

2014-07-21 Thread Erick Erickson
Try escaping the hyphen as \-. Or enclosing it all in quotes. But you _really_ have to spend some time with the debug option an admin/analysis page or you will find endless surprises. Best, Erick On Mon, Jul 21, 2014 at 11:12 AM, EXTERNAL Taminidi Ravi (ETI, Automotive-Service-Solutions) wrote

Re: Edit Example Post.jar to read ALL file types

2014-07-21 Thread Erick Erickson
So how do you expect these to be indexed? I mean what happens if you run across a Word document? How about an mp3? Just blasting all files up seems chancy. And doesn't just 'java -jar post.jar * ' do what you ask? This seems like an XY problem, _why_ do you want to do this? Because unless the file

Re: SolrCloud extended warmup support

2014-07-21 Thread Jeff Wartes
On 7/21/14, 4:50 PM, "Shawn Heisey" wrote: >On 7/21/2014 5:37 PM, Jeff Wartes wrote: >> I¹d like to ensure an extended warmup is done on each SolrCloud node >>prior to that node serving traffic. >> I can do certain things prior to starting Solr, such as pump the index >>dir through /dev/null to

Re: SolrCloud extended warmup support

2014-07-21 Thread Shawn Heisey
On 7/21/2014 5:37 PM, Jeff Wartes wrote: > I’d like to ensure an extended warmup is done on each SolrCloud node prior to > that node serving traffic. > I can do certain things prior to starting Solr, such as pump the index dir > through /dev/null to pre-warm the filesystem cache, and post-start I

SolrCloud extended warmup support

2014-07-21 Thread Jeff Wartes
I’d like to ensure an extended warmup is done on each SolrCloud node prior to that node serving traffic. I can do certain things prior to starting Solr, such as pump the index dir through /dev/null to pre-warm the filesystem cache, and post-start I can use the ping handler with a health check f

Re: SolrCloud replica dies under high throughput

2014-07-21 Thread Mark Miller
Looks like you probably have to raise the http client connection pool limits to handle that kind of load currently. They are specified as top level config in solr.xml: maxUpdateConnections maxUpdateConnectionsPerHost -- Mark Miller about.me/markrmiller On July 21, 2014 at 7:14:59 PM, Darren

SolrCloud replica dies under high throughput

2014-07-21 Thread Darren Lee
Hi, I'm doing some benchmarking with Solr Cloud 4.9.0. I am trying to work out exactly how much throughput my cluster can handle. Consistently in my test I see a replica go into recovering state forever caused by what looks like a timeout during replication. I can understand the timeout and fa

How do I disable distributed search feature when I have only one shard

2014-07-21 Thread pramodEbay
Hi there, We have a solr cloud set up with only one shard. There is one leader and 15 followers. So the data is replicated on 15 nodes. When we run a solr query, only one node should handle the request and we do not need any distributed search feature as all the nodes are exact copies of each othe

RE: Multiterm analysis in complexphrase query

2014-07-21 Thread Gopal Agarwal
That would be really useful. Can you upload the jar and its requirements? It also makes it pluggable with diff versions of solr. On Jul 1, 2014 9:01 PM, "Allison, Timothy B." wrote: > If there's enough interest, I might get back into the code and throw a > standalone src (and jar) of the SpanQ

Re: Solr schema.xml query analyser

2014-07-21 Thread Jack Krupansky
That sounds more like a "reverse" query - trying to match documents against the query rather than matching the query against the documents. Solr doesn't have that feature currently. Although I'm not absolutely sure what your "textExactMatch" is. I'm guessing that it is a document field in your

RE: text search problem

2014-07-21 Thread EXTERNAL Taminidi Ravi (ETI, Automotive-Service-Solutions)
Thanks for the reply Erick, I will try as you suggested. There I have another question related to this lines. When I have "-" in my description , name then the search results are different. For e.g. "ABC-123" , it look sofr ABC or 123, I want to treat this search as exact match, i.e if my do

Re: Solr schema.xml query analyser

2014-07-21 Thread newBie88
My apologies Jack. But there was a mistake in my question. I actually switched "query" and "textExactMatch" in my question. I would be really helpful if you could have a look at the scenario once again: My task is to provide a match when the search term contains the indexed field. For example:

Re: Solr schema.xml query analyser

2014-07-21 Thread Jack Krupansky
Based on your stated requirements, there is no obvious need to use the keyword tokenizer. So fix that and then quoted phrases or escaped spaces should work. -- Jack Krupansky -Original Message- From: prashantc88 Sent: Monday, July 21, 2014 11:51 AM To: solr-user@lucene.apache.org Sub

Re: Solr schema.xml query analyser

2014-07-21 Thread prashantc88
Thanks Jack for the reply. I did not mention the query time analyzer in my post because I wasn't sure what should be put there. With regards to your reply, If I put the query term in quotes, would I get a match for the following: Indexed field value: foo bar Query term: foo bar xyz/xyz foo bar

Re: Solr schema.xml query analyser

2014-07-21 Thread Jack Krupansky
If you don't specify a "query" analyzer, Solr will use the "index" analyzer at query time. But... at query time there is something called a "query parser" which typically breaks the query into separate terms, delimited by white space, and then calls the analyzer for each term, separately. Yo

Solr schema.xml query analyser

2014-07-21 Thread prashantc88
0 down vote favorite I am a complete beginner to Solr and need some help. My task is to provide a match when the search term contains the indexed field. For example: If query= foo bar and textExactMatch= foo, I should not get a MATCH If query= foo bar and textExactMatch= foo b

Re: Query about Solr

2014-07-21 Thread Jack Krupansky
Set the field type for such a field to "ignored". Or set it to "string" and then you can still examine or query the data even if it is not properly formatted. -- Jack Krupansky -Original Message- From: Ameya Aware Sent: Monday, July 21, 2014 11:12 AM To: solr-user@lucene.apache.org

Re: Query about Solr

2014-07-21 Thread Ameya Aware
Hi, The data coming into Solr is different metadata such as author, created time, last modified time etc along with content of the file. So indexing content is giving me different errors, so i just simply want to skip indexing content part. Thanks, Ameya On Mon, Jul 21, 2014 at 11:07 AM, Alex

Re: Query about Solr

2014-07-21 Thread Alexandre Rafalovitch
Nothing gets indexed automatically. So you must be doing something (e.g. Nutch). Tell us what that something is first so we know your baseline setup. Regards, Alex On 21/07/2014 9:43 pm, "Ameya Aware" wrote: > Hi, > > How can i stop content of file from being getting indexed?? > > Will remo

Edit Example Post.jar to read ALL file types

2014-07-21 Thread jrusnak
I am working with Solr 4.8.1 to set up an enterprise search system. The file system I am working with has numerous files with unique extension types (ex .20039 .20040 .20041 etc.) I am using the post.jar file included in the binary download (src: SimplePostTool.java

Query about Solr

2014-07-21 Thread Ameya Aware
Hi, How can i stop content of file from being getting indexed?? Will removing content field from schema.xml do that job? Thanks, Ameya

RE: SolrCloud performance issues regarding hardware configuration

2014-07-21 Thread Toke Eskildsen
search engn dev [sachinyadav0...@gmail.com] wrote: > Yes, You are right my facet queries are for text analytic purpose. Does this mean that facet calls are rare (at most one at a time)? > Users will send boolean and spatial queries. current performance for spatial > queries is 100qps with 150 con

Re: Solr Cassandra MySQL Best Practice Indexing

2014-07-21 Thread Jack Krupansky
Solandra is not a supported product. DataStax Enterprise (DSE) supersedes it. With DSE, just load your data into a Solr-enabled Cassandra data center and it will be indexed automatically in the embedded Solr within DSE, as per a Solr schema that you provide. Then use any of the nodes in that So

Solr Cassandra MySQL Best Practice Indexing

2014-07-21 Thread Yavar Husain
So my full text data lies on Cassandra along with an ID. Now I have a lot of structured data linked to the ID which lies on an RDBMS (read MySQL). I need this structured data as it would help me with my faceting and other needs. What is the best practice in going about indexing in this scenario. My

Re: faceting within facets

2014-07-21 Thread Yonik Seeley
On Mon, Jul 21, 2014 at 8:08 AM, David Flower wrote: > Is it possible to create a facet within another facet in a single query For simple field facets, there's pivot faceting. For more complex nested facets, there are sub-facets in heliosearch (a solr fork): http://heliosearch.org/solr-subfacets/

faceting within facets

2014-07-21 Thread David Flower
Hi Is it possible to create a facet within another facet in a single query, currently I'm having to filter the query with facet.query=type:foo and running the query multiple times to return the number and type of object created on a given date. Is it even possible to return this in a single qu

AUTO: Nicholas M. Wertzberger is out of the office (returning 07/23/2014)

2014-07-21 Thread Nicholas M. Wertzberger
I am out of the office until 07/23/2014. I'm out of town for the next few days. I am reachable by Blackberry, if needed. Please contact Jason Brown for anything JAS Team related. Note: This is an automated response to your message "Re: questions on Solr WordBreakSolrSpellChecker and WordDelim

Re: stats.facet with multi-valued field in Solr 4.9

2014-07-21 Thread Yonik Seeley
On Mon, Jul 21, 2014 at 7:32 AM, Nico Kaiser wrote: > Yonik, thanks for your reply! I also found > https://issues.apache.org/jira/browse/SOLR-1782 which also sees to deal with > this, but I did not find out wether there is a workaround. > > For our use case the previous behaviour was ok and seem

Re: stats.facet with multi-valued field in Solr 4.9

2014-07-21 Thread Nico Kaiser
Yonik, thanks for your reply! I also found https://issues.apache.org/jira/browse/SOLR-1782 which also sees to deal with this, but I did not find out wether there is a workaround. For our use case the previous behaviour was ok and seemed (!) to be consistent. However I understand that this featur

Re: stats.facet with multi-valued field in Solr 4.9

2014-07-21 Thread Yonik Seeley
On Mon, Jul 21, 2014 at 7:09 AM, Nico Kaiser wrote: > After the upgrade to Solr 4.9 (from 3.6) this seems not to be possible > anymore: > > "Stats can only facet on single-valued fields, not: instrumentIds" https://issues.apache.org/jira/browse/SOLR-3642 It looks like perhaps it never did work

stats.facet with multi-valued field in Solr 4.9

2014-07-21 Thread Nico Kaiser
Hi! I am storing aggregated article click statistics for a website in a Lucene database. Website "articles" (i.e., pages in this case) can have multiple associated financial instruments, which – for statistics reasons – I also copy to Lucene. So basically this data is stored (and regularly upda

Re: Plugin init failure for custom analysis filter

2014-07-21 Thread ssivakumaran
Hi, I am not able to find anything in the log or rather not that specific. This error is being thrown when I add a string argument to my filter in schema. If I remove the same, I donot get any error. I tried changing the datatype but still same error. A little more detail regarding the filter argum