Attributes in fieldType/ and field/

2015-04-28 Thread Steven White
Hi Everyone, Looking at the out-of-the box schema.xml of Solr 5.1, I see this: fieldType name=payloads stored=false indexed=true class=solr.TextField analyzer Is it valid to have stored and indexed on fieldType/? My understanding is that those are on field/ only. If not, is the

Re: Overseer role in solrCloud

2015-04-28 Thread Shalin Shekhar Mangar
Comments inline: On Tue, Apr 28, 2015 at 3:00 PM, Gopal Jee zgo...@gmail.com wrote: I am trying to understand the role of overseer and solrCloud stateChange mechanism. I tried finding resources on web, but with not much luck. Can someone point me to some relevant doc or explain. Few doubts i

Expected mime type application/octet-stream but got text/html

2015-04-28 Thread Vijaya Narayana Reddy Bhoomi Reddy
Hi, I am suddenly seeing this error message when I try to index documents using SolrJ client. The same piece of code was working fine last time when I indexed the documents. But now, this is the error message being thrown on the SolrJ client. Request your urgent help as this is very high priority

Re: New to SolrCloud

2015-04-28 Thread Erick Erickson
Yeah, it took me a few tries to get it all straight in my head. Perhaps this will help. Whether or not to install Zookeeper on the same node as Solr is entirely your decision. And I'm assuming that you're NOT talking about the embedded Zookeeper BTW. The only problem with running ZK on the same

RE: Solr + RDF = SolRDF

2015-04-28 Thread Davis, Daniel (NIH/NLM) [C]
Both cool and interesting. Andrea, does your Solr RDF indexing project support inference? If so, is inference done by Jena or ahead of time before indexing by Solr? -Original Message- From: Andrea Gazzarini [mailto:a.gazzar...@gmail.com] Sent: Tuesday, April 28, 2015 7:31 AM To:

Re: Why Solr default to multivalued true

2015-04-28 Thread Shawn Heisey
On 4/28/2015 3:58 AM, balmydrizzle wrote: Just happen to have same issue as this question posted on stackoverflow site: Why Solr default Multivalued to true? - Stack Overflow http://stackoverflow.com/questions/21933032/why-solr-default-multivalued-to-true It has been my experience that if I

Re: Mutli term synonyms

2015-04-28 Thread Kaushik
Hi there, I tried the solution provided in https://lucidworks.com/blog/solution-for-multi-term-synonyms-in-lucenesolr-using-the-auto-phrasing-tokenfilter/ .The mentioned solution works when the indexed data does not have alpha numerics or special characters. But in my case the synonyms are

Re: Why Solr default to multivalued true

2015-04-28 Thread Ahmet Arslan
Hi, I checked comments in the example schema.xml it says false by default. How do you figure out that it is true? Ahmet On Tuesday, April 28, 2015 1:05 PM, balmydrizzle balmydriz...@gmail.com wrote: Just happen to have same issue as this question posted on stackoverflow site: Why Solr

How to improve the performance of query with expand query

2015-04-28 Thread yliu
Hi, I am using Solr to do some complex querying. Some queries require me to return one main document and some expanded documents at the same time. When I run the query without extracting any expanded document, the performance is good. But once I added the expanded conditions, the performance

AW: Odp.: solr issue with pdf forms

2015-04-28 Thread Steve.Scholl
Thanks a lot for being patient with me. Unfortunately there is no button load term info. :-( Can you may be help me using the TermsComponent instead? I read it is per default configured. Thanks a lot Best Steve -Ursprüngliche Nachricht- Von: Erick Erickson

Re: Async deleteshard commands?

2015-04-28 Thread Anshum Gupta
Yes, that's because DELETEREPLICA doesn't support async at this time. It's expected and documented. The reason why it's not supported is because when ASYNC mode was introduced, it was only added for tasks that could end up running longer than the http timeout. It might be a good thing to have for

Re: Solr VS Google Mini Search Appliance

2015-04-28 Thread Alexandre Rafalovitch
Facets! I believe Google Search Appliance does not support facets. Which means it supports search, but not post-search results tuning. In general, custom metadata was problematic with GSA Cost. I know at least one - very large, international, company moving to Solr from Google Search Appliance

On the fly reloading of solr core properties

2015-04-28 Thread KNitin
Hi In Solrcloud (4.6.1) every time a property/value is changed in solrcore.properties file, a core/collection reload is needed to pick up the new values. Core/Collection reloads for large collections (example 100 shards) is very expensive (performance wise) and can pose a threat to the

Re: Odp.: solr issue with pdf forms

2015-04-28 Thread Erick Erickson
There better be. 1 go to the admin UI 2 select a core 3 select schema browser 4 select a field from the drop-down Until you do step 4 the window will be pretty blank. Here's the info for TermsComponent, what have you tried? https://cwiki.apache.org/confluence/display/solr/The+Terms+Component

Re: Support of solr in Spark

2015-04-28 Thread Chris Hostetter
: I am thinking to index these companies name in solr since all the functionality already there? : : Do we have support for spark? https://github.com/LucidWorks/spark-solr Also of possible interest... http://lucidworks.com/blog/solr-yarn/ https://github.com/LucidWorks/yarn-proto

Re: Expected mime type application/octet-stream but got text/html

2015-04-28 Thread Vijaya Narayana Reddy Bhoomi Reddy
Just to add, my solrconfig.xml is the standard one, with no modifications. It was taken directly from the collection1 core from 4.10.2 installation. However, in schema.xml, I have added my own fields. Hope it has got nothing to do with schema.xml Thanks Regards Vijay

Off-top: Solr with language detection

2015-04-28 Thread LAFK
Shani, Off topic: that footer of yours may collide with list policy. All content here is publicly available, in case you missed it. @LAFK_PL   Oryginalna wiadomość   Od: Chaushu, Shani Wysłano: wtorek, 28 kwietnia 2015 14:59 Do: solr-user@lucene.apache.org Odpowiedz: solr-user@lucene.apache.org

Re: Attributes in fieldType/ and field/

2015-04-28 Thread Steve Rowe
Hi Steve, From https://cwiki.apache.org/confluence/display/solr/Field+Type+Definitions+and+Properties: The properties that can be specified for a given field type fall into three major categories: • Properties specific to the field type's class. • General Properties Solr supports for

Re: Solr + RDF = SolRDF

2015-04-28 Thread Andrea Gazzarini
Hi Charlie, definitely cool and interesting. Best, Andrea On 04/28/2015 10:20 AM, Charlie Hull wrote: On 27/04/2015 21:41, Andrea Gazzarini wrote: Hi guys, I'd like to share with you a project (actually a hobby for me) where I'm spending my free time, maybe someone could get some idea or

Solr with language detection

2015-04-28 Thread Chaushu, Shani
Hi, I'm trying to use the Solr Tika language detection. I added to the Solrconfig.xml: updateRequestProcessorChain name=langid processor class=org.apache.solr.update.processor.TikaLanguageIdentifierUpdateProcessorFactory str name=langid.flcontent/str str

Re: Custom Query Implementation?

2015-04-28 Thread Doug Turnbull
Johannes, If you just want to implement a custom search syntax, Solr can be great for this. You just need a Solr query parser which takes a search string and allows you to translate that into any number of Lucene queries. These are fairly straightforward to implement with a small amount of Lucene

RE: Expected mime type application/octet-stream but got text/html

2015-04-28 Thread Stephan Schubert
Hi, just a wild guess: you are calling /solr/update instead of /solr/collection/update Regards Stephan Von:Vijaya Narayana Reddy Bhoomi Reddy vijaya.bhoomire...@whishworks.com An: solr-user@lucene.apache.org, Datum: 28.04.2015 13:57 Betreff:Expected mime type

Re: how to store _text field

2015-04-28 Thread Mirko Torrisi
Hi guys, I used the Erick's suggestions (thanks again!!) to create a new field and copy in it the _text content. curl -X POST -H 'Content-type:application/json' --data-binary '{ add-field : { name:content, type:string, indexed:true, stored:true}, add-copy-field : { source:_text, dest: [

Re: Expected mime type application/octet-stream but got text/html

2015-04-28 Thread Vijaya Narayana Reddy Bhoomi Reddy
Thanks Stephen, that was the issue. In the URL, I missed the solr part. Thanks for your help. Functionality is working fine now. Thanks Regards Vijay On 28 April 2015 at 13:05, Stephan Schubert stephan.schub...@sick.de wrote: Hi, just a wild guess: you are calling /solr/update instead of

Re: Solr + RDF = SolRDF

2015-04-28 Thread Andrea Gazzarini
Hi Daniel, no, unfortunately not...it is definitely one of the interesting challenges of this wedding: inference in responses or crazy things like inferential faceting. It's all in the grocery list :) but I never thought about the concrete implementation. Thanks for your suggestions. Best,

Re: Solr VS Google Mini Search Appliance

2015-04-28 Thread Erick Erickson
Well, frankly if what they have already serves their needs, I see no reason they should switch. Investing time and effort in more modern technology without a compelling reason is a waste. Personally, I'd just leave the argument there. Talk to the stake-holders in the product and get them to list

Support of solr in Spark

2015-04-28 Thread Jeetendra Gangele
Hi All I have around 20 million company name and I want to index them. Currently What I am doing I am tokenizing and for each token I am applying Metaphone 3 and then Stroring each token in Hbase. When I get new query(company to match) I will again tokenize and apply metaphone3 as I did when I

Re: New to SolrCloud

2015-04-28 Thread Shawn Heisey
On 4/28/2015 4:40 AM, shacky wrote: I'm using Solr for 3 years and now I want to move to a SolrCloud configuration on 3 nodes which would make my infrastructure highly available. But I am very confused about it. I read that ZooKeeper should not be installed on the same Solr nodes, but I

Re: Async deleteshard commands?

2015-04-28 Thread Shalin Shekhar Mangar
This is a bug. Can you please open a Jira issue? On Tue, Apr 28, 2015 at 8:35 PM, Ian Rose ianr...@fullstory.com wrote: Is it possible to run DELETESHARD commands in async mode? Google searches seem to indicate yes, but not definitively. My local experience indicates otherwise. If I start

Solr VS Google Mini Search Appliance

2015-04-28 Thread Branko Simic
Hi, We have a client that will have a website in Ektron CMS. As you may know Ektron has good integration with Solr and that is primary reason for us developers to use it. But the client does not want to switch from their Google Mini Search appliance even though Google stopped support for it

Re: TIKA OCR not working

2015-04-28 Thread trung.ht
Hi Uwe, Today, I downloaded Solr 5.1 and it worked fine. It seems that this bug fix SOLR-7139 is only included in 5.1, not 5.0. Thank everyone for your support. Trung. On Tue, Apr 28, 2015 at 10:21 AM, trung.ht trung...@anlab.vn wrote: Hi Uwe, Thanks for the answer, but it looks like it

Re: Async deleteshard commands?

2015-04-28 Thread Ian Rose
Done! https://issues.apache.org/jira/browse/SOLR-7481 On Tue, Apr 28, 2015 at 11:09 AM, Shalin Shekhar Mangar shalinman...@gmail.com wrote: This is a bug. Can you please open a Jira issue? On Tue, Apr 28, 2015 at 8:35 PM, Ian Rose ianr...@fullstory.com wrote: Is it possible to run

Re: Async deleteshard commands?

2015-04-28 Thread Anshum Gupta
Hi Ian, DELETESHARD doesn't support ASYNC calls officially. We could certainly do with a better response but I believe with most of the Collections API calls at this time in Solr, you could send random params which would get ignored. Therefore, in this case, I believe that the async param gets

Solr Highlighting

2015-04-28 Thread Vijaya Narayana Reddy Bhoomi Reddy
Hi, When I perform a query, the matching document related field information is displayed separate from the highlighting information. Is there a way to merge these two so that highlighting for each document appears within the document level information itself. That way, it would be easier to find

Re: Load balancer for indexing?

2015-04-28 Thread Chris Hostetter
: I would still use ConcurrentUpdateSolrServer as it is good for catching up : when my indexing has fallen behind. I know it swallows exceptions. I feel like you are missing the point of when/why ConcurrentUpdateSolrServer compared to your goal of load balancing updates. The *only* feature

Re: How to improve the performance of query with expand query

2015-04-28 Thread Joel Bernstein
Could you provide a few more details? 1) Version of Lucene/Solr 2) A sample slow query 3) Number of unique values in the collapse field 4) Number of search results before the collapse 5) Number of results fetched in the page 6) Performance numbers for the query Joel Bernstein

Custom Query Implementation?

2015-04-28 Thread Johannes Ruscheinski
Hi, I am entirely new to the world of SOLR programming and I have the following questions: In addition to our regular searches we need to implement a specialised form of range search and ranking. What I mean by this is that users can search for one or more numeric ranges like 17:85,205:303 etc.

Re: Async deleteshard commands?

2015-04-28 Thread Anshum Gupta
Hi Ian, What do you mean by *my testing shows* ? Can you elaborate on the steps and how did you confirm that the call was indeed *async* ? I may be wrong but I think what you're seeing is a normal DELETEREPLICA call succeeding behind the scenes. It is not treated or processed as an async call.

Re: Async deleteshard commands?

2015-04-28 Thread Ian Rose
Hi Anshum, FWIW I find that page is not entirely accurate with regard to async params. For example, my testing shows that DELETEREPLICA *does* support the async param, although that is not listed here: https://cwiki.apache.org/confluence/display/solr/Collections+API#CollectionsAPI-api9 Cheers,

Re: Choosing order of fields in response with fl=field_1, field_2

2015-04-28 Thread Chris Hostetter
because of th enature of the CSV format, the order of the fields *has* to be deterministic and consistent for all documents, so the response writer sorts them into the approrpaite columns. for JSON XML formats this consistency isn't required, so instead Solr writes out hte fields of each

Re: New to SolrCloud

2015-04-28 Thread shacky
2015-04-28 19:45 GMT+02:00 Erick Erickson erickerick...@gmail.com: I think you're over-thinking the problem though. How often does a machine fail? If it's more often than once in an blue moon, you have _other_ problems. My needs are not only high availability (for which 2 nodes would be

Re: New to SolrCloud

2015-04-28 Thread shacky
Yeah, it took me a few tries to get it all straight in my head. Thanks Erick for your fast answer! The only problem with running ZK on the same node as Solr is that if the node goes down, it takes _both_ zookeeper and Solr with it. If running the embedded zookeeper, then you can't even

Re: New to SolrCloud

2015-04-28 Thread Erick Erickson
Your last comment really answered. A ZK quorum is explicitly ((num zk instances)/2) + 1. So no, you don't need 6 nodes at all. It's perfectly reasonable to run a Solr instance on each node and a ZK instance (not embedded) on the same three nodes. I think you're over-thinking the problem though.

Re: Overseer role in solrCloud

2015-04-28 Thread Gopal Jee
Thanks a ton shalin. Now i have a very clear view of state change. Will certainly help me stabilize my cluster issues. Thanks a lot. Gopal On Tue, Apr 28, 2015 at 8:16 PM, Shalin Shekhar Mangar shalinman...@gmail.com wrote: Comments inline: On Tue, Apr 28, 2015 at 3:00 PM, Gopal Jee

Re: Load balancer for indexing?

2015-04-28 Thread spillane
Shawn / Hoss, I see that CloudSolrServer appeared in SolrJ 4.5, will that work with a 4.2 SolrCloud? If so I'll upgrade my client and point the CloudSolrServer constructor at my 5 ZK hosts. I would still use ConcurrentUpdateSolrServer as it is good for catching up when my indexing has fallen

Re: Multiple index.timestamp directories using up disk space

2015-04-28 Thread Ramkumar R. Aiyengar
SolrCloud does need up to twice the amount of disk space as your usual index size during replication. Amongst other things, this ensures you have a full copy of the index at any point. There's no way around this, I would suggest you provision the additional disk space needed. On 20 Apr 2015 23:21,

New to SolrCloud

2015-04-28 Thread shacky
Hi. I'm using Solr for 3 years and now I want to move to a SolrCloud configuration on 3 nodes which would make my infrastructure highly available. But I am very confused about it. I read that ZooKeeper should not be installed on the same Solr nodes, but I also read another guide that installs

Re: Solr node going to recovering state during heavy reindexing

2015-04-28 Thread Gopal Jee
Thanks Shawn for the insight. WIll try your recommendations . Gopal On Mon, Apr 27, 2015 at 9:46 PM, Rajesh Hazari rajeshhaz...@gmail.com wrote: thanks, i am sure that we have missed this command line property, this gives me more information on how to use latest solr scripts more

Antwort: Start Solr with multiple external zookeepers on Windows Server?

2015-04-28 Thread Stephan Schubert
Hi there, editing the solr.in.cmd and list the zookeeper hosts there instead of passing them via parameter in the console worked. I'm using Solr 5.1 btw. Von:Stephan Schubert stephan.schub...@sick.de An: solr-user@lucene.apache.org, Datum: 27.04.2015 19:06 Betreff:Start

Overseer role in solrCloud

2015-04-28 Thread Gopal Jee
I am trying to understand the role of overseer and solrCloud stateChange mechanism. I tried finding resources on web, but with not much luck. Can someone point me to some relevant doc or explain. Few doubts i have: 1. In doc, it says overseer updates clusterstate.json when a new node joins. How

Why Solr default to multivalued true

2015-04-28 Thread balmydrizzle
Just happen to have same issue as this question posted on stackoverflow site: Why Solr default Multivalued to true? - Stack Overflow http://stackoverflow.com/questions/21933032/why-solr-default-multivalued-to-true -- View this message in context:

Re: Load balancer for indexing?

2015-04-28 Thread Shawn Heisey
On 4/28/2015 1:14 PM, spillane wrote: I see that CloudSolrServer appeared in SolrJ 4.5, will that work with a 4.2 SolrCloud? If so I'll upgrade my client and point the CloudSolrServer constructor at my 5 ZK hosts. I would still use ConcurrentUpdateSolrServer as it is good for catching up

Re: Async deleteshard commands?

2015-04-28 Thread Ian Rose
Sure. Here is an example of ADDREPLICA in synchronous mode: http://localhost:8983/solr/admin/collections?action=addreplicacollection=293shard=shard1_1 response: response lst name=responseHeader int name=status0/int int name=QTime1168/int /lst lst name=success lst lst name=responseHeader int

Re: Solr + RDF = SolRDF

2015-04-28 Thread Charlie Hull
On 27/04/2015 21:41, Andrea Gazzarini wrote: Hi guys, I'd like to share with you a project (actually a hobby for me) where I'm spending my free time, maybe someone could get some idea or benefit from it. https://github.com/agazzarini/SolRDF I called it SolRDF (Solr + RDF): It is a set of Solr

Re: Multiple index.timestamp directories using up disk space

2015-04-28 Thread Mark Miller
If copies of the index are not eventually cleaned up, I'd fill a JIRA to address the issue. Those directories should be removed over time. At times there will have to be a couple around at the same time and others may take a while to clean up. - Mark On Tue, Apr 28, 2015 at 3:27 AM Ramkumar R.

Async deleteshard commands?

2015-04-28 Thread Ian Rose
Is it possible to run DELETESHARD commands in async mode? Google searches seem to indicate yes, but not definitively. My local experience indicates otherwise. If I start with an async SPLITSHARD like so:

Choosing order of fields in response with fl=field_1, field_2

2015-04-28 Thread Raphaël Tournoy
Hi Everyone, if add fl=field_1, field_2 in a query the order of fields in the response is good for my needs : { field_1 : value X, field_2 : value Y } *however* it works only with the CSV reponse format :-( How do i get the same functionnality with for instance XML and JSON

RE: Solr VS Google Mini Search Appliance

2015-04-28 Thread Davis, Daniel (NIH/NLM) [C]
Branko, Your client's existing enterprise search isn't supported now, so I'm not saying that you should stick with Google Mini Search appliance. However, there are some things that Solr doesn't do well, and you should be aware of these before advocating Solr as a pure developer.Here are