Solr 6.1.x Release Date ??

2016-06-06 Thread Ramesh Shankar
Hi, Any idea of Solr 6.1.X Release Date ?? I am interested in the [subquery] transformer and like to know the release date since its available only in 6.1.x Thanks & Regards Ramesh

Using Solr to index zip files

2016-06-06 Thread anupama . gangadhar
Hi, I have an use case where I need to search zip files quickly in HDFS. I intend to use Solr but not finding any relevant information about whether it can be done for zip files. These are nested zip files i.e. zips within a zip file. Any help/information is much appreciated. Thank you,

Concern of large amount daily update

2016-06-06 Thread scott.chu
We recently plan to replace a old-school lucene that has 50M docs with Solrcloud but the daily update, according to the responsive colleague, could be around 100 thousands docs. Its data source is a bunch of mysql tables. When implementing the updating workflow, what solud I do so that I can

Re: Solr Cloud and Multi-word Synonyms :: synonym_edismax parser

2016-06-06 Thread MaryJo Sminkey
On Mon, Jun 6, 2016 at 7:36 PM, Joe Lawson < jlaw...@opensourceconnections.com> wrote: > > We were thinking, as you experimented with, that the 0.5 and 2.0 boosts > were no match for the product name and keyword field boosts so that would > influence your search as well. Yeah I definitely will

Re: NRT updates

2016-06-06 Thread Chris Vizisa
Hi, Any pointers, suggestions, experiences ... please.. Thanks! Chris. On Mon, Jun 6, 2016 at 10:27 AM, Chris Vizisa wrote: > Hi, > > Does number of fields in a document affect NRT updates? > I have around 1.6 million products. Each product can be available in about >

using spell check on phrases

2016-06-06 Thread kaveh minooie
Hi everyone I am using solr 6 and DirectSolrSpellChecker, and edismax parser. the problem that I am having is that when the query is a phrase, every single word in the phrase need to be misspelled for the spell checker to gets activated and gives suggestions. if only one of the word is

Re: Solr Cloud and Multi-word Synonyms :: synonym_edismax parser

2016-06-06 Thread Joe Lawson
Yeah I thought the scale of the boosts were off as well but got caught up verifying that the plugin was working. My colleague suggested that it could be that because small block is a phrase that it would get a higher score in matching because you basically get a phrase match each time which causes

Re: Solr Cloud and Multi-word Synonyms :: synonym_edismax parser

2016-06-06 Thread MaryJo Sminkey
Oh thanks, yeah I did miss that one field which had a parent type with the normal synonym filter. However, that's our product SKU field so really doesn't even come into play. I verified that none of the other fields have a synonym filter set and even removed the productumbertext just to make sure

Re: Solr Cloud and Multi-word Synonyms :: synonym_edismax parser

2016-06-06 Thread Joe Lawson
> > Advice: make sure on the schema that none of the fields your are running > queries against do any complex query operations, especially make sure they > aren't doing additional synonym resolution against the same file. > BTW. I'd do this first before messing with MM

Re: Solr Cloud and Multi-word Synonyms :: synonym_edismax parser

2016-06-06 Thread Joe Lawson
Mary Jo. It appears to be working correctly but you have a very complex query going on so it can be confusing. Assuming you are using the queryParser as provided in examples your query would look like "+sbc" when it enters the queryParser and would look like "+((sbc)^2.0 (sb)^0.5 (small

Re: Solr 6 fail to index images

2016-06-06 Thread Shawn Heisey
On 6/6/2016 10:56 AM, Jeferson dos Anjos wrote: > I'm trying to index images on SOLR, but I get the following error: > ERROR: [doc=5b36cb2b78072e41] Error adding field > 'media_black_point'='(0.012054443, 0.012496948, 0.010314941)' msg=For > input string: "(0.012054443" It looks like it's a

RE: Help needed on Solr Streaming Expressions

2016-06-06 Thread Hui Liu
The only difference between document3 and document5 is document3 has no data in 'shard2', after loading some data into shard2, the http command also worked: http://localhost:8988/solr/document3/stream?expr=search(document3,zkHost="127.0.0.1:2181",q="*:*",fl="document_id, sender_msg_dest",

RE: Help needed on Solr Streaming Expressions

2016-06-06 Thread Hui Liu
Joel, Thank you very much for your help, I tried the http command below with my existing 2 shards collection 'document3' (sorry I have a typo below should be document3 instead of document2), this time I got much better error: {"result-set":{"docs":[ {"EXCEPTION":"Unable to construct instance

Solr 6 fail to index images

2016-06-06 Thread Jeferson dos Anjos
I'm trying to index images on SOLR, but I get the following error: ERROR: [doc=5b36cb2b78072e41] Error adding field 'media_black_point'='(0.012054443, 0.012496948, 0.010314941)' msg=For input string: "(0.012054443" It looks like it's a problem of field types, but these fields are extracted

Re: Can a DocTransformer access the whole results tree?

2016-06-06 Thread Upayavira
:-) On Sat, 4 Jun 2016, at 06:50 PM, Mikhail Khludnev wrote: > I'm sorry for thinking sooo slow. > > On Sat, Jun 4, 2016 at 7:19 PM, Upayavira wrote: > > > Ahhh, seen it now in your SubQueryAugmenterFactory, via the threadLocal. > > Somewhat scary code, but I think I can work

Re: Getting a list of matching terms and offsets

2016-06-06 Thread Justin Lee
Thank you very much! That JIRA entry led me to https://issues.apache.org/jira/browse/SOLR-4722, which still works against Solr 6 with a couple of modifications and should serve as the basis for what I want to do. You saved me a bunch of work, so thanks very much. (Also, it is always nice to

Re: Index time Dates format when time is not needed

2016-06-06 Thread Erick Erickson
That padding is just fine, you're effectively indexing everything exactly at midnight. Best, Erick On Sun, Jun 5, 2016 at 12:48 PM, Steven White wrote: > Hi everyone, > > I'm using "solr.DateRangeField" data type to index my dates data and based > on [1] the format of the

Re: SolrCloud 5.2.1 nodes are out of sync - how to handle

2016-06-06 Thread Erick Erickson
Sure, the routing doesn't matter to the ADDREPLICA command, you give it a shard ID. I'm more worried about how the nodes got out of sync in the first place. Are _both_ Solr noded on a particular machine out of sync? And what is the evidence that they are? You can issue something like

Re: Zookeeper hanging after a commit

2016-06-06 Thread Erick Erickson
Zookeeper hanging? If it was truly unresponsive I would think your entire SolrCloud would be down. I guess you could test this by, say, creating a new collection and seeing if it goes live, if Zookeeper is truly unresponsive that would fail. Are you sure it's not just that the merging that's

Re: solr 5.4.1

2016-06-06 Thread Erick Erickson
It's unclear what you're asking. You want your own schema file? Or your own configuration for parsing your documents? Have you read through the reference guide section here: https://cwiki.apache.org/confluence/display/solr/Uploading+Data+with+Solr+Cell+using+Apache+Tika and if so, what parts are

NRT updates

2016-06-06 Thread Chris Vizisa
Hi, Does number of fields in a document affect NRT updates? I have around 1.6 million products. Each product can be available in about 3000 stores. In addition to around 50 fields related to a product I am storing product_store info in each product document like: 1. Quantity of that product in

Re: Solr /export and dates (Solr 5.5.1)

2016-06-06 Thread Erick Erickson
Sorry, it dropped off my radar somehow. Just opened SOLR-9187. I have a patch that I'm testing now, we'll see how that goes. On Wed, Jun 1, 2016 at 7:54 PM, Ronald Wood wrote: > > Thanks! I'm glad to find out I'm not going crazy. > > I'll keep a lookout for that enhancement. >

Re: Multilingual Solr

2016-06-06 Thread Johannes Riedl
Hi Alessandro, hi Alexandre, Thanks a lot for your reply and your considerations and hints. We use a web front end that comes bundled with Solr. It currently uses a single core approach. We would like to stick to the original setup as closely as possible to avoid administrative overhead and

Re: Not (!) operator

2016-06-06 Thread Anil
This is good idea Thanks Alex. On May 28, 2016 12:59 AM, "Alexandre Rafalovitch" wrote: > If you are worried about performance, bake the present/absent as a > signal in a separate field during the document processing as a special > UpdateRequestProcessor sequence. > >

Re: Help needed on Solr Streaming Expressions

2016-06-06 Thread Joel Bernstein
Hi, To eliminate any issues that might be happening due to curl, try running the command from your browser. http://localhost:8988/solr/document2/stream?expr=search(document3,zkHost=" 127.0.0.1:2181",q="*:*",fl="document_id, sender_msg_dest", sort="document_id asc",qt="/export") I think most

RE: find stores with sales of > $x in last 2 months ?

2016-06-06 Thread Allison, Timothy B.
Thank you, Alex. > Sorry, your question a bit confusing. Y. Sorry. > Also, is this last month as in 'January' (rolling monthly) or as in 'last 30 > days' (rolling daily). Ideally, the latter, if this is possible to calculate dynamically in response to a query. My backoff method (if the

clustering in solr(carrot2)

2016-06-06 Thread Mugeesh Husain
Hello everyone, For clustering I tried to implement some test using official document https://cwiki.apache.org/confluence/display/solr/Result+Clustering. I am getting the result as below DDR 3.9599865057283354 TWINX2048-3200PRO

Re: Multilingual Solr

2016-06-06 Thread Alexandre Rafalovitch
There is a language auto-detect UpdateRequestProcessor to route indexed content to differently suffixed fields. You have Google's algorithm: http://www.solr-start.com/info/update-request-processors/#LangDetectLanguageIdentifierUpdateProcessorFactory or a Tika one:

solr5.4.1 : data import handler for index rich data

2016-06-06 Thread kostali hassan
I am looking to add new field to extract they value from the field text: for example the field links to extract all links from the field text of each file. I define in tika.config.xml a regex for the expression of links but when the prossesor of indexation is finish I get just one value even

Re: language configuration in update extract request handler

2016-06-06 Thread Reth RM
This question should be posted on tika mailing list. It is not related to index or search but about parsing content of image. On Sun, Jun 5, 2016 at 10:20 PM, SIDDHAST® Roshan wrote: > Hi All, > > we are using the application for indexing and searching text using > solr. we

Multiple dictionary and affix for HunspellStemFilterFactory

2016-06-06 Thread Zheng Lin Edwin Yeo
Hi, I would like to check, is it possible to reference to multiple dictionaries for HunspellStemFilterFactory? i am trying to add more records to the default en_GB.dic file, but the file size has exceeded 1024KB, and ZooKeeper doesn't allow files that are larger than 1024KB to be loaded in.

Solr highlights

2016-06-06 Thread Anil
HI , As per my understanding, there will be a on highlighter for all fields of solr document. Is there a way to apply different highlighters for different fields ? Thanks. Cheers, Anil

Re: Indexing logs in Solr

2016-06-06 Thread Anil
Hi Eric and Benedetti Alessandro*,* do you have any inputs on the solution given the following link ? http://lucene.472066.n3.nabble.com/Highlighting-for-non-stored-fields-td1773015.html Thanks, Anil On 5 June 2016 at 11:56, Anil wrote: > Thanks IIan. I will look into

Re: Multilingual Solr

2016-06-06 Thread Alessandro Benedetti
Hi Johannes, nothing out of the box unfortunately but could be a nice idea and contribution. If having a multi-core setup is not an option ( out of curiousity, can I ask why ?) you could proceed in this way : 1) you define in the schema N field variation per field you are interested in. N is the