Re: Design optimal Solr Schema

2014-12-11 Thread tomas.kalas
Thanks for help, but how wrote Alex, I used synonm filter and it is what i want. When i wrote to synonym for example Hello, Hi. And sentence is Hello how are you and my query is Hi how are you, so that find it too. -- View this message in context:

Alternative synonymum

2014-12-11 Thread tomas.kalas
Hello, i want to searching in between transcripts of phone conversations. And the machine which is make transcript the conversation to text is making some alternatives. For example If we have sentence. Hello how are you. 1. Segment Hello Halo Hollow 2.Segment How Bow When i

Re: Length norm not functioning in solr queries.

2014-12-11 Thread S.L
Mikhail, Thank you for confirming this , however Ahmet's proposal seems more simpler to implement to me . On Wed, Dec 10, 2014 at 5:07 AM, Mikhail Khludnev mkhlud...@griddynamics.com wrote: S.L, I briefly skimmed Lucene50NormsConsumer.writeNormsField(), my conclusion is: if you supply own

Highlighting integer field

2014-12-11 Thread Pawel Rog
Hi, Is it possible to highlight int (TrieLongField) or long (TrieLongField) field in Solr? -- Paweł

Re: Length norm not functioning in solr queries.

2014-12-11 Thread S.L
Ahmet, Thank you , as the configurations in SolrCloud are uploaded to zookeeper , are there any special steps that need to be taken to make this work in SolrCloud ? On Wed, Dec 10, 2014 at 4:32 AM, Ahmet Arslan iori...@yahoo.com.invalid wrote: Hi, Or even better, you can use your new field

Re: Priority in search an synonyms

2014-12-11 Thread Antoine REBOUL
Hello, First of all thank you for your answers ! In my schema.xml file: - I created this field : fieldType name=tmp_libelle class=solr.TextField positionIncrementGap=100 analyzer type=index tokenizer class=solr.StandardTokenizerFactory//analyzer analyzer type=querytokenizer

Histogram Facet and Aggregation Solr

2014-12-11 Thread Ankit Jain
Hi All, We have an usecase where we want to perform histogram on 10 minutes time period and then each 10 mins time frame we have to perform facet on some field. We are currently using 4.7.2 version of Solr. Please suggest how we nested facet with histogram. -- Thanks, Ankit Jain

Re: Priority in search an synonyms

2014-12-11 Thread Ahmet Arslan
Hi Antoine, By saying The problem I have now is that ebc_libelle synonyms reported for the field are not show, you mean you have synonym entry for the word Castaroma, and documents containing those synonym entries do not show up in fist 100 documents? If yes, play with boost values (5 versus

Re: Length norm not functioning in solr queries.

2014-12-11 Thread Ahmet Arslan
Hi, No special steps to be taken for cloud setup. Please note that for both solutions, re-index is mandatory. Ahmet On Thursday, December 11, 2014 12:15 PM, S.L simpleliving...@gmail.com wrote: Ahmet, Thank you , as the configurations in SolrCloud are uploaded to zookeeper , are there any

Suspicious message with attachment

2014-12-11 Thread help
The following message addressed to you was quarantined because it likely contains a virus: Subject: Inconsistent doc value across two nodes - very simple test - what's the expected behavior? From: Gili Nachum gilinac...@gmail.com However, if you know the sender and are expecting an attachment,

Re: Design optimal Solr Schema

2014-12-11 Thread Alexandre Rafalovitch
Tomas, You have a difficult use case. You seem to have a speech recognition domain and you want to be able to search that transcribed text with reference back to timing. It's an interesting problem, but not an easy one. Certainly not something one can give you the answer all at once. The issue

Re: Highlighting integer field

2014-12-11 Thread Tomoko Uchida
Hi Pawel, Essentially, highlighting is a feature to show fragments of documents that matche user queries. With that, he/she can find occurrence of their query in long documents and can understand their results well. For tint or tlong fields (or other non-text field types), fragments usually have

Re: Design optimal Solr Schema

2014-12-11 Thread tomas.kalas
Oh no, i want to answered to this topic, where you help me with the synonym filter: http://lucene.472066.n3.nabble.com/Alternative-searching-td4172339.html but i was opened this topic too and i checking my answer in google translator and copy it here. Now, i have a edit task, i do not have to

Is it possible in Solr to have document field value, based on context during query time, by request parameter ?

2014-12-11 Thread Nenko Ivanov
The Use Case: Very large and sharded index with articles with different categorization fields, pre populated with algorithmic estimated values (simple type, mostly Integer values). The index is accessed from multiple “clients” and each client can override article property based on his

Re: Length norm not functioning in solr queries.

2014-12-11 Thread S.L
Yes, I understand that reindexing is neccesary , however for some reason I was not able to invoke the js script from the updateprocessor, so I ended up using Java only solution at index time. Thanks. On Thu, Dec 11, 2014 at 7:18 AM, Ahmet Arslan iori...@yahoo.com.invalid wrote: Hi, No

Re: Design optimal Solr Schema

2014-12-11 Thread Alexandre Rafalovitch
Ok. Make sure to post in the right topics. People get super confused when the conversation thread changes. Maybe ignore this last couple of messages and post the new one as appropriate (separate or in another thread). That way the right people will see it. Regards, Alex. Personal:

Re: Is it possible in Solr to have document field value, based on context during query time, by request parameter ?

2014-12-11 Thread Alexandre Rafalovitch
So, what did not work for you with the External File Field approach? What is the next gap you are trying to close? You seem to be aware of the possible extension points for Solr, so you are not looking for just a pointer to custom search components or whatever. Regards, Alex. Personal:

RE: WordBreakSolrSpellChecker Usage

2014-12-11 Thread Dyer, James
My first guess here, is seeing it works some of the time but not others, is that these values are too low: str name=spellcheck.maxCollationTries5/str str name=spellcheck.count5/str You know spellcheck.count is too low if the suggestion you want is not in the suggestions part of the response,

Re: Details on why ConccurentUpdateSolrServer is reccommended for maximum index performance

2014-12-11 Thread Tom Burton-West
Thanks Eric, That is helpful. We already have a process that works similarly. Each thread/process that sends a document to Solr waits until it gets a response in order to make sure that the document was indexed successfully (we log errors and retry docs that don't get indexed successfully),

Re: Details on why ConccurentUpdateSolrServer is reccommended for maximum index performance

2014-12-11 Thread Erick Erickson
I don't think so, it uses SolrInputDocuments and lists thereof. So if you parse the xml and then put things in SolrInputDocuments.. Or something like that. Erick On Thu, Dec 11, 2014 at 9:43 AM, Tom Burton-West tburt...@umich.edu wrote: Thanks Eric, That is helpful. We already have a

Re: Details on why ConccurentUpdateSolrServer is reccommended for maximum index performance

2014-12-11 Thread Michael Della Bitta
Tom: ConcurrentUpdateSolrServer isn't magic or anything. You could pretty trivially write something that takes batches of your XML documents and combines them into a single document (multiple doc tags in the add section) and sends them up to Solr and achieve some of the same speed benefits.

Inconsistent doc value across two nodes - very simple test - what's the expected behavior?

2014-12-11 Thread Gili Nachum
I know Solr CAP properties are CP, but I don't see it happening over a very basic test - doing something wrong? With two Solr nodes, I index doc1 to both, stop node2, update doc1, stop node1, start node2, start node1, and I get two different versions of the doc depending on which replica I query.

Re: Details on why ConccurentUpdateSolrServer is reccommended for maximum index performance

2014-12-11 Thread Mikhail Khludnev
Agree with Erick. However, I suppose you can try to provide your own RequestWriter, and let it stream XML. btw, what's in them? How Solr handles them right now? Why don't you want to start from the test? On Thu, Dec 11, 2014 at 7:04 PM, Erick Erickson erickerick...@gmail.com wrote: I don't

Help with a Join Query

2014-12-11 Thread Darin Amos
Hello, I am trying to execute a join query that I am not 100% sure how to execute. Lets say I have a bunch of parent and child documents and every one of my child documents has a single value field “color”. If I want to search all parents that have a “red” child, tis is very easy: {!join

Re: Details on why ConccurentUpdateSolrServer is reccommended for maximum index performance

2014-12-11 Thread Alexandre Rafalovitch
On 11 December 2014 at 11:40, Yonik Seeley yo...@heliosearch.com wrote: So to Solr (server side), it looks like a single update request (assuming 1 thread) with a batch of multiple documents... but it was never actually batched on the client side. Does Solr also indexes them one-by-one as it

Re: Details on why ConccurentUpdateSolrServer is reccommended for maximum index performance

2014-12-11 Thread Yonik Seeley
On Thu, Dec 11, 2014 at 11:52 AM, Alexandre Rafalovitch arafa...@gmail.com wrote: On 11 December 2014 at 11:40, Yonik Seeley yo...@heliosearch.com wrote: So to Solr (server side), it looks like a single update request (assuming 1 thread) with a batch of multiple documents... but it was never

Re: WordBreakSolrSpellChecker Usage

2014-12-11 Thread Matt Mongeau
Is there a suggested value for this. I bumped them up to 20 and still nothing has seemed to change. On Thu, Dec 11, 2014 at 9:42 AM, Dyer, James james.d...@ingramcontent.com wrote: My first guess here, is seeing it works some of the time but not others, is that these values are too low: str

Re: Solr Error when making GeoPrefixTree polygon filter search

2014-12-11 Thread mathaix
Thank you. That was the issue. Is am running solr with Jetty. Is there are recommended way for including those jars in the jetty configuration? -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-Error-when-making-GeoPrefixTree-polygon-filter-search-tp4173629p4173807.html

Have anyone used Automatic Phrase Tokenization (AutoPhrasingTokenFilterFactory) ?

2014-12-11 Thread shamik
Hi, I'm trying to use AutoPhrasingTokenFilterFactory which seems to be a great solution to our phrase query issues. But doesn't seem to work as mentioned in the blog :

Re: Help with a Join Query

2014-12-11 Thread Kydryavtsev Andrey
How about something like ({!join from=parent to=id}color:red) AND ({!join from=parent to=id}color:blue) ? 11.12.2014, 19:48, Darin Amos dari...@gmail.com: Hello, I am trying to execute a join query that I am not 100% sure how to execute. Lets say I have a bunch of parent and child

RE: WordBreakSolrSpellChecker Usage

2014-12-11 Thread Dyer, James
Matt, There is no exact number here, but I would think most people would want count to be maybe 10-20. Increasing this incurs a very small performance penalty for each term it generates suggestions for, but you probably won't notice a difference. For maxCollationTries, 5 is a reasonable

Re: Help with a Join Query

2014-12-11 Thread Darin Amos
Thanks, That looks like a viable option, I could do something like the following: q={!join from=parent to=id}my main search term query fq={!join from=parent to=id}color:red fq={!join from=parent to=id}color:blue With all these joins happening like this, what kind of performance concern is

Re: Help with a Join Query

2014-12-11 Thread Kydryavtsev Andrey
11.12.2014, 21:24, Darin Amos dari...@gmail.com: Thanks, That looks like a viable option, I could do something like the following: q={!join from=parent to=id}my main search term query fq={!join from=parent to=id}color:red fq={!join from=parent to=id}color:blue With all these joins

Re: Help with a Join Query

2014-12-11 Thread Kydryavtsev Andrey
According to may experience, query time join has relatively poor performance. If you can cache this joins effectively (not so many unique color values in requests, cache doesn't invalidate) - it's ok. If not, it may be interesting to try block join instead -

Re: Inconsistent doc value across two nodes - very simple test - what's the expected behavior?

2014-12-11 Thread Shalin Shekhar Mangar
Hi Gili, Great question! A write in Solr, by default, is only guaranteed to exist in 1 place i.e. the leader and the safety valves that we have to preserve these writes are: 1. The leaderVoteWait time for which leader election is suspended until enough live replicas are available 2. The two-way

Re: Solr 4.10.2 Found core but I get No cores available in dashboard page

2014-12-11 Thread solr-user
my apologies for the lack of clarity our internal name for the project to upgrade solr from 4.0 to 4.10.2 is helios and so we named our test folder heliosearch. I was not even aware of the github project Heliosearch, and nothing we are doing is related to it. to simplify things for this post,

Re: Solr 4.10.2 Found core but I get No cores available in dashboard page

2014-12-11 Thread solr-user
small correction; coreX (the one with the unmodified schema.xml and solrconfig.xml) IS seen by solr and appears on the solr admin page, but coreA (which has our modified schema and solrconfig) is found by solr but is not shown in the solr admin page: 1494 [main] INFO

RE: Help with a Join Query

2014-12-11 Thread EXTERNAL Taminidi Ravi (ETI, Automotive-Service-Solutions)
May be you can try using AND condition in the single join something like q={!join from=parent to=id}(Id:xxx AND (Color:red OR Color:Blue)), I don't think this will give bigger performance issue. Thanks Ravi -Original Message- From: Darin Amos [mailto:dari...@gmail.com] Sent:

Re: Solr 4.10.2 Found core but I get No cores available in dashboard page

2014-12-11 Thread Alexandre Rafalovitch
And the XML is valid, lib references in solrconfig.xml point to the right libraries (if any), you don't have duplicate definitions of types, you don't have missing definitions of types? And you didn't disable the admin handler? And it's not just admin that's failing to find the core, right? If

Re: Solr 4.10.2 Found core but I get No cores available in dashboard page

2014-12-11 Thread Chris Hostetter
: can you please include the *exact* solrconfig.xml schema.xml you are : using for coreA ... you've given us an overview of what you changed, but : that's not enough for anyone to actally try and reproduce your problem. if it helps (since hte list doesn't allow attachments) feel free to open a

Re: Solr 4.10.2 Found core but I get No cores available in dashboard page

2014-12-11 Thread Chris Hostetter
: coreA (which has our modified schema and solrconfig) is found by solr but is : not shown in the solr admin page: can you please include the *exact* solrconfig.xml schema.xml you are using for coreA ... you've given us an overview of what you changed, but that's not enough for anyone to

Re: Solr 4.10.2 Found core but I get No cores available in dashboard page

2014-12-11 Thread solr-user
yes, have triple checked the schema and solrconfig XML; various tools have indicated the XML is valid no missing types or dupes, and have not disabled the admin handler as mentioned in my most recent response, I can see the coreX core (the renamed and unmodified collection1 core from the

Re: Solr 4.10.2 Found core but I get No cores available in dashboard page

2014-12-11 Thread solr-user
Chris, will get the schema and solrconfig ready for uploading. -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-4-10-2-Found-core-but-I-get-No-cores-available-in-dashboard-page-tp4173602p4173840.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Inconsistent doc value across two nodes - very simple test - what's the expected behavior?

2014-12-11 Thread Shalin Shekhar Mangar
I opened https://issues.apache.org/jira/browse/SOLR-6837 Probably best to have further conversations on the Jira issue. On Thu, Dec 11, 2014 at 6:46 PM, Shalin Shekhar Mangar shalinman...@gmail.com wrote: Hi Gili, Great question! A write in Solr, by default, is only guaranteed to exist in

Re: Solr Error when making GeoPrefixTree polygon filter search

2014-12-11 Thread david.w.smi...@gmail.com
As in the layout shipped with Solr? Try putting the JTS ‘jar’ in lib/ext and let us know if that worked. I think it will but I forget. ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley On Thu, Dec 11, 2014 at 12:40 PM, mathaix

different fields for user-supplied phrases in edismax

2014-12-11 Thread Michael Sokolov
I'd like to supply a different set of fields for phrases than for bare terms. Specifically, we'd like to treat phrases as more exact - probably turning off stemming and generally having a tighter analysis chain. Note: this is *not* what's done by configuring pf which controls fields for the

Mutli Lengual Suggester Solr 4.8

2014-12-11 Thread alaa.abuzaghleh
I am trying create suggester handler using solr 4.8, everything work fine but when I try to get suggestion using different language Arabic, or Japanese for example I got result in mixed language, but I am trying to search only using Japanese, I got Arabic with that too. the following is my

Re: different fields for user-supplied phrases in edismax

2014-12-11 Thread Ahmet Arslan
Hi Mike, If I am not wrong, you are trying to simulate google behaviour. If you use quotes, google return exact matches. I think that makes perfectly sense and will be a valuable addition. I remember some folks asked/requested this behaviour in the list. Ahmet On Thursday, December 11, 2014

Re: different fields for user-supplied phrases in edismax

2014-12-11 Thread alaa.abuzaghleh
requestHandler name=/suggest class=solr.SearchHandler lst name=defaults str name=echoParamsexplicit/str str name=defTypeedismax/str str name=rows10/str str name=flfull_name,job_tree,

Re: Highlighting integer field

2014-12-11 Thread Pawel
Hi, Thanks for response. It is quite important to me for example to highlight multivalued field with many int or long tokens. -- Paweł On Thu, Dec 11, 2014 at 3:08 PM, Tomoko Uchida tomoko.uchida.1...@gmail.com wrote: Hi Pawel, Essentially, highlighting is a feature to show fragments of

Re: Highlighting integer field

2014-12-11 Thread Michael Sokolov
So the short answer to your original question is no. Highlighting is designed to find matches *within* a tokenized (text) field only. That is difficult because text gets processed and there are all sorts of complications, but for integers it should be pretty easy to match the values in the

To understand SolrCloud configurations

2014-12-11 Thread E S J
Hello Team, I would like to get clarified where to place schema.xml on SolrCloud set-up. My Solr cloud set-up , 3 nodes, 3 shards and 3 replications, 3 ZooKeeper What I have done is, 1. Taken a solr.war from solr default download ( solr-4.10.2/example/webapps/solr.war - 4.10.2) and placed

To understand SolrCloud configurations

2014-12-11 Thread E S J
Hello Team, I would like to get clarified where to place schema.xml on SolrCloud set-up. My Solr cloud set-up , 3 nodes, 3 shards and 3 replications, 3 ZooKeeper What I have done is, 1. Taken a solr.war from solr default download ( solr-4.10.2/example/webapps/solr.war - 4.10.2) and placed

Browse interface

2014-12-11 Thread tharpa
Is it possible to boost a query using the browse interface? How would one do this? -- View this message in context: http://lucene.472066.n3.nabble.com/Browse-interface-tp4173897.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: To understand SolrCloud configurations

2014-12-11 Thread Erick Erickson
bq: 1. Like to know what I have done is correct ? Looks fine to me. bq: 2. Where to place the schema.xml's and other configurations. Because for the moment it's are under collection1/conf folder and collection1 is not an active collection for me. ( i'm using only c-ins core) I think you're a bit

Re: To understand SolrCloud configurations

2014-12-11 Thread E S J
Thanks Eric, I understand your explanation. Quick question, Are configurations sits under /configs/defaults because -configname specified as default when I execute the following command? Can I specify -configname as /c-ins/ zkcli.sh -zkhost zoo1.internal:2183,zoo2.internal:2183,zoo3.internal:2183

Re: Details on why ConccurentUpdateSolrServer is reccommended for maximum index performance

2014-12-11 Thread Shawn Heisey
On 12/11/2014 9:19 AM, Michael Della Bitta wrote: Only thing you have to worry about (in both the CUSS and the home grown case) is a single bad document in a batch fails the whole batch. It's up to you to fall back to writing them individually so the rest of the batch makes it in. With CUSS,

Re: To understand SolrCloud configurations

2014-12-11 Thread Shawn Heisey
On 12/11/2014 6:31 PM, E S J wrote: Thanks Eric, I understand your explanation. Quick question, Are configurations sits under /configs/defaults because -configname specified as default when I execute the following command? Can I specify -configname as /c-ins/ zkcli.sh -zkhost

Re: To understand SolrCloud configurations

2014-12-11 Thread E S J
Thanks, I thought only option is default or schemaless because , When we run bin/solr -e cloud you will get prompt like , To begin, how many Solr nodes would you like to run in your local cluster? (specify 1-4 nodes) [2] 3 Ok, let's start up 3 Solr nodes for your example SolrCloud cluster.

Re: To understand SolrCloud configurations

2014-12-11 Thread Shawn Heisey
On 12/11/2014 8:09 PM, E S J wrote: Thanks, I thought only option is default or schemaless because , When we run bin/solr -e cloud you will get prompt like , snip *Please choose a configuration for the gettingstarted collection, available options are: default or schemaless [default]* I have

Documents with SOLR function sort are NOT sorted by score

2014-12-11 Thread eakarsu
I am having difficulty with my sort function. With the following sort, documents are not sorted by score if you can see. Why sort function is not able to sort it properly? I appreciate your prompt answer This is my sort function.

Join in SOLR

2014-12-11 Thread Rajesh
I'm using Solr 4.10. While importing through DIH, I've configured 3 separate entities. I'm facing some problems for indexing and retrieval. 1) How can I give the unique key, as the 3 entities will have different fields. 2) Is there a join query, from which I can join all the 3 tables. -- View

[Hep] tab delimited gz file indexing steps

2014-12-11 Thread Sithik
Team, I have a compressed text file (gz) which holds tab delimited data. Is it possible for me to index this file directly without doing any pre processing of uncompressing the file on my own? if so, can you please tell me the steps/config changes I am supposed to follow. BTW, I am using

Re: Join in SOLR

2014-12-11 Thread Tomoko Uchida
Hi, I cannot guess what is 'entities' in your context, but do you want some kind of join functionality like RDBs on Solr? Basically, Solr is not relational. So at first, you should consider denormalize your RDB tables to one table/view (or issue SQL JOIN query in DIH) to import data to Solr. If