Regarding delta-import and full-import

2012-09-28 Thread darshan
Hi All, Can anyone refer me few number blogs that explains both imports in little bit more detail and with examples. Thanks, Darshan

Merge Policy Recommendation for 3.6.1

2012-09-28 Thread Sujatha Arun
Hello, In the case where there are over 200+ cores on a single node , is it recommended to go with Tiered MP with segment size of 4 ? Our Index size vary from a few MB to 4 GB . Will there be any issue with Too many open files and the number of indexes with respect to MP ? At the moment we are

Re: Query filtering

2012-09-28 Thread Finotti Simone
Hi Amit, thank you for your answer, but I did know how to do it with two distinct queries: I hoped for some way to do it with a single query :-) (maybe using some advanced functionality with nested queries...) S Inizio: Amit Nithian [anith...@gmail.com]

RE: Indexing in Solr: invalid UTF-8

2012-09-28 Thread Patrick Oliver Glauner
Thank you. I will check our textification process and see how to improve it. Patrick From: Michael McCandless [luc...@mikemccandless.com] Sent: Wednesday, September 26, 2012 5:45 PM To: solr-user@lucene.apache.org Subject: Re: Indexing in Solr: invalid

how to boost query term after tokenizer

2012-09-28 Thread 曹霖
if the query word is ABCD,then after being tokenized it is A BC D , l want to boost term BC ,so the query word is like this: A BC^10 D and phrase query ABCD . all query words users typing in will be processed like that automaticly. l guess l can custom a filter to do it ,but l don't know if it is

Re: Problem with Special Characters in SOLR Query

2012-09-28 Thread Erick Erickson
There's a zillion of them, see: http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters. but note that this isn't a complete list, there's a link at the top of the page to the javadocs. For you latest example you probably want to see AsciiFoldingFilterFactory But you're now conflating

Boosting in query level the relevance based in content of any fields

2012-09-28 Thread Claudio Ranieri
Hello, How can I boost in query level the relevance of documents based in content of any fields? Example, I have 5 documents with fields: name and city: name | city Jose | Campinas Jose | São Paulo Jose | Rio de Janeiro Jose | Rio Branco Jose | Ourinhos In search by Jose, I wish return on top

Re: Boosting in query level the relevance based in content of any fields

2012-09-28 Thread Miguel Ángel Martín
Hi c. Recently, i was testing that issue, i use boost query in solr 3.6.1 with edixmax and works fine for me Regards El 28/09/2012 14:44, Claudio Ranieri claudio.rani...@estadao.com escribió: Hello, How can I boost in query level the relevance of documents based in content of any fields?

RES: Boosting in query level the relevance based in content of any fields

2012-09-28 Thread Claudio Ranieri
With dismax / edixmax I can boost a field, but not boosting based on content of field (in my example, São Paulo and Rio de Janeiro must have boost, but Campinas, Rio Branco and Ourinhos not). Is it possible? -Mensagem original- De: Miguel Ángel Martín

Solrj and server shutdown

2012-09-28 Thread Marc SCHNEIDER
Hi, I'm using SolrJ along with EmbeddedServer component in a Tomcat server. I was wondering why Tomcat couldn't be gracefully shut down and discovered it was a Solr issue. When Tomcat is stopped I do call the shutdown methods of both CoreContainer and EmbeddedSolrServer instances. But this didn't

Re: Can SOLR Index UTF-16 Text

2012-09-28 Thread Shawn Heisey
On 9/27/2012 2:55 PM, vybe3142 wrote: Our SOLR setup (4.0.BETA on Tomcat 6) works as expected when indexing UTF-8 files. Recently, however, we noticed that it has issues with indexing certain text files eg. UTF-16 files. I'd wait for a yes/no vote on this from one of the actual experts on

Re: Merge Policy Recommendation for 3.6.1

2012-09-28 Thread Shawn Heisey
On 9/28/2012 12:43 AM, Sujatha Arun wrote: Hello, In the case where there are over 200+ cores on a single node , is it recommended to go with Tiered MP with segment size of 4 ? Our Index size vary from a few MB to 4 GB . Will there be any issue with Too many open files and the number of

Fallout from the deprecation of setQueryType

2012-09-28 Thread Shawn Heisey
I am planning and building up a test system with Solr 4.0, for my eventual upgrade. I have not made a lot of progress so far, but I have come across a potential problem. I have seen that setQueryType is being deprecated in favor of setRequestHandler, and that the recommendation now is to use

configure saxon in 4.x

2012-09-28 Thread tech.vronk
hi, i am unable to configure saxon as the xslt-transformer of choice in solr 4.x (ALPHA and BETA) On startup, I keep getting the error: null:javax.xml.transform.TransformerFactoryConfigurationError: Pr ovider net.sf.saxon.TransformerFactoryImpl not found even though the log before says, it is

DIH import transaction issue

2012-09-28 Thread Steve Cerny
Hello, I have a simple Solr core setup to index 500 store locations with their corresponding latitude / longitude. The core is setup, running, and importing correctly however I'm seeing odd behavior when monitoring Sql. Here are the steps I'm going through and the behavior we're seeing. We have

Re: How to retrive value from float field in custom request handler?

2012-09-28 Thread Chris Hostetter
: But why Solr Field is not having a method to retrieve values for all data : types? : something similar to : Object obj = doc.getField(Field1); : : Why only stringvalue is exposed in this Field class? : : doc.getField(Field1).stringValue() Lucene's Document objecta is a very low level

Re: How to run Solr Cloud using Tomcat?

2012-09-28 Thread Chris Hostetter
: Subject: How to run Solr Cloud using Tomcat? : References: 1348661945059-4010478.p...@n3.nabble.com : 1348667092.74317.yahoomailclas...@web121704.mail.ne1.yahoo.com : 1348740007490-4010707.p...@n3.nabble.com : In-Reply-To: 1348740007490-4010707.p...@n3.nabble.com

Re: Boosting in query level the relevance based in content of any fields

2012-09-28 Thread Erick Erickson
How can Solr/Lucene distinguish between that which must be boosted and that which must not? There must be some rule here, what is it? Until one understands the rule, there's not much one can suggest... Best Erick On Fri, Sep 28, 2012 at 9:00 AM, Claudio Ranieri claudio.rani...@estadao.com

Re: Filter query not null or in list

2012-09-28 Thread Kiran J
Thank you Jack, that works. Kiran On Thu, Sep 27, 2012 at 5:18 PM, Jack Krupansky j...@basetechnology.comwrote: Add a *:* before the negative query. (*:* -Group:[* TO *]) OR Group:(X OR Y OR Z) -- Jack Krupansky -Original Message- From: Kiran J Sent: Thursday, September 27, 2012

Re: Problem configuring Solr plugin for Liferay 6.1 GA2 Ver 6.1.20

2012-09-28 Thread Chris Hostetter
: As solr-web plugin still not available I wanted to configure Liferay 6.1 : GA2 to use solr-web-6.1.10.1 throwing following error when deployed, : appreciate if someone through some light how to resolve. Spent almost a : couple weeks could not find any resolution. Your error message does not

Trie Based field (long) value parsing on query time

2012-09-28 Thread Clecio Varjao
HI All, I was wondering if someone could point to a direction on how to implement a rewrite for the value on trie field (long) on query time. Example, considering the query: q=reference:4-1.2 the value is a text, but the following is indexed as a number (e.g.: 004001002, where 4 becomes 004,

Re: Trie Based field (long) value parsing on query time

2012-09-28 Thread Chris Hostetter
: q=reference:4-1.2 : : the value is a text, but the following is indexed as a number (e.g.: : 004001002, where 4 becomes 004, and 1 becomes 001, and 2 002), depnding on how you look at it, you could implment this as one of two plugins: 1) if you consider this a special form of query syntax,

Re: httpSolrServer and exyternal load balancer

2012-09-28 Thread Chris Hostetter
: The issue we face is the f5 balancer is returning a cookie which the client : is hanging onto. resulting in the same slave being hit for all requests. ... : My question is can I configure the solr server to ignore client state ? We : are on solr 3.4 I'm not an expert on HTTP session

Re: DIH import from MySQL results in garbage text for special chars

2012-09-28 Thread Chris Hostetter
This is what i see in your original email... I am attempting to import documents to Solr from MySQL using DIH. One of the field contains the text - =E2=80=9CFuture of Mobile Value Added Service=s (VAS) in Australia=E2=80=9D .Notice the character =E2=80=9C and =E2=80=9D. E2 80 9C and E2

Re: Can SOLR Index UTF-16 Text

2012-09-28 Thread Chris Hostetter
: Our SOLR setup (4.0.BETA on Tomcat 6) works as expected when indexing UTF-8 : files. Recently, however, we noticed that it has issues with indexing : certain text files eg. UTF-16 files. See attachment for an example : (tarred+zipped) : : tesla-utf16.txt :

Re: need best solution for indexing and searching multiple, related database tables

2012-09-28 Thread Biff Baxter
Hi jimtronic, I think you have the answers I need. I have several questions. As a rough example, when you create your view, if you have one parent record and three child records, do you end up with three denormalized records or one record that has data for three child records consolidated into

Re: need best solution for indexing and searching multiple, related database tables

2012-09-28 Thread Biff Baxter
Hi Jack, The queries we need to do have a lot in common with SQL queries in that they are field specific, but we need lots of fuzzy stuff so SQL won't do. Speed is also an issue. I will provide sample queries and data if needed. For now, I am waiting to hear back from jimtronic. Thanks for

Re: need best solution for indexing and searching multiple, related database tables

2012-09-28 Thread Walter Underwood
Denormalize. Think about a view that shows your results page. Put that view in Solr. Query and get ready for speed. wunder On Sep 28, 2012, at 5:34 PM, Biff Baxter wrote: Hi Jack, The queries we need to do have a lot in common with SQL queries in that they are field specific, but we need

Re: need best solution for indexing and searching multiple, related database tables

2012-09-28 Thread Biff Baxter
Hi Walter, I have bought into the denormalize approach. My remaining questions are around how to construct the denormlized view and any solr functions that would support issues related to a) minimizing the denormalization explosion for 3 or more tables and b) handling many to many relationships.

how to boost query term after tokenizer

2012-09-28 Thread 曹霖
if the query word is ABCD,then after being tokenized it is A BC D , l want to boost term BC ,so the query word is like this: A BC^10 D and phrase query ABCD . all query words users typing in will be processed like that automaticly. l guess l can custom a filter to do it ,but l don't know if it is

Re: Boosting in query level the relevance based in content of any fields

2012-09-28 Thread 曹霖
u can use boost function query 2012/9/29 Erick Erickson erickerick...@gmail.com How can Solr/Lucene distinguish between that which must be boosted and that which must not? There must be some rule here, what is it? Until one understands the rule, there's not much one can suggest... Best