Re: Problem with AND clause in multi core search query

2012-05-14 Thread Tommaso Teofili
The latter is supposed to work: http://localhost:8983/solr/core0/select?shards=localhost:8983/solr/core0,localhost:8983/solr/core1&q=column1 :"A" OR column2:"B" The first query cannot work as there is no document neither in core0 nor in core1 which has A in field column1 and B in field column2 but

Re: problem with date searching.

2012-05-14 Thread ayyappan
select/?defType=dismax&q=+ibrahim.hamid+2012-02-02T04:00:52Z&qf=+userid+scanneddate&version=2.2&start=0&rows=50&indent=on&wt=json&&debugQuery=on -- View this message in context: http://lucene.472066.n3.nabble.com/problem-with-date-searching-tp3961761p3983802.html Sent from the Solr - User mailing

Re: problem with date searching.

2012-05-14 Thread ayyappan
In fact I am able to see "scanneddate" field when i added query like this "responseHeader":{ - - "q":" ibrahim.hamid 2012-02-02T04:00:52Z", "qf":" userid scanneddate", "wt":"json", "defType":"dismax", "version":"2.2", "rows":"50"}}, "response":{"numF

Problem with AND clause in multi core search query

2012-05-14 Thread ravicv
Hi, I have 2 cores configured in my solr instance. Both cores are using same schema. I have indexed column1 in core0 and column2 in core1 My search query is http://localhost:8983/solr/core0/select?shards=localhost:8983/solr/core0,localhost:8983/solr/core1&q=column1:"A"; AND column2:"B" No re

Re: Getting payloads for matching term in search result

2012-05-14 Thread s . hermann
Hello, On 05/14/2012 10:33 PM, Mikhail Khludnev wrote: It's not really out-of-the-box, but not a big deal http://www.lucidimagination.com/blog/2010/04/18/refresh-getting-started-with-payloads/ yeah I know, but I do not know where to put/plugin the code on solrs server side. For testing purpo

document cache

2012-05-14 Thread shinkanze
hi , I want to know the internal mechanism how document cache works . specifically its flushing cycle ... i.e does it gets flushed on every commit /replication . regards Rajat Rastogi -- View this message in context: http://lucene.472066.n3.nabble.com/document-cache-tp3983796.html Sent f

Re: socket timeout

2012-05-14 Thread Jon Kirton
Here is the stacktrace for the timeout: 2012-05-09 13:08:30,521 [http-8080-62] DEBUG solr.SolrService - org.apache.solr.client.solrj.SolrServerException: java.net.SocketTimeoutException: Read timed out org.apache.solr.client.solrj.SolrServerException: java.net.SocketTimeoutException: Read timed

socket timeout

2012-05-14 Thread Jon Kirton
Intermittently, a socket timeout occurs from a search request. Is there a config param I can set in solrconfig.xml to specify socket timeouts for version 1.4.1 ?

Re: Unexpected query rewrite from WordDelimiterFilterFactory and SynonymFilterFactory

2012-05-14 Thread Jack Krupansky
If it is important enough for you, you could expand multi-word and compound word synonyms as a preprocessing step and generate an "OR" expression in the query. -- Jack Krupansky -Original Message- From: Chung Wu Sent: Monday, May 14, 2012 8:25 PM To: solr-user@lucene.apache.org Subje

Re: facet range query question

2012-05-14 Thread andy
THANKS for your relay -- View this message in context: http://lucene.472066.n3.nabble.com/facet-range-query-question-tp3976026p3983783.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Unexpected query rewrite from WordDelimiterFilterFactory and SynonymFilterFactory

2012-05-14 Thread Chung Wu
Thanks Jack! It's too bad I can't have catenate and generateParts both set to "1" at query time. If I set catenate to "0", then I miss the case where "wifi" is indexed but "wi-fi" is queried. If I set generateParts to "0", then I miss the case where "wi fi" is queried but "wi-fi" is canceled.

Re: adding an OR to a fq makes some doc that matched not match anymore

2012-05-14 Thread Jack Krupansky
Don't forget to uuencode the spaces as "+" or "%20". Playing around, I noticed that putting parens around the negative term changed the results: I'm not sure whether that is a bug or not. In any case, try: /suggest?q=suggest_terms:lap*&fq=type:P&fq=((-type:B)+OR+name:aa) -- Jack Krupansky

Re: Unexpected query rewrite from WordDelimiterFilterFactory and SynonymFilterFactory

2012-05-14 Thread Jack Krupansky
The extra terms are okay at index time - they simply overlap the base words and make composite terms more searchable, but you need to have a separate query analyzer that sets the various catenate options to "0" since the query generator doesn't know what to do with the extra terms. Synonyms are

Re: Boosting on field empty or not

2012-05-14 Thread Jack Krupansky
Reading more closely, I see that there is a workaround: add a space after any left parenthesis. So, try this: q=chairs+AND+(+regularprice:*^5+OR+(+*:*+-regularprice:*)^5) Here's an example of my own: http://localhost:8983/solr/select/?q=the+AND+(+price:*^5+OR+(+*:*+-price:*)^0.5)&defType=edis

adding an OR to a fq makes some doc that matched not match anymore

2012-05-14 Thread jmlucjav
Hi, I am trying to understand this scenario (Solr3.6): - /suggest?q=suggest_terms:lap*&fq=type:P&fq=(-type:B) numFound=1 - I add a OR to the second fq. That fq is already fulfilled by the found doc, so adding a doc will also fulfill right? /suggest?q=suggest_terms:lap*&fq=type:P&fq=(-type:B OR n

Re: Boosting on field empty or not

2012-05-14 Thread Jack Krupansky
Oh well, it looks like my suggestion is running into "SOLR-3377 - eDismax: A fielded query wrapped by parens is not recognized". See: https://issues.apache.org/jira/browse/SOLR-3377 That issue has a patch, but not yet committed. That explains why it works in the traditional Solr/Lucene query p

Re: Index an xml filed that is saved in a database

2012-05-14 Thread Jack Krupansky
Granted, a proper XML parse of the input field is better. I didn't see an obvious solution at first, but I did run across this: "Use a fielddatasource for reading field from database and then use xpathentityprocessor. Field datasource will give you the stream that is needed by xpathentity proc

Re: Index an xml filed that is saved in a database

2012-05-14 Thread Michael Della Bitta
That answer may serve the OP well, but I can't help but propagate this link when the idea of parsing XML with regex comes up: http://stackoverflow.com/questions/1732348/regex-match-open-tags-except-xhtml-self-contained-tags/1732454#1732454 :) Michael On Mon, 2012-05-14 at 17:03 -0400, Jack Kr

- Solr 4.0 - How do I enable JSP support ? ...

2012-05-14 Thread Naga Vijayapuram
Hello, How do I enable JSP support in Solr 4.0 ? Thanks Naga

Re: Urgent! Highlighting not working as expected

2012-05-14 Thread Jack Krupansky
The highlighting will be based only on the fields in which matching occurred. Are you using edismax and with multiple fields in qf, or the traditional Solr (Lucene) query parser that only matches in the default field or an explicit field? -- Jack Krupansky -Original Message- From: TJ

Re: Solr Shards multi core slower then single big core

2012-05-14 Thread Otis Gospodnetic
Aha!  See, Kuli, I wasn't making it up! ;) Otis  Performance Monitoring for Solr / ElasticSearch / HBase - http://sematext.com/spm  > > From: Robert Stewart >To: solr-user@lucene.apache.org >Sent: Monday, May 14, 2012 11:23 AM >Subject: Re: Solr Shards mu

Re: Update JSON not working for me

2012-05-14 Thread rjain15
I haven't modified any schema or config. I am going to do it all over...clean install. I tried with 3.6 and I have the same issue. I am going to try with 4.x one more time, its been painful, I am so excited to use Solr for my project, and seems I am stuck on the basics. Thanks Rajesh -- Vie

Re: Index an xml filed that is saved in a database

2012-05-14 Thread Jack Krupansky
A regex transformer should do the trick: http://wiki.apache.org/solr/DataImportHandler#RegexTransformer -- Jack Krupansky -Original Message- From: Ramo Karahasan Sent: Monday, May 14, 2012 4:54 PM To: solr-user@lucene.apache.org Subject: Index an xml filed that is saved in a database

Index an xml filed that is saved in a database

2012-05-14 Thread Ramo Karahasan
Hi, I have an XML document saved in a column of a database table. Is it possible to index just one part of that xml string, e.g. . with the DIH handler or is it necessary to extract this information previously? Thanks, Ramo

Urgent! Highlighting not working as expected

2012-05-14 Thread TJ Tong
Dear all, I queried Solr (3.5) with this: q=text:"G-Money"&hl=true&hl.fl=*, where text is a "text" field and all the other fields were copied to it. I got three records returned, however, only one field (also "text" field) was highlighted: G-MONEY HETZEL But the other two also have match

Re: Boosting on field empty or not

2012-05-14 Thread Jack Krupansky
It may not matter, but the spaces in the query should be uuencoded as "+". I have the query working with Solr query, but it is giving me no docs for edismax for some reason. But, it does seem to work if I reverse the order of the query terms to be: http://localhost:8983/solr/select/?q=*:*+AND

Re: Getting payloads for matching term in search result

2012-05-14 Thread Mikhail Khludnev
It's not really out-of-the-box, but not a big deal http://www.lucidimagination.com/blog/2010/04/18/refresh-getting-started-with-payloads/ On Mon, May 14, 2012 at 5:13 PM, wrote: > Good day > > currently I have a field defined as can be seen below: > > class="solr.TextField"> > > > d

Re: Boosting on field empty or not

2012-05-14 Thread Donald Organ
OK thats giving me only documents that have the field populated On Mon, May 14, 2012 at 4:28 PM, Donald Organ wrote: > OK i think i found the proper way to do what i was trying to do: > > &q=chairs AND (regularprice:[0 TO *]^5) > > > On Mon, May 14, 2012 at 4:25 PM, Donald Organ wrote: > >> I've

Re: Boosting on field empty or not

2012-05-14 Thread Donald Organ
OK i think i found the proper way to do what i was trying to do: &q=chairs AND (regularprice:[0 TO *]^5) On Mon, May 14, 2012 at 4:25 PM, Donald Organ wrote: > I've even tried upping the boost to 10 and the de-boost to 1but yet > its still applying the boost to all the documents returned.

Re: Boosting on field empty or not

2012-05-14 Thread Donald Organ
I've even tried upping the boost to 10 and the de-boost to 1but yet its still applying the boost to all the documents returned. So it matter if this is a money field? On Mon, May 14, 2012 at 4:19 PM, Donald Organ wrote: > &q=chairs AND (regularprice:*^5 OR (*:* -regularprice:*)^0.5)&sort=sco

Re: Boosting on field empty or not

2012-05-14 Thread Donald Organ
&q=chairs AND (regularprice:*^5 OR (*:* -regularprice:*)^0.5)&sort=score desc Same effect. On Mon, May 14, 2012 at 4:12 PM, Jack Krupansky wrote: > Change the second boost to 0.5 to de-boost doc that are missing the field > value. You had them the same. > > -- Jack Krupansky > > -Original

Re: Boosting on field empty or not

2012-05-14 Thread Jack Krupansky
Change the second boost to 0.5 to de-boost doc that are missing the field value. You had them the same. -- Jack Krupansky -Original Message- From: Donald Organ Sent: Monday, May 14, 2012 4:01 PM To: solr-user@lucene.apache.org Subject: Re: Boosting on field empty or not OK it looks l

Re: Update JSON not working for me

2012-05-14 Thread Yonik Seeley
On Mon, May 14, 2012 at 3:11 PM, Rajesh Jain wrote: > Hi Yonik > > i tried without the json in the URL, the result was same but in XML format Interesting... the XML response is fine (just not ideal). When I tried it, I did get a JSON response (perhaps I'm running a later version of trunk... the

Re: Boosting on field empty or not

2012-05-14 Thread Donald Organ
OK it looks like the query change is working but it looks like it boosting everything even documents that have that field empty On Mon, May 14, 2012 at 3:41 PM, Donald Organ wrote: > OK i must be missing something: > > > defType=edismax&start=0&rows=24&facet=true&qf=nameSuggest^10 name^10 >

Re: Update JSON not working for me

2012-05-14 Thread Jack Krupansky
I just tried update/json myself with example (no changes) for both 3.6 and the same trunk build as you used, and it works fine for me - I get 4 docs for cat:book. Did you modify the schema or config? -- Jack Krupansky -Original Message- From: Rajesh Jain Sent: Monday, May 14, 2012 3

Re: Boosting on field empty or not

2012-05-14 Thread Donald Organ
OK i must be missing something: defType=edismax&start=0&rows=24&facet=true&qf=nameSuggest^10 name^10 codeTXT^2 description^1 brand_search^0 cat_search^10&spellcheck=true&spellcheck.collate=true&spellcheck.q=chairs&facet.mincount=1&fl=code,score&q=chairs AND (regularprice:*^5 OR (*:* -regularprice

Re: Boosting on field empty or not

2012-05-14 Thread Jack Krupansky
"(*:* -regularprice:*)5" should be "(*:* -regularprice:*)^0.5" - the missing boost operator. -- Jack Krupansky -Original Message- From: Donald Organ Sent: Monday, May 14, 2012 3:31 PM To: solr-user@lucene.apache.org Subject: Re: Boosting on field empty or not Still doesnt appear to b

Re: Boosting on field empty or not

2012-05-14 Thread Donald Organ
Still doesnt appear to be working. Here is the full Query string: defType=edismax&start=0&rows=24&facet=true&qf=nameSuggest^10 name^10 codeTXT^2 description^1 brand_search^0 cat_search^10&spellcheck=true&spellcheck.collate=true&spellcheck.q=chairs&facet.mincount=1&fl=code,score&q=chairs AND (reg

Re: Boosting on field empty or not

2012-05-14 Thread Jack Krupansky
Sorry, make that: &q=chairs AND (regularprice:*^5 OR (*:* -regularprice:*)^0.5) I forgot that pure negative queries are broken again, so you need the *:* in there. I noticed that you second boost operator was missing as well. -- Jack Krupansky -Original Message- From: Donald Organ

Re: Boosting on field empty or not

2012-05-14 Thread Donald Organ
OK i just tried: &q=chairs AND (regularprice:*^5 OR (-regularprice:*)5) And that gives me 0 results On Mon, May 14, 2012 at 2:51 PM, Jack Krupansky wrote: > foo AND (field:*^2.0 OR (-field:*)^0.5) > > So, if a doc has anything in the field, it gets boosted, and if the doc > does not have anyt

Re: Kernel methods in SOLR

2012-05-14 Thread Lance Norskog
Lucene provides these vectors as 'term vectors' or 'term frequency vectors'. The MoreLikeThis feature does queries against these (I think). http://www.lucidimagination.com/search/?q=term+vectors http://www.lucidimagination.com/search/?q=MoreLikeThis On Mon, May 14, 2012 at 11:07 AM, Dmitry Kan w

Re: Update JSON not working for me

2012-05-14 Thread Rajesh Jain
Hi Yonik i tried without the json in the URL, the result was same but in XML format C:\Tools\Solr\apache-solr-4.0-2012-05-04_08-23-31\example\exampledocs>C:\tools\curl\curl http://localhost:8983/solr/update?commit=true --data-binary @money.json -H 'Content-type:application/json' 045 On Mon

Re: Update JSON not working for me

2012-05-14 Thread Rajesh Jain
Jack I tried with cat=book which was in books.json and in my smaller demo file, I had it as books, in either case it doesn't seem to work. >From the example on http://wiki.apache.org/solr/UpdateJSON http://localhost:8983/solr/select?q=title:monsters&wt=json&indent=true This should result in out b

Re: Update JSON not working for me

2012-05-14 Thread Jack Krupansky
The books.json in example/exampledocs has: "cat" : ["book","hardcover"], That is "book" singular, not "books" plural as in your query. There is no stemming since it is a string field, not text. -- Jack Krupansky -Original Message- From: Rajesh Jain Sent: Monday, May 14, 2012 2:42 P

Re: Update JSON not working for me

2012-05-14 Thread Yonik Seeley
I think this may be due to https://issues.apache.org/jira/browse/SOLR-2857 JIRA is down right now so I can't check, but I thought the intent was to have some back compat. Try changing the URL from /update/json to just /update in the meantime -Yonik http://lucidimagination.com On Mon, May 14, 20

Re: Boosting on field empty or not

2012-05-14 Thread Jack Krupansky
foo AND (field:*^2.0 OR (-field:*)^0.5) So, if a doc has anything in the field, it gets boosted, and if the doc does not have anything in the field, de-boost it. Choose the boost factors to suit your desired boosting effect. -- Jack Krupansky -Original Message- From: Donald Organ S

Re: Problems with field names in solr functions

2012-05-14 Thread Yonik Seeley
In trunk, see: * SOLR-2335: New 'field("...")' function syntax for refering to complex field names (containing whitespace or special characters) in functions. The schema in trunk also specifies: -Yonik http://lucidimagination.com On Thu, May 10, 2012 at 11:28 AM, Iker Huerga wrote: > Hi

Re: Update JSON not working for me

2012-05-14 Thread Rajesh Jain
Hi Jack I am following the http://wiki.apache.org/solr/UpdateJSON tutorials. The first example is of books.json, which I executed, but I dont see any books http://localhost:8983/solr/collection1/browse?q=cat%3Dbooks 0 results found in 26 ms Page 0 of 0 I modified the books.json to add my own

Re: Boosting on field empty or not

2012-05-14 Thread Donald Organ
OK maybe i need to describe this a little more. Basically I want documents that have a given field populated to have a higher score than the documents that dont. So if you search for foo I want documents that contain foo, but i want the documents that have field a populated to have a higher score

Re: Update JSON not working for me

2012-05-14 Thread Jack Krupansky
Check the examples of update/json here: http://wiki.apache.org/solr/UpdateJSON In your case, either leave out the "add" level or add a "doc" level below it. For example: curl http://localhost:8983/solr/update/json -H 'Content-type:application/json' -d ' { "add": {"doc": {"id" : "TestDoc1",

Re: Boosting on field empty or not

2012-05-14 Thread Jack Krupansky
In a query or filter query you can write +field:* to require that a field be populated or +(-field:*) to require that it not be populated -- Jack Krupansky -Original Message- From: Donald Organ Sent: Monday, May 14, 2012 2:10 PM To: solr-user Subject: Boosting on field empty or not I

Boosting on field empty or not

2012-05-14 Thread Donald Organ
Is there a way to boost a document based on whether the field is empty or not. I am looking to boost documents that have a specific field populated.

Re: Kernel methods in SOLR

2012-05-14 Thread Dmitry Kan
Peyman, Did you have a look at this? https://issues.apache.org/jira/browse/LUCENE-2959 the pluggable ranking functions. Can be a good starting point for you. Dmitry On Mon, Apr 23, 2012 at 7:29 PM, Peyman Faratin wrote: > Hi > > Has there been any work that tries to integrate Kernel methods [

Re: not getting expected results when doing a delta import via full import

2012-05-14 Thread geeky2
update on this: i also tried manipulating the timestamps in the dataimport.properties file to advance the date so that no records could be older than last_index_time example: #Mon May 14 12:42:49 CDT 2012 core1-model.last_index_time=2012-05-15 14\:38\:55 last_index_time=2012-05-15 14\:38\:55 ~

relative path for xsl:include failing in Solr 3.6

2012-05-14 Thread pramila_tha...@ontla.ola.org
Hi Everyone, I am sure some one might have encountered this problem. using xsl:include works when the file is in conf\xslt directory But fails if the file is at different location. The same thing worked for solr 1.4. Can someone share their experience, if they have encounteres this please. th

not getting expected results when doing a delta import via full import

2012-05-14 Thread geeky2
hello all, i am not getting the expected results when trying to set up delta imports according to the wiki documentation here: http://wiki.apache.org/solr/DataImportHandlerDeltaQueryViaFullImport?highlight=%28delta%29|%28import%29 i have the following set up in my DIH, query="select [complic

Update JSON not working for me

2012-05-14 Thread Rajesh Jain
Hi, I am using the 4.x version of Solr, and following the UpdateJSON Solr Wiki 1. When I try to update using : curl 'http://localhost:8983/solr/update/json?commit=true' --data-binary @books.json -H 'Content-type:application/json' I don't see any Category as Books in Velocity based Solr Browser

Re: Solr Import Handler Custom Transformer not working

2012-05-14 Thread dboychuck
Thank you for your input. With your help I was able to solve my problem. Although I could find no good example of how to handle multivalued fields with a custom transformer online your comments helped me to find a solution. Here is the code that handles both multi-valued and single valued fields.

Re: Solr Shards multi core slower then single big core

2012-05-14 Thread arjit
Robert can you tell what you mean when you say "We do a lot of faceting so maybe that is why since facets can be built in parallel on different threads/cores". I am novice in solr. Can you tell me where Can i read about it ? Thanks , Arjit On Mon, May 14, 2012 at 8:54 PM, Robert Stewart [via Luc

Re: Getting payloads for matching term in search result

2012-05-14 Thread Jack Krupansky
The "stored" value of your payload field does in fact have the original payload value, albeit formatted as you have shown. Is that not sufficient? There doesn't appear to be any Solr support for returning term payload directly. I see a Jira issue for adding query support, but I don’t see a Ji

Re: Solr Shards multi core slower then single big core

2012-05-14 Thread Robert Stewart
We used to have one large index - then moved to 10 shards (7 million docs each) - parallel search across all shards, and we get better performance that way. We use a 40 core box with 128GB ram. We do a lot of faceting so maybe that is why since facets can be built in parallel on different thre

Re: Date format in the schema.xml

2012-05-14 Thread Jack Krupansky
At least in this case where dates have a precision of day, the total number of unique values should be relatively low (3,650 for a 10-year period or even 18,250 for a 50-year period), so precision step probably won't matter in this case much at all. The big benefit with tdate over old date here

RE: Relicating a large solr index

2012-05-14 Thread Rohit
Hi Erick, Yes I have enabled the following setting, internal 5000 1 Will try with higher timeouts. I tried scp command and the link didn’t break once, I was able to copy the entire 300Gb files, so am not too sure if this is a network problem. Regards, Rohit Mobile: +91

Re: Date format in the schema.xml

2012-05-14 Thread Bruno Mannina
Ok Thanks ! Le 14/05/2012 16:16, Ahmet Arslan a écrit : is it mandatory to use the date format -mm-ddThh:mm:ssZ ? Yes. I have a date with this format: mmdd in my xml source file. Where can I find more information, I found only these definitions in the schema.xml In schema.xml ther

Re: Relicating a large solr index

2012-05-14 Thread Erick Erickson
Have you tried modifying the timeout parameters? See: http://wiki.apache.org/solr/SolrReplication, the "Slave" section.. Best Erick On Mon, May 14, 2012 at 10:30 AM, Rohit wrote: > The size of index is about 300GB, I am seeing the following error in the > logs, > > java.net.SocketTimeoutExceptio

Re: Documents With large number of fields

2012-05-14 Thread Jack Krupansky
Indexing should be fine - depending on your total document count. I think the potential issue is the FieldCache at query time. I think it should be linear based on number of documents, fields, and unique terms per field for string values, so if you do two tests, index with 1,000 docs and then 2,

Re: Solr Shards multi core slower then single big core

2012-05-14 Thread Michael Della Bitta
Hi, all, I've been running into murmurs about this idea elsewhere: http://stackoverflow.com/questions/8698762/run-multiple-big-solr-shard-instances-on-one-physical-machine http://java.dzone.com/articles/optimizing-solr-or-how-7x-your?mz=33057-solr_lucene Michael On Mon, May 14, 2012 at 10:29 A

Re: slave index not cleaned

2012-05-14 Thread Bill Bell
This is a known issue in 1.4 especially in Windows. Some of it was resolved in 3x. Bill Bell Sent from mobile On May 14, 2012, at 5:54 AM, Erick Erickson wrote: > Hmmm, replication will require up to twice the space of the > index _temporarily_, just checking if that's what you're seeing

Re: Kernel methods in SOLR

2012-05-14 Thread Otis Gospodnetic
Hi Peyman, I never saw this mentioned on Lucene/Solr MLs, so if anyone has done any work on this, I don't think it was shared. Otis  Performance Monitoring for Solr / ElasticSearch / HBase - http://sematext.com/spm  > > From: Peyman Faratin >To: solr-use

Re: Documents With large number of fields

2012-05-14 Thread Otis Gospodnetic
Nitin, I meant to reply, but I think the thing to watch out for are Lucene segment merges.  I think this is another thing I saw in a client engagement where the client had a crazy number of fields.  If I recall correctly, it was segment merges that were painfully slow. So try creating a n

RE: Relicating a large solr index

2012-05-14 Thread Rohit
The size of index is about 300GB, I am seeing the following error in the logs, java.net.SocketTimeoutException: Read timed out at java.net.SocketInputStream.socketRead0(Native Method) at java.net.SocketInputStream.read(SocketInputStream.java:129) at java.io.BufferedInputStr

Re: Solr Shards multi core slower then single big core

2012-05-14 Thread Otis Gospodnetic
Hi Kuli, As long as there are enough CPUs with spare cycles and disk IO is not a bottleneck, this works faster.  This was 12+ months ago. Otis  Performance Monitoring for Solr / ElasticSearch / HBase - http://sematext.com/spm  > > From: Michael Kuhlmann

RE: Documents With large number of fields

2012-05-14 Thread Keswani, Nitin - BLS CTR
Unfortunately I never got any response. However I did a POC with a Document containing 400 fields and loaded around 1000 docs to my local machine. I didn’t see any issue but then again the document set was very small. Hopefully as mentioned below providing enough memory should help alleviate any

Re: Solr Shards multi core slower then single big core

2012-05-14 Thread Michael Kuhlmann
Am 14.05.2012 16:18, schrieb Otis Gospodnetic: Hi Kuli, In a client engagement, I did see this (N shards on 1 beefy box with lots of RAM and CPU cores) be faster than 1 big index. I want to believe you, but I also want to understand. Can you explain why? And did this only happen for single

Re: Solr Shards multi core slower then single big core

2012-05-14 Thread Otis Gospodnetic
Hi Kuli, In a client engagement, I did see this (N shards on 1 beefy box with lots of RAM and CPU cores) be faster than 1 big index. Otis  Performance Monitoring for Solr / ElasticSearch / HBase - http://sematext.com/spm  > > From: Michael Kuhlmann >To:

Re: Relicating a large solr index

2012-05-14 Thread Otis Gospodnetic
Rohit, Sure.  You can set up old style Solr index replication (uses rsync - all documented on the Wiki) and then enable Java replication. Of course, you could also try sharing more details about how your current replication breaks and maybe people can help you fix that so you don't have to use

Re: Date format in the schema.xml

2012-05-14 Thread Ahmet Arslan
> is it mandatory to use the date format -mm-ddThh:mm:ssZ > ? Yes. > I have a date with this format: > mmdd > in my xml source file. > > Where can I find more information, I found only these > definitions in the schema.xml In schema.xml there is a xml comment about dates, starting with

Date format in the schema.xml

2012-05-14 Thread Bruno Mannina
Dear, is it mandatory to use the date format -mm-ddThh:mm:ssZ ? I have a date with this format: mmdd in my xml source file. Where can I find more information, I found only these definitions in the schema.xml positionIncrementGap="0"/> positionIncrementGap="0"/> Could you explain m

Getting payloads for matching term in search result

2012-05-14 Thread s . hermann
Good day currently I have a field defined as can be seen below: Basically the content for that field has the following form: "Wiedersehn|x1062y1755 macht|x1340y1758 Freude|x1502y1758" where the stuff after the pipe is the payload data (some coordinates). What I want i

Re: New scoring models in LUCENE/SOLR (LUCENE-2959)

2012-05-14 Thread Erick Erickson
See LUCENE-2959 is relevant, but can't link there right now, the site is under maintenance. Bottom line: This has already been done in trunk although I confess I haven't made use of it. It is NOT in any of the 3.x code however. Best Erick On Mon, May 14, 2012 at 8:40 AM, ilay raja wrote: > Hi >

Re: New scoring models in LUCENE/SOLR (LUCENE-2959)

2012-05-14 Thread ilay raja
Hi I was going through flexscoring implementations of lucene to experiment the new scoring models. I am into structured search where I feel BM25F is more relevant to use/experiement with. But there is no implementation of BM25FSimilarity .. I have also come across BM25F implemenatation i

Re: Lucene FieldCache doesn' get cleaned up and OOM occurs

2012-05-14 Thread Erick Erickson
Patches welcome ... But yeah, the fieldCache is pretty much controlled by Lucene. I rather doubt there's a lot of interest in flushing it as you're asking, just because of the concentration on making Solr/Lucene _fast_. Especially if you think about running 10 simultaneous queries sorting on the 1

Re: Relicating a large solr index

2012-05-14 Thread Erick Erickson
What do your logs show? Solr replication should be robust. How large is "large"? You might review: http://wiki.apache.org/solr/UsingMailingLists Best Erick On Mon, May 14, 2012 at 3:11 AM, Rohit wrote: > Hi, > > > > I have a large solr index which needs to be replicated, solr replication > star

Re: searching when in a solr-component?

2012-05-14 Thread Erick Erickson
I think something like SolrQueryRequest.getCore().getSearcher() does what you want Best Erick On Fri, May 11, 2012 at 5:46 PM, Paul Libbrecht wrote: > Hello SOLR experts, > > can I see the same index while responding another query? > If yes how? > > thanks in advance > > Paul

Re: Editing long Solr URLs - Chrome Extension

2012-05-14 Thread Erick Erickson
Cool! On Fri, May 11, 2012 at 10:56 AM, Jan Høydahl wrote: > I've been testing > https://chrome.google.com/webstore/detail/mbnigpeabbgkmbcbhkkbnlidcobbapff?hl=en > but I don't think it's great. > > Great work on this one. Simple and straight forward. A few wishes: > * Sticky mode? This tool wou

Re: Lucene FieldCache doesn' get cleaned up and OOM occurs

2012-05-14 Thread Mathias Hodler
Hi Erick, I'm sorting on 10 different fields (string, date and floats) with 90% - 100% unique values and 50k indexed documents. You're right - cleaning the cache would slow down the next queries until the fields are cached again. But it would be nice to run Solr on systems with limited memory and

Re: How detect slave replication termination

2012-05-14 Thread Erick Erickson
What is it you're trying to accomplish with this knowledge? I _think_ that you can use the HTTP request indexversion here: http://wiki.apache.org/solr/SolrReplication#HTTP_API and I _think_ this is not updated until after replication... It should match that of the master. Best Erick On Fri, May

Re: Solr Shards multi core slower then single big core

2012-05-14 Thread Michael Kuhlmann
Am 14.05.2012 13:22, schrieb Sami Siren: Sharding is (nearly) always slower than using one big index with sufficient hardware resources. Only use sharding when your index is too huge to fit into one single machine. If you're not constrained by CPU or IO, in other words have plenty of CPU cores

Re: slave index not cleaned

2012-05-14 Thread Erick Erickson
Hmmm, replication will require up to twice the space of the index _temporarily_, just checking if that's what you're seeing But that should go away reasonably soon. Out of curiosity, what happens if you restart your server, do the extra files go away? But it sounds like your index is growing o

Re: Multi-words synonyms matching

2012-05-14 Thread elisabeth benoit
Just for the record, I'd like to conclude this thread First, you were right, there was no behaviour difference between fq and q parameters. I realized that: 1) my synonym (hotel de ville) has a stopword in it (de) and since I used tokenizerFactory="solr.KeywordTokenizerFactory" in my synonyms de

Re: Lucene FieldCache doesn' get cleaned up and OOM occurs

2012-05-14 Thread Erick Erickson
But consider what would happen if the cache was cleaned up the next query in would require that the terms be re-loaded. I guess it's possible that some people would be willing to sacrifice speed in constrained situations... Meanwhile, you have two options 1> increase memory 2> sort on fewer un

Re: Solr Import Handler Custom Transformer not working

2012-05-14 Thread Erick Erickson
Nothing jumps out on a quick look. So I'd try a couple of things: 1> you can debug this via "remote debugging" in an IDE, see if your code is a> reached and b> does what you expect. 2> Look at your logs. Anything coming out that's unexpected? 3> Try some simple logging, maybe dump out the docu

Multicore file handling

2012-05-14 Thread Aleksander Akerø
Hi I’m having problems with the file handling when using multicore setup in solr 3.6. The same issue that is described in Solr-1894 . In Jira it says that it has been fixed in 3.1, doesn’t that mean that it should work also for 3.6? The prob

Re: Solr Shards multi core slower then single big core

2012-05-14 Thread Sami Siren
> Sharding is (nearly) always slower than using one big index with sufficient > hardware resources. Only use sharding when your index is too huge to fit > into one single machine. If you're not constrained by CPU or IO, in other words have plenty of CPU cores available together with for example se

Re: Solr Shards multi core slower then single big core

2012-05-14 Thread Michael Kuhlmann
Am 14.05.2012 05:56, schrieb arjit: Thanks Erick for the reply. I have 6 cores which doesn't contain duplicated data. every core has some unique data. What I thought was when I read it would read parallel 6 cores and join the result and return the query. And this would be efficient then reading o

Hunspell stemmer solr 3.4

2012-05-14 Thread search engn dev
I am currently using solr 3.4 in my application, Currently i cant upgrade solr to 3.5 due to some problem. I want to use hunspell stemmer in solr 3.4 , for doing this which all changes i need to make. -- View this message in context: http://lucene.472066.n3.nabble.com/Hunspell-stemmer-solr-3-4-tp

Re: How add custom field to Nutch1.4?

2012-05-14 Thread Markus Jelsma
Please ask Nutch related questions only on the Nutch users mailing list. Thanks. On Sun, 13 May 2012 20:18:37 -0700 (PDT), forwardswing wrote: who can help me ? -- View this message in context: http://lucene.472066.n3.nabble.com/How-add-custom-field-to-Nutch1-4-tp3983549p3983597.html Sent f

Re: Show a portion of searchable text in Solr

2012-05-14 Thread Ahmet Arslan
> I have indexed very large documents, In some cases these > documents has > 100.000 characters. Is there a way to return a portion of > the documents > (lets say the 300 first characters) when i am querying > "Solr"?. Is there any > attribute to set in the schema.xml or solrconfig.xml to > achieve

Relicating a large solr index

2012-05-14 Thread Rohit
Hi, I have a large solr index which needs to be replicated, solr replication start but then keeps breaking and starting from 0. Is there another way to achieve this, I was thinking of using scp to copy the index from master to slave and then enable replication, will this work? Reg

Re: Index Corruption

2012-05-14 Thread Shubham Srivastava
I am using 3.5 . - Original Message - From: Lance Norskog [mailto:goks...@gmail.com] Sent: Monday, May 14, 2012 11:08 AM To: solr-user@lucene.apache.org Subject: Re: Index Corruption "Index corruption" usually means data structure problems. There is a Lucene program 'org.apache.lucene.

  1   2   >