Re: how to boost exact match

2012-08-10 Thread Li Li
create an field for exact match. it is a optional boolean clause 在 2012-8-11 下午1:42,"abhayd" 写道: > hi > > I have documents like > iphone 4 - white > iphone 4s - black > ipone4 - black > > when user searches for iphone 4 i would like to show iphone 4 docs first > and > iphone 4s after that. > Simil

how to boost exact match

2012-08-10 Thread abhayd
hi I have documents like iphone 4 - white iphone 4s - black ipone4 - black when user searches for iphone 4 i would like to show iphone 4 docs first and iphone 4s after that. Similary when user is searching for iphone 4s i would like to show iphone 4s docs first then iphone 4 docs. At present i

Re: termFrequncy off and still use fastvector highlighter?

2012-08-10 Thread abhayd
that helps -- View this message in context: http://lucene.472066.n3.nabble.com/termFrequncy-off-and-still-use-fastvector-highlighter-tp3998590p4000575.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Tuning caching of geofilt queries

2012-08-10 Thread Lance Norskog
In other computations I found exactly zero performance difference between floats & doubles. Even with long arrays number which you would expect to be sensitive to locality effects. On Fri, Aug 10, 2012 at 11:20 AM, David Smiley (@MITRE.org) wrote: > Yeah it is... I rather like this write-up: > ht

Re: Solr 4.0 schedule

2012-08-10 Thread Jack Krupansky
The beta appears imminent, unless somebody suggests that there are problems with the current RC. You can try it here: http://people.apache.org/~rmuir/staging_area/lucene-solr-4.0bRC0-rev1370099/solr/ The official Solr 4.0 wiki (a little outdated) is here: http://wiki.apache.org/solr/Solr4.0 As

Solr 4.0 schedule

2012-08-10 Thread sausarkar
anyone has any clue when will the beta version of Solr 4.0 be released also is their any timeframe when the first GA release for Solr4.0? -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-4-0-schedule-tp4000561.html Sent from the Solr - User mailing list archive at Nabble

Re: Updateing Solr

2012-08-10 Thread Chris Collins
On Aug 10, 2012, at 3:05 PM, Nana wrote: > Hi > > Thanks for you answer. > My Solr running under Tomcat, do you mean I have to restart Solr for > updating? > > -Nader > > > > On Fri, Aug 10, 2012 at 2:50 PM, Jack Krupansky-2 [via Lucene] < > ml-node+s472066n4000471...@n3.nabble.com> wrote:

Re: Updateing Solr

2012-08-10 Thread Jack Krupansky
No, I meant that you must resend and new or changed documents to Solr, the same way you sent documents to Solr initially. Were scripts or shell/curl commands or data import handler (DIH) or custom Java programs used for the initial load of data into Solr? -- Jack Krupansky -Original Mess

Re: Updateing Solr

2012-08-10 Thread Nana
Hi Thanks for you answer. My Solr running under Tomcat, do you mean I have to restart Solr for updating? -Nader On Fri, Aug 10, 2012 at 2:50 PM, Jack Krupansky-2 [via Lucene] < ml-node+s472066n4000471...@n3.nabble.com> wrote: > Solr itself doesn't crawl or re-crawl - you have to do that exter

Distributed Searching + unique Ids

2012-08-10 Thread Eric Khoury
hey guys, the spec mentions the following: The unique key field must be unique across all shards. If docs with duplicate unique keys are encountered, Solr will make an attempt to return valid results, but the behavior may be non-deterministic. I'm actually looking to dupli

Re: Documentation on the new updateLog transaction log feature?

2012-08-10 Thread Yonik Seeley
On Fri, Aug 10, 2012 at 2:31 PM, David Smiley (@MITRE.org) wrote: > Is there any documentation on the updateLog transaction log feature in Solr > 4? Not much beyond what's in solrconfig.xml > I started a quick prototype using Solr 4 alpha with a fairly structured > schema; no big text. I disabl

Documentation on the new updateLog transaction log feature?

2012-08-10 Thread David Smiley (@MITRE.org)
Is there any documentation on the updateLog transaction log feature in Solr 4? I started a quick prototype using Solr 4 alpha with a fairly structured schema; no big text. I disabled auto-commit which came pre-enabled and there's no soft-commit either. With CURL I posted a 1.8GB CSV file. AFter

Re: Tuning caching of geofilt queries

2012-08-10 Thread David Smiley (@MITRE.org)
Yeah it is... I rather like this write-up: https://sites.google.com/site/trescopter/Home/concepts/required-precision-for-gps-calculations#TOC-Precision-of-Float-and-Double -- which also arrives at 2.37m worse case. Aside from RAM savings, I wonder if there is any noticeable performance differenc

Tabs with Apache Solr using Drupal 7

2012-08-10 Thread Valentin, AJ
Hello Solr Experts, Does anyone know how to active tabs (or scope bar) for the Solr search mechanism within Drupal 7? Thank you. CONFIDENTIALITY NOTICE: This email constitutes an electronic communication within the meaning of the Electronic Communications Priva

Re: Tuning caching of geofilt queries

2012-08-10 Thread Yonik Seeley
On Fri, Aug 10, 2012 at 1:47 PM, David Smiley (@MITRE.org) wrote: > Information I've read vary on exactly what is the accuracy of float > vs double but at a kilometer there's no question a double is overkill. Back of the envelope: 23 mantissa bits + 1 implied bit == 24 effective mantissa bits in

Re: Tuning caching of geofilt queries

2012-08-10 Thread David Smiley (@MITRE.org)
Chris's response is quite good, and I have a couple things to add: 1. Since you can tolerate 1km slop, try defining the dynamic field *_coordinate as tfloat instead of tdouble. This will halve your memory requirements, but I'm not sure if it will be any faster -- it's worth a shot since you've al

Re: DataImportHandler WARNING: Unable to resolve variable

2012-08-10 Thread Jon Drukman
That column does not allow NULL. It's definitely an empty string, but I'm using MySQL IF() to catch it and make sure it always has something. On Thu, Aug 9, 2012 at 8:45 PM, Swati Swoboda wrote: > Ah, my bad. I was incorrect - it was not actually indexing. > > @Jon - is there a possibility that

Re: Connect to SOLR over socket file

2012-08-10 Thread Jon Drukman
On Fri, Aug 10, 2012 at 2:44 AM, Jason Axelson wrote: > You're correct that there is an underlying problem I'm trying to > solve. The underlying problem is that due to the security policies I > cannot run another service that listens on a TCP port, but a unix > domain socket would be okay. It look

Re: AW: Indexing wildcard patterns

2012-08-10 Thread Jack Krupansky
"Doc1 has the pattern "AB%CD%" associated with it (somehow?!)." You need to clarify what you mean by that. To be clear, Solr support for wildcards is a superset of the SQL LIKE operator, and the patterns used in the LIKE operator are NOT stored in the table data, but used at query time - same

Re: Tlog vs. buffer + softcommit.

2012-08-10 Thread Bing Hua
I remember I did set the 15sec autocommit and still saw the Tlogs growing unboundedly. But sounds like theoretically it should not if I index in a constant rate. I'll probably try it again sometime. For the peersync, I think solr cloud now uses push-replication over pull. Hmm, it makes sense to ke

Re: Tlog vs. buffer + softcommit.

2012-08-10 Thread Yonik Seeley
On Fri, Aug 10, 2012 at 11:19 AM, Bing Hua wrote: > Thanks for the information. It definitely helps a lot. There're > numDeletesToKeep = 1000; numRecordsToKeep = 100; in UpdateLog so this should > probably be what you're referring to. > > However when I was doing indexing the total size of TLogs k

Re: Tlog vs. buffer + softcommit.

2012-08-10 Thread Bing Hua
Thanks for the information. It definitely helps a lot. There're numDeletesToKeep = 1000; numRecordsToKeep = 100; in UpdateLog so this should probably be what you're referring to. However when I was doing indexing the total size of TLogs kept on increasing. It doesn't sound like the case where the

Re: Updating a field on a document on index

2012-08-10 Thread Jack Krupansky
One simple example is in the testUpdateField method of: http://svn.apache.org/viewvc/lucene/dev/trunk/solr/solrj/src/test/org/apache/solr/client/solrj/SolrExampleTests.java It creates a HashMap with a "set" command, and then adds that Map as the value for the field HashMap oper = new HashM

RE: multi-searching problem

2012-08-10 Thread Ahmet Arslan
> It seems more complicate than i > need. > I just want, if the user specify nothing, to search in all > my fields that I declared in my schema.xml > like that : > article_nom > but not only article_nom but all fields. > There should be some simple way to do that without using all > of this..? > O

RE: solrcloud and facet_pivot

2012-08-10 Thread Markus Jelsma
This may be your problem: https://issues.apache.org/jira/browse/SOLR-2894 -Original message- > From:giovanni.bricc...@banzai.it > Sent: Fri 10-Aug-2012 16:08 > To: solr-user@lucene.apache.org > Subject: solrcloud and facet_pivot > > I'm moving my first steps with solr4 alpha and solr

solrcloud and facet_pivot

2012-08-10 Thread giovanni.bricc...@banzai.it
I'm moving my first steps with solr4 alpha and solr cloud, and I'm having troubles with the facet.pivot parameter. starting solr cloudless I am able to use this parameter in some queries: ...facet.pivot=pidlevel_0,pidlevel_1,pidlevel_2,pidlevel_3 obtaining something like this ..."facet_pivot"

RE: solr indexing problem

2012-08-10 Thread Videnova, Svetlana

Re: solr indexing problem

2012-08-10 Thread Gora Mohanty
On 10 August 2012 14:35, Videnova, Svetlana wrote: > > This is schema.xml As requested, please also share the DIH configuration file, data-config.xml. Regards, Gora

Re: multi-searching problem

2012-08-10 Thread Michael Della Bitta
Either you need to use dismax/edismax or have a catch-all text field and copyField directives that copy the contents of your other fields into it, and make that the default search field. Michael Della Bitta Appinions | 18 East 41st St., Suite 1806

Re: Updateing Solr

2012-08-10 Thread Jack Krupansky
Solr itself doesn't crawl or re-crawl - you have to do that external to Solr. How did you index the data originally? You simply need to perform those steps again, hopefully via a script that can be re-executed to refresh Solr whenever the external data has changed. -- Jack Krupansky -Orig

Re: Designing an index with multiple entity types, sharing field names across entity-types.

2012-08-10 Thread Erick Erickson
I'd recommend you ignore search speed for the time being. First, your index isn't that large from what you've described. I see clients witn 40-50M documents on a single machine (admittedly with some iron under the hood)... Instead, I'd concentrate on designing the best user experience I could, and

FieldCollapsing/Grouping

2012-08-10 Thread mechravi25
Hi, I am planning to use Field Collapsing/Grouping feature in Solr 3.6.1 version. I found the details regarding the same from the solr wiki page. I have a doubt regarding the same i.e. can we change the way, the groups are formed. If so can you please guide me on the same? I have a scenario wh

RE: multi-searching problem

2012-08-10 Thread Videnova, Svetlana
It seems more complicate than i need. I just want, if the user specify nothing, to search in all my fields that I declared in my schema.xml like that : article_nom but not only article_nom but all fields. There should be some simple way to do that without using all of this..? Or am I wrong? -

Re: Solr index storage strategy on FileSystem

2012-08-10 Thread Erick Erickson
You probably want a mergepolicy, here's a good blog: http://juanggrande.wordpress.com/2011/02/07/merge-policy-internals/ Best Erick On Tue, Aug 7, 2012 at 5:24 PM, Bing Hua wrote: > Hi folks, > > With StandardDirectoryFactory, index is stored under data/index in forms of > frq, tim, tip and a f

AW: Indexing wildcard patterns

2012-08-10 Thread Lochschmied, Alexander
Thank you Toke, your comments made a lot of sense to me. Luckily we do not have many patterns and we just decided to consider only the prefixes up to the first wildcard. So we will no longer have to deal with patterns. Alexander -Ursprüngliche Nachricht- Von: Toke Eskildsen [mailto:t...@

Re: Stopping replication?

2012-08-10 Thread Erick Erickson
Thanks for wrapping that up. Erick On Mon, Aug 6, 2012 at 2:52 PM, csscouter wrote: > Erick, > > Thank you for the courtesy of your reply. > > I was able to figure out the problem, and for the benefit of the list, I > list the analysis. Judging by the caliber of those on this list, this is >

Re: Indexing wildcard patterns

2012-08-10 Thread Toke Eskildsen
On Fri, 2012-08-10 at 10:07 +0200, Lochschmied, Alexander wrote: > Coming from a SQL database based search system, we already have a set of > defined patterns associated with our searchable documents. > > % matches no or any number of characters > _ matches one character > > Example: > Doc 1: 'A

Updateing Solr

2012-08-10 Thread Nana
I have OpenCMS on one of my server and use Solr for searching in index files. Each night will OpenCMS-index files updated. But Solr stille use the old index files. How can I configure Solr to become opdatet just shortly efter index files are updated? -- View this message in context: http://luce

Re: AW: Indexing wildcard patterns

2012-08-10 Thread Ahmet Arslan
> So in the example I provided below, a user might enter " > ABCDXYZ " and I need Solr to return Doc1, as Doc1 has the > pattern "AB%CD%" associated with it (somehow?!). OK understood now. You can use Lucene's MemoryIndex for this. http://lucene.apache.org/core/3_6_1/api/contrib-memory/org/apache

Re: multi-searching problem

2012-08-10 Thread Ahmet Arslan
> I couldn't find none tutorials who explain properly how to > use dismix :s > Can somebody please explain here, or maybe give to me better > solution of my multisearching base problem? http://searchhub.org/dev/2010/05/23/whats-a-dismax/ Plus example solrconfig.xml has an example. Search for keyw

Re: Connect to SOLR over socket file

2012-08-10 Thread Jason Axelson
You're correct that there is an underlying problem I'm trying to solve. The underlying problem is that due to the security policies I cannot run another service that listens on a TCP port, but a unix domain socket would be okay. It looks like I might have to go with mysql full-text search or someth

multi-searching problem

2012-08-10 Thread Videnova, Svetlana
Currently I'm using this in order to search: ### article_id article_nom ### But I wish be able to add more parameters in my defaultSearchField. I saw in some forums that is not possible with defaultSearchField to do multi requests and add parameters, but I can use dismax request. I couldn't fin

AW: Indexing wildcard patterns

2012-08-10 Thread Lochschmied, Alexander
I thought my question might be confusing... I know about Solr providing wildcards in queries, but my problem is different. I have those patterns associated with my searchable documents before any actual search is done. I need Solr to return the document which is associated with matching patterns

RE: solr indexing problem

2012-08-10 Thread Videnova, Svetlana
This is schema.xml ###

Re: Connect to SOLR over socket file

2012-08-10 Thread Michael Kuhlmann
On 10.08.2012 04:42, Jason Axelson wrote: Hi Michael, Thanks for the information. Unfortunately I'm having a hard time finding any servlet containers that can serve over a unix domain socket. Also it looks like EmbeddedSolr won't work since I am not writing the application in Java (it's in Ruby

Re: solr indexing problem

2012-08-10 Thread Gora Mohanty
On 10 August 2012 14:09, Videnova, Svetlana wrote: > Hi everybody, someone can help me please with this: > > > > I have a problem to index the part named "contenusection". > > > -Data-config.xml : > > > Section.id Is ok => It is a little difficult to understand what you are saying (please shar

solr indexing problem

2012-08-10 Thread Videnova, Svetlana
Hi everybody, someone can help me please with this: I have a problem to index the part named "contenusection". -Data-config.xml : Section.id Is ok => -Schema.xml In the Schema browser section_id is filled, but after reindexation which is completed, but there is nothing in contenusect

Re: Indexing wildcard patterns

2012-08-10 Thread Ahmet Arslan
--- On Fri, 8/10/12, Lochschmied, Alexander wrote: > From: Lochschmied, Alexander > Subject: Indexing wildcard patterns > To: "solr-user@lucene.apache.org" > Date: Friday, August 10, 2012, 11:07 AM > Coming from a SQL database based > search system, we already have a set of defined patterns

Indexing wildcard patterns

2012-08-10 Thread Lochschmied, Alexander
Coming from a SQL database based search system, we already have a set of defined patterns associated with our searchable documents. % matches no or any number of characters _ matches one character Example: Doc 1: 'AB%CD', 'AB%CD%' Doc 2: 'AB_CD' ... Thus Doc 1 matches ABXYZCD ABCD ABCDXYZ ...