By all means, please go ahead. Contributions are always welcome :)
On Fri, May 30, 2008 at 12:56 AM, Jonathan Ariel <[EMAIL PROTECTED]> wrote:
> Ok. So I have a version of solr with a small modification to the SimpleFacet
> class where you can send a parameter to tell that you want some more info.
I see. That patch is not in Lucene yet, and it looks like *nobody* voted for
it. If you like it, please vote for it.
Personally seeing a mention of higher memory usage in that patch's javadoc
worries me a little large index, lots of docs, lots of memory..
Otis
--
Sematext -- http://semate
Right, the only actively developed Solr client is really Solrj. All other ones
are not well maintained - I don't recall seeing any patches for any of them in
the recent months. In other words, if your Python client offers everything
(and more) that the reptile client in Solr's repo offers, tha
Hi,
I am using function queries to rank the results,
if some/ allthe fields (used in the function ) are missing from the document
what will be the ranking behavior for such documents?
thanks
-umar
Sorry I forgot to mention that.
http://wiki.apache.org/solr/DataImportHandler#head-a6916b30b5d7605a990fb03c4ff461b3736496a9
--Noble
On Fri, May 30, 2008 at 11:37 AM, Shalin Shekhar Mangar
<[EMAIL PROTECTED]> wrote:
> You need to enable TemplateTransformer for your entity. For example:
>
>
> On Fr
You need to enable TemplateTransformer for your entity. For example:
On Fri, May 30, 2008 at 11:31 AM, Julio Castillo
<[EMAIL PROTECTED]> wrote:
> Noble,
> I tried the template setting for the "id" field, but I didn't notice any
> different behavior. I also didn't see where this would be reflecte
Noble,
I tried the template setting for the "id" field, but I didn't notice any
different behavior. I also didn't see where this would be reflected.
I looked at the fields and the debug output for the dataImporter and
couldn't see any reference to a modified id name (per the template
instructions).
Consider constructing the id concatenating an extra string for each
document . You can construct that field using the TeplateTransformer.
in the entity owners keep the id as
and in vets
or anything else which can make it unique
--Noble
On Fri, May 30, 2008 at 10:05 AM, Shalin Shekhar Mangar
That will happen only if id is the uniqueKey in Solr and the id coming
from both your tables have same values. In that case, they will
overwrite each other. You will need a separate uniqueKey (on other
than id field).
On Fri, May 30, 2008 at 6:34 AM, Julio Castillo <[EMAIL PROTECTED]> wrote:
> Tha
This comment for the benefit of who is using distributed search:
The protocol of communication has been xml for distributed search. For
a good part of 1.3.
It is now changed to a custom binary format (SOLR-486 ). So each shard
participating in a distributed search must be using the same protocol.
Looking further at the java error, those crashes are mostly related to GC.
VM_Operation (0x41b429e0): parallel gc failed allocation, mode:
safepoint, requested by thread 0x2aab1988c400
I'm following the
http://java.sun.com/javase/6/webnotes/trouble/TSG-VM/html/gbyzo.html
and see if
It's most likely a
1) hardware issue: bad memory
OR
2) incompatible libraries (most likely libc version for the JVM).
If you have another box around, try that.
-Yonik
On Thu, May 29, 2008 at 9:51 PM, Gaku Mak <[EMAIL PROTECTED]> wrote:
>
> Hi Yonik and others,
>
> I'm getting this java error af
On Thu, May 29, 2008 at 9:44 PM, Tim Christensen <[EMAIL PROTECTED]> wrote:
> Yonik,
>
> Thank you for the response. You are correct, regular (non-accessory)
> products are boosted '2.0' at index time. However both items the non ipod
> item and the ipod would have received the initial boost on the
Hi Yonik and others,
I'm getting this java error after switching to JVM 1.6.0_3. This error
occurs after the stress test has been going for a while and failed at 12K
docs level and at 18K again. Am I doing something wrong? Please help!
Thanks!
#
# An unexpected error has been detected by Jav
Yonik,
Thank you for the response. You are correct, regular (non-accessory)
products are boosted '2.0' at index time. However both items the non
ipod item and the ipod would have received the initial boost on the
same fields since they are both non-accessory items.
Is your comment still r
field norms of un-boosted fields are normally less than 1 (it's a
factor that weights larger fields less).
The index-time boost is also multiplied into this factor though.
Given that your first doc had a huge norm, it looks like the document
or field was boosted at index time?
-Yonik
On Thu, May
Hi,
This is my first post. I have been working with Lucene for about 4
weeks and Solr for just about 10 days. We are going to convert our
site search over to Solr as soon as we figure out some of the nuances.
As I was testing out the synonyms features to decide how we could best
use it, I
Thanks Shalin,
I tried putting everything under the same document (two different unrelated
entities), and got a bit further.
My problem now appears to be both of them stepping on each other due to "id"
conflicts. Currently my id is defined in my schema as
Do I have to create a new "id" field?
T
Thanks for that, I looked into fq and it will definatly help when I
drill into zip codes.
However I'm still having some issues, facet.prefix only got me so far
because sometimes the facet is the second word in the field.
Also I have another question with this example:
Company A
1
Car
a
The people working on Lucene are pretty smart, and this sort of
query optimization is a well-known trick, so I would not worry
about it.
A dozen years ago at Infoseek, we checked the count of matches
for each term in an AND, and evaluated the smallest one first.
If any of them had zero matches, we
Hi Yonik,
Thanks for your quick reply. I'm very new to the lucene source code.
Can you give me a little more detail explaination about this.
Do you think it will save some memory if docnum = find_match("A") >
docnum = find_match("B") and put B in the front of the AND query like "B
AND A AND C"? H
On Thu, May 29, 2008 at 4:05 PM, Yongjun Rong <[EMAIL PROTECTED]> wrote:
> I have a question about how the lucene query parser. For example, I
> have query "A AND B AND C". Will lucene extract all documents satisfy
> condition A in memory and then filter it with condition B and C?
No, Lucene will
Hi,
I have a question about how the lucene query parser. For example, I
have query "A AND B AND C". Will lucene extract all documents satisfy
condition A in memory and then filter it with condition B and C? or only
the documents satisfying "A AND B AND C" will be put into memory? Is
there any art
Ok. So I have a version of solr with a small modification to the SimpleFacet
class where you can send a parameter to tell that you want some more info.
It'll bring back a list with the max and min values as well as the SD, CV
and Mean for the facet values.
If you are interested I could generate a p
>(the first parameter of my request wasn't 'shards', and this produced the
bug)
Wrong. The problem was that I was pointing, in the 'shards' parameter, to a
Solr 1.2 installation (which is furthermore sharing a single index with the
new Solr 1.3)
2008/5/29 Grégoire Neuville <[EMAIL PROTECTED]>:
>
Hi Julio,
The first data-config is correct.
You're running DataImportHandler in debug mode which creates only the
first 10 documents by default. You can also add count=N to index only
the first N documents. But this is intended only for debugging
purposes. If you want to do a full-import just use
Otis Gospodnetic schrieb:
I just had a look at the demo and reeeally like it!
I didn't pay enough attention to this thread, though. Is the main concern that
by having a Solr search webapp that is really all in UI and uses your JS
library, the backend Solr server is directly exposed and thus s
Greg Ludington schrieb:
Building on a library like jQuery (which is a great lib) opens the door to
some hairy namespacing conflicts with existing libraries (prototype and moo,
for instance), or handcoded javascript that may exist on the current site.
This is actually one of the areas where jQ
Hi all,
I must now apologize ; the fault was entirely mine : I was shaping the Solr
interrogation URL the wrong way (the first parameter of my request wasn't
'shards', and this produced the bug). All is working fine now.
Thanks for your quick answers,
Grégoire.
2008/5/29 Noble Paul നോബിള് नो
On Thu, May 29, 2008 at 6:40 PM, Otis Gospodnetic
<[EMAIL PROTECTED]> wrote:
> I haven't been paying close attention to the uniformity of URL parameters,
> but if there is room for making them more uniform
> (e.g. always use singular, always use comma as a delimiting character, etc.)
> without hu
I have 2 dB tables unrelated to each other that I want to index.
I have tried 2 approaches for specifying them in my data-config.xml file.
None of them seem to work (it seems I can only get data for the first one
listed).
CASE 1)
CASE 2)
On Thu, May 29, 2008 at 12:22 PM, Rusli Ruslakall
<[EMAIL PROTECTED]> wrote:
> searched forever before posting and of course I found it shortly after :)
>
> Can use facet.prefix, beautiful!
You can also constrain both results and facets to any arbitrary query
via fq=myquery
-Yonik
> On Thu, May
Thanks for the hints.
I have been aware of the Collator. Actually a colleague of mine has written
a Collator based sorting Class for lucene. See:
https://issues.apache.org/jira/browse/LUCENE-943. This was almost 2 years
ago and I only wanted to know if there is already a solution in Solr 1.3 or
Lu
I have done something similar and I am using a search servlet that will
forward the request to solr tru commons htclient.
Maybe it could be a solution to DoS, although it is still possible.
Best.
-Cam Bazz
On Thu, May 29, 2008 at 8:04 PM, Otis Gospodnetic <
[EMAIL PROTECTED]> wrote:
> I just h
I just had a look at the demo and reeeally like it!
I didn't pay enough attention to this thread, though. Is the main concern that
by having a Solr search webapp that is really all in UI and uses your JS
library, the backend Solr server is directly exposed and thus somebody could
peek in the w
Wow. This is really pretty cool. You're much further along than I
thought you were! I'd love to see this in as an 'official' Solr client.
Thanks!
Matthew Runo
Software Developer
Zappos.com
702.943.7833
On May 29, 2008, at 8:15 AM, Matthias Epheser wrote:
The server was rebooted yesterday wit
I haven't been paying close attention to the uniformity of URL parameters, but
if there is room for making them more uniform (e.g. always use singular, always
use comma as a delimiting character, etc.) without hurting anything, I'm for it.
As for pysolr - I had a quick look the other day and saw
Hi,
I don't have a very concrete suggestion for this, but maybe this will lead you
in the right direction:
http://java.sun.com/javase/6/docs/api/java/text/Collator.html
http://java.sun.com/javase/6/docs/api/java/text/spi/CollatorProvider.html
You may also wish to bring this up on the Lucene jav
> Building on a library like jQuery (which is a great lib) opens the door to
> some hairy namespacing conflicts with existing libraries (prototype and moo,
> for instance), or handcoded javascript that may exist on the current site.
This is actually one of the areas where jQuery offers capabilit
Hi again,
searched forever before posting and of course I found it shortly after :)
Can use facet.prefix, beautiful!
On Thu, May 29, 2008 at 3:43 PM, Rusli Ruslakall
<[EMAIL PROTECTED]> wrote:
> Hi,
>
> I index something like this:
>
>
>Company A
>123
>456
>789
>
Hi,
I index something like this:
Company A
123
456
789
Company B
129
123
987
So I ONLY want to display all category names starting with '12' and
how many companies are in each one.
In this example it should output:
name count
Original-Nachricht
> Datum: Thu, 29 May 2008 09:36:37 -0500
> Von: Nik Krimm <[EMAIL PROTECTED]>
> An: "solr-user@lucene.apache.org"
> Betreff: Re: Announcement of Solr Javascript Client
> Hi Matthias:
> Glad to hear of your efforts. A couple of initial comments...
>
> I'm ca
Hi Matthias:
Glad to hear of your efforts. A couple of initial comments...
I'm cautious about your decision to build on top of jQuery.
My understanding is that you're planning to build a set of client-side
widgets that would be easily embeddable in an existing web-site.
Building on a librar
bram,
you'll want to look at the KeywordTokenizerFactory (which doesn't
actually tokenize), and then use the LowerCaseFilterFactory. the
schema in the example has a fieldType called 'alphaOnlySort' that
should get you started.
cheers,
rob
On Thu, May 29, 2008 at 6:21 AM, Bram de Jong <[EMAIL PR
hello all,
a while ago I was looking into making the schema for my (rather rich)
data set, and I wanted to have a field for username.
I would need a case insensitive string for that, but literal (no
tokenizing, ...).
how can I do this within the Solr schema definition?
- bram
--
http://freeso
Hello Eric (and others)
>>
>> repeating the parameter:
>> sort=field1,field2 desc,field3
>> but
>> facet.field=field1&facet.field=field2
>> This is pretty confusing to first-hand users! :-)
>
> Yeah, it is confusing. But we have to be careful with order. I don't
> believe you can rely on the or
No, I don't think it is possible to do that with one query. You'll
need to make two calls to Solr:
1. Without fq=type:B -- just to get all type facets
2. With fq to get the results.
On Thu, May 29, 2008 at 2:12 PM, Umar Shah <[EMAIL PROTECTED]> wrote:
> Hi,
>
> I have a problem wherein
> I have f
Hi,
I have a problem wherein
I have field 'type' which can have value A, B C,
I want to return facet count for each type but need only show one type of
result ( say with max count)
so if i have following counts
type:A = 300
type:B = 400
type:D = 100
I should only show type:B results (fq=type
Thanks for your answer
I nearly update my Solr everyday
It was always OK
But when I updated Solr 2 days ago , the errors came out
btw, I have set the heap size to 1G,but the problem remains
在08-5-28,Shalin Shekhar Mangar <[EMAIL PROTECTED]> 写道:
>
> Ok, I see you're getting a OutOfMemoryError
On Wed, May 28, 2008 at 11:41 PM, Alexander Ramos Jardim <
[EMAIL PROTECTED]> wrote:
> Well,
>
> One solution that I can see for this problem is having different indexes
> for
> each language.
>
>
In which way would that solve the sorting problem?
50 matches
Mail list logo