Re: WELCOME to solr-user@lucene.apache.org

2019-10-20 Thread Erick Erickson
In short, nothing that’s maintained as part of the Apache project. There may be 
commercial products, but I haven’t had occasion to look for one.

Best,
Erick

> On Oct 20, 2019, at 7:42 AM, Wasim S Kazi  wrote:
> 
> Good day
> 
> I would like to get some info or confirmation about configuring Solr 8+ to 
> get content from WCM (Websphere Content Management)
> 
> Essentially, we have manually index data from WCM into Solr and this all 
> works fine. We want to now automate this process, so checking is there is any 
> well established integration method between WCM and Solr. This integration 
> should allow content being indexed automatically, or periodically without 
> human intervention.
> 
> Regards
> Wasim Kazi
> 
> -Original Message-
> From: solr-user-h...@lucene.apache.org 
> Sent: Sunday, October 20, 2019 2:39 PM
> To: Wasim S Kazi 
> Subject: WELCOME to solr-user@lucene.apache.org
> 
> Hi! This is the ezmlm program. I'm managing the solr-user@lucene.apache.org 
> mailing list.
> 
> I'm working for my owner, who can be reached at 
> solr-user-ow...@lucene.apache.org.
> 
> Acknowledgment: I have added the address
> 
>   wasim.s.k...@za.ey.com
> 
> to the solr-user mailing list.
> 
> Welcome to solr-user@lucene.apache.org!
> 
> Please save this message so that you know the address you are subscribed 
> under, in case you later want to unsubscribe or change your subscription 
> address.
> 
> 
> --- Administrative commands for the solr-user list ---
> 
> I can handle administrative requests automatically. Please do not send them 
> to the list address! Instead, send your message to the correct command 
> address:
> 
> To subscribe to the list, send a message to:
>   
> 
> To remove your address from the list, send a message to:
>   
> 
> Send mail to the following for info and FAQ for this list:
>   
>   
> 
> Similar addresses exist for the digest list:
>   
>   
> 
> To get messages 123 through 145 (a maximum of 100 per request), mail:
>   
> 
> To get an index with subject and author for messages 123-456 , mail:
>   
> 
> They are always returned as sets of 100, max 2000 per request, so you'll 
> actually get 100-499.
> 
> To receive all messages with the same subject as message 12345, send a short 
> message to:
>   
> 
> The messages should contain one line or word of text to avoid being treated 
> as sp@m, but I will ignore their content.
> Only the ADDRESS you send to is important.
> 
> You can start a subscription for an alternate address, for example 
> "john@host.domain", just add a hyphen and your address (with '=' instead of 
> '@') after the command word:
> 
> 
> To stop subscription for this address, mail:
> 
> 
> In both cases, I'll send a confirmation message to that address. When you 
> receive it, simply reply to it to complete your subscription.
> 
> If despite following these instructions, you do not get the desired results, 
> please contact my owner at solr-user-ow...@lucene.apache.org. Please be 
> patient, my owner is a lot slower than I am ;-)
> 
> --- Enclosed is a copy of the request I received.
> 
> Return-Path: 
> Received: (qmail 96582 invoked by uid 99); 20 Oct 2019 11:38:52 -
> Received: from pnap-us-west-generic-nat.apache.org (HELO 
> spamd1-us-west.apache.org) (209.188.14.142)
>by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 20 Oct 2019 11:38:52 +
> Received: from localhost (localhost [127.0.0.1])
>by spamd1-us-west.apache.org (ASF Mail Server at 
> spamd1-us-west.apache.org) with ESMTP id 81232C0C8E
>for 
> ;
>  Sun, 20 Oct 2019 11:38:51 + (UTC)
> X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org
> X-Spam-Flag: NO
> X-Spam-Score: -4.8
> X-Spam-Level:
> X-Spam-Status: No, score=-4.8 tagged_above=-999 required=6.31
>tests=[HTML_FONT_LOW_CONTRAST=0.001, HTML_MESSAGE=0.2,
>KAM_SHORT=0.001, RCVD_IN_DNSWL_HI=-5, SPF_HELO_PASS=-0.001,
>SPF_PASS=-0.001] autolearn=disabled
> Received: from mx1-he-de.apache.org ([10.40.0.8])
>by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, 
> port 10024)
>with ESMTP id Kbk25gxC2elm
>for 
> ;
>Sun, 20 Oct 2019 11:38:50 + (UTC)
> Received-SPF: Pass (mailfrom) identity=mailfrom; client-ip=199.49.1.52; 
> helo=em01.ey.com; envelope-from=wasim.s.k...@za.ey.com; receiver=
> Received: from em01.ey.com (em01.ey.com [199.49.1.52])
>by mx1-he-de.apache.org (ASF Mail Server at mx1-he-de.apache.org) with 
> ESMTPS id 86E307DDFA
>for 
> ;
>  Sun, 20 Oct 2019 11:38:49 + (UTC)
&g

RE: WELCOME to solr-user@lucene.apache.org

2019-10-20 Thread Wasim S Kazi
Good day

I would like to get some info or confirmation about configuring Solr 8+ to get 
content from WCM (Websphere Content Management)

Essentially, we have manually index data from WCM into Solr and this all works 
fine. We want to now automate this process, so checking is there is any well 
established integration method between WCM and Solr. This integration should 
allow content being indexed automatically, or periodically without human 
intervention.

Regards
Wasim Kazi

-Original Message-
From: solr-user-h...@lucene.apache.org 
Sent: Sunday, October 20, 2019 2:39 PM
To: Wasim S Kazi 
Subject: WELCOME to solr-user@lucene.apache.org

Hi! This is the ezmlm program. I'm managing the solr-user@lucene.apache.org 
mailing list.

I'm working for my owner, who can be reached at 
solr-user-ow...@lucene.apache.org.

Acknowledgment: I have added the address

   wasim.s.k...@za.ey.com

to the solr-user mailing list.

Welcome to solr-user@lucene.apache.org!

Please save this message so that you know the address you are subscribed under, 
in case you later want to unsubscribe or change your subscription address.


--- Administrative commands for the solr-user list ---

I can handle administrative requests automatically. Please do not send them to 
the list address! Instead, send your message to the correct command address:

To subscribe to the list, send a message to:
   

To remove your address from the list, send a message to:
   

Send mail to the following for info and FAQ for this list:
   
   

Similar addresses exist for the digest list:
   
   

To get messages 123 through 145 (a maximum of 100 per request), mail:
   

To get an index with subject and author for messages 123-456 , mail:
   

They are always returned as sets of 100, max 2000 per request, so you'll 
actually get 100-499.

To receive all messages with the same subject as message 12345, send a short 
message to:
   

The messages should contain one line or word of text to avoid being treated as 
sp@m, but I will ignore their content.
Only the ADDRESS you send to is important.

You can start a subscription for an alternate address, for example 
"john@host.domain", just add a hyphen and your address (with '=' instead of 
'@') after the command word:


To stop subscription for this address, mail:


In both cases, I'll send a confirmation message to that address. When you 
receive it, simply reply to it to complete your subscription.

If despite following these instructions, you do not get the desired results, 
please contact my owner at solr-user-ow...@lucene.apache.org. Please be 
patient, my owner is a lot slower than I am ;-)

--- Enclosed is a copy of the request I received.

Return-Path: 
Received: (qmail 96582 invoked by uid 99); 20 Oct 2019 11:38:52 -
Received: from pnap-us-west-generic-nat.apache.org (HELO 
spamd1-us-west.apache.org) (209.188.14.142)
by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 20 Oct 2019 11:38:52 +
Received: from localhost (localhost [127.0.0.1])
by spamd1-us-west.apache.org (ASF Mail Server at 
spamd1-us-west.apache.org) with ESMTP id 81232C0C8E
for 
;
 Sun, 20 Oct 2019 11:38:51 + (UTC)
X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org
X-Spam-Flag: NO
X-Spam-Score: -4.8
X-Spam-Level:
X-Spam-Status: No, score=-4.8 tagged_above=-999 required=6.31
tests=[HTML_FONT_LOW_CONTRAST=0.001, HTML_MESSAGE=0.2,
KAM_SHORT=0.001, RCVD_IN_DNSWL_HI=-5, SPF_HELO_PASS=-0.001,
SPF_PASS=-0.001] autolearn=disabled
Received: from mx1-he-de.apache.org ([10.40.0.8])
by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 
10024)
with ESMTP id Kbk25gxC2elm
for 
;
Sun, 20 Oct 2019 11:38:50 + (UTC)
Received-SPF: Pass (mailfrom) identity=mailfrom; client-ip=199.49.1.52; 
helo=em01.ey.com; envelope-from=wasim.s.k...@za.ey.com; receiver=
Received: from em01.ey.com (em01.ey.com [199.49.1.52])
by mx1-he-de.apache.org (ASF Mail Server at mx1-he-de.apache.org) with 
ESMTPS id 86E307DDFA
for 
;
 Sun, 20 Oct 2019 11:38:49 + (UTC)
IronPort-SDR: 
0i+SrmLgncBfCsgonKDgt+Ll+5TCuN/hbDHsUS1V98D3LWk4dgqQE9qJPrbcZyYjLWRYXieztn
 Fjky8vaAREXw==
X-IronPort-AV: E=Sophos;i="5.67,319,1566864000";
   d="gif'147?scan'147,208,217,147";a="240843155"
Received: from unknown (HELO DERUSRMPEXTP02.ey.net) ([10.151.33.58])
  by defrakaeyip01.eurw.ey.net with ESMTP; 20 Oct 2019 11:38:42 +
ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none;  
b=Em+4qSC0AqZ4Ei+nYLvNi3BwVnwrjtXdFD2W5lnj3CNDBO0x9JJBOn5yWMUj4JNnCnhg4R524D5O+lX6dYrYut/tTe09g0pnRemmla9J7icpboVqK6i5gXJLHLFA9dERNQwRDieNKqKEkei0eIbCzLMJeVld1lvj7CJiXIZPZIySU5hHZI7N5+Q9i1eb4GRYxATio7ibfxNknvf3/2298wyUhY9EuQEEuTWNrylkhMtQORgdlgv+mEdpzGJO+FaiG0fv1MQ0TO8JcgybSjJ14hG7xYlhkGEO39qzV7Q9EDbsPwJuupwZg/r4XAIIZ0Bjc0f7YX11S2BhnV8mdm+T+A==
ARC-Message-

Re: WELCOME to solr-user@lucene.apache.org

2018-06-25 Thread Erick Erickson
First, understand that this list is maintained by volunteers, so
answers aren't guaranteed.

If you require dedicated support there are various organizations that
provide same, but
you'll have to contact them.

That said, the community is quite responsive, just post questions to
solr-user like this
one.

Best,
Erick

On Sun, Jun 24, 2018 at 11:35 PM, Srinivas Muppu (US)
 wrote:
> Hi Solr Team,
>
> We are facing Solr System Configuration issues which needs help. Please let
> us know whom to post our Questions/Queries.
>
> Thanks,
> Srinivas
>
> On Mon, Jun 25, 2018 at 2:22 AM,  wrote:
>
>> Hi! This is the ezmlm program. I'm managing the
>> solr-user@lucene.apache.org mailing list.
>>
>> I'm working for my owner, who can be reached
>> at solr-user-ow...@lucene.apache.org.
>>
>> Acknowledgment: I have added the address
>>
>>srinivas.mu...@pwc.com
>>
>> to the solr-user mailing list.
>>
>> Welcome to solr-user@lucene.apache.org!
>>
>> Please save this message so that you know the address you are
>> subscribed under, in case you later want to unsubscribe or change your
>> subscription address.
>>
>>
>> --- Administrative commands for the solr-user list ---
>>
>> I can handle administrative requests automatically. Please
>> do not send them to the list address! Instead, send
>> your message to the correct command address:
>>
>> To subscribe to the list, send a message to:
>>
>>
>> To remove your address from the list, send a message to:
>>
>>
>> Send mail to the following for info and FAQ for this list:
>>
>>
>>
>> Similar addresses exist for the digest list:
>>
>>
>>
>> To get messages 123 through 145 (a maximum of 100 per request), mail:
>>
>>
>> To get an index with subject and author for messages 123-456 , mail:
>>
>>
>> They are always returned as sets of 100, max 2000 per request,
>> so you'll actually get 100-499.
>>
>> To receive all messages with the same subject as message 12345,
>> send a short message to:
>>
>>
>> The messages should contain one line or word of text to avoid being
>> treated as sp@m, but I will ignore their content.
>> Only the ADDRESS you send to is important.
>>
>> You can start a subscription for an alternate address,
>> for example "john@host.domain", just add a hyphen and your
>> address (with '=' instead of '@') after the command word:
>> 
>>
>> To stop subscription for this address, mail:
>> 
>>
>> In both cases, I'll send a confirmation message to that address. When
>> you receive it, simply reply to it to complete your subscription.
>>
>> If despite following these instructions, you do not get the
>> desired results, please contact my owner at
>> solr-user-ow...@lucene.apache.org. Please be patient, my owner is a
>> lot slower than I am ;-)
>>
>> --- Enclosed is a copy of the request I received.
>>
>> Return-Path: 
>> Received: (qmail 84164 invoked by uid 99); 25 Jun 2018 06:22:12 -
>> Received: from pnap-us-west-generic-nat.apache.org (HELO
>> spamd1-us-west.apache.org) (209.188.14.142)
>> by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 25 Jun 2018 06:22:12
>> +
>> Received: from localhost (localhost [127.0.0.1])
>> by spamd1-us-west.apache.org (ASF Mail Server at
>> spamd1-us-west.apache.org) with ESMTP id 63CB9CA4A5
>> for > pwc@lucene.apache.org>; Mon, 25 Jun 2018 06:22:12 + (UTC)
>> X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org
>> X-Spam-Flag: NO
>> X-Spam-Score: -1
>> X-Spam-Level:
>> X-Spam-Status: No, score=-1 tagged_above=-999 required=6.31
>> tests=[HTML_MESSAGE=2, KAM_BADIPHTTP=2, KAM_SHORT=0.001,
>> NORMAL_HTTP_TO_IP=0.001, RCVD_IN_DNSWL_HI=-5, SPF_HELO_PASS=-0.001,
>> SPF_PASS=-0.001] autolearn=disabled
>> Received: from mx1-lw-us.apache.org ([10.40.0.8])
>> by localhost (spamd1-us-west.apache.org [10.40.0.7])
>> (amavisd-new, port 10024)
>> with ESMTP id NuBVNjDIIyqW
>> for > pwc@lucene.apache.org>;
>> Mon, 25 Jun 2018 06:22:10 + (UTC)
>> Received: from lxsmpr20.pwc.com (lxsmpr20.pwc.com [155.201.248.112])
>> by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org)
>> with ESMTPS id 500895F1B4
>> for > pwc@lucene.apache.org>; Mon, 25 Jun 2018 06:22:10 + (UTC)

Re: WELCOME to solr-user@lucene.apache.org

2018-06-24 Thread Srinivas Muppu (US)
Hi Solr Team,

We are facing Solr System Configuration issues which needs help. Please let
us know whom to post our Questions/Queries.

Thanks,
Srinivas

On Mon, Jun 25, 2018 at 2:22 AM,  wrote:

> Hi! This is the ezmlm program. I'm managing the
> solr-user@lucene.apache.org mailing list.
>
> I'm working for my owner, who can be reached
> at solr-user-ow...@lucene.apache.org.
>
> Acknowledgment: I have added the address
>
>srinivas.mu...@pwc.com
>
> to the solr-user mailing list.
>
> Welcome to solr-user@lucene.apache.org!
>
> Please save this message so that you know the address you are
> subscribed under, in case you later want to unsubscribe or change your
> subscription address.
>
>
> --- Administrative commands for the solr-user list ---
>
> I can handle administrative requests automatically. Please
> do not send them to the list address! Instead, send
> your message to the correct command address:
>
> To subscribe to the list, send a message to:
>
>
> To remove your address from the list, send a message to:
>
>
> Send mail to the following for info and FAQ for this list:
>
>
>
> Similar addresses exist for the digest list:
>
>
>
> To get messages 123 through 145 (a maximum of 100 per request), mail:
>
>
> To get an index with subject and author for messages 123-456 , mail:
>
>
> They are always returned as sets of 100, max 2000 per request,
> so you'll actually get 100-499.
>
> To receive all messages with the same subject as message 12345,
> send a short message to:
>
>
> The messages should contain one line or word of text to avoid being
> treated as sp@m, but I will ignore their content.
> Only the ADDRESS you send to is important.
>
> You can start a subscription for an alternate address,
> for example "john@host.domain", just add a hyphen and your
> address (with '=' instead of '@') after the command word:
> 
>
> To stop subscription for this address, mail:
> 
>
> In both cases, I'll send a confirmation message to that address. When
> you receive it, simply reply to it to complete your subscription.
>
> If despite following these instructions, you do not get the
> desired results, please contact my owner at
> solr-user-ow...@lucene.apache.org. Please be patient, my owner is a
> lot slower than I am ;-)
>
> --- Enclosed is a copy of the request I received.
>
> Return-Path: 
> Received: (qmail 84164 invoked by uid 99); 25 Jun 2018 06:22:12 -
> Received: from pnap-us-west-generic-nat.apache.org (HELO
> spamd1-us-west.apache.org) (209.188.14.142)
> by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 25 Jun 2018 06:22:12
> +
> Received: from localhost (localhost [127.0.0.1])
> by spamd1-us-west.apache.org (ASF Mail Server at
> spamd1-us-west.apache.org) with ESMTP id 63CB9CA4A5
> for  pwc@lucene.apache.org>; Mon, 25 Jun 2018 06:22:12 + (UTC)
> X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org
> X-Spam-Flag: NO
> X-Spam-Score: -1
> X-Spam-Level:
> X-Spam-Status: No, score=-1 tagged_above=-999 required=6.31
> tests=[HTML_MESSAGE=2, KAM_BADIPHTTP=2, KAM_SHORT=0.001,
> NORMAL_HTTP_TO_IP=0.001, RCVD_IN_DNSWL_HI=-5, SPF_HELO_PASS=-0.001,
> SPF_PASS=-0.001] autolearn=disabled
> Received: from mx1-lw-us.apache.org ([10.40.0.8])
> by localhost (spamd1-us-west.apache.org [10.40.0.7])
> (amavisd-new, port 10024)
> with ESMTP id NuBVNjDIIyqW
> for  pwc@lucene.apache.org>;
> Mon, 25 Jun 2018 06:22:10 + (UTC)
> Received: from lxsmpr20.pwc.com (lxsmpr20.pwc.com [155.201.248.112])
> by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org)
> with ESMTPS id 500895F1B4
> for  pwc@lucene.apache.org>; Mon, 25 Jun 2018 06:22:10 + (UTC)
> Received: from mail-vk0-f71.google.com (mail-vk0-f71.google.com
> [209.85.213.71])
> by lxsmpr20.nam.pwcinternal.com (8.16.0.21/8.16.0.21) with ESMTPS
> id w5P6M3MF054491
> (version=TLSv1.2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128
> verify=OK)
> for  pwc@lucene.apache.org>; Mon, 25 Jun 2018 02:22:03 -0400
> Received: by mail-vk0-f71.google.com with SMTP id j123-v6so5886670vkc.4
> for  pwc@lucene.apache.org>; Sun, 24 Jun 2018 23:22:03 -0700 (PDT)
> X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
> d=1e100.net; s=20161025;
> h=x-gm-message-state:mime-version:in-reply-to:references:from:date
>  :message-id:subject:to;
> bh=+MKXiCktrcuycddIpUqd9ljQ2oLqYBsgU3qPgb6oZ2M=;
> b=q4Vku4HdqSxx2NyQ1G2GtPG7

Re: WELCOME to solr-user@lucene.apache.org

2011-05-24 Thread Lord Khan Han
Hi ,

   Can I limit the terms that the HighlightComponent uses. My query is
generally long and I want specific ones to be highlighted and the rest is
not highlighted. Is there an option like the SpellCheckComponent. it uses q
unless spellcheck.q if specified. Is  a hl.q parameter possible?


Or any other tricky way to workaround ..


PS: I need this tomorrow (hopefully) to show my boss insisting some other
stupid well known  commercial search engines..


Regards


Re: WELCOME to solr-user@lucene.apache.org

2010-11-12 Thread Ahmet Arslan
> /spell/?q=built+to+last
> 
> so that we can check the spelling. We are not using
> /select?q=built+to+last
> 
> Can I use dismax with /spell?

Yes you can.

> I understood from your reply that I need to change my
> schema.xml and modify
> the field types.

Correct. Make them full-text searchable. string type is not tokenized.

> Do I need to still use the searchFields field and what do I
> need to specify
> in the defaultSearchField tag?

Delete searchFields, you don't need it. Regarding defaultSearchField, it does 
not matter with dismax. Write any of your fields. For example title.
And play with other dismax parameters. In short dismax is the way to go if you 
are searching multiple fields.


  


Re: WELCOME to solr-user@lucene.apache.org

2010-11-12 Thread Solr User
Ahmet,

In production system we are using

/spell/?q=built+to+last

so that we can check the spelling. We are not using /select?q=built+to+last

Can I use dismax with /spell?

I understood from your reply that I need to change my schema.xml and modify
the field types.

Do I need to still use the searchFields field and what do I need to specify
in the defaultSearchField tag?

searchFields is one of the field names that we provided.

Thanks,
Solr User


On Fri, Nov 12, 2010 at 10:26 AM, Ahmet Arslan  wrote:

> >
> select/?q=built+to+last&defType=dismax&qf=searchFields^0.2+title^20&debugQuery=on
> >
> > For some reason if I use title field in my query I don't
> > get any results.
> >
> > I am copying all searchable fields into searchFields field.
> > So I am able to
> > search only in the searchFields field not in any other
> > fields.
> >
> > I request you all to clarify if anything wrong with my
> > schema.xml. The
> > schema.xml is at the bottom of this email.
> >
> > I am not able to get the boosting working on the title
> > field. Please help me
> > here too.
>
> Change type of your title field. It is string now. Make it solr.TextField.
> Actually you dont need cath-all copy field with dismax.
> Just change their types string to text and append them qf= parameter.
>
>
>
>


Re: WELCOME to solr-user@lucene.apache.org

2010-11-12 Thread Ahmet Arslan
> select/?q=built+to+last&defType=dismax&qf=searchFields^0.2+title^20&debugQuery=on
> 
> For some reason if I use title field in my query I don't
> get any results.
> 
> I am copying all searchable fields into searchFields field.
> So I am able to
> search only in the searchFields field not in any other
> fields.
> 
> I request you all to clarify if anything wrong with my
> schema.xml. The
> schema.xml is at the bottom of this email.
> 
> I am not able to get the boosting working on the title
> field. Please help me
> here too.

Change type of your title field. It is string now. Make it solr.TextField. 
Actually you dont need cath-all copy field with dismax. 
Just change their types string to text and append them qf= parameter.


  


Re: WELCOME to solr-user@lucene.apache.org

2010-11-12 Thread Solr User
Ahmet,

Thanks for the reply.

select/?q=built+to+last&defType=dismax&qf=searchFields^0.2+title^20&debugQuery=on

For some reason if I use title field in my query I don't get any results.

I am copying all searchable fields into searchFields field. So I am able to
search only in the searchFields field not in any other fields.

I request you all to clarify if anything wrong with my schema.xml. The
schema.xml is at the bottom of this email.

I am not able to get the boosting working on the title field. Please help me
here too.

Thanks,
Solr User

On Thu, Nov 11, 2010 at 5:11 PM, Ahmet Arslan  wrote:

> There are several mistakes in your approach:
>
> copyField just copies data. Index time boost is not copied.
>
> There is no such boosting syntax. /select?q=Each&title^9&fl=score
>
> You are searching on your default field.
>
> This is not your cause of your problem but omitNorms="true" disables index
> time boosts.
>
> http://wiki.apache.org/solr/DisMaxQParserPlugin can satisfy your need.
>
>
> --- On Thu, 11/11/10, Solr User  wrote:
>
> > From: Solr User 
> > Subject: Re: WELCOME to solr-user@lucene.apache.org
> > To: solr-user@lucene.apache.org
> > Date: Thursday, November 11, 2010, 11:54 PM
> > Eric,
> >
> > Thank you so much for the reply and apologize for not
> > providing all the
> > details.
> >
> > The following are the field definitons in my schema.xml:
> >
> >  > stored="true"
> > omitNorms="false" />
> >
> >  > stored="true"
> > multiValued="true" omitNorms="true" />
> >
> >  > stored="true"
> > multiValued="true" omitNorms="true" />
> >
> >  > stored="true" />
> >
> >  > stored="true" />
> >
> >  > stored="true" />
> >
> >  > stored="true" />
> >
> >  > stored="true" />
> >
> >  > stored="true" />
> >
> >  > stored="true" />
> >
> >  > stored="true" />
> >
> >  > stored="true" />
> >
> >  > stored="true" />
> >
> >  > stored="true" />
> >
> >  > stored="true" />
> >
> >  > stored="true"
> > multiValued="true" omitNorms="true" />
> >
> >  > stored="true" />
> >
> >  > stored="true"
> > multiValued="true" omitNorms="true" />
> >
> >  > stored="true"
> > multiValued="true" omitNorms="true" />
> >
> >  > stored="true" />
> >
> >  > stored="true" />
> >
> >  > stored="true" />
> >
> >  > stored="true" />
> >
> >  > stored="true" />
> >
> >  > stored="true"
> > omitNorms="true"/>
> >
> >  > stored="true"/>
> >
> >  > indexed="true" stored="true"
> > multiValued="true" omitNorms="true"/>
> >
> > Copy Fields:
> >
> > 
> >
> > 
> >
> > 
> >
> > 
> >
> > 
> >
> > 
> >
> > 
> >
> > 
> >
> > 
> >
> > 
> >
> > 
> >
> > 
> >
> > 
> >
> > 
> >
> >
> >
> > searchFields
> >
> >
> >
> > Before creating the indexes I feed XML file to the Solr job
> > to create index
> > files. I added Boost attribute to the title field before
> > creating indexes
> > and an example is below:
> >
> >  > standalone="no"?> > name="material">1785440 > boost="10.0" name="title">Each Little
> > Bird That Sings > name="price">16.0 > name="isbn10">0152051139 > name="isbn13">9780152051136 > name="format">Hardcover > name="pubdate">2005-03-01 > name="pubyear">2005 > name="reldate">2005-02-22 > name="pages">272 > name="bisacstatus">Active > name="season">Spring
> > 2005 > name="imprint">Children's > name="age">8.0-12.0 > name="grade">3-6 > name="author">Marla Frazee > name="authortype">

Re: WELCOME to solr-user@lucene.apache.org

2010-11-11 Thread Ramavtar Meena
Hi,

If you are looking for query time boosting on title field you can do
the following:
/select?q=title:android^10

Also unless you have a very good reason to use string for date data
(in your case pubdate and reldate), you should be using
solr.DateField.

regards,
Ram
On Fri, Nov 12, 2010 at 3:41 AM, Ahmet Arslan  wrote:
> There are several mistakes in your approach:
>
> copyField just copies data. Index time boost is not copied.
>
> There is no such boosting syntax. /select?q=Each&title^9&fl=score
>
> You are searching on your default field.
>
> This is not your cause of your problem but omitNorms="true" disables index 
> time boosts.
>
> http://wiki.apache.org/solr/DisMaxQParserPlugin can satisfy your need.
>
>
> --- On Thu, 11/11/10, Solr User  wrote:
>
>> From: Solr User 
>> Subject: Re: WELCOME to solr-user@lucene.apache.org
>> To: solr-user@lucene.apache.org
>> Date: Thursday, November 11, 2010, 11:54 PM
>> Eric,
>>
>> Thank you so much for the reply and apologize for not
>> providing all the
>> details.
>>
>> The following are the field definitons in my schema.xml:
>>
>> > stored="true"
>> omitNorms="false" />
>>
>> > stored="true"
>> multiValued="true" omitNorms="true" />
>>
>> > stored="true"
>> multiValued="true" omitNorms="true" />
>>
>> > stored="true" />
>>
>> > stored="true" />
>>
>> > stored="true" />
>>
>> > stored="true" />
>>
>> > stored="true" />
>>
>> > stored="true" />
>>
>> > stored="true" />
>>
>> > stored="true" />
>>
>> > stored="true" />
>>
>> > stored="true" />
>>
>> > stored="true" />
>>
>> > stored="true" />
>>
>> > stored="true"
>> multiValued="true" omitNorms="true" />
>>
>> > stored="true" />
>>
>> > stored="true"
>> multiValued="true" omitNorms="true" />
>>
>> > stored="true"
>> multiValued="true" omitNorms="true" />
>>
>> > stored="true" />
>>
>> > stored="true" />
>>
>> > stored="true" />
>>
>> > stored="true" />
>>
>> > stored="true" />
>>
>> > stored="true"
>> omitNorms="true"/>
>>
>> > stored="true"/>
>>
>> > indexed="true" stored="true"
>> multiValued="true" omitNorms="true"/>
>>
>> Copy Fields:
>>
>> 
>>
>> 
>>
>> 
>>
>> 
>>
>> 
>>
>> 
>>
>> 
>>
>> 
>>
>> 
>>
>> 
>>
>> 
>>
>> 
>>
>> 
>>
>> 
>>
>>
>>
>> searchFields
>>
>>
>>
>> Before creating the indexes I feed XML file to the Solr job
>> to create index
>> files. I added Boost attribute to the title field before
>> creating indexes
>> and an example is below:
>>
>> > standalone="no"?>> name="material">1785440> boost="10.0" name="title">Each Little
>> Bird That Sings> name="price">16.0> name="isbn10">0152051139> name="isbn13">9780152051136> name="format">Hardcover> name="pubdate">2005-03-01> name="pubyear">2005> name="reldate">2005-02-22> name="pages">272> name="bisacstatus">Active> name="season">Spring
>> 2005> name="imprint">Children's> name="age">8.0-12.0> name="grade">3-6> name="author">Marla Frazee> name="authortype">Jacket
>> IllustratorDeborah
>> Wiles> name="authortype">Author> name="bisacsub">Social
>> Issues/Friendship> name="bisacsub">Social Issues/General (see
>> also headings under Family)> name="bisacsub">General> name="bisacsub">Girls &
>> Women> name="category">Fiction/Middle
>> Grade> name="category">Fiction/Award
&

Re: WELCOME to solr-user@lucene.apache.org

2010-11-11 Thread Ahmet Arslan
There are several mistakes in your approach:

copyField just copies data. Index time boost is not copied.

There is no such boosting syntax. /select?q=Each&title^9&fl=score

You are searching on your default field. 

This is not your cause of your problem but omitNorms="true" disables index time 
boosts.

http://wiki.apache.org/solr/DisMaxQParserPlugin can satisfy your need.


--- On Thu, 11/11/10, Solr User  wrote:

> From: Solr User 
> Subject: Re: WELCOME to solr-user@lucene.apache.org
> To: solr-user@lucene.apache.org
> Date: Thursday, November 11, 2010, 11:54 PM
> Eric,
> 
> Thank you so much for the reply and apologize for not
> providing all the
> details.
> 
> The following are the field definitons in my schema.xml:
> 
>  stored="true"
> omitNorms="false" />
> 
>  stored="true"
> multiValued="true" omitNorms="true" />
> 
>  stored="true"
> multiValued="true" omitNorms="true" />
> 
>  stored="true" />
> 
>  stored="true" />
> 
>  stored="true" />
> 
>  stored="true" />
> 
>  stored="true" />
> 
>  stored="true" />
> 
>  stored="true" />
> 
>  stored="true" />
> 
>  stored="true" />
> 
>  stored="true" />
> 
>  stored="true" />
> 
>  stored="true" />
> 
>  stored="true"
> multiValued="true" omitNorms="true" />
> 
>  stored="true" />
> 
>  stored="true"
> multiValued="true" omitNorms="true" />
> 
>  stored="true"
> multiValued="true" omitNorms="true" />
> 
>  stored="true" />
> 
>  stored="true" />
> 
>  stored="true" />
> 
>  stored="true" />
> 
>  stored="true" />
> 
>  stored="true"
> omitNorms="true"/>
> 
>  stored="true"/>
> 
>  indexed="true" stored="true"
> multiValued="true" omitNorms="true"/>
> 
> Copy Fields:
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> searchFields
> 
> 
> 
> Before creating the indexes I feed XML file to the Solr job
> to create index
> files. I added Boost attribute to the title field before
> creating indexes
> and an example is below:
> 
>  standalone="no"?> name="material">1785440 boost="10.0" name="title">Each Little
> Bird That Sings name="price">16.0 name="isbn10">0152051139 name="isbn13">9780152051136 name="format">Hardcover name="pubdate">2005-03-01 name="pubyear">2005 name="reldate">2005-02-22 name="pages">272 name="bisacstatus">Active name="season">Spring
> 2005 name="imprint">Children's name="age">8.0-12.0 name="grade">3-6 name="author">Marla Frazee name="authortype">Jacket
> IllustratorDeborah
> Wiles name="authortype">Author name="bisacsub">Social
> Issues/Friendship name="bisacsub">Social Issues/General (see
> also headings under Family) name="bisacsub">General name="bisacsub">Girls &
> Women name="category">Fiction/Middle
> Grade name="category">Fiction/Award
> WinnersComing
> of AgeSocial
> Situations/Death &
> DyingSocial
> Situations/Friendship name="path">/assets/product/0152051139.gif name="desc"><div>Ten-year-old Comfort
> Snowberger has attended 247
> funerals. But that's not surprising, considering that her
> family runs the
> town funeral home. And even though Great-uncle Edisto
> keeled over with a
> heart attack and Great-great-aunt Florentine dropped
> dead--just like
> that--six months later, Comfort knows how to deal with
> loss, or so she
> thinks. She's more concerned with avoiding her crazy cousin
> Peach and trying
> to figure out why her best friend, Declaration, suddenly
> won't talk to her.
> Life is full of surprises. And the biggest one of all is
> learning what it
> takes to handle them.<br>
> <br>Deborah Wiles has created a
> unique, funny, and utterly real cast of characters in this
> heartfelt, and
> quintessentially Southern com

Re: WELCOME to solr-user@lucene.apache.org

2010-11-11 Thread Solr User
Eric,

Thank you so much for the reply and apologize for not providing all the
details.

The following are the field definitons in my schema.xml:























































Copy Fields:































searchFields



Before creating the indexes I feed XML file to the Solr job to create index
files. I added Boost attribute to the title field before creating indexes
and an example is below:

1785440Each Little
Bird That Sings16.001520511399780152051136Hardcover2005-03-0120052005-02-22272ActiveSpring
2005Children's8.0-12.03-6Marla FrazeeJacket
IllustratorDeborah WilesAuthorSocial
Issues/FriendshipSocial Issues/General (see
also headings under Family)GeneralGirls &
WomenFiction/Middle GradeFiction/Award WinnersComing
of AgeSocial Situations/Death &
DyingSocial
Situations/Friendship/assets/product/0152051139.gif
Ten-year-old Comfort Snowberger has attended 247 funerals. But that's not surprising, considering that her family runs the town funeral home. And even though Great-uncle Edisto keeled over with a heart attack and Great-great-aunt Florentine dropped dead--just like that--six months later, Comfort knows how to deal with loss, or so she thinks. She's more concerned with avoiding her crazy cousin Peach and trying to figure out why her best friend, Declaration, suddenly won't talk to her. Life is full of surprises. And the biggest one of all is learning what it takes to handle them.

Deborah Wiles has created a unique, funny, and utterly real cast of characters in this heartfelt, and quintessentially Southern coming-of-age novel. Comfort will charm young readers with her wit, her warmth, and her struggles as she learns about life, loss, and ultimately, triumph.
Ten-year-old Comfort Snowberger learns about life's surprises in this funny, poignant, and very Southern coming-of-age story.1195443Baby Bear's Chairs16.001520511479780152051143Hardcover2005-09-0120052005-08-0140ActiveFall 2005Children's2.0-5.0P-KJane YolenAuthorMelissa SweetIllustratorBedtime & DreamsAnimals/BearsFamily/General (see also headings under Social Issues)Social Issues/Emotions & FeelingsFamily/ParentsAnimals/BearsBedtime BooksFamily Relationships/Parent-Child/assets/product/0152051147.gif
Baby Bear is the littlest bear in his family, and sometimes that's not so easy. Mama and Papa Bear get to stay up late in their great big chairs. Big brother gets to play fun games in his middle-sized chair. And Baby Bear only seems to cause trouble in his own tiny chair. But at the end of the day, he finds the one perfect chair that's comfier and cozier than all the rest.

Bestselling author Jane Yolen and popular illustrator Melissa Sweet have come together to create a lyrical bedtime tale about a baby bear trying to find his place in a family. With a playful rhyming text and adorable, fun illustrations, here is a book for parents and their own baby bears to treasure.
In this sweet, bedtime story, Baby Bear discovers that Papa's lap is the best chair of all! I am trying to boost the title field so that the search results brings the actual match with title as the first item in the results. Adding boost attribute to the title field and Index time boosting did not change the search results. I tried Query time boosting also as mentioned below but no luck /select?q=Each+Little+Bird+That+Sings&title^9&fl=score Any help to fix this issue would be really helpful. Thanks, Solr User On Thu, Nov 11, 2010 at 10:32 AM, Solr User wrote: > Hi, > > I have a question about boosting. > > I have the following fields in my schema.xml: > > 1. title > 2. description > 3. ISBN > > etc > > I want to boost the field title. I tried index time boosting but it did not > work. I also tried Query time boosting but with no luck. > > Can someone help me on how to implement boosting on a specific field like > title? > > Thanks, > Solr User > > >

Re: WELCOME to solr-user@lucene.apache.org

2010-11-11 Thread Erick Erickson
There's not much to go on here. Boosting works,
and index time as opposed to query time boosting
addresses two different needs. Could you add some
detail? All you've really said is "it didn't work", which
doesn't allow a very constructive response.

Perhaps you could review:
http://wiki.apache.org/solr/HowToContribute

Best
Erick



On Thu, Nov 11, 2010 at 10:32 AM, Solr User  wrote:

> Hi,
>
> I have a question about boosting.
>
> I have the following fields in my schema.xml:
>
> 1. title
> 2. description
> 3. ISBN
>
> etc
>
> I want to boost the field title. I tried index time boosting but it did not
> work. I also tried Query time boosting but with no luck.
>
> Can someone help me on how to implement boosting on a specific field like
> title?
>
> Thanks,
> Solr User
>
>
>


Re: WELCOME to solr-user@lucene.apache.org

2010-11-11 Thread Solr User
Hi,

I have a question about boosting.

I have the following fields in my schema.xml:

1. title
2. description
3. ISBN

etc

I want to boost the field title. I tried index time boosting but it did not
work. I also tried Query time boosting but with no luck.

Can someone help me on how to implement boosting on a specific field like
title?

Thanks,
Solr User

On Thu, Nov 11, 2010 at 10:26 AM,  wrote:

> Hi! This is the ezmlm program. I'm managing the
> solr-user@lucene.apache.org mailing list.
>
> I'm working for my owner, who can be reached
> at solr-user-ow...@lucene.apache.org.
>
> Acknowledgment: I have added the address
>
>   solr...@gmail.com
>
> to the solr-user mailing list.
>
> Welcome to solr-u...@lucene.apache.org!
>
> Please save this message so that you know the address you are
> subscribed under, in case you later want to unsubscribe or change your
> subscription address.
>
>
> --- Administrative commands for the solr-user list ---
>
> I can handle administrative requests automatically. Please
> do not send them to the list address! Instead, send
> your message to the correct command address:
>
> To subscribe to the list, send a message to:
>   
>
> To remove your address from the list, send a message to:
>   
>
> Send mail to the following for info and FAQ for this list:
>   
>   
>
> Similar addresses exist for the digest list:
>   
>   
>
> To get messages 123 through 145 (a maximum of 100 per request), mail:
>   
>
> To get an index with subject and author for messages 123-456 , mail:
>   
>
> They are always returned as sets of 100, max 2000 per request,
> so you'll actually get 100-499.
>
> To receive all messages with the same subject as message 12345,
> send a short message to:
>   
>
> The messages should contain one line or word of text to avoid being
> treated as s...@m, but I will ignore their content.
> Only the ADDRESS you send to is important.
>
> You can start a subscription for an alternate address,
> for example "j...@host.domain", just add a hyphen and your
> address (with '=' instead of '@') after the command word:
> 
>
> To stop subscription for this address, mail:
> 
>
> In both cases, I'll send a confirmation message to that address. When
> you receive it, simply reply to it to complete your subscription.
>
> If despite following these instructions, you do not get the
> desired results, please contact my owner at
> solr-user-ow...@lucene.apache.org. Please be patient, my owner is a
> lot slower than I am ;-)
>
> --- Enclosed is a copy of the request I received.
>
> Return-Path: 
> Received: (qmail 48883 invoked by uid 99); 11 Nov 2010 15:26:44 -
> Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230)
>by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 11 Nov 2010 15:26:44
> +
> X-ASF-Spam-Status: No, hits=2.2 required=10.0
>
>  
> tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL
> X-Spam-Check-By: apache.org
> Received-SPF: pass (nike.apache.org: domain of solr...@gmail.comdesignates 
> 209.85.213.48 as permitted sender)
> Received: from [209.85.213.48] (HELO mail-yw0-f48.google.com)
> (209.85.213.48)
>by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 11 Nov 2010 15:26:35
> +
> Received: by ywp4 with SMTP id 4so1394872ywp.35
>for  @lucene.apache.org>; Thu, 11 Nov 2010 07:26:14 -0800 (PST)
> DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
>d=gmail.com; s=gamma;
>h=domainkey-signature:mime-version:received:received:in-reply-to
> :references:date:message-id:subject:from:to:content-type;
>bh=4KuKRrRVLjzTO4oB9/DNxMdQPfNQH2GnYznzPE6YqOo=;
>b=l5lBfUYcyvipJn9SE+5j+t1XUmBjTtbyPYlRVj7jDb6G+W3NzQ21EHOowiD9rNH2L9
>
> gc2+6mGEZmRJOZQwpKD7SUQ2bXL9fVm7mVfS21TMAgC+ZsWQ3vvFOHXalWZa8dbtcOY7
> C23KauLY7YH1UfducfXL77J7u0/snEZl5jQ7A=
> DomainKey-Signature: a=rsa-sha1; c=nofws;
>d=gmail.com; s=gamma;
>
>  h=mime-version:in-reply-to:references:date:message-id:subject:from:to
> :content-type;
>b=nb9+3a9bOHnjGO5T5BhMlW15adcafr+MPzvpgc5X5NXEUGCI05ViLho0SSoQP2Wp2i
>
> xp1Mfjrjw05umeKmHX23oeD5Idc2G6xgz8I3ZcJ1bUM+cD7c52cMKG2suE2VvhUHlfah
> z52rEtlqd0Q9fk/ZDWwR2DS7GoiVMRmgaWgD0=
> MIME-Version: 1.0
> Received: by 10.229.216.201 with SMTP id hj9mr877669qcb.58.1289489174123;
> Thu,
>  11 Nov 2010 07:26:14 -0800 (PST)
> Received: by 10.229.66.165 with HTTP; Thu, 11 Nov 2010 07:26:14 -0800 (PST)
> In-Reply-To: <1289489103.46214.ez...@lucene.apache.org>
> References: <1289489103.46214.ez...@lucene.apache.org>
> Date: Thu, 11 Nov 2010 10:26:14 -0500
> Message-ID: 
> 
> >
> Subject: Re: confirm subscribe to solr-user@lucene.apache.org
> From: Solr User 
> To: solr-user-sc.1289489103.apfngfdapdhadiahjfln-solrnew=gmail.com@
> lucene.apache.org
> Content-Type: multipart/alternative; boundary=0016361e83f82a56590494c898ec
> X-Virus-Checked: Checked by ClamAV on apache.org
>
> --0016361e83f82a56590494c898ec
> Content-Type: text/plain; charset=ISO-8859-1
>
> Pl

Re: WELCOME to solr-user@lucene.apache.org

2009-12-08 Thread Chris Hostetter

(FYI: in the future please start a new thread with an approriate subject 
line when you ask questions -- you probably would have gotten a lot more 
responses fro people interested in Tika and SolrCell if they could tell 
that this email was about SolrCell)

: I found that Tika read the html and extract metadata like  from my htmls but my documents has an already an id setted by
: literal.id=10.
: 
: I tried to map the id from Tika by fmap.id=ignored_ but it ignore also my
: literal.id

H, yeah: that seems like  an odd order of operations, but it's 
documented on the wiki so evidently it's intentional...

http://wiki.apache.org/solr/ExtractingRequestHandler#Order_of_field_operations

my best sugguestions:

 * use the capture param to restrict what gets extracted (it's probably
possible to write an XPath query that selects everything *except* 
metadata[id])
 * change the name of your uniqueKey field to be something other then "id" 
so it's less likely to collide with a value from the document.

I also opened two Jira issues that you may want to post comments in...

https://issues.apache.org/jira/browse/SOLR-1633
https://issues.apache.org/jira/browse/SOLR-1634


-Hoss



Re: WELCOME to solr-user@lucene.apache.org

2009-12-05 Thread khalid y
Thanks a lot for you response !!

For the first solution :

I need to index all the content of my websites and I want just tika ignore
 because I have already an id
I'll try monday and tell you if it works

The second solution :
Are your sure Tika use the HTML Tokenizer ? I'll check

2009/12/5 Raghuveer Kancherla 

> 2 ways I can think of ...
>
>   - ExtractingRequestHandler (this is what I am guessing you are using now)
>
> Set extractOnly=true while making a request to the extractingRequestHandler
> and get the parsed content back. Now make a post request on update request
> handler with what ever fields and field values you want.
>



>
>   - Use HTMLStripWhiteSpaceTokenizer factory. This article may be helpful
>   to explain what I mean.
>
> http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.HTMLStripWhitespaceTokenizerFactory
> .
>
>
>
> - Raghu
>
>
>
> On Sat, Dec 5, 2009 at 3:44 AM, khalid y  wrote:
>
> > Hi,
> >
> > I have a problem with solr. I'm indexing some html content and solr crash
> > because my id field is multivalued.
> > I found that Tika read the html and extract metadata like  > content="12"> from my htmls but my documents has an already an id setted
> by
> > literal.id=10.
> >
> > I tried to map the id from Tika by fmap.id=ignored_ but it ignore also
> my
> > literal.id
> >
> > I'm using solr 1.4 and tika 0.5
> >
> > Someone can explain to me how I can ignore this the Tika id metadata ??
> >
> > Thanks
> >
>


Re: WELCOME to solr-user@lucene.apache.org

2009-12-05 Thread Raghuveer Kancherla
2 ways I can think of ...

   - ExtractingRequestHandler (this is what I am guessing you are using now)

Set extractOnly=true while making a request to the extractingRequestHandler
and get the parsed content back. Now make a post request on update request
handler with what ever fields and field values you want.


   - Use HTMLStripWhiteSpaceTokenizer factory. This article may be helpful
   to explain what I mean.
   
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.HTMLStripWhitespaceTokenizerFactory.



- Raghu



On Sat, Dec 5, 2009 at 3:44 AM, khalid y  wrote:

> Hi,
>
> I have a problem with solr. I'm indexing some html content and solr crash
> because my id field is multivalued.
> I found that Tika read the html and extract metadata like  content="12"> from my htmls but my documents has an already an id setted by
> literal.id=10.
>
> I tried to map the id from Tika by fmap.id=ignored_ but it ignore also my
> literal.id
>
> I'm using solr 1.4 and tika 0.5
>
> Someone can explain to me how I can ignore this the Tika id metadata ??
>
> Thanks
>


Re: WELCOME to solr-user@lucene.apache.org

2009-12-04 Thread khalid y
Hi,

I have a problem with solr. I'm indexing some html content and solr crash
because my id field is multivalued.
I found that Tika read the html and extract metadata like  from my htmls but my documents has an already an id setted by
literal.id=10.

I tried to map the id from Tika by fmap.id=ignored_ but it ignore also my
literal.id

I'm using solr 1.4 and tika 0.5

Someone can explain to me how I can ignore this the Tika id metadata ??

Thanks


Re: WELCOME to solr-user@lucene.apache.org

2008-01-08 Thread bjorkgre

There are some instructions about integrating Nutch with Solr here:

http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html

Joakim


"Otis Gospodnetic" <[EMAIL PROTECTED]> kirjoitti 9.1.2008:
> Nutch and Solr work nice in tandem.  We've used Nutch for its distributed 
> fetching + parsing and related functionality and have used Solr to indexed 
> the resulting text.  What glued them together was Solrj, actually.
> 
> Otis
> 
> --
> Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
> 
> - Original Message 
> From: Jan Buelens <[EMAIL PROTECTED]>
> To: solr-user@lucene.apache.org
> Sent: Tuesday, January 8, 2008 3:37:12 AM
> Subject: Re: WELCOME to solr-user@lucene.apache.org
> 
> Hi,
> 
> We are currently using Solr as search engine.
> To add an existing website to our search engine, we are investigating
>  Nutch.
> 
> Does anyone have more information / experience about an integration
>  between
> Solr and Nutch?
> 
> Thanks in advance !
> 
> 
> Best regards,
> Jan
> 
> 
> 


Re: WELCOME to solr-user@lucene.apache.org

2008-01-08 Thread Otis Gospodnetic
Nutch and Solr work nice in tandem.  We've used Nutch for its distributed 
fetching + parsing and related functionality and have used Solr to indexed the 
resulting text.  What glued them together was Solrj, actually.

Otis 

--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch

- Original Message 
From: Jan Buelens <[EMAIL PROTECTED]>
To: solr-user@lucene.apache.org
Sent: Tuesday, January 8, 2008 3:37:12 AM
Subject: Re: WELCOME to solr-user@lucene.apache.org

Hi,

We are currently using Solr as search engine.
To add an existing website to our search engine, we are investigating
 Nutch.

Does anyone have more information / experience about an integration
 between
Solr and Nutch?

Thanks in advance !


Best regards,
Jan





Re: WELCOME to solr-user@lucene.apache.org

2008-01-08 Thread Ryan McKinley

currently two approaches:

http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html
and:
https://issues.apache.org/jira/browse/NUTCH-442

I have had experience with the former... you may have more luck on the 
nutch-user list for help


ryan


Jan Buelens wrote:

Hi,

We are currently using Solr as search engine.
To add an existing website to our search engine, we are investigating Nutch.

Does anyone have more information / experience about an integration between
Solr and Nutch?

Thanks in advance !


Best regards,
Jan





Re: WELCOME to solr-user@lucene.apache.org

2008-01-08 Thread Jan Buelens
Hi,

We are currently using Solr as search engine.
To add an existing website to our search engine, we are investigating Nutch.

Does anyone have more information / experience about an integration between
Solr and Nutch?

Thanks in advance !


Best regards,
Jan