Re: WELCOME to solr-user@lucene.apache.org
In short, nothing that’s maintained as part of the Apache project. There may be commercial products, but I haven’t had occasion to look for one. Best, Erick > On Oct 20, 2019, at 7:42 AM, Wasim S Kazi wrote: > > Good day > > I would like to get some info or confirmation about configuring Solr 8+ to > get content from WCM (Websphere Content Management) > > Essentially, we have manually index data from WCM into Solr and this all > works fine. We want to now automate this process, so checking is there is any > well established integration method between WCM and Solr. This integration > should allow content being indexed automatically, or periodically without > human intervention. > > Regards > Wasim Kazi > > -Original Message- > From: solr-user-h...@lucene.apache.org > Sent: Sunday, October 20, 2019 2:39 PM > To: Wasim S Kazi > Subject: WELCOME to solr-user@lucene.apache.org > > Hi! This is the ezmlm program. I'm managing the solr-user@lucene.apache.org > mailing list. > > I'm working for my owner, who can be reached at > solr-user-ow...@lucene.apache.org. > > Acknowledgment: I have added the address > > wasim.s.k...@za.ey.com > > to the solr-user mailing list. > > Welcome to solr-user@lucene.apache.org! > > Please save this message so that you know the address you are subscribed > under, in case you later want to unsubscribe or change your subscription > address. > > > --- Administrative commands for the solr-user list --- > > I can handle administrative requests automatically. Please do not send them > to the list address! Instead, send your message to the correct command > address: > > To subscribe to the list, send a message to: > > > To remove your address from the list, send a message to: > > > Send mail to the following for info and FAQ for this list: > > > > Similar addresses exist for the digest list: > > > > To get messages 123 through 145 (a maximum of 100 per request), mail: > > > To get an index with subject and author for messages 123-456 , mail: > > > They are always returned as sets of 100, max 2000 per request, so you'll > actually get 100-499. > > To receive all messages with the same subject as message 12345, send a short > message to: > > > The messages should contain one line or word of text to avoid being treated > as sp@m, but I will ignore their content. > Only the ADDRESS you send to is important. > > You can start a subscription for an alternate address, for example > "john@host.domain", just add a hyphen and your address (with '=' instead of > '@') after the command word: > > > To stop subscription for this address, mail: > > > In both cases, I'll send a confirmation message to that address. When you > receive it, simply reply to it to complete your subscription. > > If despite following these instructions, you do not get the desired results, > please contact my owner at solr-user-ow...@lucene.apache.org. Please be > patient, my owner is a lot slower than I am ;-) > > --- Enclosed is a copy of the request I received. > > Return-Path: > Received: (qmail 96582 invoked by uid 99); 20 Oct 2019 11:38:52 - > Received: from pnap-us-west-generic-nat.apache.org (HELO > spamd1-us-west.apache.org) (209.188.14.142) >by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 20 Oct 2019 11:38:52 + > Received: from localhost (localhost [127.0.0.1]) >by spamd1-us-west.apache.org (ASF Mail Server at > spamd1-us-west.apache.org) with ESMTP id 81232C0C8E >for > ; > Sun, 20 Oct 2019 11:38:51 + (UTC) > X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org > X-Spam-Flag: NO > X-Spam-Score: -4.8 > X-Spam-Level: > X-Spam-Status: No, score=-4.8 tagged_above=-999 required=6.31 >tests=[HTML_FONT_LOW_CONTRAST=0.001, HTML_MESSAGE=0.2, >KAM_SHORT=0.001, RCVD_IN_DNSWL_HI=-5, SPF_HELO_PASS=-0.001, >SPF_PASS=-0.001] autolearn=disabled > Received: from mx1-he-de.apache.org ([10.40.0.8]) >by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, > port 10024) >with ESMTP id Kbk25gxC2elm >for > ; >Sun, 20 Oct 2019 11:38:50 + (UTC) > Received-SPF: Pass (mailfrom) identity=mailfrom; client-ip=199.49.1.52; > helo=em01.ey.com; envelope-from=wasim.s.k...@za.ey.com; receiver= > Received: from em01.ey.com (em01.ey.com [199.49.1.52]) >by mx1-he-de.apache.org (ASF Mail Server at mx1-he-de.apache.org) with > ESMTPS id 86E307DDFA >for > ; > Sun, 20 Oct 2019 11:38:49 + (UTC) &g
RE: WELCOME to solr-user@lucene.apache.org
Good day I would like to get some info or confirmation about configuring Solr 8+ to get content from WCM (Websphere Content Management) Essentially, we have manually index data from WCM into Solr and this all works fine. We want to now automate this process, so checking is there is any well established integration method between WCM and Solr. This integration should allow content being indexed automatically, or periodically without human intervention. Regards Wasim Kazi -Original Message- From: solr-user-h...@lucene.apache.org Sent: Sunday, October 20, 2019 2:39 PM To: Wasim S Kazi Subject: WELCOME to solr-user@lucene.apache.org Hi! This is the ezmlm program. I'm managing the solr-user@lucene.apache.org mailing list. I'm working for my owner, who can be reached at solr-user-ow...@lucene.apache.org. Acknowledgment: I have added the address wasim.s.k...@za.ey.com to the solr-user mailing list. Welcome to solr-user@lucene.apache.org! Please save this message so that you know the address you are subscribed under, in case you later want to unsubscribe or change your subscription address. --- Administrative commands for the solr-user list --- I can handle administrative requests automatically. Please do not send them to the list address! Instead, send your message to the correct command address: To subscribe to the list, send a message to: To remove your address from the list, send a message to: Send mail to the following for info and FAQ for this list: Similar addresses exist for the digest list: To get messages 123 through 145 (a maximum of 100 per request), mail: To get an index with subject and author for messages 123-456 , mail: They are always returned as sets of 100, max 2000 per request, so you'll actually get 100-499. To receive all messages with the same subject as message 12345, send a short message to: The messages should contain one line or word of text to avoid being treated as sp@m, but I will ignore their content. Only the ADDRESS you send to is important. You can start a subscription for an alternate address, for example "john@host.domain", just add a hyphen and your address (with '=' instead of '@') after the command word: To stop subscription for this address, mail: In both cases, I'll send a confirmation message to that address. When you receive it, simply reply to it to complete your subscription. If despite following these instructions, you do not get the desired results, please contact my owner at solr-user-ow...@lucene.apache.org. Please be patient, my owner is a lot slower than I am ;-) --- Enclosed is a copy of the request I received. Return-Path: Received: (qmail 96582 invoked by uid 99); 20 Oct 2019 11:38:52 - Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 20 Oct 2019 11:38:52 + Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 81232C0C8E for ; Sun, 20 Oct 2019 11:38:51 + (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -4.8 X-Spam-Level: X-Spam-Status: No, score=-4.8 tagged_above=-999 required=6.31 tests=[HTML_FONT_LOW_CONTRAST=0.001, HTML_MESSAGE=0.2, KAM_SHORT=0.001, RCVD_IN_DNSWL_HI=-5, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001] autolearn=disabled Received: from mx1-he-de.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id Kbk25gxC2elm for ; Sun, 20 Oct 2019 11:38:50 + (UTC) Received-SPF: Pass (mailfrom) identity=mailfrom; client-ip=199.49.1.52; helo=em01.ey.com; envelope-from=wasim.s.k...@za.ey.com; receiver= Received: from em01.ey.com (em01.ey.com [199.49.1.52]) by mx1-he-de.apache.org (ASF Mail Server at mx1-he-de.apache.org) with ESMTPS id 86E307DDFA for ; Sun, 20 Oct 2019 11:38:49 + (UTC) IronPort-SDR: 0i+SrmLgncBfCsgonKDgt+Ll+5TCuN/hbDHsUS1V98D3LWk4dgqQE9qJPrbcZyYjLWRYXieztn Fjky8vaAREXw== X-IronPort-AV: E=Sophos;i="5.67,319,1566864000"; d="gif'147?scan'147,208,217,147";a="240843155" Received: from unknown (HELO DERUSRMPEXTP02.ey.net) ([10.151.33.58]) by defrakaeyip01.eurw.ey.net with ESMTP; 20 Oct 2019 11:38:42 + ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=Em+4qSC0AqZ4Ei+nYLvNi3BwVnwrjtXdFD2W5lnj3CNDBO0x9JJBOn5yWMUj4JNnCnhg4R524D5O+lX6dYrYut/tTe09g0pnRemmla9J7icpboVqK6i5gXJLHLFA9dERNQwRDieNKqKEkei0eIbCzLMJeVld1lvj7CJiXIZPZIySU5hHZI7N5+Q9i1eb4GRYxATio7ibfxNknvf3/2298wyUhY9EuQEEuTWNrylkhMtQORgdlgv+mEdpzGJO+FaiG0fv1MQ0TO8JcgybSjJ14hG7xYlhkGEO39qzV7Q9EDbsPwJuupwZg/r4XAIIZ0Bjc0f7YX11S2BhnV8mdm+T+A== ARC-Message-
Re: WELCOME to solr-user@lucene.apache.org
First, understand that this list is maintained by volunteers, so answers aren't guaranteed. If you require dedicated support there are various organizations that provide same, but you'll have to contact them. That said, the community is quite responsive, just post questions to solr-user like this one. Best, Erick On Sun, Jun 24, 2018 at 11:35 PM, Srinivas Muppu (US) wrote: > Hi Solr Team, > > We are facing Solr System Configuration issues which needs help. Please let > us know whom to post our Questions/Queries. > > Thanks, > Srinivas > > On Mon, Jun 25, 2018 at 2:22 AM, wrote: > >> Hi! This is the ezmlm program. I'm managing the >> solr-user@lucene.apache.org mailing list. >> >> I'm working for my owner, who can be reached >> at solr-user-ow...@lucene.apache.org. >> >> Acknowledgment: I have added the address >> >>srinivas.mu...@pwc.com >> >> to the solr-user mailing list. >> >> Welcome to solr-user@lucene.apache.org! >> >> Please save this message so that you know the address you are >> subscribed under, in case you later want to unsubscribe or change your >> subscription address. >> >> >> --- Administrative commands for the solr-user list --- >> >> I can handle administrative requests automatically. Please >> do not send them to the list address! Instead, send >> your message to the correct command address: >> >> To subscribe to the list, send a message to: >> >> >> To remove your address from the list, send a message to: >> >> >> Send mail to the following for info and FAQ for this list: >> >> >> >> Similar addresses exist for the digest list: >> >> >> >> To get messages 123 through 145 (a maximum of 100 per request), mail: >> >> >> To get an index with subject and author for messages 123-456 , mail: >> >> >> They are always returned as sets of 100, max 2000 per request, >> so you'll actually get 100-499. >> >> To receive all messages with the same subject as message 12345, >> send a short message to: >> >> >> The messages should contain one line or word of text to avoid being >> treated as sp@m, but I will ignore their content. >> Only the ADDRESS you send to is important. >> >> You can start a subscription for an alternate address, >> for example "john@host.domain", just add a hyphen and your >> address (with '=' instead of '@') after the command word: >> >> >> To stop subscription for this address, mail: >> >> >> In both cases, I'll send a confirmation message to that address. When >> you receive it, simply reply to it to complete your subscription. >> >> If despite following these instructions, you do not get the >> desired results, please contact my owner at >> solr-user-ow...@lucene.apache.org. Please be patient, my owner is a >> lot slower than I am ;-) >> >> --- Enclosed is a copy of the request I received. >> >> Return-Path: >> Received: (qmail 84164 invoked by uid 99); 25 Jun 2018 06:22:12 - >> Received: from pnap-us-west-generic-nat.apache.org (HELO >> spamd1-us-west.apache.org) (209.188.14.142) >> by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 25 Jun 2018 06:22:12 >> + >> Received: from localhost (localhost [127.0.0.1]) >> by spamd1-us-west.apache.org (ASF Mail Server at >> spamd1-us-west.apache.org) with ESMTP id 63CB9CA4A5 >> for > pwc@lucene.apache.org>; Mon, 25 Jun 2018 06:22:12 + (UTC) >> X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org >> X-Spam-Flag: NO >> X-Spam-Score: -1 >> X-Spam-Level: >> X-Spam-Status: No, score=-1 tagged_above=-999 required=6.31 >> tests=[HTML_MESSAGE=2, KAM_BADIPHTTP=2, KAM_SHORT=0.001, >> NORMAL_HTTP_TO_IP=0.001, RCVD_IN_DNSWL_HI=-5, SPF_HELO_PASS=-0.001, >> SPF_PASS=-0.001] autolearn=disabled >> Received: from mx1-lw-us.apache.org ([10.40.0.8]) >> by localhost (spamd1-us-west.apache.org [10.40.0.7]) >> (amavisd-new, port 10024) >> with ESMTP id NuBVNjDIIyqW >> for > pwc@lucene.apache.org>; >> Mon, 25 Jun 2018 06:22:10 + (UTC) >> Received: from lxsmpr20.pwc.com (lxsmpr20.pwc.com [155.201.248.112]) >> by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) >> with ESMTPS id 500895F1B4 >> for > pwc@lucene.apache.org>; Mon, 25 Jun 2018 06:22:10 + (UTC)
Re: WELCOME to solr-user@lucene.apache.org
Hi Solr Team, We are facing Solr System Configuration issues which needs help. Please let us know whom to post our Questions/Queries. Thanks, Srinivas On Mon, Jun 25, 2018 at 2:22 AM, wrote: > Hi! This is the ezmlm program. I'm managing the > solr-user@lucene.apache.org mailing list. > > I'm working for my owner, who can be reached > at solr-user-ow...@lucene.apache.org. > > Acknowledgment: I have added the address > >srinivas.mu...@pwc.com > > to the solr-user mailing list. > > Welcome to solr-user@lucene.apache.org! > > Please save this message so that you know the address you are > subscribed under, in case you later want to unsubscribe or change your > subscription address. > > > --- Administrative commands for the solr-user list --- > > I can handle administrative requests automatically. Please > do not send them to the list address! Instead, send > your message to the correct command address: > > To subscribe to the list, send a message to: > > > To remove your address from the list, send a message to: > > > Send mail to the following for info and FAQ for this list: > > > > Similar addresses exist for the digest list: > > > > To get messages 123 through 145 (a maximum of 100 per request), mail: > > > To get an index with subject and author for messages 123-456 , mail: > > > They are always returned as sets of 100, max 2000 per request, > so you'll actually get 100-499. > > To receive all messages with the same subject as message 12345, > send a short message to: > > > The messages should contain one line or word of text to avoid being > treated as sp@m, but I will ignore their content. > Only the ADDRESS you send to is important. > > You can start a subscription for an alternate address, > for example "john@host.domain", just add a hyphen and your > address (with '=' instead of '@') after the command word: > > > To stop subscription for this address, mail: > > > In both cases, I'll send a confirmation message to that address. When > you receive it, simply reply to it to complete your subscription. > > If despite following these instructions, you do not get the > desired results, please contact my owner at > solr-user-ow...@lucene.apache.org. Please be patient, my owner is a > lot slower than I am ;-) > > --- Enclosed is a copy of the request I received. > > Return-Path: > Received: (qmail 84164 invoked by uid 99); 25 Jun 2018 06:22:12 - > Received: from pnap-us-west-generic-nat.apache.org (HELO > spamd1-us-west.apache.org) (209.188.14.142) > by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 25 Jun 2018 06:22:12 > + > Received: from localhost (localhost [127.0.0.1]) > by spamd1-us-west.apache.org (ASF Mail Server at > spamd1-us-west.apache.org) with ESMTP id 63CB9CA4A5 > for pwc@lucene.apache.org>; Mon, 25 Jun 2018 06:22:12 + (UTC) > X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org > X-Spam-Flag: NO > X-Spam-Score: -1 > X-Spam-Level: > X-Spam-Status: No, score=-1 tagged_above=-999 required=6.31 > tests=[HTML_MESSAGE=2, KAM_BADIPHTTP=2, KAM_SHORT=0.001, > NORMAL_HTTP_TO_IP=0.001, RCVD_IN_DNSWL_HI=-5, SPF_HELO_PASS=-0.001, > SPF_PASS=-0.001] autolearn=disabled > Received: from mx1-lw-us.apache.org ([10.40.0.8]) > by localhost (spamd1-us-west.apache.org [10.40.0.7]) > (amavisd-new, port 10024) > with ESMTP id NuBVNjDIIyqW > for pwc@lucene.apache.org>; > Mon, 25 Jun 2018 06:22:10 + (UTC) > Received: from lxsmpr20.pwc.com (lxsmpr20.pwc.com [155.201.248.112]) > by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) > with ESMTPS id 500895F1B4 > for pwc@lucene.apache.org>; Mon, 25 Jun 2018 06:22:10 + (UTC) > Received: from mail-vk0-f71.google.com (mail-vk0-f71.google.com > [209.85.213.71]) > by lxsmpr20.nam.pwcinternal.com (8.16.0.21/8.16.0.21) with ESMTPS > id w5P6M3MF054491 > (version=TLSv1.2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128 > verify=OK) > for pwc@lucene.apache.org>; Mon, 25 Jun 2018 02:22:03 -0400 > Received: by mail-vk0-f71.google.com with SMTP id j123-v6so5886670vkc.4 > for pwc@lucene.apache.org>; Sun, 24 Jun 2018 23:22:03 -0700 (PDT) > X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; > d=1e100.net; s=20161025; > h=x-gm-message-state:mime-version:in-reply-to:references:from:date > :message-id:subject:to; > bh=+MKXiCktrcuycddIpUqd9ljQ2oLqYBsgU3qPgb6oZ2M=; > b=q4Vku4HdqSxx2NyQ1G2GtPG7
Re: WELCOME to solr-user@lucene.apache.org
Hi , Can I limit the terms that the HighlightComponent uses. My query is generally long and I want specific ones to be highlighted and the rest is not highlighted. Is there an option like the SpellCheckComponent. it uses q unless spellcheck.q if specified. Is a hl.q parameter possible? Or any other tricky way to workaround .. PS: I need this tomorrow (hopefully) to show my boss insisting some other stupid well known commercial search engines.. Regards
Re: WELCOME to solr-user@lucene.apache.org
> /spell/?q=built+to+last > > so that we can check the spelling. We are not using > /select?q=built+to+last > > Can I use dismax with /spell? Yes you can. > I understood from your reply that I need to change my > schema.xml and modify > the field types. Correct. Make them full-text searchable. string type is not tokenized. > Do I need to still use the searchFields field and what do I > need to specify > in the defaultSearchField tag? Delete searchFields, you don't need it. Regarding defaultSearchField, it does not matter with dismax. Write any of your fields. For example title. And play with other dismax parameters. In short dismax is the way to go if you are searching multiple fields.
Re: WELCOME to solr-user@lucene.apache.org
Ahmet, In production system we are using /spell/?q=built+to+last so that we can check the spelling. We are not using /select?q=built+to+last Can I use dismax with /spell? I understood from your reply that I need to change my schema.xml and modify the field types. Do I need to still use the searchFields field and what do I need to specify in the defaultSearchField tag? searchFields is one of the field names that we provided. Thanks, Solr User On Fri, Nov 12, 2010 at 10:26 AM, Ahmet Arslan wrote: > > > select/?q=built+to+last&defType=dismax&qf=searchFields^0.2+title^20&debugQuery=on > > > > For some reason if I use title field in my query I don't > > get any results. > > > > I am copying all searchable fields into searchFields field. > > So I am able to > > search only in the searchFields field not in any other > > fields. > > > > I request you all to clarify if anything wrong with my > > schema.xml. The > > schema.xml is at the bottom of this email. > > > > I am not able to get the boosting working on the title > > field. Please help me > > here too. > > Change type of your title field. It is string now. Make it solr.TextField. > Actually you dont need cath-all copy field with dismax. > Just change their types string to text and append them qf= parameter. > > > >
Re: WELCOME to solr-user@lucene.apache.org
> select/?q=built+to+last&defType=dismax&qf=searchFields^0.2+title^20&debugQuery=on > > For some reason if I use title field in my query I don't > get any results. > > I am copying all searchable fields into searchFields field. > So I am able to > search only in the searchFields field not in any other > fields. > > I request you all to clarify if anything wrong with my > schema.xml. The > schema.xml is at the bottom of this email. > > I am not able to get the boosting working on the title > field. Please help me > here too. Change type of your title field. It is string now. Make it solr.TextField. Actually you dont need cath-all copy field with dismax. Just change their types string to text and append them qf= parameter.
Re: WELCOME to solr-user@lucene.apache.org
Ahmet, Thanks for the reply. select/?q=built+to+last&defType=dismax&qf=searchFields^0.2+title^20&debugQuery=on For some reason if I use title field in my query I don't get any results. I am copying all searchable fields into searchFields field. So I am able to search only in the searchFields field not in any other fields. I request you all to clarify if anything wrong with my schema.xml. The schema.xml is at the bottom of this email. I am not able to get the boosting working on the title field. Please help me here too. Thanks, Solr User On Thu, Nov 11, 2010 at 5:11 PM, Ahmet Arslan wrote: > There are several mistakes in your approach: > > copyField just copies data. Index time boost is not copied. > > There is no such boosting syntax. /select?q=Each&title^9&fl=score > > You are searching on your default field. > > This is not your cause of your problem but omitNorms="true" disables index > time boosts. > > http://wiki.apache.org/solr/DisMaxQParserPlugin can satisfy your need. > > > --- On Thu, 11/11/10, Solr User wrote: > > > From: Solr User > > Subject: Re: WELCOME to solr-user@lucene.apache.org > > To: solr-user@lucene.apache.org > > Date: Thursday, November 11, 2010, 11:54 PM > > Eric, > > > > Thank you so much for the reply and apologize for not > > providing all the > > details. > > > > The following are the field definitons in my schema.xml: > > > > > stored="true" > > omitNorms="false" /> > > > > > stored="true" > > multiValued="true" omitNorms="true" /> > > > > > stored="true" > > multiValued="true" omitNorms="true" /> > > > > > stored="true" /> > > > > > stored="true" /> > > > > > stored="true" /> > > > > > stored="true" /> > > > > > stored="true" /> > > > > > stored="true" /> > > > > > stored="true" /> > > > > > stored="true" /> > > > > > stored="true" /> > > > > > stored="true" /> > > > > > stored="true" /> > > > > > stored="true" /> > > > > > stored="true" > > multiValued="true" omitNorms="true" /> > > > > > stored="true" /> > > > > > stored="true" > > multiValued="true" omitNorms="true" /> > > > > > stored="true" > > multiValued="true" omitNorms="true" /> > > > > > stored="true" /> > > > > > stored="true" /> > > > > > stored="true" /> > > > > > stored="true" /> > > > > > stored="true" /> > > > > > stored="true" > > omitNorms="true"/> > > > > > stored="true"/> > > > > > indexed="true" stored="true" > > multiValued="true" omitNorms="true"/> > > > > Copy Fields: > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > searchFields > > > > > > > > Before creating the indexes I feed XML file to the Solr job > > to create index > > files. I added Boost attribute to the title field before > > creating indexes > > and an example is below: > > > > > standalone="no"?> > name="material">1785440 > boost="10.0" name="title">Each Little > > Bird That Sings > name="price">16.0 > name="isbn10">0152051139 > name="isbn13">9780152051136 > name="format">Hardcover > name="pubdate">2005-03-01 > name="pubyear">2005 > name="reldate">2005-02-22 > name="pages">272 > name="bisacstatus">Active > name="season">Spring > > 2005 > name="imprint">Children's > name="age">8.0-12.0 > name="grade">3-6 > name="author">Marla Frazee > name="authortype">
Re: WELCOME to solr-user@lucene.apache.org
Hi, If you are looking for query time boosting on title field you can do the following: /select?q=title:android^10 Also unless you have a very good reason to use string for date data (in your case pubdate and reldate), you should be using solr.DateField. regards, Ram On Fri, Nov 12, 2010 at 3:41 AM, Ahmet Arslan wrote: > There are several mistakes in your approach: > > copyField just copies data. Index time boost is not copied. > > There is no such boosting syntax. /select?q=Each&title^9&fl=score > > You are searching on your default field. > > This is not your cause of your problem but omitNorms="true" disables index > time boosts. > > http://wiki.apache.org/solr/DisMaxQParserPlugin can satisfy your need. > > > --- On Thu, 11/11/10, Solr User wrote: > >> From: Solr User >> Subject: Re: WELCOME to solr-user@lucene.apache.org >> To: solr-user@lucene.apache.org >> Date: Thursday, November 11, 2010, 11:54 PM >> Eric, >> >> Thank you so much for the reply and apologize for not >> providing all the >> details. >> >> The following are the field definitons in my schema.xml: >> >> > stored="true" >> omitNorms="false" /> >> >> > stored="true" >> multiValued="true" omitNorms="true" /> >> >> > stored="true" >> multiValued="true" omitNorms="true" /> >> >> > stored="true" /> >> >> > stored="true" /> >> >> > stored="true" /> >> >> > stored="true" /> >> >> > stored="true" /> >> >> > stored="true" /> >> >> > stored="true" /> >> >> > stored="true" /> >> >> > stored="true" /> >> >> > stored="true" /> >> >> > stored="true" /> >> >> > stored="true" /> >> >> > stored="true" >> multiValued="true" omitNorms="true" /> >> >> > stored="true" /> >> >> > stored="true" >> multiValued="true" omitNorms="true" /> >> >> > stored="true" >> multiValued="true" omitNorms="true" /> >> >> > stored="true" /> >> >> > stored="true" /> >> >> > stored="true" /> >> >> > stored="true" /> >> >> > stored="true" /> >> >> > stored="true" >> omitNorms="true"/> >> >> > stored="true"/> >> >> > indexed="true" stored="true" >> multiValued="true" omitNorms="true"/> >> >> Copy Fields: >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> searchFields >> >> >> >> Before creating the indexes I feed XML file to the Solr job >> to create index >> files. I added Boost attribute to the title field before >> creating indexes >> and an example is below: >> >> > standalone="no"?>> name="material">1785440> boost="10.0" name="title">Each Little >> Bird That Sings> name="price">16.0> name="isbn10">0152051139> name="isbn13">9780152051136> name="format">Hardcover> name="pubdate">2005-03-01> name="pubyear">2005> name="reldate">2005-02-22> name="pages">272> name="bisacstatus">Active> name="season">Spring >> 2005> name="imprint">Children's> name="age">8.0-12.0> name="grade">3-6> name="author">Marla Frazee> name="authortype">Jacket >> IllustratorDeborah >> Wiles> name="authortype">Author> name="bisacsub">Social >> Issues/Friendship> name="bisacsub">Social Issues/General (see >> also headings under Family)> name="bisacsub">General> name="bisacsub">Girls & >> Women> name="category">Fiction/Middle >> Grade> name="category">Fiction/Award &
Re: WELCOME to solr-user@lucene.apache.org
There are several mistakes in your approach: copyField just copies data. Index time boost is not copied. There is no such boosting syntax. /select?q=Each&title^9&fl=score You are searching on your default field. This is not your cause of your problem but omitNorms="true" disables index time boosts. http://wiki.apache.org/solr/DisMaxQParserPlugin can satisfy your need. --- On Thu, 11/11/10, Solr User wrote: > From: Solr User > Subject: Re: WELCOME to solr-user@lucene.apache.org > To: solr-user@lucene.apache.org > Date: Thursday, November 11, 2010, 11:54 PM > Eric, > > Thank you so much for the reply and apologize for not > providing all the > details. > > The following are the field definitons in my schema.xml: > > stored="true" > omitNorms="false" /> > > stored="true" > multiValued="true" omitNorms="true" /> > > stored="true" > multiValued="true" omitNorms="true" /> > > stored="true" /> > > stored="true" /> > > stored="true" /> > > stored="true" /> > > stored="true" /> > > stored="true" /> > > stored="true" /> > > stored="true" /> > > stored="true" /> > > stored="true" /> > > stored="true" /> > > stored="true" /> > > stored="true" > multiValued="true" omitNorms="true" /> > > stored="true" /> > > stored="true" > multiValued="true" omitNorms="true" /> > > stored="true" > multiValued="true" omitNorms="true" /> > > stored="true" /> > > stored="true" /> > > stored="true" /> > > stored="true" /> > > stored="true" /> > > stored="true" > omitNorms="true"/> > > stored="true"/> > > indexed="true" stored="true" > multiValued="true" omitNorms="true"/> > > Copy Fields: > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > searchFields > > > > Before creating the indexes I feed XML file to the Solr job > to create index > files. I added Boost attribute to the title field before > creating indexes > and an example is below: > > standalone="no"?> name="material">1785440 boost="10.0" name="title">Each Little > Bird That Sings name="price">16.0 name="isbn10">0152051139 name="isbn13">9780152051136 name="format">Hardcover name="pubdate">2005-03-01 name="pubyear">2005 name="reldate">2005-02-22 name="pages">272 name="bisacstatus">Active name="season">Spring > 2005 name="imprint">Children's name="age">8.0-12.0 name="grade">3-6 name="author">Marla Frazee name="authortype">Jacket > IllustratorDeborah > Wiles name="authortype">Author name="bisacsub">Social > Issues/Friendship name="bisacsub">Social Issues/General (see > also headings under Family) name="bisacsub">General name="bisacsub">Girls & > Women name="category">Fiction/Middle > Grade name="category">Fiction/Award > WinnersComing > of AgeSocial > Situations/Death & > DyingSocial > Situations/Friendship name="path">/assets/product/0152051139.gif name="desc"><div>Ten-year-old Comfort > Snowberger has attended 247 > funerals. But that's not surprising, considering that her > family runs the > town funeral home. And even though Great-uncle Edisto > keeled over with a > heart attack and Great-great-aunt Florentine dropped > dead--just like > that--six months later, Comfort knows how to deal with > loss, or so she > thinks. She's more concerned with avoiding her crazy cousin > Peach and trying > to figure out why her best friend, Declaration, suddenly > won't talk to her. > Life is full of surprises. And the biggest one of all is > learning what it > takes to handle them.<br> > <br>Deborah Wiles has created a > unique, funny, and utterly real cast of characters in this > heartfelt, and > quintessentially Southern com
Re: WELCOME to solr-user@lucene.apache.org
Eric, Thank you so much for the reply and apologize for not providing all the details. The following are the field definitons in my schema.xml: Copy Fields: searchFields Before creating the indexes I feed XML file to the Solr job to create index files. I added Boost attribute to the title field before creating indexes and an example is below: 1785440Each Little Bird That Sings16.001520511399780152051136Hardcover2005-03-0120052005-02-22272ActiveSpring 2005Children's8.0-12.03-6Marla FrazeeJacket IllustratorDeborah WilesAuthorSocial Issues/FriendshipSocial Issues/General (see also headings under Family)GeneralGirls & WomenFiction/Middle GradeFiction/Award WinnersComing of AgeSocial Situations/Death & DyingSocial Situations/Friendship/assets/product/0152051139.gifTen-year-old Comfort Snowberger has attended 247 funerals. But that's not surprising, considering that her family runs the town funeral home. And even though Great-uncle Edisto keeled over with a heart attack and Great-great-aunt Florentine dropped dead--just like that--six months later, Comfort knows how to deal with loss, or so she thinks. She's more concerned with avoiding her crazy cousin Peach and trying to figure out why her best friend, Declaration, suddenly won't talk to her. Life is full of surprises. And the biggest one of all is learning what it takes to handle them.Ten-year-old Comfort Snowberger learns about life's surprises in this funny, poignant, and very Southern coming-of-age story.1195443Baby Bear's Chairs16.001520511479780152051143Hardcover2005-09-0120052005-08-0140ActiveFall 2005Children's2.0-5.0P-KJane YolenAuthorMelissa SweetIllustratorBedtime & DreamsAnimals/BearsFamily/General (see also headings under Social Issues)Social Issues/Emotions & FeelingsFamily/ParentsAnimals/BearsBedtime BooksFamily Relationships/Parent-Child/assets/product/0152051147.gif
Deborah Wiles has created a unique, funny, and utterly real cast of characters in this heartfelt, and quintessentially Southern coming-of-age novel. Comfort will charm young readers with her wit, her warmth, and her struggles as she learns about life, loss, and ultimately, triumph.Baby Bear is the littlest bear in his family, and sometimes that's not so easy. Mama and Papa Bear get to stay up late in their great big chairs. Big brother gets to play fun games in his middle-sized chair. And Baby Bear only seems to cause trouble in his own tiny chair. But at the end of the day, he finds the one perfect chair that's comfier and cozier than all the rest.In this sweet, bedtime story, Baby Bear discovers that Papa's lap is the best chair of all! I am trying to boost the title field so that the search results brings the actual match with title as the first item in the results. Adding boost attribute to the title field and Index time boosting did not change the search results. I tried Query time boosting also as mentioned below but no luck /select?q=Each+Little+Bird+That+Sings&title^9&fl=score Any help to fix this issue would be really helpful. Thanks, Solr User On Thu, Nov 11, 2010 at 10:32 AM, Solr User wrote: > Hi, > > I have a question about boosting. > > I have the following fields in my schema.xml: > > 1. title > 2. description > 3. ISBN > > etc > > I want to boost the field title. I tried index time boosting but it did not > work. I also tried Query time boosting but with no luck. > > Can someone help me on how to implement boosting on a specific field like > title? > > Thanks, > Solr User > > >
Bestselling author Jane Yolen and popular illustrator Melissa Sweet have come together to create a lyrical bedtime tale about a baby bear trying to find his place in a family. With a playful rhyming text and adorable, fun illustrations, here is a book for parents and their own baby bears to treasure.
Re: WELCOME to solr-user@lucene.apache.org
There's not much to go on here. Boosting works, and index time as opposed to query time boosting addresses two different needs. Could you add some detail? All you've really said is "it didn't work", which doesn't allow a very constructive response. Perhaps you could review: http://wiki.apache.org/solr/HowToContribute Best Erick On Thu, Nov 11, 2010 at 10:32 AM, Solr User wrote: > Hi, > > I have a question about boosting. > > I have the following fields in my schema.xml: > > 1. title > 2. description > 3. ISBN > > etc > > I want to boost the field title. I tried index time boosting but it did not > work. I also tried Query time boosting but with no luck. > > Can someone help me on how to implement boosting on a specific field like > title? > > Thanks, > Solr User > > >
Re: WELCOME to solr-user@lucene.apache.org
Hi, I have a question about boosting. I have the following fields in my schema.xml: 1. title 2. description 3. ISBN etc I want to boost the field title. I tried index time boosting but it did not work. I also tried Query time boosting but with no luck. Can someone help me on how to implement boosting on a specific field like title? Thanks, Solr User On Thu, Nov 11, 2010 at 10:26 AM, wrote: > Hi! This is the ezmlm program. I'm managing the > solr-user@lucene.apache.org mailing list. > > I'm working for my owner, who can be reached > at solr-user-ow...@lucene.apache.org. > > Acknowledgment: I have added the address > > solr...@gmail.com > > to the solr-user mailing list. > > Welcome to solr-u...@lucene.apache.org! > > Please save this message so that you know the address you are > subscribed under, in case you later want to unsubscribe or change your > subscription address. > > > --- Administrative commands for the solr-user list --- > > I can handle administrative requests automatically. Please > do not send them to the list address! Instead, send > your message to the correct command address: > > To subscribe to the list, send a message to: > > > To remove your address from the list, send a message to: > > > Send mail to the following for info and FAQ for this list: > > > > Similar addresses exist for the digest list: > > > > To get messages 123 through 145 (a maximum of 100 per request), mail: > > > To get an index with subject and author for messages 123-456 , mail: > > > They are always returned as sets of 100, max 2000 per request, > so you'll actually get 100-499. > > To receive all messages with the same subject as message 12345, > send a short message to: > > > The messages should contain one line or word of text to avoid being > treated as s...@m, but I will ignore their content. > Only the ADDRESS you send to is important. > > You can start a subscription for an alternate address, > for example "j...@host.domain", just add a hyphen and your > address (with '=' instead of '@') after the command word: > > > To stop subscription for this address, mail: > > > In both cases, I'll send a confirmation message to that address. When > you receive it, simply reply to it to complete your subscription. > > If despite following these instructions, you do not get the > desired results, please contact my owner at > solr-user-ow...@lucene.apache.org. Please be patient, my owner is a > lot slower than I am ;-) > > --- Enclosed is a copy of the request I received. > > Return-Path: > Received: (qmail 48883 invoked by uid 99); 11 Nov 2010 15:26:44 - > Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) >by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 11 Nov 2010 15:26:44 > + > X-ASF-Spam-Status: No, hits=2.2 required=10.0 > > > tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL > X-Spam-Check-By: apache.org > Received-SPF: pass (nike.apache.org: domain of solr...@gmail.comdesignates > 209.85.213.48 as permitted sender) > Received: from [209.85.213.48] (HELO mail-yw0-f48.google.com) > (209.85.213.48) >by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 11 Nov 2010 15:26:35 > + > Received: by ywp4 with SMTP id 4so1394872ywp.35 >for @lucene.apache.org>; Thu, 11 Nov 2010 07:26:14 -0800 (PST) > DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; >d=gmail.com; s=gamma; >h=domainkey-signature:mime-version:received:received:in-reply-to > :references:date:message-id:subject:from:to:content-type; >bh=4KuKRrRVLjzTO4oB9/DNxMdQPfNQH2GnYznzPE6YqOo=; >b=l5lBfUYcyvipJn9SE+5j+t1XUmBjTtbyPYlRVj7jDb6G+W3NzQ21EHOowiD9rNH2L9 > > gc2+6mGEZmRJOZQwpKD7SUQ2bXL9fVm7mVfS21TMAgC+ZsWQ3vvFOHXalWZa8dbtcOY7 > C23KauLY7YH1UfducfXL77J7u0/snEZl5jQ7A= > DomainKey-Signature: a=rsa-sha1; c=nofws; >d=gmail.com; s=gamma; > > h=mime-version:in-reply-to:references:date:message-id:subject:from:to > :content-type; >b=nb9+3a9bOHnjGO5T5BhMlW15adcafr+MPzvpgc5X5NXEUGCI05ViLho0SSoQP2Wp2i > > xp1Mfjrjw05umeKmHX23oeD5Idc2G6xgz8I3ZcJ1bUM+cD7c52cMKG2suE2VvhUHlfah > z52rEtlqd0Q9fk/ZDWwR2DS7GoiVMRmgaWgD0= > MIME-Version: 1.0 > Received: by 10.229.216.201 with SMTP id hj9mr877669qcb.58.1289489174123; > Thu, > 11 Nov 2010 07:26:14 -0800 (PST) > Received: by 10.229.66.165 with HTTP; Thu, 11 Nov 2010 07:26:14 -0800 (PST) > In-Reply-To: <1289489103.46214.ez...@lucene.apache.org> > References: <1289489103.46214.ez...@lucene.apache.org> > Date: Thu, 11 Nov 2010 10:26:14 -0500 > Message-ID: > > > > Subject: Re: confirm subscribe to solr-user@lucene.apache.org > From: Solr User > To: solr-user-sc.1289489103.apfngfdapdhadiahjfln-solrnew=gmail.com@ > lucene.apache.org > Content-Type: multipart/alternative; boundary=0016361e83f82a56590494c898ec > X-Virus-Checked: Checked by ClamAV on apache.org > > --0016361e83f82a56590494c898ec > Content-Type: text/plain; charset=ISO-8859-1 > > Pl
Re: WELCOME to solr-user@lucene.apache.org
(FYI: in the future please start a new thread with an approriate subject line when you ask questions -- you probably would have gotten a lot more responses fro people interested in Tika and SolrCell if they could tell that this email was about SolrCell) : I found that Tika read the html and extract metadata like from my htmls but my documents has an already an id setted by : literal.id=10. : : I tried to map the id from Tika by fmap.id=ignored_ but it ignore also my : literal.id H, yeah: that seems like an odd order of operations, but it's documented on the wiki so evidently it's intentional... http://wiki.apache.org/solr/ExtractingRequestHandler#Order_of_field_operations my best sugguestions: * use the capture param to restrict what gets extracted (it's probably possible to write an XPath query that selects everything *except* metadata[id]) * change the name of your uniqueKey field to be something other then "id" so it's less likely to collide with a value from the document. I also opened two Jira issues that you may want to post comments in... https://issues.apache.org/jira/browse/SOLR-1633 https://issues.apache.org/jira/browse/SOLR-1634 -Hoss
Re: WELCOME to solr-user@lucene.apache.org
Thanks a lot for you response !! For the first solution : I need to index all the content of my websites and I want just tika ignore because I have already an id I'll try monday and tell you if it works The second solution : Are your sure Tika use the HTML Tokenizer ? I'll check 2009/12/5 Raghuveer Kancherla > 2 ways I can think of ... > > - ExtractingRequestHandler (this is what I am guessing you are using now) > > Set extractOnly=true while making a request to the extractingRequestHandler > and get the parsed content back. Now make a post request on update request > handler with what ever fields and field values you want. > > > - Use HTMLStripWhiteSpaceTokenizer factory. This article may be helpful > to explain what I mean. > > http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.HTMLStripWhitespaceTokenizerFactory > . > > > > - Raghu > > > > On Sat, Dec 5, 2009 at 3:44 AM, khalid y wrote: > > > Hi, > > > > I have a problem with solr. I'm indexing some html content and solr crash > > because my id field is multivalued. > > I found that Tika read the html and extract metadata like > content="12"> from my htmls but my documents has an already an id setted > by > > literal.id=10. > > > > I tried to map the id from Tika by fmap.id=ignored_ but it ignore also > my > > literal.id > > > > I'm using solr 1.4 and tika 0.5 > > > > Someone can explain to me how I can ignore this the Tika id metadata ?? > > > > Thanks > > >
Re: WELCOME to solr-user@lucene.apache.org
2 ways I can think of ... - ExtractingRequestHandler (this is what I am guessing you are using now) Set extractOnly=true while making a request to the extractingRequestHandler and get the parsed content back. Now make a post request on update request handler with what ever fields and field values you want. - Use HTMLStripWhiteSpaceTokenizer factory. This article may be helpful to explain what I mean. http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.HTMLStripWhitespaceTokenizerFactory. - Raghu On Sat, Dec 5, 2009 at 3:44 AM, khalid y wrote: > Hi, > > I have a problem with solr. I'm indexing some html content and solr crash > because my id field is multivalued. > I found that Tika read the html and extract metadata like content="12"> from my htmls but my documents has an already an id setted by > literal.id=10. > > I tried to map the id from Tika by fmap.id=ignored_ but it ignore also my > literal.id > > I'm using solr 1.4 and tika 0.5 > > Someone can explain to me how I can ignore this the Tika id metadata ?? > > Thanks >
Re: WELCOME to solr-user@lucene.apache.org
Hi, I have a problem with solr. I'm indexing some html content and solr crash because my id field is multivalued. I found that Tika read the html and extract metadata like from my htmls but my documents has an already an id setted by literal.id=10. I tried to map the id from Tika by fmap.id=ignored_ but it ignore also my literal.id I'm using solr 1.4 and tika 0.5 Someone can explain to me how I can ignore this the Tika id metadata ?? Thanks
Re: WELCOME to solr-user@lucene.apache.org
There are some instructions about integrating Nutch with Solr here: http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html Joakim "Otis Gospodnetic" <[EMAIL PROTECTED]> kirjoitti 9.1.2008: > Nutch and Solr work nice in tandem. We've used Nutch for its distributed > fetching + parsing and related functionality and have used Solr to indexed > the resulting text. What glued them together was Solrj, actually. > > Otis > > -- > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > > - Original Message > From: Jan Buelens <[EMAIL PROTECTED]> > To: solr-user@lucene.apache.org > Sent: Tuesday, January 8, 2008 3:37:12 AM > Subject: Re: WELCOME to solr-user@lucene.apache.org > > Hi, > > We are currently using Solr as search engine. > To add an existing website to our search engine, we are investigating > Nutch. > > Does anyone have more information / experience about an integration > between > Solr and Nutch? > > Thanks in advance ! > > > Best regards, > Jan > > >
Re: WELCOME to solr-user@lucene.apache.org
Nutch and Solr work nice in tandem. We've used Nutch for its distributed fetching + parsing and related functionality and have used Solr to indexed the resulting text. What glued them together was Solrj, actually. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: Jan Buelens <[EMAIL PROTECTED]> To: solr-user@lucene.apache.org Sent: Tuesday, January 8, 2008 3:37:12 AM Subject: Re: WELCOME to solr-user@lucene.apache.org Hi, We are currently using Solr as search engine. To add an existing website to our search engine, we are investigating Nutch. Does anyone have more information / experience about an integration between Solr and Nutch? Thanks in advance ! Best regards, Jan
Re: WELCOME to solr-user@lucene.apache.org
currently two approaches: http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html and: https://issues.apache.org/jira/browse/NUTCH-442 I have had experience with the former... you may have more luck on the nutch-user list for help ryan Jan Buelens wrote: Hi, We are currently using Solr as search engine. To add an existing website to our search engine, we are investigating Nutch. Does anyone have more information / experience about an integration between Solr and Nutch? Thanks in advance ! Best regards, Jan
Re: WELCOME to solr-user@lucene.apache.org
Hi, We are currently using Solr as search engine. To add an existing website to our search engine, we are investigating Nutch. Does anyone have more information / experience about an integration between Solr and Nutch? Thanks in advance ! Best regards, Jan