Re: solr 1872

2012-07-30 Thread Rob Casson
looks like it might actually be a zip file.  try renaming/unzipping it.

cheers,
rob

On Mon, Jul 30, 2012 at 2:50 PM, Sujatha Arun  wrote:
> I am uable to use the rar file from the site
> https://issues.apache.org/jira/browse/SOLR-1872.
>
> When I try to open it,I get the message 'SolrACLSecurity.rar is not RAR
> archive.
>
> Is the file there at this link?
>
> Regards
> Sujatha


Re: Two unrelated questions

2011-09-21 Thread Rob Casson
for #1, i don't use DIH, but is there any possibility of that column
having duplicate keys, with subsequent docs replacing existing ones?

and for #2, for some cases you could use a negative filterquery:

 
http://wiki.apache.org/solr/SimpleFacetParameters#Retrieve_docs_with_facets_missing

so instead of that "fq=-facetField:[* TO *]", something like
"fq=-car_make:Taurus".  picking "negatives" might even make the UI a
bit easier.

anyway, just some thoughts.  cheers,
rob

On Wed, Sep 21, 2011 at 5:17 PM, Olson, Ron  wrote:
> Thanks for the reply. As far as #1, my table that I'm indexing via DIH has a 
> PK field, generated by a sequence, so there are records with ID of 1, 2, 3, 
> etc. That same id is the one I use in my unique id field in the document 
> (ID).
>
> I've noticed that the table has, say, 10 rows. My index only has 8. I don't 
> know why that is, but I'd like to figure out which records are missing and 
> add them (and hopefully understand why they weren't added in the first 
> place). I was just wondering if there was some way to compare the two as part 
> of a sql query, but on reflection, it does seem like an absurd request, so I 
> apologize; I think what I'll have to do is write a solrj program that gets 
> every ID in the table, then does a search on that ID in the index, and add 
> the ones that are missing.
>
> Regarding the second item, yes, it's crazy but I'm not sure what to do; there 
> really are that many options and some searches will be extremely specific, 
> yet broad enough in terms for this to be a problem.
>
> -Original Message-
> From: Erick Erickson [mailto:erickerick...@gmail.com]
> Sent: Wednesday, September 21, 2011 3:55 PM
> To: solr-user@lucene.apache.org
> Subject: Re: Two unrelated questions
>
> for <1> I don't quite get what you're driving at. Your DIH
> query assigns the uniqueKey, it's not like it's something
> auto-generated. Perhaps a concrete example would
> help.
>
> <2> There's a limit you can adjust that defaults to
> 1024 (maxBooleanClauses in solrconfig.xml). You can
>  bump this very high, but you're right, if anyone actually
> does something absurd it'll slow *that* query down. But
> just bumping this query higher won't change performance
> absent someone actually putting a ton of items in it...
>
> Best
> Erick
>
> On Mon, Sep 19, 2011 at 9:12 AM, Olson, Ron  wrote:
>> Hi all-
>>
>> I'm not sure if I should break this out into two separate questions to the 
>> list for searching purposes, or if one is more acceptable (don't want to 
>> flood).
>>
>> I have two (hopefully) straightforward questions:
>>
>> 1. Is it possible to expose the unique ID of a document to a DIH query? The 
>> reason I want to do this is because I use the unique ID of the row in the 
>> table as the unique ID of the Lucene document, but I've noticed that the 
>> counts of documents doesn't match the count in the table; I'd like to add 
>> these rows and was hoping to avoid writing a custom SolrJ app to do it.
>>
>> 2. Is there any limit to the number of conditions in a Boolean search? We're 
>> working on a new project where the user can choose either, for example, 
>> "Ford Vehicles", in which case I can simply search for "Ford", but if the 
>> user chooses specific makes and models, then I have to say something like 
>> "Crown Vic OR Focus OR Taurus OR F-150", etc., where they could 
>> theoretically choose every model of Ford ever made except one. This could 
>> lead to a *very* large query, and was worried both that it was even 
>> possible, but also the impact on performance.
>>
>>
>> Thanks, and I apologize if this really should be two separate messages.
>>
>> Ron
>>
>> DISCLAIMER: This electronic message, including any attachments, files or 
>> documents, is intended only for the addressee and may contain CONFIDENTIAL, 
>> PROPRIETARY or LEGALLY PRIVILEGED information.  If you are not the intended 
>> recipient, you are hereby notified that any use, disclosure, copying or 
>> distribution of this message or any of the information included in or with 
>> it is  unauthorized and strictly prohibited.  If you have received this 
>> message in error, please notify the sender immediately by reply e-mail and 
>> permanently delete and destroy this message and its attachments, along with 
>> any copies thereof. This message does not create any contractual obligation 
>> on behalf of the sender or Law Bulletin Publishing Company.
>> Thank you.
>>
>
>
> DISCLAIMER: This electronic message, including any attachments, files or 
> documents, is intended only for the addressee and may contain CONFIDENTIAL, 
> PROPRIETARY or LEGALLY PRIVILEGED information.  If you are not the intended 
> recipient, you are hereby notified that any use, disclosure, copying or 
> distribution of this message or any of the information included in or with it 
> is  unauthorized and strictly prohibited.  If you have received this message 
> in error, please notify the sender immediately by reply

Re: synonyms.txt: different results on admin and on site..

2011-09-07 Thread Rob Casson
you should probably post your schema.xml and some parts of your
synonyms.txt.  it could be differences between your index and query
analysis chains, synonym expansion errors, etc, but folks will likely
need more details to help you out.

cheers,
rob

On Wed, Sep 7, 2011 at 9:46 PM, deniz  wrote:
> could it be related with analysis issue about synonyms once again?
>
>
>
> -
> Zeki ama calismiyor... Calissa yapar...
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/synonyms-txt-different-results-on-admin-and-on-site-tp3318338p3318464.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>


Re: Solr warming when using master/slave replication

2011-08-29 Thread Rob Casson
it's always been my understanding that the caches are discarded, then
rebuilt/warmed:

 
http://wiki.apache.org/solr/SolrCaching#Caching_and_Distribution.2BAC8-Replication

hth,
rob

On Mon, Aug 29, 2011 at 5:30 PM, Mike Austin  wrote:
> How does warming work when a collection is being distributed to a slave.  I
> understand that a temp directory is created and it is eventually copied to
> the live folder, but what happens to the cache that was built in with the
> old index?  Does the cache get rebuilt, can we warm it before it becomes
> live, or can we keep the old cache?
>
> Thanks,
> Mike
>


Re: Exact matching on names?

2011-08-16 Thread Rob Casson
"exact" can mean a lot of things (do diacritics count?, etc), but in
this case, it sounds like you just need to turn off the stemmer you
have on this fieldtype (or create a new one that doesn't include the
stemmer).

hth,
rob

On Tue, Aug 16, 2011 at 11:20 AM, Olson, Ron  wrote:
> Hi all-
>
> I'm missing something fundamental yet I've been unable to find the definitive 
> answer for exact name matching. I'm indexing names using the standard "text" 
> field type and my search is for the name "clarke". My results include 
> "clark", which is incorrect, it needs to match clarke exactly (case 
> insensitive).
>
> I tried textType but that doesn't work because I believe it needs to be 
> *really* exact, whereas I'm looking for things like "clark oil", "bob, frank, 
> and clark", etc.
>
> Thanks for any help,
>
> Ron
>
> DISCLAIMER: This electronic message, including any attachments, files or 
> documents, is intended only for the addressee and may contain CONFIDENTIAL, 
> PROPRIETARY or LEGALLY PRIVILEGED information.  If you are not the intended 
> recipient, you are hereby notified that any use, disclosure, copying or 
> distribution of this message or any of the information included in or with it 
> is  unauthorized and strictly prohibited.  If you have received this message 
> in error, please notify the sender immediately by reply e-mail and 
> permanently delete and destroy this message and its attachments, along with 
> any copies thereof. This message does not create any contractual obligation 
> on behalf of the sender or Law Bulletin Publishing Company.
> Thank you.
>


Re: Searching for strings

2011-07-18 Thread Rob Casson
chip,

gonna need more information about your particular analysis chain,
content, and example searches to give a better answer, but phrase
queries (using quotes) are supported in both the standard and dismax
query parsers

that being said, lots of things may not match a person's idea of an
exact string...stopwords, synonyms, slop, etc.

cheers,
rob

On Mon, Jul 18, 2011 at 5:25 PM, Chip Calhoun  wrote:
> Is there a way to search for a specific string using Solr, either by putting 
> it in quotes or by some other means?  I haven't been able to do this, but I 
> may be missing something.
>
> Thanks,
> Chip
>


Re: Can I invert the inverted index?

2011-07-05 Thread Rob Casson
sounds like the Luke request handler will get what you're after:

 http://wiki.apache.org/solr/LukeRequestHandler
 http://wiki.apache.org/solr/LukeRequestHandler#id

cheers,
rob

On Tue, Jul 5, 2011 at 3:59 PM, Gabriele Kahlout
 wrote:
> Hello,
>
> With an inverted index the term is the key, and the documents are the
> values. Is it still however possible that given a document id I get the
> terms indexed for that document?
>
> --
> Regards,
> K. Gabriele
>
> --- unchanged since 20/9/10 ---
> P.S. If the subject contains "[LON]" or the addressee acknowledges the
> receipt within 48 hours then I don't resend the email.
> subject(this) ∈ L(LON*) ∨ ∃x. (x ∈ MyInbox ∧ Acknowledges(x, this) ∧ time(x)
> < Now + 48h) ⇒ ¬resend(I, this).
>
> If an email is sent by a sender that is not a trusted contact or the email
> does not contain a valid code then the email is not received. A valid code
> starts with a hyphen and ends with "X".
> ∀x. x ∈ MyInbox ⇒ from(x) ∈ MySafeSenderList ∨ (∃y. y ∈ subject(x) ∧ y ∈
> L(-[a-z]+[0-9]X)).
>


Re: [Handling] empty fields

2011-06-15 Thread Rob Casson
i also thought of the lengthFilter stuff, provided it's a
text/KeywordTokenizer field:

 
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.LengthFilterFactory

cheers,
rob

On Wed, Jun 15, 2011 at 12:00 PM, Erick Erickson
 wrote:
> Have you tried setting 'facet.missing="false" '?
>
> See:
> http://wiki.apache.org/solr/SimpleFacetParameters#facet.missing
>
> Best
> Erick
>
> On Wed, Jun 15, 2011 at 11:52 AM, Adam Estrada
>  wrote:
>> All,
>>
>> I have a field "foo" with several thousand blank or non-existing records in
>> it. This is also my faceting field. My question is, how can I deal with this
>> field so that I don't get a blank facet at query time?
>>
>> 5000
>> vs.
>> 1000
>>
>> Adam
>>
>


Re: indexing numbers

2011-05-25 Thread Rob Casson
the default schema.xml provided in the Solr distribution is
well-documented, and a good place to get started (including numeric
fieldTypes):

 http://wiki.apache.org/solr/SchemaXml

Lucid Imagination also provides a nice reference guide:

 
http://www.lucidimagination.com/Downloads/LucidWorks-for-Solr/Reference-Guide

hope that helps,
rob

On Wed, May 25, 2011 at 6:20 PM, antoniosi  wrote:
> Hi,
>
> How does solr index a numeric value? Does it index it as a string or does it
> keep it as a numeric value?
>
> Thanks.
>
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/indexing-numbers-tp2986424p2986424.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>


Re: aliasing?

2011-05-09 Thread Rob Casson
a lot of this stuff is covered in the tutorial, and expanded in the
wiki. still the best places to start in figuring out the fundamentals:

 http://lucene.apache.org/solr/tutorial.html
 
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.SynonymFilterFactory

hth,
rc

On Mon, May 9, 2011 at 9:09 PM, deniz  wrote:
> well... if i knew what to do about aliasing, i wouldnt post my question here
> Grijesh :) My idea is this: for some search queries, I need to provide some
> synonyms...
> But it is just an idea...
>
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/aliasing-tp2917733p2921305.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>


Re: *:* query with dismax

2011-05-06 Thread Rob Casson
it does seem a little weird, but q.alt will get what you want:

  http://wiki.apache.org/solr/DisMaxQParserPlugin#q.alt

hth,
rc

On Fri, May 6, 2011 at 7:41 PM, Jason Chaffee  wrote:
> Can you shed some light on what you did to configure it to handle *:*?
> I have the same issue that I need it to work for faceting, but I do need
> the dismax abilities as well.
>
> -Original Message-
> From: Mark Mandel [mailto:mark.man...@gmail.com]
> Sent: Friday, May 06, 2011 4:30 PM
> To: solr-user@lucene.apache.org
> Subject: Re: *:* query with dismax
>
> This is exactly what should be happening, as the dismax parser doesn't
> understand regular query syntax (and for good reason too). This tripped
> me
> up as well when I first started using dismax.
>
> Solution for me was to comfigure the handler to use *:* when the query
> is
> empty, so that you can still get back a full result set if you need it,
> say
> for faceting.
>
> HTH
>
> Mark
> On May 7, 2011 9:22 AM, "Jason Chaffee"  wrote:
>> I am using dismax and trying to use q=*:* to return all indexed
>> documents. However, it is always returning 0 found.
>>
>>
>>
>> If I used the default select (not dismax) handler and try q=*:* then
> it
>> returns all documents.
>>
>>
>>
>> There is nothing in the logs to indicate why this happening.
>>
>>
>>
>> Does anyone have any clues?
>>
>>
>>
>> Thanks,
>>
>>
>>
>> Jason
>>
>


Re: querying in Java

2011-04-29 Thread Rob Casson
copyField should do the trick:

 http://wiki.apache.org/solr/SchemaXml#Copy_Fields

"A common requirement is to copy or merge all input fields into a
single solr field. This can be done as follows:-

 "

hth,
rob

On Fri, Apr 29, 2011 at 2:06 PM, Saler, Jeff  wrote:
> Thanks for the reply.  What I want is for the query to search all fields
> for the specified value.
>
> -Original Message-
> From: Anuj Kumar [mailto:anujs...@gmail.com]
> Sent: Friday, April 29, 2011 1:51 PM
> To: solr-user@lucene.apache.org
> Subject: Re: querying in Java
>
> Hi Jeff,
>
> In that case, it will query w.r.t default field. What is your default
> search
> field in the schema?
>
> Regards,
> Anuj
>
> On Fri, Apr 29, 2011 at 11:10 PM, Saler, Jeff  wrote:
>
>> Is there any way to query for data that is in any field, i.e. not
> using
>> a specific field name?
>>
>>
>>
>> For example, when I use the following statements:
>>
>>
>>
>>                SolrQuery  query = new SolrQuery();
>>
>>                Query.setQuery("ANALYST:John Schummers");
>>
>>      QueryResponse  rsp = server.query(query);
>>
>>
>>
>>
>>
>> I get the documents I'm looking for.
>>
>>
>>
>> But I would like to get the same set of documents without using the
>> specific ANALYST field name.
>>
>> I have tried using just "Schummers" as the query, but no documents are
>> returned.
>>
>> The ANALYST field is an indexed field.
>>
>>
>>
>>
>>
>>
>> This message and any enclosures are intended only for the addressee.
>>  Please
>> notify the sender by email if you are not the intended recipient.  If
> you
>> are
>> not the intended recipient, you may not use, copy, disclose, or
> distribute
>> this
>> message or its contents or enclosures to any other person and any such
>> actions
>> may be unlawful.  Ball reserves the right to monitor and review all
>> messages
>> and enclosures sent to or from this email address.
>
>
>
> This message and any enclosures are intended only for the addressee.  Please
> notify the sender by email if you are not the intended recipient.  If you are
> not the intended recipient, you may not use, copy, disclose, or distribute 
> this
> message or its contents or enclosures to any other person and any such actions
> may be unlawful.  Ball reserves the right to monitor and review all messages
> and enclosures sent to or from this email address.
>


Re: Result order when score is the same

2011-04-13 Thread Rob Casson
you could just explicitly send multiple sorts...from the tutorial:

 &sort=inStock asc, price desc

cheers.

On Wed, Apr 13, 2011 at 2:59 PM, kenf_nc  wrote:
> Is sort order when 'score' is the same a Lucene thing? Should I ask on the
> Lucene forum?
>
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/Result-order-when-score-is-the-same-tp2816127p2817330.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>


Re: Facet Query Question

2011-02-27 Thread Rob Casson
if i'm understanding your question, it sounds like
localparams/tagging/exclusion might be what you want:

 
http://wiki.apache.org/solr/SimpleFacetParameters#Multi-Select_Faceting_and_LocalParams

hth,
rob

On Sun, Feb 27, 2011 at 6:41 PM, Tim Christensen
 wrote:
> Hi,
>
> I am trying to do the following:
>
> Where a query might return:
>
> Facet 1
> A
> B
> C
>
> Facet 2
> X
> Y
> Z
>
> User selects Facet 1 option A. Normally this paradigm would contract the 
> results as in a refining paradigm. That would be fine and the obvious UI 
> change. But by doing so, Facet 2 option X is no longer available -- again 
> because of the refining. Let's say I still wanted Facet 2 option X to be 
> available to instead of refining, expands the results.
>
> Normally, my query might look like:
>
> q=query&fq=Facet 1:A (for the first part of my question. What I have done is 
> return two sets of facet results, one for the main query and one for the 
> refined query. That way I can still offer option X. What I don't know how to 
> do is query beyond that. I have tried some ORs and ANDs  in my unit tests, 
> but don't think this is the right way.
>
> My question is whether there is a way in a single query to bring back all the 
> original facets regardless of any facet refining. If not, give that I return 
> two sets of facets - a refined set and the 'original' querys' facet set, how 
> would I fashion this query?
>
> My apologies if this is rookie, I have a few years of Solr under my belt, but 
> can't think outside the refining and then expanding the result set with a 
> facet query that was available in the original query results.
>
> Thank you,
>
> Tim Christensen
>
>
>
>
>
>
>


Re: Copying the index from one solr instance to another

2010-12-15 Thread Rob Casson
just making sure that you're aware of the built-in replication:

 http://wiki.apache.org/solr/SolrReplication

can pull the indexes, along with config files.

cheers,
rob

2010/12/15 Robert Gründler :
> Hi again,
>
> let's say you have 2 solr Instances, which have both exactly the same 
> configuration (schema, solrconfig, etc).
>
> Could it cause any troubles if we import an index from a SQL database on solr 
> instance A, and copy the whole
> index to the datadir of solr instance B (both solr instances run on different 
> servers) ?.
>
> As far as i can tell, this should work and solr instance B should have the 
> exact same index as solr instance A after the copy-process.
>
> Do we miss something, or is this workflow safe to go with?
>
> -robert


Re: command line to check if Solr is up running

2010-10-25 Thread Rob Casson
you could look at the ping stuff:

 http://wiki.apache.org/solr/SolrConfigXml#The_Admin.2BAC8-GUI_Section

cheers,
rob

On Mon, Oct 25, 2010 at 3:56 PM, Xin Li  wrote:
> As we know we can use browser to check if Solr is running by going to 
> http://$hostName:$portNumber/$masterName/admin, say 
> http://localhost:8080/solr1/admin. My questions is: are there any ways to 
> check it using command line? I used "curl http://localhost:8080"; to check my 
> Tomcat, it worked fine. However, no response if I try "curl 
> http://localhost:8080/solr1/admin"; (even when my Solr is running). Does 
> anyone know any command line alternatives?
>
> Thanks,
> Xin
> This electronic mail message contains information that (a) is or
> may be CONFIDENTIAL, PROPRIETARY IN NATURE, OR OTHERWISE
> PROTECTED
> BY LAW FROM DISCLOSURE, and (b) is intended only for the use of
> the addressee(s) named herein.  If you are not an intended
> recipient, please contact the sender immediately and take the
> steps necessary to delete the message completely from your
> computer system.
>
> Not Intended as a Substitute for a Writing: Notwithstanding the
> Uniform Electronic Transaction Act or any other law of similar
> effect, absent an express statement to the contrary, this e-mail
> message, its contents, and any attachments hereto are not
> intended
> to represent an offer or acceptance to enter into a contract and
> are not otherwise intended to bind this sender,
> barnesandnoble.com
> llc, barnesandnoble.com inc. or any other person or entity.
>


Re: Search Results optimization

2010-08-26 Thread Rob Casson
you might find these helpful...similar question came up last week:

 http://ln-s.net/7WpX
 
http://robotlibrarian.billdueber.com/solr-forcing-items-with-all-query-terms-to-the-top-of-a-solr-search/

not exactly the same, as this case wanted to boost if *every* term
matched, but a similar tactic may workhope that helps,
rob

On Thu, Aug 26, 2010 at 4:02 PM, Hasnain  wrote:
>
> perhaps i wasnt clear in my earlier post
>
> if user searches for "swingline red stapler hammer hand rigid", then
> documents that matches max number of words written in query should come
> first
> e.g a document with name field as "swingline stapler" should come later than
> the document with "swingline red stapler"
>
> any suggestions how to achieve this functionality?
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/Search-Results-optimization-tp1129374p1359916.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>


Re: spellcheck suggestion ordering and distance measures

2010-03-30 Thread Rob Casson
> 2) i'm not entirely clear on the order in which suggestions are
> returned. for a search of "porgrams" against a subset of my data, i
> get these suggestions...i've added my own levenshtein calculations,
> provided they're correct:
>
> program: 3
> pulgram: 3
> porras: 2
> portrait: 3

i should have included the relevant section of the response:


-

-

5
0
8
0
-

-

program
271

-

pulgram
1

-

porras
1

-

portrait
74

-

nomogram
1



false




erikhatcher might be pulling-my-head-out-of-my-ass off-list, fwiw ;)

cheers, and thanks all,
rc


spellcheck suggestion ordering and distance measures

2010-03-30 Thread Rob Casson
poking around at the spellcheck component, and have a couple questions:

1) is there a way to return the distance measure with
spellcheck.extendedResults?  haven't poked too closely at the source,
but it might be useful.

2) i'm not entirely clear on the order in which suggestions are
returned. for a search of "porgrams" against a subset of my data, i
get these suggestions...i've added my own levenshtein calculations,
provided they're correct:

 program: 3
 pulgram: 3
 porras: 2
 portrait: 3

relevant section of my solrconfig.xml:

text


  default
  spell
  ./spellchecker



granted, i like that 'program' is at the top, and it has the highest
frequency in my corpus, but just want to make sure i can reliably
interpret these results.

cheers, and apologies if i'm just being dense,
rob


Re: filter querying working on dynamic int fields but not dynamic string fields?

2010-01-20 Thread Rob Casson
> http://localhost:8983/solr/select?indent=on&version=2.2&q=climate&fq=awardinstrument_s:Continuing+grant
> Continuing grant 

everything that erik already mentioned, but looks like you also have a
trailing space in the document, so even quoting it would require that
last space.


Re: case-insensitive string type

2010-01-13 Thread Rob Casson
from http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters

"On wildcard and fuzzy searches, no text analysis is performed on
the search word."

i'd just lowercase the wildcard-ed search term in your client code,
before you send it to solr.

hth,
rob

On Wed, Jan 13, 2010 at 2:18 PM, Harsch, Timothy J. (ARC-TI)[PEROT
SYSTEMS]  wrote:
> Hi I have a field:
>
>  multiValued="true" />
>
> With type definition:
>                
>                                        sortMissingLast="true" omitNorms="true">
>                        
>                                 class="solr.KeywordTokenizerFactory"/>
>                                
>                        
>                        
>                                 class="solr.KeywordTokenizerFactory"/>
>                                
>                        
>                
>
> When searching that field I can't get a case-insensitive match.  It works as 
> if it is a regular string, for instance I can do a prefix query and so long 
> as the prefix matches the case of the value it works, but if I change the 
> prefix case it doesn't
>
> Essentially I am trying to get case-insensitive matching that supports wild 
> cards...
>
> Tim Harsch
> Sr. Software Engineer
> Dell Perot Systems
> (650) 604-0374
>
>


Re: query ( dynamicField ) in solr

2009-12-22 Thread Rob Casson
sounds like a job for copyField:

 http://wiki.apache.org/solr/FAQ#How_do_I_use_copyField_with_wildcards.3F

add sku_defaultPriceAll to your  and then:



...query on sku_defaultPriceAll.

hth,
rob

On Tue, Dec 22, 2009 at 3:36 PM, Gustavo  wrote:
> Hello All,
>
>  I have been trying to make a query at a dynamicField.
> for example:
> i have a product that can have many sku's. So at my xsd i have
>  stored="true" />
>  stored="true" />
>                                                    .
>                                                    .
>                                                    .
> when a make a query:
> http://localhost:8983/solr/productIdx/select/?q=sku_defaultPrice_2:valueOfThePrice
> or
> http://localhost:8983/solr/productIdx/select/?q=sku_defaultPrice_1:valueOfThePriceor
> sku_defaultPrice_2:valueOfThePrice or
> sku_defaultPrice_3:valueOfThePrice
> it works !!
>
> but i can have many sku_defaultPrice_*
> but i have no idea in how to make the query with this kind of field
> thanks for helping
>
> --
> --
> Atenciosamente,
>
> Gustavo de Araujo Lima Machado
> Telefone: +55 +21 ***
> Celular: +55+21 99034401
> E-Mail: g.macha...@gmail.com.br
>
> Residência
> :
>


Re: How to get Solr 1.4 to replicate spellcheck directories as well?

2009-12-16 Thread Rob Casson
i don't think that's currently supported, but sure others will correct
me if i'm wrong:


http://www.lucidimagination.com/search/document/ac8cf41bdb761069/solr_replication_and_spellcheck_data

cheers,
rob

On Wed, Dec 16, 2009 at 10:08 AM, michael8  wrote:
>
> I'm currently using Solr 1.4 with its built-in solr.ReplicationHandler
> enabled in solrconfig.xml for a master and slave as follows:
>
>  
>    
>      ${enable.master:false}
>      commit
>      startup
>       name="confFiles">schema.xml,protwords.txt,spellings.txt,stopwords.txt,synonyms.txt
>    
>    
>      ${enable.slave:false}
>       name="masterUrl">http://searchhost:8983/solr/items/replication
>      00:00:60
>    
>  
>
> Everything in the index is replicated perfectly except that my spellcheck
> directories are not being replicated.  Here is my spellcheck config in
> solrconfig.xml:
>
>  
>    textSpell
>    
>      default
>      spell
>      ./spellchecker1
>      false
>
>    
>    
>      jarowinkler
>      spell
>      
>       name="distanceMeasure">org.apache.lucene.search.spell.JaroWinklerDistance
>      ./spellchecker2
>      false
>
>    
>
>    
>      solr.FileBasedSpellChecker
>      file
>      spellings.txt
>      UTF-8
>      ./spellcheckerFile
>      false
>    
>  
>
> I have set the buildOnCommit to 'false', but instead have a separate cron to
> build my spellcheck dictionaries on a nightly basis.
>
> Is there a way to tell Solr to also replicate the spellcheck files too?  Is
> my setting 'buildOnCommit' to 'false' causing my spellcheck files to not
> replicate?  I would think after the nightly build is triggered and done (via
> cron) that the spellcheck files would be replicated by that is not the case.
>
> Thanks for any help or info.
>
> Michael
>
> --
> View this message in context: 
> http://old.nabble.com/How-to-get-Solr-1.4-to-replicate-spellcheck-directories-as-well--tp26812569p26812569.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>
>


Re: Upgrading from 1.2.0 to 1.3.0

2009-05-07 Thread Rob Casson
this isn't advice on how to upgrade, but if you/your-project have a
bit of time to wait, 1.4 sounds like it's getting close to an official
releasefyi.

cheers,
rob


On Tue, May 5, 2009 at 1:05 PM, Francis Yakin  wrote:
>
> What's the best way to upgrade solr from 1.2.0 to 1.3.0 ?
>
> We have the current index that our users search running on 1.2.0 Solr version.
>
> We would like to upgrade it to 1.3.0?
>
> We have Master/Slaves env.
>
> What's the best way to upgrade it without affecting the search? Do we need to 
> do it on master or slaves first?
>
>
>
> Thanks
>
> Francis
>
>
>


Re: Filter query with wildcard, fq=a*

2009-04-30 Thread Rob Casson
it sounds to me like the field you're using (artistText) is tokenized
and lowercased.  it might be good to go over  the wiki pages again:

 http://wiki.apache.org/solr/SolrFacetingOverview

if you keep having problems, post your schema...cheers,
rob

On Thu, Apr 30, 2009 at 10:21 AM, Andrew McCombe  wrote:
> Hi
>
> It has now introduced a new issue.  I am now getting results for my
> query but they are not what I wanted.  I'm using a Dismax handler and
> am trying to filter a result set with a filter query:
>
> http://localhost:8180/solr/select?indent=on&version=2.2&q=i+love&start=0&rows=30&fl=*%2Cscore&qt=dismax&wt=standard&fq=artistText:e*
>
> This is bringing results with 'e' or 'E' as the first letter and with
> 'e' or 'E' as the first letter after whitespace.  e.g, Eric Prydz,
> Marc Et Claude.
>
> Looking at the solr wiki it seems that I am using the fq param to
> boost the results and not filter them.  Is there a way to achieve the
> required result, i.e, filtering the result set for all artistText
> beginning with 'E'?
>
> Regards
> Andrew
>
>
>
> 2009/4/30 Andrew McCombe 
>>
>> Hi
>>
>> I've sorted it.  Wildcard term must be lowercase to get results.
>>
>> Thanks
>> Andrew
>>
>> 2009/4/30 Andrew McCombe 
>>>
>>> Hi
>>>
>>> I have half a million records indexed and need to filter results on a term 
>>> by the first letter.  For Example the search term is 'I love' and returns a 
>>> few thousand records.  I need to filter these results for all artists 
>>> beginning with 'A'.
>>>
>>> I've tried:
>>> 'fq=artistText:A*'
>>>
>>> But then get no results.
>>>
>>> Can someone point me in the right direction please?
>>>
>>> Regards
>>> Andrew
>>
>


Re: new faceting algorithm

2008-12-05 Thread Rob Casson
very similar situation to those already reported.  2.9M bilbiographic
records, with authors being the (previous) bottleneck, and the one
we're starting to test with the new algorithm.

so far, no load tests, but just in single requests i'm seeing the same
improvements...phenomenal improvements, btw, with most example queries
taking less than 1/100th of the time

always very impressed with this project/product, and just thought i'd
add a "me-too" to the list...cheers, and have a great weekend,

rob

On Mon, Nov 24, 2008 at 11:12 PM, Yonik Seeley <[EMAIL PROTECTED]> wrote:
> A new faceting algorithm has been committed to the development version
> of Solr, and should be available in the next nightly test build (will
> be dated 11-25).  This change should generally improve field faceting
> where the field has many unique values but relatively few values per
> document.  This new algorithm is now the default for multi-valued
> fields (including tokenized fields) so you shouldn't have to do
> anything to enable it.  We'd love some feedback on how it works to
> ensure that it actually is a win for the majority and should be the
> default.
>
> -Yonik
>


Re: Compiling Solr 1.3.0 + KStem

2008-12-03 Thread Rob Casson
i've experimented with the KStem stuff in the past, and just pulled a
fresh copy of solr from trunk

it looks like Hoss' suggestion #1 does the trick, by simply commenting
out the super.init call...loaded the example data, tested some
analysis, and it seems to work as before.

just a confirmation, and thanks,
rob

On Fri, Nov 28, 2008 at 6:18 PM, Chris Hostetter
<[EMAIL PROTECTED]> wrote:
>
> : /usr/local/build/apache-solr-1.3.0/src/java/org/apache/solr/analysis/
> : KStemFilterFactory.java:63:
> : cannot find symbol
> : [javac] symbol  : method
> : init(org.apache
> : .solr.core.SolrConfig,java.util.Map)
> : [javac] location: class org.apache.solr.analysis.BaseTokenFilterFactory
> : [javac] super.init(solrConfig, args);
> : [javac]  ^
>
> that KStemFilterFactory seems to be trying to use a method that existed
> for a while on the trunk, but was never released.
>
> i'm not familiary with KStemFilterFactory to know why/if it needs a
> SolrConfig, but a few things you can try...
>
> 1) if there are no references to solrConfig anywhere except the init
> method (and the super.init method it calls) just remove the refrences to
> it (so the methods just deal with the Map)
>
> 2) if there are other refrences to the solrConfig, they *may* just be to
> take advantage of ResourceLoader methods, so after making the changes
> above, make KStemFilterFactory "implements ResourceLoaderAware" and then
> add a method like this...
>
>  public void inform(ResourceLoader loader) {
>// code that used solrConfig should go here, but use loader
>  }
>
> ...it will get called after the init(Map) method and let
> KStemmFilterFactory get access to files on disk.
>
> 3) if that doesn't work ... i don't know what else to try (i'd need to get
> a lot more familiar with KStem to guess)
>
>
>
> -Hoss
>
>


Re: [VOTE] Community Logo Preferences

2008-11-24 Thread Rob Casson
https://issues.apache.org/jira/secure/attachment/12394282/solr2_maho_impression.png
https://issues.apache.org/jira/secure/attachment/12394266/apache_solr_b_red.jpg

thanks to everyone who contributed,
rob

On Sun, Nov 23, 2008 at 11:59 AM, Ryan McKinley <[EMAIL PROTECTED]> wrote:
> Please submit your preferences for the solr logo.
>
> For full voting details, see:
>  http://wiki.apache.org/solr/LogoContest#Voting
>
> The eligible logos are:
>  http://people.apache.org/~ryan/solr-logo-options.html
>
> Any and all members of the Solr community are encouraged to reply to this
> thread and list (up to) 5 ranked choices by listing the Jira attachment
> URLs. Votes will be assigned a point value based on rank. For each vote, 1st
> choice has a point value of 5, 5th place has a point value of 1, and all
> others follow a similar pattern.
>
> https://issues.apache.org/jira/secure/attachment/12345/yourfrstchoice.jpg
> https://issues.apache.org/jira/secure/attachment/34567/yoursecondchoice.jpg
> ...
>
> This poll will be open until Wednesday November 26th, 2008 @ 11:59PM GMT
>
> When the poll is complete, the solr committers will tally the community
> preferences and take a final vote on the logo.
>
> A big thanks to everyone would submitted possible logos -- its great to see
> so many good options.


Re: Simple Searching Question

2008-08-14 Thread Rob Casson
you're likely not copyField-ing *_facet to text, and we'd need to see
what type of field it is to see how it will be analyzed at both
search/index time.

the default schema.xml file is pretty well documented, so you might
want to spend some time looking thru it, and reading the
commentslots of good info in there.

cheers,
rob

On Thu, Aug 14, 2008 at 7:17 PM, Jake Conk <[EMAIL PROTECTED]> wrote:
> Hi Shalin,
>
> "foobar_facet" is a dynamic field. Its defined in my schema like this:
>
> 
>
> I have the default search field set to text. Can I use more than one
> default search field?
>
> text
>
> Thanks,
> - Jake
>
>
> On Thu, Aug 14, 2008 at 2:48 PM, Shalin Shekhar Mangar
> <[EMAIL PROTECTED]> wrote:
>> Hi Jake,
>>
>> What is the type of the foobar_facet field in your schema.xml ?
>> Did you add foobar_facet as the default search field?
>>
>> On Fri, Aug 15, 2008 at 3:13 AM, Jake Conk <[EMAIL PROTECTED]> wrote:
>>
>>> Hello,
>>>
>>> I inserted the following documents into Solr:
>>>
>>>
>>> ---
>>>
>>> 
>>> 
>>>  124
>>>  Jake Conk
>>> 
>>> 
>>>  125
>>>  Jake Conk
>>> 
>>> 
>>>
>>>
>>> ---
>>>
>>> id is the only required integer field.
>>> foobar_facet is a dynamic string field.
>>>
>>> When I try to search for anything with the word Jake in it the
>>> following ways I get no results.
>>>
>>>
>>> select?q=Jake
>>> select?q=Jake*
>>>
>>>
>>> I thought one of those two should work but the only way I got it to
>>> work was by specifying which field "Jake" is in along with a wild
>>> card.
>>>
>>>
>>> select?q=foobar_facet:Jake*
>>>
>>>
>>> 1) Does this mean for each field I would like to search if Jake exists
>>> I would have to add each field like I did above to the query?
>>>
>>> 2) How would I search if I want to find the name Jake anywhere in the
>>> string? The documentation
>>> (http://lucene.apache.org/java/docs/queryparsersyntax.html) states
>>> that I cannot use a wildcard as the first character such as *Jake*
>>>
>>> Thanks,
>>> - Jake
>>>
>>
>>
>>
>> --
>> Regards,
>> Shalin Shekhar Mangar.
>>
>


Re: schema for "literal string, case insensitive"

2008-05-29 Thread Rob Casson
bram,

you'll want to look at the KeywordTokenizerFactory (which doesn't
actually tokenize), and then use the LowerCaseFilterFactory.  the
schema in the example has a fieldType called 'alphaOnlySort' that
should get you started.

cheers,
rob

On Thu, May 29, 2008 at 6:21 AM, Bram de Jong <[EMAIL PROTECTED]> wrote:
> hello all,
>
> a while ago I was looking into making the schema for my (rather rich)
> data set, and I wanted to have a field for username.
> I would need a case insensitive string for that, but literal (no
> tokenizing, ...).
>
> how can I do this within the Solr schema definition?
>
>  - bram
>
> --
> http://freesound.iua.upf.edu
> http://www.smartelectronix.com
> http://www.musicdsp.org
>


Re: How to delete records that don't contain a field?

2007-12-04 Thread Rob Casson
i'm using this:

*:* -[* TO *]

which is what lance suggested..works just fine.

fyi: https://issues.apache.org/jira/browse/SOLR-381

On Dec 3, 2007 8:09 PM, Norskog, Lance <[EMAIL PROTECTED]> wrote:
> Wouldn't this be: *:* AND "negative query"
>
>
> -Original Message-
> From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Yonik
> Seeley
> Sent: Monday, December 03, 2007 2:23 PM
> To: solr-user@lucene.apache.org
> Subject: Re: How to delete records that don't contain a field?
>
> On Dec 3, 2007 5:22 PM, Jeff Leedy <[EMAIL PROTECTED]> wrote:
>
> > I was wondering if there was a way to post a delete query using curl
> > to delete all records that do not contain a certain field--something
> > like
> > this:
> >
> > curl http://localhost:8080/solr/update --data-binary
> > '-_title:[* TO *]' -H
> > 'Content-type:text/xml; charset=utf-8'
> >
> > The minus syntax seems to return the correct list of ids (that is, all
>
> > records that do not contain the "_title" field) when I use the Solr
> > administrative console to do the above query, so I'm wondering if Solr
>
> > just doesn't support this type of delete.
>
>
> Not yet... it makes sense to support this in the future though.
>
> -Yonik
>


Re: LowerCaseFilterFactory and spellchecker

2007-11-28 Thread Rob Casson
lance,

thanks for the quick replylooks like 'thorne' is getting added to
the dictionary, as it comes up as a suggestion for 'Thorne'

i could certainly just lowercase in my client, but just confirming
that i'm not just screwing it up in the firstplace :)

thanks again,
rc

On Nov 28, 2007 8:11 PM, Norskog, Lance <[EMAIL PROTECTED]> wrote:
> There are a few parameters for limiting what words are added to the
> dictionary.  You might be trimming out 'thorne'. See this page:
>
> http://wiki.apache.org/solr/SpellCheckerRequestHandler
>
>
> -Original Message-
> From: Rob Casson [mailto:[EMAIL PROTECTED]
> Sent: Wednesday, November 28, 2007 4:25 PM
> To: solr-user@lucene.apache.org
> Subject: LowerCaseFilterFactory and spellchecker
>
> think i'm just doing something wrong...
>
> was experimenting with the spellcheck handler with the nightly checkout
> from 11-28; seems my spellchecking is case-sensitive, even tho i think
> i'm adding the LowerCaseFilterFactory to both the index and query
> analyzers.
>
> here's a brief rundown of my testing steps.
>
> from schema.xml:
>
>  positionIncrementGap="100">
> 
> 
> 
>  class="solr.RemoveDuplicatesTokenFilterFactory"/>
> 
> 
> 
> 
> 
>  class="solr.RemoveDuplicatesTokenFilterFactory"/>
> 
> 
> 
>
>  multiValued="true"/>
>  multiValued="true"/>
>
> 
>
> 
>
> from solrconfig.xml:
>
>  class="solr.SpellCheckerRequestHandler" startup="lazy">
> 
> 1
> 0.5
> 
> spell
> spelling
> 
>
> 
>
> adding the doc:
>
> curl http://localhost:8983/solr/update -H "Content-Type: text/xml"
> --data-binary ' name="title">Thorne'
> curl http://localhost:8983/solr/update -H "Content-Type: text/xml"
> --data-binary ''
>
> 
>
> building the spellchecker:
>
> http://localhost:8983/solr/select/?q=Thorne&qt=spellchecker&cmd=rebuild
>
> 
>
> querying the spellchecker:
>
> results from http://localhost:8983/solr/select/?q=Thorne&qt=spellchecker
>
> 
> 
> 
> 0
> 1
> 
>     Thorne
> false
> 
> thorne
> 
> 
>
> results from http://localhost:8983/solr/select/?q=thorne&qt=spellchecker
>
> 
> 
> 
> 0
> 2
> 
> thorne
> true
> 
> 
>
>
> any pointers as to what i'm doing wrong, misinterpreting?  i suspect i'm
> just doing something bone-headed in the analyzer sections...
>
> thanks as always,
>
> rob casson
> miami university libraries
>


LowerCaseFilterFactory and spellchecker

2007-11-28 Thread Rob Casson
think i'm just doing something wrong...

was experimenting with the spellcheck handler with the nightly
checkout from 11-28; seems my spellchecking is case-sensitive, even
tho i think i'm adding the LowerCaseFilterFactory to both the index
and query analyzers.

here's a brief rundown of my testing steps.

from schema.xml:























from solrconfig.xml:



1
0.5

spell
spelling




adding the doc:

curl http://localhost:8983/solr/update -H "Content-Type: text/xml"
--data-binary 'Thorne'
curl http://localhost:8983/solr/update -H "Content-Type: text/xml"
--data-binary ''



building the spellchecker:

http://localhost:8983/solr/select/?q=Thorne&qt=spellchecker&cmd=rebuild



querying the spellchecker:

results from http://localhost:8983/solr/select/?q=Thorne&qt=spellchecker




0
1

Thorne
false

thorne



results from http://localhost:8983/solr/select/?q=thorne&qt=spellchecker




0
2

thorne
true




any pointers as to what i'm doing wrong, misinterpreting?  i suspect
i'm just doing something bone-headed in the analyzer sections...

thanks as always,

rob casson
miami university libraries


Re: delete by negative query

2007-10-15 Thread Rob Casson
> the work arround is to include *:* in yoru query ...
>*:* -solr
> ... if/when this is fixed
> in Solr that's esentally what solr will do under the covers.
>
> (would you mind opening a bug to track this and mention the work arround
> for other people who encounter it)

will do.

thanks again...adding the *:* to any query (qualified or not) seems to
be working as expected

solr++,
r


Re: delete by negative query

2007-10-15 Thread Rob Casson
piete,

thanks for the quick reply.

> You need to explicitly define the field you are referring to in order to
> achieve this, otherwise the query parser will assume that the minus
> character is part of the query and interpret it as field:"-solr" (where
> "field" is the name of the default field set in your schema).  Try:
>
> curl http://localhost:8983/solr/update --data-binary
> '-field:solr' -H 'Content-type:text/xml;
> charset=utf-8'

does this work for you?:

curl http://localhost:8983/solr/update --data-binary
'-name:solr' -H
'Content-type:text/xml; charset=utf-8'

curl http://localhost:8983/solr/update --data-binary '' -H
'Content-type:text/xml; charset=utf-8'

then, 
http://localhost:8983/solr/select/?q=-name:solr&version=2.2&start=0&rows=10&indent=on

i still get all of those documents returned

> parser will assume that the minus
> character is part of the query and interpret it as field:"-solr" (where
> "field" is the name of the default field set in your schema)

my true use case is to deleteByQuery against my default field, but
even in this case, with the field specified, and using the
exampledocs, i'm not seeing the document get deleted.

thanks again...maybe i should just grab a beer, and go to bed...cheers :)
r


delete by negative query

2007-10-15 Thread Rob Casson
i'm having no luck deleting by a negative query

indexing the example docs from 1.2, these steps work:

curl http://localhost:8983/solr/update --data-binary
'solr' -H 'Content-type:text/xml;
charset=utf-8'

curl http://localhost:8983/solr/update --data-binary '' -H
'Content-type:text/xml; charset=utf-8'

but if i reindex, and change the delete query to a negative, the
non-'solr' docs don't get deleted:

curl http://localhost:8983/solr/update --data-binary
'-solr' -H 'Content-type:text/xml;
charset=utf-8'

curl http://localhost:8983/solr/update --data-binary '' -H
'Content-type:text/xml; charset=utf-8'

good chance i'm missing something obvious

tia,
r