DefaultSearchField ? important

2009-07-15 Thread Jörg Agatz
Hallo Users... And good Morning, in germany it is morning :-) I have a realy important Prroblem... My Fields are realy Bad.. Like CUPS_EBENE1_EBENE2_TASKS_CATEGORIE I have no Content field ore somthing like this... So when i will search somthing, i need to search in ALL fields, but when i

spellcheck with misspelled words in index

2009-07-15 Thread Chris Williams
Hi, I'm having some trouble getting the correct results from the spellcheck component. I'd like to use it to suggest correct product titles on our site, however some of our products have misspellings in them outside of our control. For example, there's 2 products with the misspelled word cusine

Re: Using Multiple fields in UniqueKey

2009-07-15 Thread Fergus McMenemie
Some ideas: - Use copyField to copy fields to the field designated as the uniqueKey (not sure if this will work) - Create the field from existing data before sending docs to Solr - Create a custom UpdateRequestProcessor that adds a field for each document it processes and stuffs it with other

case-sensitive

2009-07-15 Thread Jörg Agatz
I search for the Otions for case-*sensitive ore *Key* sensitive to some or all Fields.. Exist a Option for that? in the Solr Schema.xml? Problem: I search in a Field for ri* but the real Name is RI-700690 RI* brings Results, but sumtimes it is NI, and sumetimes it is ni or something like that.

Re: case-sensitive

2009-07-15 Thread Avlesh Singh
Even though I could not completely understand your question, my guess is that you want to do case-insensitive searches. If yes, you may use a text type for this field or add the LowerCaseFilterFactory as a filter for this field in your schema.xml Cheers Avlesh On Wed, Jul 15, 2009 at 1:42 PM,

How to filter old revisions

2009-07-15 Thread Reza Safari
Hi, How is possible to search for max values e.g. doc1 has revision number 1 doc2 has revision number 2 doc3 has revision number 3 doc1, doc2 and doc3 have all same root revision id e.g. 1 I want search result with doc's with only highest revision number? Gr, Reza

Re: How to filter old revisions

2009-07-15 Thread Shalin Shekhar Mangar
On Wed, Jul 15, 2009 at 3:19 PM, Reza Safari r.saf...@lukkien.com wrote: Hi, How is possible to search for max values e.g. doc1 has revision number 1 doc2 has revision number 2 doc3 has revision number 3 doc1, doc2 and doc3 have all same root revision id e.g. 1 I want search result

Highlight arbitrary text

2009-07-15 Thread Anders Melchiorsen
Is it possible to have Solr highlight an arbitrary text that is posted at request time? Currently, we are storing an unindexed HTML field in Solr, just to have it highlighted. We would prefer to generate the HTML from the database at presentation time, in order to keep the Solr index smaller and

Re: Segments_2 and segments.gen under Index folder and spellchecker1, spellchecker2, spellcheckerFile folder

2009-07-15 Thread Michael McCandless
Actually, segments.gen is created by current Lucene versions. It records the current generation (the _N in segments_N) in the index, as a fallback in case directory listing of the files fails to locate the segments_N file (eg on filesystems, like NFS, where the directory listing may come from a

Re: How to filter old revisions

2009-07-15 Thread Reza Safari
Revision is a field. Sorting is not an option because then I sort all document! I want to filter a subset of documents with the same root version number (field) and get only one document of the subset with highest revision number. In other words the root revision number of all documents

Re: DefaultSearchField ? important

2009-07-15 Thread Otis Gospodnetic
Jörg, You can use copyField to copy all fields into a single fields that has all other fields' content. Then you can put that catch all field as the default search field. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: Jörg Agatz

Re: Segments_2 and segments.gen under Index folder and spellchecker1, spellchecker2, spellcheckerFile folder

2009-07-15 Thread Shalin Shekhar Mangar
On Wed, Jul 15, 2009 at 4:45 PM, Michael McCandless luc...@mikemccandless.com wrote: Actually, segments.gen is created by current Lucene versions. It records the current generation (the _N in segments_N) in the index, as a fallback in case directory listing of the files fails to locate the

Re: DefaultSearchField ? important

2009-07-15 Thread Mark Miller
Jörg Agatz wrote: Hallo Users... And good Morning, in germany it is morning :-) I have a realy important Prroblem... My Fields are realy Bad.. Like CUPS_EBENE1_EBENE2_TASKS_CATEGORIE I have no Content field ore somthing like this... So when i will search somthing, i need to search in ALL

Re: case-sensitive

2009-07-15 Thread Jörg Agatz
yes, thats right... i have use in al Time string So i dont change something, without the field like this: field name=DOCUMENTTYPE_SPEC_CTC type=string indexed=true stored=true omitNorms=true / now i test what you say... field name=DOCUMENTTYPE_SPEC_CTC type=text indexed=true stored=true

Re: case-sensitive

2009-07-15 Thread Avlesh Singh
1. Did you re-index after changing the field type? Changes would not reflect until you do this. 2. This is highly unlikely, but please make sure that you have fieldType called text defined in your schema.xml Cheers Avlesh On Wed, Jul 15, 2009 at 6:00 PM, Jörg Agatz

issue inquiry: unterminated index lock after optimize update command

2009-07-15 Thread Brian Klippel
Hello, I'm using solr build 2009-06-16_08-06-14, in multicore configuration. When I issue the update command optimize to a core, the index files are locked and never released. Calling the coreAdmin unload method on the core unload the core but does not unlock the underlying index files. The

java.io.FileNotFoundException: ..index/segments_3ee (No such file or directory)

2009-07-15 Thread Sebastian Böttger
Hi, I get an error while indexing some xml files. The segments_3ee file is not existent. The point is, if i index the same xml file a couple of times: sometimes it works, sometimes not. Does anybody got an idea whats going wrong here? Jul 15, 2009 5:11:32 PM org.apache.solr.core.SolrCore

Re: Modifying a stored field after analyzing it?

2009-07-15 Thread solrcoder
markrmiller wrote: Yonik's patch makes it so that you can supply the TokenStream straight to the field and still store an *independent* text value in a stored field. When building the Lucene Document, when adding the field, you would add the raw TokenStream and then use setValue to set

Different structure of standard generated query for CJK vs. Western query

2009-07-15 Thread Mark Bennett
(resending to solr-usr per Chris H; all Asian characters removed from examples to avoid filter) I'm getting phrase queries instead of implicit OR queries with Asian text. I first noticed it with the Dismax query handler, but it also happens with the Standard query. Of course Asian text is broken

Re: Boosting for most recent documents

2009-07-15 Thread vivek sar
Hi, Does anyone know if Solr supports sorting by internal document ids, i.e, like Sort.INDEXORDER in Lucene? If so, how? Also, if anyone have any insight on if function query loads up unique terms (like field sorts) in memory or not. Thanks, -vivek On Fri, Jul 10, 2009 at 10:26 AM, vivek

Dedicated Slave Master

2009-07-15 Thread wojtekpia
I'm building a high load system that will require several search slaves (at least 2, but this may grow to 5-10+ in the near future). I plan to have a single indexer that replicates to the search slaves. I want indexing to be as fast as possible, so I've considered adding another machine between

Chrome Web Browser doesn't render properly

2009-07-15 Thread Jason Rutherglen
From the Solr admin page, solr/admin/file/?file=schema.xml and /solr/select/?q=solrversion=2.2start=0rows=10indent=on renders improperly (meaning the XML isn't formatted). Maybe Chrome doesn't support XML?

Re: Wikipedia or reuters like index for testing facets?

2009-07-15 Thread Jason Rutherglen
Yeah that's what I was thinking of as an alternative, use enwiki and randomly generate facet data along with it. However for consistent benchmarking the random data would need to stay the same so that people could execute the same benchmark consistently in their own environment. On Tue, Jul 14,

RE: Chrome Web Browser doesn't render properly

2009-07-15 Thread Brian Klippel
Nope, chrome treats xml as html. Either view source or use another browser. -Original Message- From: Jason Rutherglen [mailto:jason.rutherg...@gmail.com] Sent: Wednesday, July 15, 2009 2:15 PM To: solr-user@lucene.apache.org Subject: Chrome Web Browser doesn't render properly From the

Re: DefaultSearchField ? important

2009-07-15 Thread Mani Kumar
@mark, @otis: yeah copying all the fields to one text field will work but what if i want to assign specific weightage to specific fields? e.g. i have a three fields 1) title 2) tags 3) description i copied all of them to a new field called all_text. now i want to search in all the fields with

Re: DefaultSearchField ? important

2009-07-15 Thread Erik Hatcher
On Jul 15, 2009, at 2:59 PM, Mani Kumar wrote: @mark, @otis: Can I answer too? :) yeah copying all the fields to one text field will work but what if i want to assign specific weightage to specific fields? e.g. i have a three fields 1) title 2) tags 3) description i copied all of them

DIH: On import (full or delta) commit=false seems to not take effect

2009-07-15 Thread Jay Hill
I am trying to run full and delta imports with the commit=false option, but it doesn't seem to take effect - after the import a commit always happens no matter what params I send. I've looked at the source and unless I'm missing something it doesn't seem to process the commit param. Here's the

Multivalued fields and scoring/sorting

2009-07-15 Thread Matt Schraeder
I am trying to come up with a way to sort (or score, and sort based on the score) of a multivalued field. I was looking at FunctionQueries and saw fieldvalue, but as that only works on single valued fields that doesn't help me. The field is as follows: fieldType name=keyword

Re: support for Payload Feature of lucene in solr

2009-07-15 Thread Grant Ingersoll
Note, also that there are factories for some of the other TokenFilters in the payloads package from contrib/analyzers. On Jul 14, 2009, at 5:55 PM, Grant Ingersoll wrote: The TokenFilterFactory side is trivial for the DelimitedPayloadTokenFilter. That could be in for 1.4. In fact, there

Question on solr 1.4 Replication

2009-07-15 Thread Gurjot Singh
Hi, I am using data import handler to do full and delta import. I want to use the replication feature of solr 1.4 For that I wanted to understand 2 scenarios 1. What happens when the slave solr server tries to poll the master at the time delta import is running on master. Does the slave only

Re: Question on solr 1.4 Replication

2009-07-15 Thread Jeff Newburn
After each commit the index effectively locks down for replication in a way. So if your slave grabs from the master while the master is reindexing it will be updated to the point of last commit/optimize (depending on your configuration). Same is true while the slave is replicating. So if the

Re: spellcheck with misspelled words in index

2009-07-15 Thread Jay Hill
We had the same thing to deal with recently, and a great solution was posted to the list. Create a stopwords filter on the field your using for your spell checking, and then populate a custom stopwords file with known misspelled words: fieldType name=textSpell class=solr.TextField

Re: DIH: On import (full or delta) commit=false seems to not take effect

2009-07-15 Thread Jay Hill
My bad, I had a configuration setting overriding this value. Sorry for the mistake. -Jay On Wed, Jul 15, 2009 at 12:07 PM, Jay Hill jayallenh...@gmail.com wrote: I am trying to run full and delta imports with the commit=false option, but it doesn't seem to take effect - after the import a

Re: DIH: On import (full or delta) commit=false seems to not take effect

2009-07-15 Thread Jay Hill
Actually, my good after all. The parameter does not take effect. If commit=false is passed in a commit still happens. Will open and JIRA and supply a patch shortly. -Jay On Wed, Jul 15, 2009 at 5:50 PM, Jay Hill jayallenh...@gmail.com wrote: My bad, I had a configuration setting overriding

Re: Multivalued fields and scoring/sorting

2009-07-15 Thread Avlesh Singh
The harsh reality of life is that you cannot sort on multivalued fields. If you can explain your domain problem (the significance of numbers 818, 2 etc), maybe people can come up with an alternate index design which fits into your use cases. Cheers Avlesh On Thu, Jul 16, 2009 at 1:18 AM, Matt