Hallo Users...
And good Morning, in germany it is morning :-)
I have a realy important Prroblem...
My Fields are realy Bad.. Like
CUPS_EBENE1_EBENE2_TASKS_CATEGORIE
I have no Content field ore somthing like this...
So when i will search somthing, i need to search in ALL fields, but when i
Hi,
I'm having some trouble getting the correct results from the
spellcheck component. I'd like to use it to suggest correct product
titles on our site, however some of our products have misspellings in
them outside of our control. For example, there's 2 products with the
misspelled word cusine
Some ideas:
- Use copyField to copy fields to the field designated as the uniqueKey (not
sure if this will work)
- Create the field from existing data before sending docs to Solr
- Create a custom UpdateRequestProcessor that adds a field for each document
it processes and stuffs it with other
I search for the Otions for case-*sensitive ore *Key* sensitive to some
or all Fields..
Exist a Option for that? in the Solr Schema.xml?
Problem:
I search in a Field for ri* but the real Name is RI-700690
RI* brings Results, but sumtimes it is NI, and sumetimes it is ni or
something like that.
Even though I could not completely understand your question, my guess is
that you want to do case-insensitive searches. If yes, you may use a text
type for this field or add the LowerCaseFilterFactory as a filter for this
field in your schema.xml
Cheers
Avlesh
On Wed, Jul 15, 2009 at 1:42 PM,
Hi,
How is possible to search for max values e.g.
doc1 has revision number 1
doc2 has revision number 2
doc3 has revision number 3
doc1, doc2 and doc3 have all same root revision id e.g. 1
I want search result with doc's with only highest revision number?
Gr, Reza
On Wed, Jul 15, 2009 at 3:19 PM, Reza Safari r.saf...@lukkien.com wrote:
Hi,
How is possible to search for max values e.g.
doc1 has revision number 1
doc2 has revision number 2
doc3 has revision number 3
doc1, doc2 and doc3 have all same root revision id e.g. 1
I want search result
Is it possible to have Solr highlight an arbitrary text that is posted at
request time?
Currently, we are storing an unindexed HTML field in Solr, just to have it
highlighted. We would prefer to generate the HTML from the database at
presentation time, in order to keep the Solr index smaller and
Actually, segments.gen is created by current Lucene versions. It
records the current generation (the _N in segments_N) in the index, as
a fallback in case directory listing of the files fails to locate the
segments_N file (eg on filesystems, like NFS, where the directory
listing may come from a
Revision is a field.
Sorting is not an option because then I sort all document! I want to
filter a subset of documents with the same root version number (field)
and get only one document of the subset with highest revision number.
In other words the root revision number of all documents
Jörg,
You can use copyField to copy all fields into a single fields that has all
other fields' content.
Then you can put that catch all field as the default search field.
Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
- Original Message
From: Jörg Agatz
On Wed, Jul 15, 2009 at 4:45 PM, Michael McCandless
luc...@mikemccandless.com wrote:
Actually, segments.gen is created by current Lucene versions. It
records the current generation (the _N in segments_N) in the index, as
a fallback in case directory listing of the files fails to locate the
Jörg Agatz wrote:
Hallo Users...
And good Morning, in germany it is morning :-)
I have a realy important Prroblem...
My Fields are realy Bad.. Like
CUPS_EBENE1_EBENE2_TASKS_CATEGORIE
I have no Content field ore somthing like this...
So when i will search somthing, i need to search in ALL
yes, thats right...
i have use in al Time
string So i dont change something, without the field
like this:
field name=DOCUMENTTYPE_SPEC_CTC type=string indexed=true
stored=true omitNorms=true /
now i test what you say...
field name=DOCUMENTTYPE_SPEC_CTC type=text indexed=true stored=true
1. Did you re-index after changing the field type? Changes would not
reflect until you do this.
2. This is highly unlikely, but please make sure that you have
fieldType called text defined in your schema.xml
Cheers
Avlesh
On Wed, Jul 15, 2009 at 6:00 PM, Jörg Agatz
Hello,
I'm using solr build 2009-06-16_08-06-14, in multicore configuration.
When I issue the update command optimize to a core, the index files
are locked and never released. Calling the coreAdmin unload method on
the core unload the core but does not unlock the underlying index files.
The
Hi,
I get an error while indexing some xml files.
The segments_3ee file is not existent. The point is, if i index the same
xml file a couple of times: sometimes it works, sometimes not.
Does anybody got an idea whats going wrong here?
Jul 15, 2009 5:11:32 PM org.apache.solr.core.SolrCore
markrmiller wrote:
Yonik's patch makes it so that you can supply the TokenStream straight to
the field and still store an *independent* text value in a stored field.
When building the Lucene Document, when adding the field, you would add
the
raw TokenStream and then use setValue to set
(resending to solr-usr per Chris H; all Asian characters removed from
examples to avoid filter)
I'm getting phrase queries instead of implicit OR queries with Asian
text. I first noticed it with the Dismax query handler, but it also happens
with the Standard query.
Of course Asian text is broken
Hi,
Does anyone know if Solr supports sorting by internal document ids,
i.e, like Sort.INDEXORDER in Lucene? If so, how?
Also, if anyone have any insight on if function query loads up unique
terms (like field sorts) in memory or not.
Thanks,
-vivek
On Fri, Jul 10, 2009 at 10:26 AM, vivek
I'm building a high load system that will require several search slaves (at
least 2, but this may grow to 5-10+ in the near future). I plan to have a
single indexer that replicates to the search slaves. I want indexing to be
as fast as possible, so I've considered adding another machine between
From the Solr admin page, solr/admin/file/?file=schema.xml and
/solr/select/?q=solrversion=2.2start=0rows=10indent=on
renders improperly (meaning the XML isn't formatted). Maybe
Chrome doesn't support XML?
Yeah that's what I was thinking of as an alternative, use enwiki
and randomly generate facet data along with it. However for
consistent benchmarking the random data would need to stay the
same so that people could execute the same benchmark
consistently in their own environment.
On Tue, Jul 14,
Nope, chrome treats xml as html. Either view source or use another
browser.
-Original Message-
From: Jason Rutherglen [mailto:jason.rutherg...@gmail.com]
Sent: Wednesday, July 15, 2009 2:15 PM
To: solr-user@lucene.apache.org
Subject: Chrome Web Browser doesn't render properly
From the
@mark, @otis:
yeah copying all the fields to one text field will work but what if i want
to assign specific weightage to specific fields?
e.g. i have a three fields
1) title
2) tags
3) description
i copied all of them to a new field called all_text.
now i want to search in all the fields with
On Jul 15, 2009, at 2:59 PM, Mani Kumar wrote:
@mark, @otis:
Can I answer too? :)
yeah copying all the fields to one text field will work but what if
i want
to assign specific weightage to specific fields?
e.g. i have a three fields
1) title
2) tags
3) description
i copied all of them
I am trying to run full and delta imports with the commit=false option, but
it doesn't seem to take effect - after the import a commit always happens no
matter what params I send. I've looked at the source and unless I'm missing
something it doesn't seem to process the commit param.
Here's the
I am trying to come up with a way to sort (or score, and sort based on
the score) of a multivalued field. I was looking at FunctionQueries and
saw fieldvalue, but as that only works on single valued fields that
doesn't help me.
The field is as follows:
fieldType name=keyword
Note, also that there are factories for some of the other TokenFilters
in the payloads package from contrib/analyzers.
On Jul 14, 2009, at 5:55 PM, Grant Ingersoll wrote:
The TokenFilterFactory side is trivial for the
DelimitedPayloadTokenFilter. That could be in for 1.4. In fact,
there
Hi,
I am using data import handler to do full and delta import. I want to use
the replication feature of solr 1.4
For that I wanted to understand 2 scenarios
1. What happens when the slave solr server tries to poll the master at the
time delta import is running on master. Does the slave only
After each commit the index effectively locks down for replication in a way.
So if your slave grabs from the master while the master is reindexing it
will be updated to the point of last commit/optimize (depending on your
configuration). Same is true while the slave is replicating. So if the
We had the same thing to deal with recently, and a great solution was posted
to the list. Create a stopwords filter on the field your using for your
spell checking, and then populate a custom stopwords file with known
misspelled words:
fieldType name=textSpell class=solr.TextField
My bad, I had a configuration setting overriding this value. Sorry for the
mistake.
-Jay
On Wed, Jul 15, 2009 at 12:07 PM, Jay Hill jayallenh...@gmail.com wrote:
I am trying to run full and delta imports with the commit=false option, but
it doesn't seem to take effect - after the import a
Actually, my good after all. The parameter does not take effect. If
commit=false is passed in a commit still happens.
Will open and JIRA and supply a patch shortly.
-Jay
On Wed, Jul 15, 2009 at 5:50 PM, Jay Hill jayallenh...@gmail.com wrote:
My bad, I had a configuration setting overriding
The harsh reality of life is that you cannot sort on multivalued fields.
If you can explain your domain problem (the significance of numbers 818,
2 etc), maybe people can come up with an alternate index design which fits
into your use cases.
Cheers
Avlesh
On Thu, Jul 16, 2009 at 1:18 AM, Matt
35 matches
Mail list logo