This is an interesting discussion and I have a few questions:
1) My apologies but I haven't been following the NRT patch beyond what was
presented at a meetup some months back and the wiki but what is the status
of it in Solr?
2) What are typical/accepted definitions of "Real Time" vs "Near Real Ti
I am just using the sor.war file that came with the Solr 1.4 download on
weblogic.
did not add any jar or remove any jar
On Tue, May 25, 2010 at 9:54 PM, Chris Hostetter
wrote:
>
> : yes i am running 1.5, Any idea how we can run Solr 1.4 using Java 1.5
>
> Solr 1.4 works just fine with Java 1.5
: yes i am running 1.5, Any idea how we can run Solr 1.4 using Java 1.5
Solr 1.4 works just fine with Java 1.5 -- even when Using the
DataImportHandler.
there are some features of DIH like the ScriptTransformer that requires
java 1.6, but that's not your issue...
: > Last I encountered that
: Actually, its not as much a Solr problem as a Lucene one, as it turns
: out, the WeightedSpanTermExtractor is in Lucene and not Solr.
:
: Why they decided to only highlight queries that are in Lucene I don't
: know, but what I did to solve this problem was simply to make my queries
: extends
: Is it possible to use solr caches such as query cache , filter cache
: and document cache from external caching system like memcached as it
: has several advantages such as centralized caching system and reducing the
: pause time of JVM 's garbage collection as we can assign less
:
: For some reason when doing delta indexing via DIH, this field is not being
updated.
:
: Are timestamp fields updated during DELTA updates?
timestamp fields aren't treated any differnetly then any other field -- as
far as Solr is concerned this just date field that happens to have a
defa
Don't forget to re-index after you make the change Lance suggested...
Erick
On Tue, May 25, 2010 at 4:51 PM, Lance Norskog wrote:
> Change type="string" to type="text". This causes the field to be
> analyzed and then searching on words finds the document.
>
>
>
> On Tue, May 25, 2010 at 8:34 AM
: I can't seem to get solr cell to index password protected pdf files.
: I can't figure out how to pass the password to tika and looking at
: ExtractingDocumentLoader,
: it doesn't seem to pass any pdf password related metadata to the tika parser.
I suspect you are correct, i don't think anyone h
: Subject: Debugging - DIH Delta Queries-
: References:
: <1659766275.5213.1274376509278.javamail.r...@vicenza.dmz.lexum.pri>
: In-Reply-To:
: <1659766275.5213.1274376509278.javamail.r...@vicenza.dmz.lexum.pri>
http://people.apache.org/~hossman/#threadhijack
Thread Hijacking on Mailing L
You may also want to look at:
ClientUtils.escapeQueryChars( String s )
http://lucene.apache.org/solr/api/org/apache/solr/client/solrj/util/ClientUtils.html#escapeQueryChars%28java.lang.String%29
this will escape any lucene query chars, then pass it to URLEncoder
and you should be good to go.
On
Hi all,
I have a suggestion for improving relevance functions in Solr by way of
providing access to a set of pre-defined constants in Solr queries.
Specifically, the number of documents indexed, the number of unique terms in
a field, the total number of terms in a field, etc. are some of the
query
Change type="string" to type="text". This causes the field to be
analyzed and then searching on words finds the document.
On Tue, May 25, 2010 at 8:34 AM, Jörg Agatz wrote:
> i create a new Index, but nothing Change.
>
> multiValued="true"/>
>
>
>
>
>
> >
>
> I search for :
>
> " *:* "
> I fo
I was assuming that I needed to leave the special characters in the http get,
but running the solr admin it looks like it converts them the same way that
URLEncoder.encode does. What is the need to preserve special characters?
http://localhost:8983/solr/select?indent=on&version=2.2&q=%22mr.+bill
My motivation is more from the performance prospective than functional
prospective. I was hoping by opening the Solr index/core read-only,
underlying Lucene IndexReader can be opened in read-only mode for optimum
query performance (removing the overhead of multi-thread management).
--
View this m
The stats.jsp page walks the internal JMX beans. It prints out the
numbers of documents among other things. I would look at how that
works instead of writing your own thing for the internal APIs.
They may have changed from Solr 1.3 to 1.4 and will change further for
1.5 (4.0 is the new name?).
On
Hi,
I'd guess there are two ways in doing this but i've never seen any
solrconfig.xml file having any directives that explicitly do not allow for
updates.
You'd either have a proxy in front that simply won't allow any other HTTP
method than GET and HEAD, or you could remove the update re
Is there a way to open a Solr index/core in read-only mode?
--
View this message in context:
http://lucene.472066.n3.nabble.com/Solr-read-only-core-tp843049p843049.html
Sent from the Solr - User mailing list archive at Nabble.com.
This sounds like you have the same solrconfig for the slave and the
master? You should turn off autoCommit on the slave. Only the master
should autoCommit.
You should set up the ReplicationHandler. This moves index updates
from the indexer to the query server.
http://www.lucidimagination.com/sear
Chris,
Please ignore the repeated response header due to typo in the previous
message.
~Umesh
--
View this message in context:
http://lucene.472066.n3.nabble.com/Re-SOLR-343-date-facet-mincount-patch-tp789556p842863.html
Sent from the Solr - User mailing list archive at Nabble.com.
Hoss,
I was able to successfully apply the path Solr-343 and even after applying
the patch, date facet minCount does not work. Appropriate part of response
are as given below:
["responseHeader"] => object(SolrObject)#107 (3) {
["status"] => int(0)
["QTime"] => int(4)
["params"] =>
Thanks Sean, that was exactly what I need. One question though...
How to correctly retain the Solr specific characters.
I tried adding escape chars but URLEncoder doesn't seem to care about that:
Example:
String s1 = "\"mr. bill\" oh n?";
String s2 = "\\\"mr. bill\\\" oh n\\?";
String encoded1
Ah, the issue was explicitly specifying components...
query
I don't remember changing this during default install, commenting this
out enabled faceted search component.
Thanks all for the help.
Ilya
On Tue, May 25, 2010 at 10:38 AM, Sascha Szott wrote:
> Hi,
>
> please note, that the Face
Sascha thanks for the response, here is the output...
0
0
xml
title:*
title
Baseball game
Soccer game
Football game
On Mon, May 24, 2010 at 5:39 PM, Sascha Szott wrote:
> Hi Ilya,
>
> Ilya Sterin
:Is there any way to get all the fields (irrespective of whether
: it contains a value or null) in solrDocument.
no. a document only has "Field" instances for the fields which it has
values for. it's also not a feature that would even be theoretically
posisbly to add, becuase of d
My documents are all quite small if not down right tiny, there is not much
analysis to do. I plan to mainly use Solr for indexing application
configuration data which there is a lot of and I have all pre-formated. Since
it is a music application there are many score templates, scale and rhythm
Java provides one. You probably want to use utf-8 as the encoding scheme.
http://java.sun.com/javase/6/docs/api/java/net/URLEncoder.html
Note you also will want to strip or escape character that are meaningful
in the Solr/Lucene query syntax.
http://lucene.apache.org/java/2_4_0/queryparsersyn
I would like to leverage on whatever SOLR provides to properly url-encode a
search string.
For example a user enters:
"mr. bill" oh no
The URL submitted by the admin page is:
http://localhost:8983/solr/select?indent=on&version=2.2&q=%22mr.+bill%22+oh+no&fq=&start=0&rows=10&fl=*%2Cscore&qt=standa
: I'm running edismax (on both a 1.4 with patch and a branch_3x version) and
: I'm seeing something I don't expect.
...
: dog cat -trilogy
: dog cat -trilogy
: allfields:dog allfields:cat
: -allfields:trilogi
: allfields:dog allfields:cat
: -allfields:trilogi
Hmmm... something is reall
The main issue is if you're using facets, which are currently
inefficient for the realtime use case because they're created on the
entire set of segment/readers. Field caches in Lucene are per segment
and so don't have this problem.
On Tue, May 25, 2010 at 4:09 AM, Grant Ingersoll wrote:
> How m
Any idea why this query returns 0 records:
"sexual assault" AND (-obama)
while this one returns 1400 ?
"sexual assault" AND -(obama)
Some debug info:
"sexual assault" AND (-obama), translates to: +text:"sexual assault"
+(-text:obama), returns 0 records
"sexual assault" AND -(obama), tr
: Pretty much every one of my queries is going to be unique. However, the
: query is fairly complex and also contains both unique and non-unique
: data. In the query, some fields will be unique (e.g description), but
: other fields will be fairly common (e.g. category). If we could use
: those
i create a new Index, but nothing Change.
I search for :
" *:* "
I fond it
i search vor "hallo" "Hallo" "hallo*" "Hallo*"or some other content from the
CDATA field i dosent.
You have to provide more details than that. We need to know the field
definition for that named field, the corresponding field type
definition, and the exact request you're making to Solr that you think
should find this document.
And most importantly, did you :)
Erik
On May 25,
ok, done..
But now i dosent find any word in the CDATA field.
i make :
it is a string field Multivalued..
King
Well, you'll just have to create valid XML, either encoding some
characters or using CDATA sections.
Erik
On May 25, 2010, at 10:06 AM, Jörg Agatz wrote:
I have a work!,
i musst indexing a lot of E-Mails, so i will create a Script to
generate me
a xml of the Mails.
Now is the que
Hi,
please note, that the FacetComponent is one of the six search components
that are automatically associated with solr.SearchHandler (this holds
also for the QueryComponent).
Another note: By using name="components" all default components will be
replaced by the components you explicitly m
Chris,
I am using SolrIndexSearcher to get a handle to the total number of records
in the index. I am doing it like this :
int num =
Integer.parseInt((String)solrSearcher.getStatistics().get("numDocs").toString());
Please let me know if there is a better way to do this.
Mark,
I can tell you what I
I have a work!,
i musst indexing a lot of E-Mails, so i will create a Script to generate me
a xml of the Mails.
Now is the question, what happens when i creade a field "body" and in this
field comes a lot of "<" or ">" like this:
Confidentiality Caution: This message and all its included content a
Is the FacetComponent loaded at all?
query
facet
On 2010-05-25, at 3:32 AM, Sascha Szott wrote:
> Hi Birger,
>
> Birger Lie wrote:
>> I don't think the bolean fields is mapped to "on" and "off" :)
> You can use true and on interchangeably.
>
> -Sascha
>
>>
>>
>> -birg
Ah, I may have misunderstood, I somehow got it in my mind
you were talking about the length of each term (as in string length).
But if you're looking at the field length as the count of terms, that's
another question, sorry for the confusion...
I have to ask, though, why you want to sort this way
On 25.05.2010, at 08:55, Lukas Kahwe Smith wrote:
> Now when I deselect one of the checkboxes I add an fq parameters:
> facet=true&fl=*,score&sort=score+desc&start=0&q=(tag_ids:("23"))&facet.field={!ex%3Ddt}organisation_id&facet.field={!ex%3Ddt}tag_ids&facet.field={!ex%3Ddt}addressee_ids&facet.fi
If a field doesn't have a value, You will get NULL on retrieving it. How
could you expect a value for a field which is not provided?
You have two options, choose either one..
1. If the fieldvalue is returned NULL then display a proper error / user
defined message. Handle the error.
2. Add a dummy
Hi Aditya,
i can retrieve all documents. but cannot retrieve all the fields
in a document(if it does not hv any value).
For example i get a list of documents, some of the documents have some value
for title field, and others mite not contain a value for title field. in
anycase i need to
Hi all,
I did some further investigation and (after turning of some filters in
yourkit) found that is was actually the machine sending the files to
solr that was slowing things down.
At first I couldn't find this as it turned out that yourkit hides
org.apache.* classes. When I removed this f
Resending it as there is a typo error.
To reterive all documents, You need to use the query/filter FieldName:*:* .
Regards
Aditya
www.findbestopensource.com
On Tue, May 25, 2010 at 4:29 PM, findbestopensource <
findbestopensou...@gmail.com> wrote:
> To reterive all documents, You need to use
To reterive all documents, You need to use the query/filter *FieldName:*:**
Regards
Aditya
www.findbestopensource.com
On Tue, May 25, 2010 at 4:14 PM, Rakhi Khatwani wrote:
> Hi,
> Is there any way to get all the fields (irrespective of whether
> it contains a value or null) in solrDocu
How many docs are in the batch you are pulling down? How many docs/second do
you expect on the index size? How big are the docs? What do you expect in
terms of queries per second? How fast do new documents need to be available on
the local server? How much analysis do you have to do? Also,
To reterive all documents, You need to use the query/filter *FieldName:*:**
Regards
Aditya
www.findbestopensource.com
On Tue, May 25, 2010 at 4:14 PM, Rakhi Khatwani wrote:
> Hi,
> Is there any way to get all the fields (irrespective of whether
> it contains a value or null) in solrDo
Hi,
Is there any way to get all the fields (irrespective of whether
it contains a value or null) in solrDocument.
or
Is there any way to get all the fields in schema.xml of the url link (
http://localhost:8983/solr/core0/)??
Regards,
Raakhi
Thanks much all
I am using following looks good to me
Regards
Prakash
-Original Message-
From: Sascha Szott [mailto:sz...@zib.de]
Sent: Tuesday, May 25, 2010 1:16 PM
To: solr-user@lucene.apache.org
Subject: Re: Highlighting is not happening
Hi,
to accomplish that, use the highligh
Since Solr 1.4 I think the uninverted method is on by default. Anyway, you
can choose wich to use with the method param:
facet.method=fc/enum (where fc is the uninverted one)
http://wiki.apache.org/solr/SimpleFacetParameters
--
View this message in context:
http://lucene.472066.n3.nabble.com/How
This looks like a case where the extended dismax parser is creating a
Lucene QueryParser parsed query rather than a disjunction maximum query.
A case of "too much magic" maybe? Looks like this one should be
parsed quite differently. Try dismax and see what you get, it'll be
quite differe
Thanks.
Do I need to configure Solr to use the uninverted algorithm or is it the
default algorithm?
--- On Tue, 5/25/10, Marc Sturlese wrote:
> From: Marc Sturlese
> Subject: Re: How well does Solr scale over large number of facet values?
> To: solr-user@lucene.apache.org
> Date: Tuesday, Ma
With the uninverted algorithm it will be very fast whatever is the number of
unique terms. But be careful with the memory because it uses quite a lot.
Using the oldest facet algorithm, if you have a lot of different terms it
will be slow.
--
View this message in context:
http://lucene.472066.n3.
Thanks Paul, I shall continue doing some more R&D with your inputs.
Best Regards,
Kranti K K Parisa
On Tue, May 25, 2010 at 12:54 PM, Paul Dhaliwal wrote:
> It depends on what kind of load you are talking about and what your
> expertise is.
>
> NGINX does perform better than apache for most p
Hi,
to accomplish that, use the highlighting parameters hl.simple.pre and
hl.simple.post.
By the way, there are a plenty of other parameters that affect
highlighting. Take a look at:
http://wiki.apache.org/solr/HighlightingParameters
-Sascha
Doddamani, Prakash wrote:
Hey,
I thought the
Hi Erick,
Erick Erickson wrote:
Are you sure you want to recompute the length when sorting?
It's the classic time/space tradeoff, but I'd suggest that when
your index is big enough to make taking up some more space
a problem, it's far too big to spend the cycles calculating each
term length for
Hi Birger,
Birger Lie wrote:
I don't think the bolean fields is mapped to "on" and "off" :)
You can use true and on interchangeably.
-Sascha
-birger
-Original Message-
From: Ilya Sterin [mailto:ster...@gmail.com]
Sent: 24. mai 2010 23:11
To: solr-user@lucene.apache.org
Subject: Fa
Hey,
I thought the Highlights would happen in the field of the documents
returned from SOLR J
But it gives new list of Highlighting at below, sorry for the confusion
I was wondering is there a way that the fields returned itself contains
bold characters
Eg : if searched for "query"
re
It depends on what kind of load you are talking about and what your
expertise is.
NGINX does perform better than apache for most people, however less people
know about NGINX than apache. If you have more than 100K searchers a day
doing a few searches each, you will benefits from NGINX. If your tra
I want to facet over a field "group".
Since "group" is created by users, potentially there can be a huge number of
values for "group".
- Would Solr be able to handle a use case like this? Or is Solr not really
appropriate for facet fields with a large number of values?
- I understand that I ca
61 matches
Mail list logo