Hi Mike,
Actually, the example I gave is the document in this case. So there was no
"ships", only "drilling".
Dmitry
On Wed, Sep 14, 2011 at 1:59 PM, Michael Sokolov wrote:
> The highlighter gives you snippets of text surrounding words (terms) drawn
> from the query. The whole document should
Koji,
This looks strange to me, because I would assume, that highlighter also
applies boolean logic same way as a query parser. In this way of thinking
"drilling" should be highlighted if "ships" occurred together in the same
document. Which wasn't the case in the example.
Dmitry
On Wed, Sep 14,
Hi Erick Erickson,
Now, we have many files format(doc, ppt, pdf, ...), File's purpose serve to
search details content of education in that files. Because i am new solr, so
maybe i understand not enough depth about Apache Tika. At the moment i can't
index pdf files from http, with one file is ok. T
I have written simple java code to index my data. i am creating xml
documents & adding in the index. sorry, but due to company's policy could
not share the configuration details here.
On Wed, Sep 14, 2011 at 10:42 PM, Rahul Warawdekar <
rahul.warawde...@gmail.com> wrote:
> Hi Pawan,
>
> Can you p
I am commiting but not doing replication now. Mine sort order also includes
last login timestamp. the new profiles are being reflected in my SOLR admin
& db. but its not listed on my website.
On Thu, Sep 15, 2011 at 4:25 AM, Chris Hostetter
wrote:
>
> : I am using Solr 3.2 on a live website. i ge
Hi all,
I have a quick question about how complex fields (subFields etc) interact with
indexing and storage.
Let's say I have this (partial) schema:
…
(slightly modified from the example/ project).
If my goals were to:
1) Always query the location by "lat,lng" (never the subfields dir
When I retrieve the value the lat/lon pair that comes out is not
exactly the same as what I indexed, which made be think it was
actually stored as the hash and then transformed back?
Anyhow - I'm trying to understand the actual use case for the field as
it exists - essentially you are saying I cou
Nevermind I just discovered faceting which does exactly what I want.
Sorry about that.
On Thu, Sep 15, 2011 at 11:31 AM, Tomek Rej wrote:
> Hi there
>
> I'm using Solr to do some category mapping, and part of this process
> consists of finding frequently occuring terms for each category id.
> My
Hi there
I'm using Solr to do some category mapping, and part of this process
consists of finding frequently occuring terms for each category id.
My index consists of a number of documents (mostly containing between 1 and
4 tokens), and a category id that this document belongs to.
Ideally I'd like
: I am using Solr 3.2 on a live website. i get live user's data of about 2000
: per day. I do an incremental index every 8 hours. but my search results
: always show the same result with same sorting order. when i check the same
Are you commiting?
Are you using replication?
Are you using a sort
: Is the parameter facet.method=fc still needed ?
https://wiki.apache.org/solr/SimpleFacetParameters#facet.method
"The default value is fc (except for BoolField) since it tends to use less
memory and is faster when a field has many unique terms in the index. "
-Hoss
Is the parameter facet.method=fc still needed ?
Thank you.
Patrick.
: &q=photo_id:* AND gender:true AND country:MALAWI AND online:false
photo_id:* does not mean what you probably think it means. you most
likely want photo_id:[* TO *] given your current schema, but i would
recommend adding a new "has_photo" boolean field and using that instead.
thta alone shou
: References: <41dfe0136ddf091e98d45dea9f0da1ab@localhost>
:
: Message-ID: <1316011545.626.yahoomail...@web110411.mail.gq1.yahoo.com>
: Subject: glassfish, solrconfig.xml and SolrException: Error loading
: DataImportHandler
: In-Reply-To:
:
https://people.apache.org/~hossman/#threadhijac
September 14 2011, Apache Solr™ 3.4.0 available
The Lucene PMC is pleased to announce the release of Apache Solr 3.4.0.
Apache Solr is the popular, blazing fast open source enterprise search
platform from the Apache Lucene project. Its major features include
powerful full-text search, hit highlig
Hi All,
I hope someone could shed some light on the issue I'm facing with solr
3.1.0. It looks like it's computing diferrent fieldNorm values despite my
configuration that aims to ignore it.
I also have a custom class that extends DefaultSimilarity to override the
idf method.
Query:
Thanks a bunch, got it working with a reluctant qualifier and the use
of " as the escaped representation of double qoutes within the
regex value so that the config file doesn't crash & burn:
Cheers,
- Pulkit
On Wed, Sep 14, 2011 at 2:24 PM, Pulkit Singhal wrote:
> Hello,
>
> Feel f
: I try to get a diff of today and an dateField. from this diff, i want do a
: modulo from another field with values of 1,3,6,12
...
: (DIFF(Month of Today - Month of Search) MOD interval) = 0
a) it looks like modulus was never implemneted as a function ... probably
overlooked before it
: phrase1 - solr is the best fts ever
: phrase2 - let us all contribute to open source for a better world
:
: now I want to perform the next query:
:
: field1:( phrase1) AND field2:(phrase2)
:
: my default operator is AND, but I want to search within field1 with AND
: operator between the token
Hello,
Feel free to point me to alternate sources of information if you deem
this question unworthy of the Solr list :)
But until then please hear me out!
When my config is something like:
I don't get any data.
But when my config is like:
I get the following data as th
You can use to put data from separate fields into a common
search field.
This page will help you get started on what mods you'd need to make on
a
to analyze it as you wish:
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters
But at a start think about WhitespaceTokenizer followed by
Low
What is the machine used for? Was your user looking at
a master? Slave? Something used for both?
Measuring the size of all the files in the index? Or looking
at memory?
The index files shouldn't be getting bigger unless there
were indexing operations going on. Is it at all possible that
DIH was c
How about this: Start with just what you had in your query (q) without the
filter queries. Then add the fq's back in one at a time to see what is giving
you problems -- leaving the birth filter query to the very last.
Others on the list more experienced with filter queries might have a more
d
On Wed, Sep 14, 2011 at 9:56 PM, Vazquez, Maria (STM)
wrote:
> Thanks Rahul
> That sounds like a good solution, I will change the code to support different
> timezones. Maybe this could be included in next release of Solr since a few
> people mentioned this problem too.
[...]
If it was indeed a
I've been reading the information on the new join feature and am not quite
sure how I would use it given my schema structure. I have "User" docs and
"BlogPost" docs and I want to return all BlogPosts that match the fulltext
title "cool" that belong to Users that match the description "solr".
Here
The solution that I am currently using is converting the last_index_time to UTC
before comparing to the LastModified field in the DB.
LastModified > DATEADD(Hour, DATEDIFF(Hour, GETDATE(), GETUTCDATE()),
'${dateimporter.last_index_time}')
This may be another option if the LastModified date in
Hi Pawan,
Can you please share more details on the indexing mechanism ? (DIH, SolrJ
or any other)
Please let us know the configuration details.
On Wed, Sep 14, 2011 at 12:48 PM, Pawan Darira wrote:
> Hi
>
> I am using Solr 3.2 on a live website. i get live user's data of about 2000
> per day.
On Sep 13, 2011, at 5:05 PM, Jamie Johnson wrote:
> I know this isn't a solr specific question but I was wondering what
> folks do in regards to managing the machines in their solr cluster?
> Are there any recommendations for how to start/stop/manage these
> machines? Any suggestions would be app
Hi
I am using Solr 3.2 on a live website. i get live user's data of about 2000
per day. I do an incremental index every 8 hours. but my search results
always show the same result with same sorting order. when i check the same
search from corresponding db, it gives me different results always (as n
If you don't need date-specific functions and/or faceting, you can store it
as a int, like 20110914 and parse it in your application
but I don't recommend... as a rule of thumb, dates should be stored as
dates, the millenium bug (Y2K bug) was all about 'saving some space'
remember?
I tried moving age query from filter query to normal query but nothing
really changed.
But when i try to move everything into query itself ( removed all filter
queries) QTimes slowed much more.
I don't have problem with memory or cpu usage, my problem is query response
times.
When i send only one
Thanks Rahul
That sounds like a good solution, I will change the code to support different
timezones. Maybe this could be included in next release of Solr since a few
people mentioned this problem too.
Thanks again
Maria
Sent from my Motorola ATRIX™ 4G on AT&T
-Original message-
From:
What we did was get the date from db, and stored it in a string fieldType in
the format mmdd. It works fine for us, as range query works just fine.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Schema-fieldType-y-m-d-tp3335359p3336309.html
Sent from the Solr - User maili
After making another try, I found it worked with
I leave this to here in case someone may need this too.
Thanks
From: "dar...@ontrenet.com"
To: Xue-Feng Yang
Cc: solr-user@lucene.apache.org
Sent: Wednesday, September 14, 2011 10:52:55 AM
Subject: Re: gla
: Is there a way to use BigDecimal as a data type in solr? I am using solr
: 3.3.
if you just want to *store* BigDecimals in a solr index, then just use
StrField with the canonical representation -- but if you want to sort or
do range queries on the values, then no.
Given that BigDecimal value
Thanks for your reply.
Actually, some of the cores are working perfectly. So it's not the
solr.solr.home problem.
From: "Jaeger, Jay - DOT"
To: "solr-user@lucene.apache.org" ; 'Xue-Feng
Yang'
Sent: Wednesday, September 14, 2011 11:21:18 AM
Subject: RE: glas
Rahul is right.
You may add a script to change the date in data-import.properties to half
hour before the last modi time before each delta-import.
--
View this message in context:
http://lucene.472066.n3.nabble.com/DIH-delta-last-index-time-tp3334992p3336203.html
Sent from the Solr - User mail
Keep the default Search Operator as OR
And for phrase1, on splitting on whitespace just add "AND" instead of "+".
Hopefully this should work. Please do confirm.
--
View this message in context:
http://lucene.472066.n3.nabble.com/query-part-default-OR-and-part-default-AND-tp3335851p3336194.html
S
Some things to think about:
When solr starts up, solr should report for the location of solr home. Is it
what you expect?
Is there any security on the "dist" directory that would prevent solr from
accessing it?
Is there a classloader policy set on glassfish that could be getting in the way?
(y
I don't have enough experience with filter queries to advise well on when to
use fq vs. putting it in the query itself, but I do know that we are not using
filter queries, and with index sizes ranging from 7 Million to 27+ Million we
have not seen this kind of issue.
Maybe keeping 16,384 filter
Thanks for a quick reply.
I just tested as you suggested. The error is still there.
The setup line
is actually coming with solr 3.3 release not by me.
From: "dar...@ontrenet.com"
To: Xue-Feng Yang
Cc: solr-user@lucene.apache.org
Sent: Wednesday, Septe
Thank you for your reply.
I tried to give most of the information i can but obviously i missed some.
1. Just what does your "test script" do? Is it doing updates, or just
queries of the sort you mentioned below?
Test script only sends random queries.
2. If the test script is doing updates, how
Here's a thought.
If dist is under solr.solr.home but your lib dir is set to be
"../../dist".
Wouldn't the lib dir be relative to solr.solr.home and therefore should
just be "dist"?
On Wed, 14 Sep 2011 07:45:45 -0700 (PDT), Xue-Feng Yang
wrote:
> Hi all,
>
>
> I am trying set up solr 3.3 with
Hi all,
I am trying set up solr 3.3 with multicores in glassfish 3.1.1 and eclipse
indigo. I have the following error:
SEVERE: org.apache.solr.common.SolrException: Error loading class
'org.apache.solr.handler.dataimport.DataImportHandler'
However, I have a line
in solrconfig.xml and dis
There is nothing between the client app and the solr server, its on the same
machine and on the same app server, only going through the loopback interface.
Unfortunatly, I cannot reproduce it, but I see it in the server log.
Thanks
Michael
--
Michael Szalay
Senior Software Engineer
basis06 AG,
I think folks are going to need a *lot* more information. Particularly
1. Just what does your "test script" do? Is it doing updates, or just
queries of the sort you mentioned below?
2. If the test script is doing updates, how are those updates being fed to
Solr?
3. What version of Solr
Thank you. Should be awesome when its ready!
On Wed, 14 Sep 2011 10:25:26 -0400, Yonik Seeley
wrote:
> On Wed, Sep 14, 2011 at 10:17 AM, wrote:
>>
>> Hi,
>> I am very excited to see this direction for Solr. I realize its early
>> still,
>> but is there any thought as to what the target releas
On Wed, Sep 14, 2011 at 10:17 AM, wrote:
>
> Hi,
> I am very excited to see this direction for Solr. I realize its early
> still,
> but is there any thought as to what the target release date might be (this
> year? next?).
We've started to work on the new functionallity now, but an official
rel
I have not used SolrJ, but it probably is worth considering as a possible
suspect.
Also, do you have anything in between the client and the Solr server (a
firewall, load balancer, etc.?) that might play games with HTTP connections?
You might want to start up a network trace on the server or net
Just wanted to follow up and say thanks for all the valuable replies. I'm
in the process of testing everything.
Thanks,
Mike
On Mon, Sep 12, 2011 at 1:20 PM, Chris Hostetter
wrote:
>
> Beyond the suggestions already made, i would add:
>
> a) being really aggressive about stop words can help kee
Hi, i'm having performance troubles with solr. I don't know if i'm expection
too much from solr or i missconfigured solr.
When i run a single query its QTime is 500-1000~ ms (without any use of
caches).
When i run my test script (with use of caches) QTime increases
exponentially, reaching 8000~ to
Hi,
I am very excited to see this direction for Solr. I realize its early
still,
but is there any thought as to what the target release date might be (this
year? next?).
Also, will the new solr cloud support all query types including all forms
of faceting,
distributed IDF, ranging, sorting, pag
Hi Maria/Gora,
I see this as more of a problem with the timezones in which the Solr server
and the database server are located.
Is this true ?
If yes, one more possibility of handling this scenario would be to customize
DataImportHandler code as follows
1. Add one more configuration property name
Hi All,
I have two fields in my schema: field1, field2 , for the sake of the example
I'll define to phrases:
phrase1 - solr is the best fts ever
phrase2 - let us all contribute to open source for a better world
now I want to perform the next query:
field1:( phrase1) AND field2:(phrase2)
my defa
"auto*" is not a leading wildcard query, a leading wildcard query would be
"*car". Wildcard queries in general will take more time than regular
queries, the more close the wildcard is to the first character, the more
expensive the query is.
With a regular field type, Solr will allow wildcards (not
Just add a bogus 0 timestamp after it when you index it. That is what we did.
Dates are not stored or indexed as characters, anyway, so space would not be
any different one way or the other.
JRJ
-Original Message-
From: stockii [mailto:stock.jo...@googlemail.com]
Sent: Wednesday, Sep
> changed the configuration to point it to my solr dir and started it again
You might look in your logs to see where Solr thinks the Solr home directory is
and/or if it complains about not being able to find it. As a guess, it can't
find it, perhaps because solr.solr.home does not point to the
Hello.
i am fighting with the FunctionQuery of Solr.
I try to get a diff of today and an dateField. from this diff, i want do a
modulo from another field with values of 1,3,6,12
in a function somthing like this. ( i know that some functions are not
available in solr)
q={!func}$v2=0&v1=(NOW - $
On 9/13/2011 6:05 PM, Jamie Johnson wrote:
I know this isn't a solr specific question but I was wondering what
folks do in regards to managing the machines in their solr cluster?
Are there any recommendations for how to start/stop/manage these
machines? Any suggestions would be appreciated.
Wh
We are using SolrJ 3.1 as our http client...
So it may be a bug in there?
Regards Michael
--
Michael Szalay
Senior Software Engineer
basis06 AG, Birkenweg 61, CH-3013 Bern - Fon +41 31 311 32 22
http://www.basis06.ch - source of smart business
- Ursprüngliche Mail -
Von: "Jay Jaeger - D
Hi,
Is there a way to use BigDecimal as a data type in solr? I am using solr
3.3.
Thanks.
Looking at the source for Jetty, line 149 in Jetty's HttpOutput java file looks
like this:
if (_closed)
throw new IOException("Closed"); <
[http://www.jarvana.com/jarvana/view/org/eclipse/jetty/aggregate/jetty-all/7.1.0.RC0/jetty-all-7.1.0.RC0-sources.jar!/org/ec
Hi Ahmad,
While Solr is starting it writes the path to SOLR_HOME to the log. The
message looks something like:
Sep 14, 2011 9:14:53 AM org.apache.solr.core.SolrResourceLoader
>
INFO: Solr home set to 'solr/'
>
If you're running the example, SOLR_HOME is usually
apache-solr-3.3.0/example/solr
S
>Also, a question for Peter, at which stage do you use lucene analyzers
>on the query? After it was parsed into the tree, or before we start
>processing the query string?
I do the analysis before creating the tree. I'm pretty sure Lucene
QueryParser does this, too.
Peter
On Wed, Sep 14, 2011 at
FileListEntityProcessor pre-supposes it's looking at files on disk. it
doesn't know anything about the web. So, as the stack trace
indicates, it tries to open a directory called http://. and fails.
What is it you're really trying to do here? Perhaps if you explain
your higher-level problem we
Thanks Dmirty for the offer to help, I am using some caching in one of the
cores not. Earlier I was using on other cores too, but now I have commented
them out because of frequent OOM, also some warming up in one of the core. I
have share the links for my config files for all the 4 cores,
http:
Well, what is the "average of latitude and longitude"? If you're asking
for the average of all the docs that match, or the average of all the
docs in the corpus, no, I don't think you can unless you write a custom
plugin.
Something like this has been talked about, see:
https://issues.apache.org/j
Erick,
Also, we had our solrconfig where we have tried increasing the cache
making the below value for autowarm count as 0 helps returning the commit call
within the second, but that will slow us down on searches
-Original Message-
From: Tirthankar Cha
Erick,
Here is the answer to your questions:
Our index is 267 GB
We are not optimizing...
No we have not profiled yet to check the bottleneck, but logs indicate opening
the searchers is taking time...
Nothing except SOLR
Total memory is 16GB tomcat has 8GB allocated
Everything 64 bit OS and JVM
(11/09/14 15:54), Dmitry Kan wrote:
Hello list,
Not sure how many of you are still using solr 1.4 in production, but here is
an issue with highlighting, that we've noticed:
The query is:
(drill AND ships) OR rigs
Excerpt from the highlighting list:
Within the fleet of 27 floatingrigs<
The highlighter gives you snippets of text surrounding words (terms)
drawn from the query. The whole document should satisfy the query (ie
it probably has ships/s somewhere else in it), but each snippet won't
generally have all the terms.
-Mike
On 9/14/2011 2:54 AM, Dmitry Kan wrote:
Hello
Hi,
OK 64GB fits into one shard quite nicely in our setup. But I have never used
multicore setup. In total you have 79,9 GB. We try to have 70-100GB per
shard with caching on. Do you do warming up of your index on starting? Also,
there was a setting of pre-populating the cache.
It could also help
is it possible to index a datefield in the format of "y-m-d" ? i dont need
the timestamp. so i can save me some space.
which ways exists to search with a complex date-filter !?
-
--- System
One Server, 12 GB RAM, 2 Solr I
I found a partial solution.
Using ReverseStringFilterFactory instead ReverseWildcardFilterFactory and
searching after "rac*" will find "autocar" for example.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Shouldn-t-ReversedWildcardFilterFactory-resolve-leadingWildcard-tp33352
Hi Erick,
I have not done anything different. I downloaded the solr tar
from one of the mirror and then extracted it in the home directory started
jetty and it works fine.
For tomcat I copied the war file in my webapps folder and restarted
tomcat changed the configuration to poin
Hi Peter,
Yes, with the tree it is pretty straightforward. I'd prefer to do it
that way, but what is the purpose of the new qParser then? Is it just
that the qParser was built with a different paradigms in mind where
the parse tree was not in the equation? Anybody knows if there is any
advantage?
Hi Ahson,
http://wiki.apache.org/solr/FieldCollapsing
group.ngroups seems to be added as an parameter, so you may not be
needed to apply any patches.
Solr 3.3 had released the grouping feature with it, so I presume it
should already be included in it.
Regards,
Jayendra
On Wed, Sep 14, 2011 at 4
Hi,
I use the next fieldType:
What I want is to find autocar when I'm
Hi Dimtry,
To answer your questions,
-Do you use caching?
I do user caching, but will disable it and give it a go.
-How big is your index in size on the disk?
These are the size of the data folder for each of the cores.
Core1 : 64GB
Core2 : 6.1GB
Core3 : 7.9GB
Core4 : 1.9GB
Will try attaching a
On Wed, Sep 14, 2011 at 11:23 AM, Maria Vazquez
wrote:
> Hi,
> How do you handle the situation where the time on the server running Solr
> doesn¹t match the time in the database?
Firstly, why is that the case? NTP is pretty universal
these days.
> I¹m using the last_index_time saved by Solr in t
Hi Jayendra
Thanks a lot for your response, now i have two questions one that to get the
count of groups is it must to apply the specified patch, if so can you help me
a little how i can apply that patch in steps as i am new to solr/java.
Regards
Ahsan
- Original Message -
From: Jay
Hi Rohit,
Do you use caching?
How big is your index in size on the disk?
What is the stack trace contents?
The OOM problems that we have seen so far were related to the
index physical size and usage of caching. I don't think we have ever found
the exact cause of these problems, but sharding has h
Hi Chris,
Thanks for taking this. Sorry for my confusing explanation. Since you
requested a bigger picture, I'll give some more detail. In short: we don't
do date facets, and sorting by date in reverse order happens naturally by
design.
All the data is split to shards. We use logical sharding, no
Thank you a lot for your answers! They help me to understand better how query
parser works.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Weird-behaviors-with-not-operators-tp3323065p3335087.html
Sent from the Solr - User mailing list archive at Nabble.com.
84 matches
Mail list logo