Thanks much !!
Explorer -- Internet Explorer :) Sorry for the miscommunication. Yeah let me
check it once again.
appreciate all the help :)
krn
--
View this message in context:
http://lucene.472066.n3.nabble.com/Need-Help-in-Patching-OPENNLP-tp4052362p4053094.html
Sent from the Solr - User
OK - so I have my SOLR instance running on AWS.
Any suggestions on how to safely share the link? Right now, the whole SOLR
instance is totally open.
Gagandeep singh gagan.g...@gmail.com wrote:
say debugQuery=truemlt=true and see the scores for the MLT query, not a
sample query. You can use
Hi
Recently solr crashed. I've found this in the error log.
My commit settings are loking like this:
autoCommit
maxTime1/maxTime
openSearcherfalse/openSearcher
/autoCommit
autoSoftCommit
maxTime2000/maxTime
/autoSoftCommit
The machine has 10GB
On some queries I get out of memory errors:
{error:{msg:java.lang.OutOfMemoryError: Java heap
space,trace:java.lang.RuntimeException:
java.lang.OutOfMemoryError: Java heap space\n\tat
org.apache.solr.servlet.SolrDispatchFilter.sendError(SolrDispatchFilter.java:462)\n\tat
Hi Arkadi,
this error usually indicates that virtual memory is not sufficient (should be
unlimited).
Please see http://comments.gmane.org/gmane.comp.jakarta.lucene.solr.user/69168
Regards,
André
Von: Arkadi Colson [ark...@smartbit.be]
Gesendet:
Hmmm I checked it and it seems to be ok:
root@solr01-dcg:~# ulimit -v
unlimited
Any other tips or do you need more debug info?
BR
On 04/02/2013 11:15 AM, André Widhani wrote:
Hi Arkadi,
this error usually indicates that virtual memory is not sufficient (should be
unlimited).
Please see
The output is from the root user. Are you running Solr as root?
If not, please try again using the operating system user that runs Solr.
André
Von: Arkadi Colson [ark...@smartbit.be]
Gesendet: Dienstag, 2. April 2013 11:26
An: solr-user@lucene.apache.org
It is running as root:
root@solr01-dcg:~# ps aux | grep tom
root 1809 10.2 67.5 49460420 6931232 ?Sl Mar28 706:29
/usr/bin/java
-Djava.util.logging.config.file=/usr/local/tomcat/conf/logging.properties -server
-Xms2048m -Xmx6144m -XX:PermSize=64m -XX:MaxPermSize=128m -XX:+UseG1GC
On Tue, 2013-04-02 at 11:09 +0200, Dotan Cohen wrote:
On some queries I get out of memory errors:
{error:{msg:java.lang.OutOfMemoryError: Java heap
[...]
org.apache.lucene.index.DocTermOrds.uninvert(DocTermOrds.java:273)\n\tat
On Tue, Apr 2, 2013 at 12:59 PM, Toke Eskildsen t...@statsbiblioteket.dk
wrote:
How many documents does your index have, how many fields do you facet on
and approximately how many unique values does your facet fields have?
8971763 documents, growing at a rate of about 500 per minute. We
I have seen the exact same on Ubuntu Server 12.04. It helped adding some
swap space, but I do not understand why this is necessary, since OS
ought to just use the actual memory mapped files if there is not room in
(virtual) memory, swapping pages in and out on demand. Note that I saw
this for
On Tue, 2013-04-02 at 12:16 +0200, Dotan Cohen wrote:
8971763 documents, growing at a rate of about 500 per minute. We
actually expect that to be ~5 per minute once we get out of
testing.
9M documents in a heavily updated index with faceting. Maybe you are
committing faster than the
Hello,
I'm using Solr collections API to create a collection.
http://127.0.0.1:8983/solr/admin/collections?action=CREATEname=test2numShards=1replicationFactor=2collection.configName=default
I'm expecting new collection to be named test2 what I get instead is
test2_shard1_replica2. I don't want
Collection API is a wrapper for the CORE API,
If you don't want that the API defines the name for you, then use the CORE API,
you can define the collection name and the shard id.
curl
'http://localhost:8983/solr/admin/cores?action=CREATEname=corenamecollection=collection1shard=XX'
--
Yago
Also, I am assuming that the collection name in this case should be
'test2'. The replica names would be on the lines of what you've mentioned.
Is that not the case?
On Tue, Apr 2, 2013 at 5:31 PM, Lukasz Kujawa luk...@php.net wrote:
Hello,
I'm using Solr collections API to create a
Hi
i want to query documents which match a certain dynamic criteria.
like, How do i get all documents, where sub(field1,field2) 0 ?
i tried _val_: sub(field1,field2) and used fq:[_val_:[0 TO *]
But it doesnt work.
./Zahoor
In this link you can see what is what
http://wiki.apache.org/solr/SolrCloud#Glossary
The collection represents a single index, the solrCores AKA core, encapsulates
a single physical index, One or more make up a logical shard which make up a
collection.
You can have a collection with the same
Oh, I see, essentially you want to get the sum of the term frequencies for
every term in a subset of documents (instead of the document frequency as
the FacetComponent would give you). I don't know of an easy/out of the box
solution for this. I know the TermVectorComponent will give you the tf for
On Tue, Apr 2, 2013 at 2:41 PM, Toke Eskildsen t...@statsbiblioteket.dk wrote:
9M documents in a heavily updated index with faceting. Maybe you are
committing faster than the faceting can be prepared?
I moved solr 4.1 to solr 4.2 on one of slave server earlier my index
directory has index.timestamp, but now, it has only index folder no
timestamp. Is this is bug.?? Though size of index is same as on master . It
shows replication running on dasboard with both master and slave version.
what
A key problem with those approaches as well as Lucene's HighFreqTerms class
(
http://lucene.apache.org/core/4_2_0/misc/org/apache/lucene/misc/HighFreqTerms.html)
is that none of them seem to have the ability to combine with a date range
query...which is key in my scenario. I'm kinda thinking that
In this thread about performance on concurrent search requests, Otis said:
http://lucene.472066.n3.nabble.com/how-to-improve-concurrent-request-performance-and-stress-testing-td496411.html
/Imagine this type of code:
synchronized (someGlobalObject) {
// search
}
What happens when 100
(13/04/02 21:45), Furkan KAMACI wrote:
Is there any documentation something like flow chart of Solr. i.e.
Documents comes into Solr(maybe indicating which classes get documents) and
goes to parsing process (i.e. stemming processes etc.) and then reverse
indexes are get so on so forth?
There
On Tue, 2013-04-02 at 15:55 +0200, Dotan Cohen wrote:
[Tokd: maxWarmingSearchers limit exceeded?]
Thank you Toke, this is exactly on my list of things to learn about
Solr. We do get the error mentioned and we cannot reduce the amount
of commits. Also, I do believe that we have the necessary
On Tue, Apr 2, 2013 at 5:33 PM, Toke Eskildsen t...@statsbiblioteket.dk wrote:
On Tue, 2013-04-02 at 15:55 +0200, Dotan Cohen wrote:
[Tokd: maxWarmingSearchers limit exceeded?]
Thank you Toke, this is exactly on my list of things to learn about
Solr. We do get the error mentioned and we
On 04/02/2013 04:20 PM, Koji Sekiguchi wrote:
(13/04/02 21:45), Furkan KAMACI wrote:
Is there any documentation something like flow chart of Solr. i.e.
Documents comes into Solr(maybe indicating which classes get documents) and
goes to parsing process (i.e. stemming processes etc.) and then
Actually maybe one the most important core thing is that Analysis part at
last diagram but there is nothing about it i.e. stamming, lemmitazing etc.
at any of them.
2013/4/2 Andre Bois-Crettez andre.b...@kelkoo.com
On 04/02/2013 04:20 PM, Koji Sekiguchi wrote:
(13/04/02 21:45), Furkan
The index folder is indeed gone but it seems to work. Maybe just a
structural change...
Met vriendelijke groeten
Arkadi Colson
Smartbit bvba • Hoogstraat 13 • 3670 Meeuwen
T +32 11 64 08 80 • F +32 11 64 08 81
On 04/02/2013 04:08 PM, yayati wrote:
I moved solr 4.1 to solr 4.2 on one of
On Tue, Apr 2, 2013 at 5:33 PM, Toke Eskildsen t...@statsbiblioteket.dk wrote:
Memory does not help you if you commit too frequently. If you commit
each X seconds and warming takes X+Y seconds, then you will run out of
memory at some point.
How might I time the warming? I've been googling
How often do you commit and how many unique values does your facet
fields have?
Most of the time I facet on one field that has about twenty unique
values. However, once per day I would like to facet on the text field,
which is a free-text field usually around 1 KiB (about 100 words), in
order
For beginners is complicate understand the complexity of solr / lucene, I'm
trying devel a custom search component and it's too hard keep in mind the flow,
inheritance and iteration between classes. I think that there is a gap between
software doc and user doc, or maybe I don't search enough
You are right about mentioning developer doc and user doc. Users separate
about it. Some of them uses Solr for indexing and monitoring via admin face
and that is quietly enough for them however some people wants to modify it
so it would be nice if there had been some documentation for developer
Please add RyanErnst to the contributors group. Thanks!
On Mon, Apr 1, 2013 at 7:04 PM, Steve Rowe sar...@gmail.com wrote:
On Apr 1, 2013, at 9:40 PM, Vaillancourt, Tim tvaillanco...@ea.com
wrote:
I would also like to contribute to SolrCloud's wiki where possible.
Please add myself
On Apr 2, 2013, at 11:23 AM, Ryan Ernst r...@iernst.net wrote:
Please add RyanErnst to the contributors group. Thanks!
Added to solr wiki ContributorsGroup.
On 04/02/2013 05:04 PM, Dotan Cohen wrote:
How might I time the warming? I've been googling warming since your
earlier message but there does not seem to be any really good
documentation on the subject. If there is anything that you feel I
should be reading I would appreciate a link or a keyword
Hi;
Please add FurkanKAMACI to the group.
Thanks;
Furkan KAMACI
2013/4/2 Steve Rowe sar...@gmail.com
On Apr 2, 2013, at 11:23 AM, Ryan Ernst r...@iernst.net wrote:
Please add RyanErnst to the contributors group. Thanks!
Added to solr wiki ContributorsGroup.
We have openings for Middleware architects (Apache solr)
*Locations:* Mountain View, California,New York City, NY, Houston, TEXAS
Mail me your resumes to jess...@kudukisgroup.com.
We can discuss more over the phone.
Thanks,
Jessica
On Apr 2, 2013, at 11:28 AM, Furkan KAMACI furkankam...@gmail.com wrote:
Please add FurkanKAMACI to the group.
Added to solr wiki ContributorsGroup.
Hello,
I am using the new collection alias feature, and it seems
CloudSolrServer class (solrj 4.2.0) does not allow to use it, either for
update or select.
When I'm requesting the CloudSolrServer with a collection alias name, I
have the error:
org.apache.solr.common.SolrException: Collection
If I use admin API instead of collection API according to my understanding
the new core will be only available on that server. If I will query
different solr server I will get an error. If I use collections API and I
query a server which physically doesn't hold the data I will still get
results.
The Solr URL in Solr 4.2 for my localhost installation looks like this:
http://localhost:8883/solr/#/development_shard1_replica1
This URL when constructed dynamically in Ruby will not validate with the
Ruby URI:HTTP class because of the # sign in the path. This is a
non-standard URL as per
: The Solr URL in Solr 4.2 for my localhost installation looks like this:
: http://localhost:8883/solr/#/development_shard1_replica1
:
: This URL when constructed dynamically in Ruby will not validate with the
: Ruby URI:HTTP class because of the # sign in the path. This is a
: non-standard URL
I think there is a gap in the support of one's path of learning Solr . I'll
try to describe it based on my own experience. Hopefully, it is helpful.
At First, there is a Solr is a blackbox stage, where the person may not
know Java and is just using out of the box components. Wiki is reasonably
Solr 4.2 implements a feature to proxy requests if the core not exists in node
requested. https://issues.apache.org/jira/browse/SOLR-4210
Actually exists a bug in this mechanism
https://issues.apache.org/jira/browse/SOLR-4584
Without the proxy feature, creating the cores using manually or on
Thank you for you answers Yriveiro. I'm trying to use Solr for a big SaaS
platform. The reason why I want everything dynamic is each user will get own
Solr collection. It looks like there are still many issues with the
distributed computing. I hope 4.3 will arrive soon ;-) Anyway.. once again
Alexandre,
You describe the normal path when a beginner try to use a source of code that
doesn't understand, black-box, reading code, hacking, ok now I know 10% of the
project, with lucky :p.
First at all, the Solr community is fantastic and always helps when I need it.
IMHO the devel
I use solr with a similar propose, I'm understand that you want have control
that as the sharing is done :)
Regards.
--
Yago Riveiro
Sent with Sparrow (http://www.sparrowmailapp.com/?sig)
On Tuesday, April 2, 2013 at 5:54 PM, Lukasz Kujawa wrote:
Thank you for you answers Yriveiro. I'm
Answers inline:
On Apr 2, 2013, at 11:45 AM, Elodie Sannier elodie.sann...@kelkoo.fr wrote:
Hello,
I am using the new collection alias feature, and it seems
CloudSolrServer class (solrj 4.2.0) does not allow to use it, either for
update or select.
When I'm requesting the CloudSolrServer
I am thinking about trying to structure a problem as a Solr plugin. The
nature of the plugin is that it would need to read and write the lucene
index to do its work. It could not be cleanly split into URP 'over here'
and a Search Component 'over there'.
Are there invariants of Solr that would
I think the current tests probably build the cloudsolrserver before creating
the aliases - sounds like we need to do some creating the cloudsolrserver after.
- Mark
On Apr 2, 2013, at 1:31 PM, Mark Miller markrmil...@gmail.com wrote:
Answers inline:
On Apr 2, 2013, at 11:45 AM, Elodie
Yago,
My point - perhaps lost in too much text - was that Solr is presented - and
can function - as a black-box. Which makes it different from more
traditional open-source project. So, the stage-2 happens exactly when the
non-programmers have to cross the boundary from the black-box into
Hallo,
does a WADL exists for the REST service of SOLR?
Ciao
Peter Schütt
I've created https://issues.apache.org/jira/browse/SOLR-4664
- Mark
On Apr 2, 2013, at 2:07 PM, Mark Miller markrmil...@gmail.com wrote:
I think the current tests probably build the cloudsolrserver before creating
the aliases - sounds like we need to do some creating the cloudsolrserver
Thanks Koji, this helped with some of our problems, but it is still not perfect.
This query, for example, returns no highlighting:
?q=id:abc123hl.q=text_it_IT:l'assiemehl.fl=text_it_IThl=truedefType=edismax
But this one does (when it is, in effect, the same query):
I think about myself as an example. I have started to make research about
Solr just for some weeks. I have learned Solr and its related projects. My
next step writing down the main steps Solr. We have separated learning
curve of Solr into two main categories.
First one is who are using it as out
I am currently looking at moving our Solr cluster to 4.2 and noticed a
strange issue while testing today. Specifically the replica has a higher
version than the master which is causing the index to not replicate.
Because of this the replica has fewer documents than the master. What
could cause
I don't think the versions you are thinking of apply here. Peersync does not
look at that - it looks at version numbers for updates in the transaction log -
it compares the last 100 of them on leader and replica. What it's saying is
that the replica seems to have versions that the leader does
Note that the pf field already parses this syntax as of 4.0, but then it is
used as a phrase-slop value. You could probably use same parsing code for qf.
--
Jan Høydahl, search solution architect
Cominvent AS - www.cominvent.com
Solr Training - www.solrtraining.com
29. mars 2013 kl. 18:33 skrev
If you want to highlight, you need to turn on highlighting for the actual field
you search, and that field needs to be stored, i.e. hl.fl=ContentSearchPhonetic
--
Jan Høydahl, search solution architect
Cominvent AS - www.cominvent.com
Solr Training - www.solrtraining.com
1. apr. 2013 kl. 14:16
Looking at the master it looks like at some point there were shards that
went down. I am seeing things like what is below.
NFO: A cluster state change: WatchedEvent state:SyncConnected
type:NodeChildrenChanged path:/live_nodes, has occurred - updating... (live
nodes size: 12)
Apr 2, 2013 8:12:52
here is another one that looks interesting
Apr 2, 2013 7:27:14 PM org.apache.solr.common.SolrException log
SEVERE: org.apache.solr.common.SolrException: ClusterState says we are the
leader, but locally we don't think so
at
Hi, I have a field that is defined to be of type text_en. Occasionally, I
notice that lengthy strings are converted to hash symbols. Here is a
snippet of my field type:
fieldType name=text_en class=solr.TextField positionIncrementGap=100
analyzer type=index
tokenizer
sorry for spamming here
shard5-core2 is the instance we're having issues with...
Apr 2, 2013 7:27:14 PM org.apache.solr.common.SolrException log
SEVERE: shard update error StdNode:
http://10.38.33.17:7577/solr/dsc-shard5-core2/:org.apache.solr.common.SolrException:
Server at
Can you enter the text on the Solr Admin UI Analysis page? Then you could
tell which stage the issue occurs.
StandardTokenizer has a default token length limit of 255. You can override
with the maxTokenLength attribute:
tokenizer class=solr.StandardTokenizerFactory
: Here is an example of the field's value:
: str
:
Sorry I didn't ask the obvious question. Is there anything else that I
should be looking for here and is this a bug? I'd be happy to troll
through the logs further if more information is needed, just let me know.
Also what is the most appropriate mechanism to fix this. Is it required to
kill
It would appear it's a bug given what you have said.
Any other exceptions would be useful. Might be best to start tracking in a JIRA
issue as well.
To fix, I'd bring the behind node down and back again.
Unfortunately, I'm pressed for time, but we really need to get to the bottom of
this and
In our use cases, for certain query terms, we want to redirect the query
processing to external system
for the rest of the keywords, we want to continue with query component ,
facets etc.
Based on some condition it is possible to skip some components in a request
handler?
--
View this
Isn't this an AWS security groups question? You should probably post this
question on the AWS forums, but for the moment, here's the basic reading
material - go set up your EC2 security groups and lock down your systems.
(13/04/03 5:27), Van Tassell, Kristian wrote:
Thanks Koji, this helped with some of our problems, but it is still not
perfect.
This query, for example, returns no highlighting:
?q=id:abc123hl.q=text_it_IT:l'assiemehl.fl=text_it_IThl=truedefType=edismax
But this one does (when it is, in
I brought the bad one down and back up and it did nothing. I can clear the
index and try4.2.1. I will save off the logs and see if there is anything
else odd
On Apr 2, 2013 9:13 PM, Mark Miller markrmil...@gmail.com wrote:
It would appear it's a bug given what you have said.
Any other
Mark
It's there a particular jira issue that you think may address this? I read
through it quickly but didn't see one that jumped out
On Apr 2, 2013 10:07 PM, Jamie Johnson jej2...@gmail.com wrote:
I brought the bad one down and back up and it did nothing. I can clear
the index and try4.2.1. I
Hi Peter,
I'm afraid we don't have anything that formal... almost empty:
http://search-lucene.com/?q=wadlfc_project=Solr
Otis
--
Solr ElasticSearch Support
http://sematext.com/
On Tue, Apr 2, 2013 at 6:38 AM, Peter Schütt newsgro...@pstt.de wrote:
Hallo,
does a WADL exists for the REST
Below is my query
http://localhost:8983/solr/select/?q=subject:session management in
phpfq=category:[*%20TO%20*]fl=category,score,subject
The result is like below
?xml version=1.0 encoding=UTF-8?
response
lst name=responseHeader
int name=status0/int
int name=QTime983/int
Thanks a lot Erick for trying this out.
Will wait for a reply from your end.
Thanks Regards,
Soumya.
-Original Message-
From: Erick Erickson [mailto:erickerick...@gmail.com]
Sent: 01 April 2013 05:46 PM
To: solr-user@lucene.apache.org
Subject: Re: Solr Phonetic Search Highlight
75 matches
Mail list logo