You can test our UIMA to Solr cas consumer
is based on JulieLab Lucas and uses their CAS.
but transformed to generate XML which can be saved to a file or posted
direcly to solr
In the map file you can define which information is generated for each
token, and how its concatenaded, allowing the
Hi,
This is probably a really dumb question, but how can I find out which version
of Solr is currently running on my (Windows) system? I can't seem to find
anything in the Solr Admin interface nor the TomCat Manager.
Thanks,
Marc
go to solr admin and then click on info, right in the first line you see the
solr version
-Ursprüngliche Nachricht-
Von: Marc Wilson [mailto:wo...@fancydressoutfitters.co.uk]
Gesendet: Dienstag, 2. März 2010 09:55
An: Solr
Betreff: Solr Version
Hi,
This is probably a really
little question: what's the difference between a MustHavePermission and a
protected
document?
at the moment we are developing a new search for our intranet and using solr.
we also have some protected documents and implemented this kind of filter like
you.
i just think on using a true filter
Thank you very much! but I have problem with url :) If I send request using
get method - I get:
http://localhost/russian/result.php?search=%EF%F0%E8%E2%B3%F2
I use function (php)urldecode! If I print result, i get привіт! But if i
send request to solr, my q param = пїЅпїЅпїЅпїЅпїЅ!
--
View
doesn't sound like you need to add the complexity of breaking it up into 500
record chunks
plenty of memory and a quad-core+ system and you should be fine with the kind
of load you are talking about
after all, should load test it first before you try any optimization tricks
like this
We have 2 different options in our acl:
Someone has access using group1 OR group1,...
Or someone has access using role1: group1 AND group2,...
i could solve this problem resolving the roles while logging in of the user.
So the session know which roles (group1 AND group2,...) the user has:
Hi,
I am planning to development some application on which users could update
their account data after login, this is on top of the search facility users
have. the basic work flow is
1) user logs in
2) searches for some data
3) gets the results from solr index
4) save some of the search results
Hi,
How can i search using stopword my query like this
This - 0 results becuase it is a stopword
is - 0 results becuase it is a stopword
that - 0 results becuase it is a stopword
if i search like This is that - it must give the result
for that i need
Hi Andy,
It sounds like you may want to have a look at tree faceting:
https://issues.apache.org/jira/browse/SOLR-792
Date: Mon, 1 Mar 2010 18:23:51 -0800
From: angelf...@yahoo.com
Subject: Implementing hierarchical facet
To: solr-user@lucene.apache.org
I read that a simple
as long as the document id is unique, concurrent writes is fine
if for same reason the same doc id is used then it is overwritten, so last in
will be the one that is in the index
Ron
- Original Message -
From: Kranti™ K K Parisa kranti.par...@gmail.com
To:
Hi Ron,
Thanks for the reply. So does this mean that writer lock is nothing to do
with concurrent writes?
Best Regards,
Kranti K K Parisa
On Tue, Mar 2, 2010 at 4:19 PM, Ron Chan rc...@i-tao.com wrote:
as long as the document id is unique, concurrent writes is fine
if for same reason the
Hi All
Is there a post request method to clean the index?
I have removed my index folder and restarted solr and its still showing
documents in the stats.
I have run this post request:
http://localhost:8983/solr/core1/update?optimize=true
I get no errors but the stats are still show my 4
Hi,
I'm using the default text field type that comes with the example.
When searching for simple words as 'HP' or 'TCS' solr is returning
results that contains 'HP1' or 'TCS'
Is there a solution for to avoid this?
Thanks,
Frederico
Hi,
I need to implement a search where i should count the number of times
the string appears on the search field,
ie: only return articles that mention the word 'HP' at least 2x.
I'm currently doing this after the SOLR search with my own methods.
Is there a way that SOLR does this type of
Thanks for the responses. This is exactly what I had to resort to. I will
definitely put in a feature request to get the generated ID back from the
extract request.
I am doing this with PHP cURL for extraction and pecl php solr for
querying. I am then saving the unique id and dupe hash in a
This is a classic problem with Stopword removal. Have you tried
just removing stopwords from the indexing definition and the
query definition and reindexing?
You can't search on them no matter what you do if they've
been removed, they just aren't there
HTH
Erick
On Tue, Mar 2, 2010 at 5:47
hello
I use Solr in my cakePHP Framework.
How can i get status information of my solr cores ??
I dont want analyze everytime the responseXML.
do anybody know a nice way to get status messages from solr ?
thx ;) Jonas
--
View this message in context:
I think that's because of the internal tokenization that Solr does. If a
document contains HP1, and you're using the default text field type, Solr
would tokenize that to HP and 1, so that document figures in the list of
documents containing HP, and hence that documents appears in the search
My very first guess would be that you're removing an index that isn't
the one your SOLR configuration points at.
Second guess would be that your browser is caching the results of
your first query and not going to SOLR at all. Stranger things have
happened G.
Third guess is you've mis-identified
Expanding on Siddant's comment, look carefully at
WordDelimiterFilterFactory, as I remember it's in the default
schema definition.
This page helps:
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFiltersErick
On Tue, Mar 2, 2010 at
Hi,
How do search the exact match like this The Books of Three ,if give
this it would found Exact result +
Some result related to Books. In my schema.xml file i has changed field type
String instead of Text but not getting anychange
--
View this message in context:
Ideally Solr would be aware of the hierarchy structure and
send back responses accordingly.
If I understand it correctly, SOLR-64 supports them I think?
So at level 1 Solr will send back facet values based on country (100
or so values).
facet=onfacet.depth=1 ?
Level 2 the facet values
Hi
Have you tried the php_solr extension from PECL? It has a handy
SolrPingResponse class.
Or you could just call the CORENAME/admin/ping?wt=phps URL and unserialize
it.
Regards,
--
I N S T A N T | L U X E - 44 rue de Montmorency | 75003 Paris | France
Tél. : 01 80 50 52 51 | Mob. : 06 09 96
Don't remove stopwords if you want to search on them. --wunder
On Mar 2, 2010, at 5:43 AM, Erick Erickson wrote:
This is a classic problem with Stopword removal. Have you tried
just removing stopwords from the indexing definition and the
query definition and reindexing?
You can't search on
Hey-
No i use the SolrPHPClient http://code.google.com/p/solr-php-client/
i not really want tu use two different php-libs. ^^
what do you mean with unserialize ? XD
Guillaume Rossolini-2 wrote:
Hi
Have you tried the php_solr extension from PECL? It has a handy
SolrPingResponse
Ha
Now I feel stupid !!
I had a misspell in the data path and you were correct.
Can I ask Erik was the command correct though ?
Thank you
Lee
On 2 Mar 2010, at 13:54, Erick Erickson wrote:
My very first guess would be that you're removing an index that isn't
the one your SOLR
Hi, How to index properly HTML documents? All the documents are HTML, some
containing charaters encodid like #x17E;#xED; ... Is there a character
filter for filtering these codes? Is there a way to strip the HTML tags out?
Does solr weight the terms in the document based on where they appear?..
There is an HTML filter documented here, which might be of some help -
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.HTMLStripCharFilterFactory
Control characters can be eliminated using code like this -
or you can try the commongrams filter that combines tokens next to a stopword
On Tue, Mar 2, 2010 at 6:56 AM, Walter Underwood wun...@wunderwood.org wrote:
Don't remove stopwords if you want to search on them. --wunder
On Mar 2, 2010, at 5:43 AM, Erick Erickson wrote:
This is a classic
Hi Paul
Thank you for your amswer
I did put all the directory structure on /raid ... /raid/solr_env/solr ... ,
/raid/solr_env/jetty ...
And it still didn't work even after I applied patch SOLR-1736
I am investigating if this is because tempDir and data dir are not on the same
partition
matt
Hi Mani,
Mani EZZAT wrote:
I'm dynamically creating cores with a new index, using the same schema
and solrconfig.xml
Does the problem occur if you use the same configuration in a single, static
core?
Tom
--
View this message in context:
The last time I tried using SolrPHPClient for this stuff, it did not really
handle the response very well because of the JSON response generated on the
server side.
I am not sure if anything has changed since then.
The JSON code generated could not be parsed properly.
If you do not want to
I think this issue is tot related to patch SOLR-1736
Here is the error I get ... Thank you for any help
[2010-03-02 19:07:26] [pool-3-thread-1] ERROR(ReplicationHandler.java:266) -
SnapPull failed
org.apache.solr.common.SolrException: Unable to download _7bre.fdt completely.
Downloaded
Hello all,
I'm having a hard time trying to change Solr queries logging level.
I've tried a lot of things I've found in the internet, this mailing list and
solr docs.
What I've found so far:
- Solr Embedded Server uses sfl4j lib for intermediating logging. Here I'm
using Log4j as my logging
Hi, guys,
I have a solr index, and i need it to ignore accents and special characters.
Eg: São Paulo = Sao Paulo, cadarço=cadarco. I know we could use a
synonim, but i guess solr already has a filter or plugin for theses cases.
Anyone knows how to do it?
Att,
Paulo Marinho
I have a solr index, and i need it to ignore accents and
special characters.
Eg: São Paulo = Sao Paulo, cadarço=cadarco. I
know we could use a
synonim, but i guess solr already has a filter or plugin
for theses cases.
Anyone knows how to do it?
ASCIIFoldingFilterFactory[1] or charFilter
The action CREATE creates a new core based on preexisting
instanceDir/solrconfig.xml/schema.xml, and registers it.
That's what the documentation is stating.
Is there a way to instruct solr to create the instanceDir if does not exist?
I'm trying to create new core based on a existing
I've noticed that fields that I define as index=false in the
schema.xml are still searchable. Here's the definition of the field:
field name=object_id type=string index=false stored=true
multiValued=false/
or
field name=object_id type=string index=false stored=false
multiValued=false/
I
I've noticed that fields that I
define as index=false in the
schema.xml are still searchable.
indexed=false defined fields are neither searchable nor sortable.
Did you re-start servlet container and re-index your documents after changing
this attribute in schema.xml?
Is there way to return function value in search results besides using
score ?=20
Hi everyone,
I'm new to Solr and just getting it set up and testing it out. I'd
like to know if there's a way to give a different weight to different
data fields.
For an example, I'm going to be storing song information. I have the
fields: Artist, Title, Description, and Tags. I'd
Hi,
I would like to solr to return to record from /exampledocs/hd.xml when I
search for the value 6H500F0 (which is the ID field for the 2'nd record in
that file).
I know there is a setting that I should change to get this done, but I can't
locate it.
Field name ID is alread included in schema.xml
My schema has always had index=false for that field. I only stopped and
restarted the servlet container when I added a document to the index using the
Lucene API instead of Solr.
-Original Message-
From: Ahmet Arslan [mailto:iori...@yahoo.com]
Sent: Tuesday, March 02, 2010 1:01 PM
To:
I'm new to Solr and just getting it set up
and testing it out. I'd like to know if there's a way
to give a different weight to different data fields.
For an example, I'm going to be storing song
information. I have the fields: Artist, Title,
Description, and Tags. I'd like
Hi,
I would like to solr to return to record from
/exampledocs/hd.xml when I
search for the value 6H500F0 (which is the ID field for
the 2'nd record in
that file).
I know there is a setting that I should change to get this
done, but I can't
locate it.
Field name ID is alread included in
My schema has always had
index=false for that field. I only stopped and
restarted the servlet container when I added a document to
the index using the Lucene API instead of Solr.
Is there a special reason/use-case for to add documents using Lucene API?
Thanks for the reply.
Is there a place in the config file where I can set it to explicitly search
the fields I want?
On Tue, Mar 2, 2010 at 3:17 PM, Ahmet Arslan iori...@yahoo.com wrote:
Hi,
I would like to solr to return to record from
/exampledocs/hd.xml when I
search for the value
For testing purposes. I just wanted to see if unindex fields in documents
added by Lucene API were searchable by Solr. This is after discovering that
the unindexed fields in documents added by Solr are searchable.
-Original Message-
From: Ahmet Arslan [mailto:iori...@yahoo.com]
Sent:
Thanks for the reply.
Is there a place in the config file where I can set it to
explicitly search
the fields I want?
If you don't want to specify your fields at query time (also you want to query
more than one fields at the same time) you can use DisMaxRequestHandler[1].
There are two
If you get the PACKT Solr 1.4 book, there are extensive examples of this
very thing.
It's *well* worth the time it'll save you...
Erick
On Tue, Mar 2, 2010 at 4:11 PM, Ahmet Arslan iori...@yahoo.com wrote:
I'm new to Solr and just getting it set up
and testing it out. I'd like to
Not sure if it will solve your specific problem. We use Solr as a WAR as well
as Solrj. So the main solr distribution comes with slf4j-jdk-1.5.5.jar. I just
deleted that and replaced it with slf4j-log4j12-1.5.5.jar. And then it used my
existing log4j.properties file.
Again, note that it should be index_ed_=false. ed - very
important! If you're saying index=false, Solr is not reading that
attribute at all, and going with the default for the field type.
Erik
On Mar 2, 2010, at 4:31 PM, Thomas Nguyen wrote:
For testing purposes. I just wanted
The replication does not work for me
I have a big master solr and I want to start replicating it. I can see that the
slave is downloading data from the master... I see a directory
index.20100302093000 gets created in data/ next to index... I can see its size
growing but then the directory
That's great information. Thanks!
-Alex
Alex Thurlow
Blastro Networks
http://www.blastro.com
http://www.roxwel.com
http://www.yallwire.com
On 3/2/2010 3:11 PM, Ahmet Arslan wrote:
I'm new to Solr and just getting it set up
and testing it out. I'd like to know if there's a way
Again, note that it should be
index_ed_=false. ed - very
important! If you're saying index=false,
Solr is not reading that attribute at all, and going with
the default for the field type.
Perfect catch :)
One More information
I deleted the index on the master and I restarted the master and restarted the
slave and now the replication works
Would it be possible that the replication doesn work well when started against
an already existing big index ?
Thank you
--- On Tue, 3/2/10, Matthieu Labour
Hi Matthieu,
Does this happen over and over?
Is this with Solr 1.4 or some other version?
Is there anything unusual about _7h0y.fdx?
Does _7h0y.fdx still exist on the master when the replication fails?
...
Otis
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Hadoop ecosystem
Great catch! Thanks for spotting my error :)
-Original Message-
From: Ahmet Arslan [mailto:iori...@yahoo.com]
Sent: Tuesday, March 02, 2010 2:07 PM
To: solr-user@lucene.apache.org
Subject: Re: Unindexed Fields Are Searchable?
Again, note that it should be
index_ed_=false. ed - very
Otis
Thank your for your response. I apologize for not being specific enough
-- yes it happened over over.
-- apache-solr-1.4.0
-- I restarted the indexing+replication from scratch. Before I did that, I
backed up the master index directory. I don't see _7h0y.fdx in it
What could have
If it's a requirement to let Solr handle the facet-hierarchy please
disregard this post, but
an alternative would be to have your App control when to ask for which
'facet-level' (e.g: country, state, city) in the hierarchy.
as follows,
each doc has 3 seperate fields (indexed=true, stored=false):
Using Solr 1.4: even less changes to the frontend:
facet=onfacet.field={!key=Location}countryid
...
facet=onfacet.field={!key=Location}cityidfq=countryid:somecountryid
etc.
will consistently render the resulting facet under the name Location .
2010/3/3 Geert-Jan Brits gbr...@gmail.com
If
Hi,
Am new to solr.
I am trying location aware search with spatial lucene in solr1.5 nightly
build.
My table in mysql has just lat,lng and some text .I want to add geohash,
lat_rad(lat in radian) and lng_rad field into the document before indexing.
I have used dataimport to get my table to solr.
Hi,
I want a want a way to get total term count per document. I am using
solr1.4.
My query looks something like this
http://192.168.1.50:8080/solr1/core_SFS/select/?q=content%3Apresident%0D%0Aversion=2.2start=0rows=10indent=on
I tried to use TermVectorComponent but it just gives me the
Thanks. I didn't know about the {!key=Location} trick.
Thanks everyone for your help. From what I could gather, there're 3 approaches:
1) SOLR-64
Pros:
- can have arbitrary levels of hierarchy without modifying schema
Cons:
- each combination of all the levels in the hierarchy will result in a
The first 2 queries 'electORnics' instead of 'electROnics'.
The third query shows the situation. The first clause has 1 out of 2
matches, and the second has 1 out of 3 matches. Look for the two
'coord' entries. They are 1/2 and 1/3.
str name=SP2514N
0.61808145 = (MATCH) sum of:
0.16856766 =
Locking is at a lower level than indexing and queries. Solr
coordinates multi-threaded indexing and query operations in memory and
a separate thread writes data to disk. There are no performance
problems with multiple searches and indexes happening at the same
time.
2010/3/2 Kranti™ K K Parisa
Hi all,
I am using Solr 1.5 from trunk. I am getting the below error on a full
load, and it is causing the import to fail and rollback. I am not
concerned about the error but rather that I cannot seem to tell the
indexing to continue. I have two entities, and I have tried all (4)
combinations
It is a bug. I just filed this. It is just a unit test that displays
the behavior.
http://issues.apache.org/jira/browse/SOLR-1803
On Tue, Mar 2, 2010 at 9:07 AM, Mark Roberts mark.robe...@red-gate.com wrote:
Hi,
I have a schema with a multivalued field like so:
field name=product
I don't know, I didn't try because I have the need to create a different
core each time.
I'll do some tests with the default config and will report back to all
of you
Thank you for your time
Tom Hill. wrote:
Hi Mani,
Mani EZZAT wrote:
I'm dynamically creating cores with a new index,
70 matches
Mail list logo