Re: Solr and UIMA

2010-03-02 Thread JCodina
You can test our UIMA to Solr cas consumer is based on JulieLab Lucas and uses their CAS. but transformed to generate XML which can be saved to a file or posted direcly to solr In the map file you can define which information is generated for each token, and how its concatenaded, allowing the

Solr Version

2010-03-02 Thread Marc Wilson
Hi, This is probably a really dumb question, but how can I find out which version of Solr is currently running on my (Windows) system? I can't seem to find anything in the Solr Admin interface nor the TomCat Manager. Thanks, Marc

AW: Solr Version

2010-03-02 Thread Markus.Rietzler
go to solr admin and then click on info, right in the first line you see the solr version -Ursprüngliche Nachricht- Von: Marc Wilson [mailto:wo...@fancydressoutfitters.co.uk] Gesendet: Dienstag, 2. März 2010 09:55 An: Solr Betreff: Solr Version Hi, This is probably a really

AW: Query from User Session to Documents with Must-Have Permissions

2010-03-02 Thread Markus.Rietzler
little question: what's the difference between a MustHavePermission and a protected document? at the moment we are developing a new search for our intranet and using solr. we also have some protected documents and implemented this kind of filter like you. i just think on using a true filter

Re: Cyrillic problem

2010-03-02 Thread michaelnazaruk
Thank you very much! but I have problem with url :) If I send request using get method - I get: http://localhost/russian/result.php?search=%EF%F0%E8%E2%B3%F2 I use function (php)urldecode! If I print result, i get привіт! But if i send request to solr, my q param = пїЅпїЅпїЅпїЅпїЅ! -- View

Re: solr for reporting purposes

2010-03-02 Thread Ron Chan
doesn't sound like you need to add the complexity of breaking it up into 500 record chunks plenty of memory and a quad-core+ system and you should be fine with the kind of load you are talking about after all, should load test it first before you try any optimization tricks like this

Re: AW: Query from User Session to Documents with Must-Have Permissions

2010-03-02 Thread _jochen
We have 2 different options in our acl: Someone has access using group1 OR group1,... Or someone has access using role1: group1 AND group2,... i could solve this problem resolving the roles while logging in of the user. So the session know which roles (group1 AND group2,...) the user has:

Simultaneous Writes to Index

2010-03-02 Thread Kranti™ K K Parisa
Hi, I am planning to development some application on which users could update their account data after login, this is on top of the search facility users have. the basic work flow is 1) user logs in 2) searches for some data 3) gets the results from solr index 4) save some of the search results

Issue on stopword list

2010-03-02 Thread Suram
Hi, How can i search using stopword my query like this This - 0 results becuase it is a stopword is - 0 results becuase it is a stopword that - 0 results becuase it is a stopword if i search like This is that - it must give the result for that i need

RE: Implementing hierarchical facet

2010-03-02 Thread Peter S
Hi Andy, It sounds like you may want to have a look at tree faceting: https://issues.apache.org/jira/browse/SOLR-792 Date: Mon, 1 Mar 2010 18:23:51 -0800 From: angelf...@yahoo.com Subject: Implementing hierarchical facet To: solr-user@lucene.apache.org I read that a simple

Re: Simultaneous Writes to Index

2010-03-02 Thread Ron Chan
as long as the document id is unique, concurrent writes is fine if for same reason the same doc id is used then it is overwritten, so last in will be the one that is in the index Ron - Original Message - From: Kranti™ K K Parisa kranti.par...@gmail.com To:

Re: Simultaneous Writes to Index

2010-03-02 Thread Kranti™ K K Parisa
Hi Ron, Thanks for the reply. So does this mean that writer lock is nothing to do with concurrent writes? Best Regards, Kranti K K Parisa On Tue, Mar 2, 2010 at 4:19 PM, Ron Chan rc...@i-tao.com wrote: as long as the document id is unique, concurrent writes is fine if for same reason the

Optimize Index

2010-03-02 Thread Lee Smith
Hi All Is there a post request method to clean the index? I have removed my index folder and restarted solr and its still showing documents in the stats. I have run this post request: http://localhost:8983/solr/core1/update?optimize=true I get no errors but the stats are still show my 4

fieldType text

2010-03-02 Thread Frederico Azeiteiro
Hi, I'm using the default text field type that comes with the example. When searching for simple words as 'HP' or 'TCS' solr is returning results that contains 'HP1' or 'TCS' Is there a solution for to avoid this? Thanks, Frederico

search and count ocurrences

2010-03-02 Thread Frederico Azeiteiro
Hi, I need to implement a search where i should count the number of times the string appears on the search field, ie: only return articles that mention the word 'HP' at least 2x. I'm currently doing this after the SOLR search with my own methods. Is there a way that SOLR does this type of

Re: Solr Cell and Deduplication - Get ID of doc

2010-03-02 Thread Bill Engle
Thanks for the responses. This is exactly what I had to resort to. I will definitely put in a feature request to get the generated ID back from the extract request. I am doing this with PHP cURL for extraction and pecl php solr for querying. I am then saving the unique id and dupe hash in a

Re: Issue on stopword list

2010-03-02 Thread Erick Erickson
This is a classic problem with Stopword removal. Have you tried just removing stopwords from the indexing definition and the query definition and reindexing? You can't search on them no matter what you do if they've been removed, they just aren't there HTH Erick On Tue, Mar 2, 2010 at 5:47

get Server Status, TotalDocCount .... PHP !

2010-03-02 Thread stocki
hello I use Solr in my cakePHP Framework. How can i get status information of my solr cores ?? I dont want analyze everytime the responseXML. do anybody know a nice way to get status messages from solr ? thx ;) Jonas -- View this message in context:

Re: fieldType text

2010-03-02 Thread Siddhant Goel
I think that's because of the internal tokenization that Solr does. If a document contains HP1, and you're using the default text field type, Solr would tokenize that to HP and 1, so that document figures in the list of documents containing HP, and hence that documents appears in the search

Re: Optimize Index

2010-03-02 Thread Erick Erickson
My very first guess would be that you're removing an index that isn't the one your SOLR configuration points at. Second guess would be that your browser is caching the results of your first query and not going to SOLR at all. Stranger things have happened G. Third guess is you've mis-identified

Re: fieldType text

2010-03-02 Thread Erick Erickson
Expanding on Siddant's comment, look carefully at WordDelimiterFilterFactory, as I remember it's in the default schema definition. This page helps: http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters http://wiki.apache.org/solr/AnalyzersTokenizersTokenFiltersErick On Tue, Mar 2, 2010 at

exact search

2010-03-02 Thread Suram
Hi, How do search the exact match like this The Books of Three ,if give this it would found Exact result + Some result related to Books. In my schema.xml file i has changed field type String instead of Text but not getting anychange -- View this message in context:

Re: Implementing hierarchical facet

2010-03-02 Thread Koji Sekiguchi
Ideally Solr would be aware of the hierarchy structure and send back responses accordingly. If I understand it correctly, SOLR-64 supports them I think? So at level 1 Solr will send back facet values based on country (100 or so values). facet=onfacet.depth=1 ? Level 2 the facet values

Re: get Server Status, TotalDocCount .... PHP !

2010-03-02 Thread Guillaume Rossolini
Hi Have you tried the php_solr extension from PECL? It has a handy SolrPingResponse class. Or you could just call the CORENAME/admin/ping?wt=phps URL and unserialize it. Regards, -- I N S T A N T | L U X E - 44 rue de Montmorency | 75003 Paris | France Tél. : 01 80 50 52 51 | Mob. : 06 09 96

Re: Issue on stopword list

2010-03-02 Thread Walter Underwood
Don't remove stopwords if you want to search on them. --wunder On Mar 2, 2010, at 5:43 AM, Erick Erickson wrote: This is a classic problem with Stopword removal. Have you tried just removing stopwords from the indexing definition and the query definition and reindexing? You can't search on

Re: get Server Status, TotalDocCount .... PHP !

2010-03-02 Thread stocki
Hey- No i use the SolrPHPClient http://code.google.com/p/solr-php-client/ i not really want tu use two different php-libs. ^^ what do you mean with unserialize ? XD Guillaume Rossolini-2 wrote: Hi Have you tried the php_solr extension from PECL? It has a handy SolrPingResponse

Re: Optimize Index

2010-03-02 Thread Lee Smith
Ha Now I feel stupid !! I had a misspell in the data path and you were correct. Can I ask Erik was the command correct though ? Thank you Lee On 2 Mar 2010, at 13:54, Erick Erickson wrote: My very first guess would be that you're removing an index that isn't the one your SOLR

Indexing HTML document

2010-03-02 Thread György Frivolt
Hi, How to index properly HTML documents? All the documents are HTML, some containing charaters encodid like #x17E;#xED; ... Is there a character filter for filtering these codes? Is there a way to strip the HTML tags out? Does solr weight the terms in the document based on where they appear?..

Re: Indexing HTML document

2010-03-02 Thread Siddhant Goel
There is an HTML filter documented here, which might be of some help - http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.HTMLStripCharFilterFactory Control characters can be eliminated using code like this -

Re: Issue on stopword list

2010-03-02 Thread Joe Calderon
or you can try the commongrams filter that combines tokens next to a stopword On Tue, Mar 2, 2010 at 6:56 AM, Walter Underwood wun...@wunderwood.org wrote: Don't remove stopwords if you want to search on them. --wunder On Mar 2, 2010, at 5:43 AM, Erick Erickson wrote: This is a classic

Re: replication issue

2010-03-02 Thread Matthieu Labour
Hi Paul Thank you for your amswer I did put all the directory structure on /raid ... /raid/solr_env/solr ... , /raid/solr_env/jetty ... And it still didn't work even after I applied patch SOLR-1736 I am investigating if this is because tempDir and data dir are not on the same partition matt

Re: Warning : no lockType configured for...

2010-03-02 Thread Tom Hill.
Hi Mani, Mani EZZAT wrote: I'm dynamically creating cores with a new index, using the same schema and solrconfig.xml Does the problem occur if you use the same configuration in a single, static core? Tom -- View this message in context:

Re: get Server Status, TotalDocCount .... PHP !

2010-03-02 Thread Israel Ekpo
The last time I tried using SolrPHPClient for this stuff, it did not really handle the response very well because of the JSON response generated on the server side. I am not sure if anything has changed since then. The JSON code generated could not be parsed properly. If you do not want to

Re: replication issue

2010-03-02 Thread Matthieu Labour
I think this issue is tot related to patch  SOLR-1736 Here is the error I get ... Thank you for any help [2010-03-02 19:07:26] [pool-3-thread-1] ERROR(ReplicationHandler.java:266) - SnapPull failed org.apache.solr.common.SolrException: Unable to download _7bre.fdt completely. Downloaded

Logging in Embedded SolrServer - What a nightmare.

2010-03-02 Thread Lucas F. A. Teixeira
Hello all, I'm having a hard time trying to change Solr queries logging level. I've tried a lot of things I've found in the internet, this mailing list and solr docs. What I've found so far: - Solr Embedded Server uses sfl4j lib for intermediating logging. Here I'm using Log4j as my logging

Ignore accents

2010-03-02 Thread Tommy Molto
Hi, guys, I have a solr index, and i need it to ignore accents and special characters. Eg: São Paulo = Sao Paulo, cadarço=cadarco. I know we could use a synonim, but i guess solr already has a filter or plugin for theses cases. Anyone knows how to do it? Att, Paulo Marinho

Re: Ignore accents

2010-03-02 Thread Ahmet Arslan
I have a solr index, and i need it to ignore accents and special characters. Eg: São Paulo = Sao Paulo, cadarço=cadarco. I know we could use a synonim, but i guess solr already has a filter or plugin for theses cases. Anyone knows how to do it? ASCIIFoldingFilterFactory[1] or charFilter

CoreAdminHandler question

2010-03-02 Thread Leonardo Souza
The action CREATE creates a new core based on preexisting instanceDir/solrconfig.xml/schema.xml, and registers it. That's what the documentation is stating. Is there a way to instruct solr to create the instanceDir if does not exist? I'm trying to create new core based on a existing

Unindexed Fields Are Searchable?

2010-03-02 Thread Thomas Nguyen
I've noticed that fields that I define as index=false in the schema.xml are still searchable. Here's the definition of the field: field name=object_id type=string index=false stored=true multiValued=false/ or field name=object_id type=string index=false stored=false multiValued=false/ I

Re: Unindexed Fields Are Searchable?

2010-03-02 Thread Ahmet Arslan
I've noticed that fields that I define as index=false in the schema.xml are still searchable.  indexed=false defined fields are neither searchable nor sortable. Did you re-start servlet container and re-index your documents after changing this attribute in schema.xml?

Returning function result in results

2010-03-02 Thread Dragisa Krsmanovic
Is there way to return function value in search results besides using score ?=20

Different weights to different fields

2010-03-02 Thread Alex Thurlow
Hi everyone, I'm new to Solr and just getting it set up and testing it out. I'd like to know if there's a way to give a different weight to different data fields. For an example, I'm going to be storing song information. I have the fields: Artist, Title, Description, and Tags. I'd

Setting the return query fields

2010-03-02 Thread Dhanushka Samarakoon
Hi, I would like to solr to return to record from /exampledocs/hd.xml when I search for the value 6H500F0 (which is the ID field for the 2'nd record in that file). I know there is a setting that I should change to get this done, but I can't locate it. Field name ID is alread included in schema.xml

RE: Unindexed Fields Are Searchable?

2010-03-02 Thread Thomas Nguyen
My schema has always had index=false for that field. I only stopped and restarted the servlet container when I added a document to the index using the Lucene API instead of Solr. -Original Message- From: Ahmet Arslan [mailto:iori...@yahoo.com] Sent: Tuesday, March 02, 2010 1:01 PM To:

Re: Different weights to different fields

2010-03-02 Thread Ahmet Arslan
    I'm new to Solr and just getting it set up and testing it out.  I'd like to know if there's a way to give a different weight to different data fields. For an example, I'm going to be storing song information.  I have the fields: Artist, Title, Description, and Tags.  I'd like

Re: Setting the return query fields

2010-03-02 Thread Ahmet Arslan
Hi, I would like to solr to return to record from /exampledocs/hd.xml when I search for the value 6H500F0 (which is the ID field for the 2'nd record in that file). I know there is a setting that I should change to get this done, but I can't locate it. Field name ID is alread included in

RE: Unindexed Fields Are Searchable?

2010-03-02 Thread Ahmet Arslan
My schema has always had index=false for that field.  I only stopped and restarted the servlet container when I added a document to the index using the Lucene API instead of Solr. Is there a special reason/use-case for to add documents using Lucene API?

Re: Setting the return query fields

2010-03-02 Thread Dhanushka Samarakoon
Thanks for the reply. Is there a place in the config file where I can set it to explicitly search the fields I want? On Tue, Mar 2, 2010 at 3:17 PM, Ahmet Arslan iori...@yahoo.com wrote: Hi, I would like to solr to return to record from /exampledocs/hd.xml when I search for the value

RE: Unindexed Fields Are Searchable?

2010-03-02 Thread Thomas Nguyen
For testing purposes. I just wanted to see if unindex fields in documents added by Lucene API were searchable by Solr. This is after discovering that the unindexed fields in documents added by Solr are searchable. -Original Message- From: Ahmet Arslan [mailto:iori...@yahoo.com] Sent:

Re: Setting the return query fields

2010-03-02 Thread Ahmet Arslan
Thanks for the reply. Is there a place in the config file where I can set it to explicitly search the fields I want? If you don't want to specify your fields at query time (also you want to query more than one fields at the same time) you can use DisMaxRequestHandler[1]. There are two

Re: Different weights to different fields

2010-03-02 Thread Erick Erickson
If you get the PACKT Solr 1.4 book, there are extensive examples of this very thing. It's *well* worth the time it'll save you... Erick On Tue, Mar 2, 2010 at 4:11 PM, Ahmet Arslan iori...@yahoo.com wrote: I'm new to Solr and just getting it set up and testing it out. I'd like to

Re: Logging in Embedded SolrServer - What a nightmare.

2010-03-02 Thread Kevin Osborn
Not sure if it will solve your specific problem. We use Solr as a WAR as well as Solrj. So the main solr distribution comes with slf4j-jdk-1.5.5.jar. I just deleted that and replaced it with slf4j-log4j12-1.5.5.jar. And then it used my existing log4j.properties file.

Re: Unindexed Fields Are Searchable?

2010-03-02 Thread Erik Hatcher
Again, note that it should be index_ed_=false. ed - very important! If you're saying index=false, Solr is not reading that attribute at all, and going with the default for the field type. Erik On Mar 2, 2010, at 4:31 PM, Thomas Nguyen wrote: For testing purposes. I just wanted

Re: replication issue

2010-03-02 Thread Matthieu Labour
The replication does not work for me I have a big master solr and I want to start replicating it. I can see that the slave is downloading data from the master... I see a directory index.20100302093000 gets created in data/ next to index... I can see its size growing but then the directory

Re: Different weights to different fields

2010-03-02 Thread Alex Thurlow
That's great information. Thanks! -Alex Alex Thurlow Blastro Networks http://www.blastro.com http://www.roxwel.com http://www.yallwire.com On 3/2/2010 3:11 PM, Ahmet Arslan wrote: I'm new to Solr and just getting it set up and testing it out. I'd like to know if there's a way

Re: Unindexed Fields Are Searchable?

2010-03-02 Thread Ahmet Arslan
Again, note that it should be index_ed_=false.  ed - very important!   If you're saying index=false, Solr is not reading that attribute at all, and going with the default for the field type. Perfect catch :)

Re: replication issue

2010-03-02 Thread Matthieu Labour
One More information I deleted the index on the master and I restarted the master and restarted the slave and now the replication works Would it be possible that the replication doesn work well when started against an already existing big index ? Thank you --- On Tue, 3/2/10, Matthieu Labour

Re: replication issue

2010-03-02 Thread Otis Gospodnetic
Hi Matthieu, Does this happen over and over? Is this with Solr 1.4 or some other version? Is there anything unusual about _7h0y.fdx? Does _7h0y.fdx still exist on the master when the replication fails? ... Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Hadoop ecosystem

RE: Unindexed Fields Are Searchable?

2010-03-02 Thread Thomas Nguyen
Great catch! Thanks for spotting my error :) -Original Message- From: Ahmet Arslan [mailto:iori...@yahoo.com] Sent: Tuesday, March 02, 2010 2:07 PM To: solr-user@lucene.apache.org Subject: Re: Unindexed Fields Are Searchable? Again, note that it should be index_ed_=false.  ed - very

Re: replication issue

2010-03-02 Thread Matthieu Labour
Otis Thank your for your response. I apologize for not being specific enough -- yes it happened over over. -- apache-solr-1.4.0 -- I restarted the indexing+replication from scratch. Before I did that, I backed up the master index directory. I don't see _7h0y.fdx in it What could have

Re: Implementing hierarchical facet

2010-03-02 Thread Geert-Jan Brits
If it's a requirement to let Solr handle the facet-hierarchy please disregard this post, but an alternative would be to have your App control when to ask for which 'facet-level' (e.g: country, state, city) in the hierarchy. as follows, each doc has 3 seperate fields (indexed=true, stored=false):

Re: Implementing hierarchical facet

2010-03-02 Thread Geert-Jan Brits
Using Solr 1.4: even less changes to the frontend: facet=onfacet.field={!key=Location}countryid ... facet=onfacet.field={!key=Location}cityidfq=countryid:somecountryid etc. will consistently render the resulting facet under the name Location . 2010/3/3 Geert-Jan Brits gbr...@gmail.com If

Need suggestion regarding custom transformer

2010-03-02 Thread KshamaPai
Hi, Am new to solr. I am trying location aware search with spatial lucene in solr1.5 nightly build. My table in mysql has just lat,lng and some text .I want to add geohash, lat_rad(lat in radian) and lng_rad field into the document before indexing. I have used dataimport to get my table to solr.

Getting total term count

2010-03-02 Thread Akash Sahu
Hi, I want a want a way to get total term count per document. I am using solr1.4. My query looks something like this http://192.168.1.50:8080/solr1/core_SFS/select/?q=content%3Apresident%0D%0Aversion=2.2start=0rows=10indent=on I tried to use TermVectorComponent but it just gives me the

Re: Implementing hierarchical facet

2010-03-02 Thread Andy
Thanks. I didn't know about the {!key=Location} trick. Thanks everyone for your help. From what I could gather, there're 3 approaches: 1) SOLR-64 Pros: - can have arbitrary levels of hierarchy without modifying schema Cons: - each combination of all the levels in the hierarchy will result in a

Re: question regarding coord() value

2010-03-02 Thread Lance Norskog
The first 2 queries 'electORnics' instead of 'electROnics'. The third query shows the situation. The first clause has 1 out of 2 matches, and the second has 1 out of 3 matches. Look for the two 'coord' entries. They are 1/2 and 1/3. str name=SP2514N 0.61808145 = (MATCH) sum of: 0.16856766 =

Re: Simultaneous Writes to Index

2010-03-02 Thread Lance Norskog
Locking is at a lower level than indexing and queries. Solr coordinates multi-threaded indexing and query operations in memory and a separate thread writes data to disk. There are no performance problems with multiple searches and indexes happening at the same time. 2010/3/2 Kranti™ K K Parisa

DIH onError question

2010-03-02 Thread Shah, Nirmal
Hi all, I am using Solr 1.5 from trunk. I am getting the below error on a full load, and it is causing the import to fail and rollback. I am not concerned about the error but rather that I cannot seem to tell the indexing to continue. I have two entities, and I have tried all (4) combinations

Re: How can I get Solr-Cell to extract to multi-valued fields?

2010-03-02 Thread Lance Norskog
It is a bug. I just filed this. It is just a unit test that displays the behavior. http://issues.apache.org/jira/browse/SOLR-1803 On Tue, Mar 2, 2010 at 9:07 AM, Mark Roberts mark.robe...@red-gate.com wrote: Hi, I have a schema with a multivalued field like so: field name=product

Re: Warning : no lockType configured for...

2010-03-02 Thread Mani EZZAT
I don't know, I didn't try because I have the need to create a different core each time. I'll do some tests with the default config and will report back to all of you Thank you for your time Tom Hill. wrote: Hi Mani, Mani EZZAT wrote: I'm dynamically creating cores with a new index,