Working Chef Cookbook for Solr

2010-12-10 Thread György Frivolt
Hi, I tried to setup Solr by chef and so far found only the opscode one, but this one setup only the group and the user for solr, not the solr engine. Does anyone know about a maintained solr chef cookbook? Thanks for suggestion! Georg

Re: Working Chef Cookbook for Solr

2010-12-10 Thread György Frivolt
when I'll get the time, though, I'm afraid. Upayavira On Fri, 10 Dec 2010 10:45 +0100, György Frivolt gyorgy.friv...@gmail.com wrote: Hi,      I tried to setup Solr by chef and so far found only the opscode one, but this one setup only the group and the user for solr, not the solr engine

Solr crashing while extracting from very simple text file

2010-03-22 Thread György Frivolt
Hi, I had problem with indexing documents some months ago as well. I found that there were XML control characters in the documents and these were not handled by Solr. Maybe it is the case for you as well. Regards, Georg On Sun, Mar 21, 2010 at 5:58 PM, Ross tetr...@gmail.com wrote:

Re: Indexing HTML document

2010-03-03 Thread György Frivolt
://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.HTMLStripCharFilterFactory Control characters can be eliminated using code like this - http://bitbucket.org/cogtree/python-solr/src/tip/pythonsolr/pysolr.py#cl-449 On Tue, Mar 2, 2010 at 9:37 PM, György Frivolt gyorgy.friv...@gmail.com

Indexing HTML document

2010-03-02 Thread György Frivolt
Hi, How to index properly HTML documents? All the documents are HTML, some containing charaters encodid like #x17E;#xED; ... Is there a character filter for filtering these codes? Is there a way to strip the HTML tags out? Does solr weight the terms in the document based on where they appear?..

Re: unexpected result using OR in query

2010-02-27 Thread György Frivolt
: --- On Thu, 2/25/10, György Frivolt gyorgy.friv...@gmail.com wrote: I run into an unexpected behaviour for Solr with query parsing. I need to fetch article which contain several expressions. However, I noticed the following behaviour: - when I fetch results for query A I get

unexpected result using OR in query

2010-02-25 Thread György Frivolt
Hi, I run into an unexpected behaviour for Solr with query parsing. I need to fetch article which contain several expressions. However, I noticed the following behaviour: - when I fetch results for query A I get a number of results X - for query B I get a number of results Y - for query A B

Re: SolrException caused by illegal character

2009-11-27 Thread György Frivolt
://en.wikipedia.org/wiki/ASCII Otis -- Sematext is hiring -- http://sematext.com/about/jobs.html?mls Lucene, Solr, Nutch, Katta, Hadoop, HBase, UIMA, NLP, NER, IR - Original Message From: György Frivolt gyorgy.friv...@gmail.com To: solr-user solr-user@lucene.apache.org Sent: Thu

SolrException caused by illegal character

2009-11-26 Thread György Frivolt
Hi, I upgradeed to Solr 1.4 and tried to reindex the data. After few thousand of reindexed documents an exception is thrown, I did not meet this using 1.3 before. Do you have any idea what caused the problem? Thanks. SEVERE: org.apache.solr.common.SolrException: Illegal character ((CTRL-CHAR,

Re: Solr and LSA

2009-10-30 Thread György Frivolt
Libbrecht p...@activemath.org wrote: Not with solr but with Lucene, there is the project called semanticvectors. It would be cute to make it a solr module. paul Le 30-oct.-09 à 09:17, György Frivolt a écrit : Hi, Does anyone of you have experiences with using LSA, Latent Semantic Analysis

Searching with or without diacritics

2009-08-27 Thread György Frivolt
Hello, I started to use solr only recently using the ruby/rails sunspot-solr client. I use solr on a slovak/czech data set and realized one not wanted behaviour of the search. When the user searches an expression or word which contains dicritics, letters like š, č, ť, ä, ô,... usually the