Deploying Solr 1.4.1 in JbossAs 6

2010-08-31 Thread Bruno Adam Osiek
Has anyone managed to deploy Solr 1.4.1 into Jboss AS 6? If yes could you provide the required steps for deployment? Thanks, Bruno

How to use protwords.txt

2010-08-31 Thread Shuai Weng
Hey, Currently we have indexed some biological fulltext files. I was wondering how to config the schema.xml such that the gene names (eg, 'met1', 'met2', 'met3' etc) won't be stemmed into the same word ('met'). I added these gene names into the protwords.txt file but it doesn't seem to work.

Re: anybody using solr with Cassandra?

2010-08-31 Thread Siju George
Thanks a million Nick, We are currently debating whether we should use cassandra or membase or hbase with solr. Do you have anything to contribute as advice to us? Thanks again :-) --Siju On Tue, Aug 31, 2010 at 5:15 AM, nickdos n...@remedios-cole.id.au wrote: Yes, we are Cassandra. There

Re: anybody using solr with Cassandra?

2010-08-31 Thread Amit Nithian
I am curious about this too.. are you talking about using HBase/Cassandra as an aux store of large data or using Cassandra to store the actual lucene index (as in LuCandra)? On Mon, Aug 30, 2010 at 11:06 PM, Siju George sgeorge...@gmail.com wrote: Thanks a million Nick, We are currently

Re: anybody using solr with Cassandra?

2010-08-31 Thread Siju George
We will be suing Solr for indexing and Cassandra/Membase/Hbase instead of a database. That is the idea now unless some body gives a better solution :-) thanks --Siju On Tue, Aug 31, 2010 at 11:39 AM, Amit Nithian anith...@gmail.com wrote: I am curious about this too.. are you talking about

Re: Implementing synonym NewBie

2010-08-31 Thread Grijesh.singh
Index time synonym explore is the better way if u do not have the disk space concern. -- View this message in context: http://lucene.472066.n3.nabble.com/Implementing-synonym-NewBie-tp1379269p1392927.html Sent from the Solr - User mailing list archive at Nabble.com.

Solr, c/s type ?

2010-08-31 Thread M.S. Kim
i'm wondering c/s type is possible (not http web type). if possible, could i get the material about it? Thanks,

Re: Hardware Specs Question

2010-08-31 Thread 朱炎詹
In our current lab project, we already built a Chinese newspaper index with 18 millions documents. The index size is around 51GB. So I am very concerned about the memory issue you guys mentioned. I also look up the Hathitrust report on SolrPerformanceData page:

Re: How to use protwords.txt

2010-08-31 Thread Tomas
Shaui, are you using a WordDelimiterFilterFactory in the analysis? That's the filter that might be transforming met1 into met and 1 and not the steamer. Check de Analysis page on Solr admin. De: Shuai Weng sh...@genome.stanford.edu Para:

Alphanumeric wildcard search problem

2010-08-31 Thread Hasnain
I have gone through all the of the related posts, but could not find a proper answer that works, so Im writing this post Is there anyway of using wilcard searches on alphanumeric text like...R-1* ? let me share relevent information fieldType name=textShoaib class=solr.TextField

Spellcheck in multilanguage search

2010-08-31 Thread Grijesh.singh
How can be spellcheck configured for multilanguage search,I have to index 17 languages in my indexes and search on them also wants to use spellcheck for that -- View this message in context: http://lucene.472066.n3.nabble.com/Spellcheck-in-multilanguage-search-tp1393357p1393357.html Sent from

RE: Spellcheck in multilanguage search

2010-08-31 Thread Markus Jelsma
Configure language specific fields and spellcheckers just as you would for a single language index, so multiple content_LANG fields and spell_LANG field. This will, of course, only work if you know in what language the search operates.   -Original message- From: Grijesh.singh

Re: Solr creates whitespace in dismax query

2010-08-31 Thread Johann Höchtl
No, it didn't solve the problem, bit I got a different solution. I make a second field in schema.xml and copy the content. This field gets analyzed by the keywordtokenizer factory. Thanks, Johann Am 24.08.2010 21:53, schrieb MitchK: Johann, try to remove the wordDelimiterFilter from the

Re: Highlighting, return the matched terms only

2010-08-31 Thread Xavier Schepler
Chris Hostetter wrote: : how could I have the highlighting component return only the terms that were : matched, without any surrounding text ? I'm not a Highlighter expert, but this is somethign that certainly *sounds* like it should be easy. I took a shot at it and this is hte best i could

Lucene Revolution Update

2010-08-31 Thread Grant Ingersoll
Hi, (apologies for the cross-post) Just a quick update on Lucene Revolution - coming up in Boston, October 7-8 (see http://lucenerevolution.org). - Marten Mickos, CEO Eucalyptus Systems, ex-MySQL CEO will be giving a keynot on How Open Source Leads Infrastructure Innovation - Bill Press from

questions about synonyms

2010-08-31 Thread Ma, Xiaohui (NIH/NLM/LHC) [C]
Hello, I have an couple of questions about synonyms. 1. I got a very big text file of synonyms. How I can use it? Do I need to index this text file first? 2. Is there a way to do synonyms' highlight in search result? 3. Does anyone use WordNet to solr? Thanks so much in advance,

solr

2010-08-31 Thread ankita shinde
hello all, I have indexed database using DIH. But I am not able to search the data using each field.

**SPAM** solr

2010-08-31 Thread Andrea Gazzarini
eh eh eh...it's a little bit hard to answer...could you provide some detail? cheers, Andrea hello all, I have indexed database using DIH. But I am not able to search the data using each field.

Stripping leading/trailing punctuation with SOLR-1653

2010-08-31 Thread Shawn Heisey
I am trying to use PatternReplaceCharFilterFactory (SOLR-1653) to strip leading and trailing punctuation from terms. It's not working. This was previously discussed here as part of something I was trying with WordDelimiterFilterFactory, but I think it needs its own thread now. I seem to be

Re: Stripping leading/trailing punctuation with SOLR-1653

2010-08-31 Thread Shawn Heisey
I didn't give any particulars about my setup, sorry about that. This is branch_3x rev 990625, downloaded two days ago. It passed all unit tests. Linux idxst9-b 2.6.32-bpo.5-amd64 #1 SMP Fri Jun 11 08:42:31 UTC 2010 x86_64 GNU/Linux Java(TM) SE Runtime Environment (build 1.6.0_20-b02) Java

Re: solr

2010-08-31 Thread kenf_nc
We would really need to see more information, but some first things to look for are: are your field definitions in the schema.xml set to indexed=true (if you want to search it) and stored=true (if you want to see it in the return results)? is the case of the field names the same in schema.xml

Memcache for Solr

2010-08-31 Thread Hitendra Molleti
Hi, We were looking at implementing Memcache for Solr. Can someone who has already implemented this let us know if it is a good option to go for i.e. how effective is using memcache compared to Solr's internal cache. Also, are there any down sides to it and difficult to implement. Thanks

Re: Stripping leading/trailing punctuation with SOLR-1653

2010-08-31 Thread Shawn Heisey
I believe I may have solved this. After a more careful reading of SOLR-1653, I noticed that they referred to another filter. I changed my configuration from /solr/.PatternReplaceCharFilterFactory to /solr/.PatternReplaceFilterFactory and updated the XML syntax appropriately, and it looks OK

RE: Memcache for Solr

2010-08-31 Thread Markus Jelsma
Hi,   In a restaurant index website, we have used Memcache only for storing the generated HTML facet list when q=*. This cached object was only used when no additional search parameters were specified. It was quite useful because the facet list was always present and only changed if real

Meeting of Solr users in Delhi/NCR, India?

2010-08-31 Thread Gora Mohanty
Hi, From past postings, I understand that discussions on meetings related to Solr are acceptable on this list. We have been talking about arranging a meeting in Delhi/National Capital region (NCR) of people using Solr, or those interested in search and related technologies. This first

Re: solr

2010-08-31 Thread Gora Mohanty
On Tue, 31 Aug 2010 07:03:08 -0700 ankita shinde ankitashinde...@gmail.com wrote: hello all, I have indexed database using DIH. But I am not able to search the data using each field. Dear Ankita, Do not mean to offend, but did you not post about this earlier, and Peter Karich followed up

Re: how to deal with virtual collection in solr?

2010-08-31 Thread Jan Høydahl / Cominvent
Hi, If you have multiple cores defined in your solr.xml you need to issue your queries to one of the cores. Below it seems as if you are lacking core name. Try instead: http://localhost:8983/solr/aapublic/select?shards=localhost:8983/solr/aaprivate,localhost:8983/solr/aapublic/ And

about some examples

2010-08-31 Thread kajal patil
We are working on solrJS.but we didnt get much more information about it.so please provide us some useful links on which we will get information.also provide some solrJS examples and demos... regards, kajal

Re: Memcache for Solr

2010-08-31 Thread Chris Hostetter
: References: 4c7d1071.8000...@elyograg.org : In-Reply-To: 4c7d1071.8000...@elyograg.org : Subject: Memcache for Solr http://people.apache.org/~hossman/#threadhijack Thread Hijacking on Mailing Lists When starting a new discussion on a mailing list, please do not reply to an existing message,

Re: questions about synonyms

2010-08-31 Thread Chris Hostetter
: Subject: questions about synonyms : References: b28a6774-1ccc-4c2a-8d7b-0ee2b07a5...@apache.org : In-Reply-To: b28a6774-1ccc-4c2a-8d7b-0ee2b07a5...@apache.org http://people.apache.org/~hossman/#threadhijack Thread Hijacking on Mailing Lists When starting a new discussion on a mailing list,

Re: Stripping leading/trailing punctuation with SOLR-1653

2010-08-31 Thread Shawn Heisey
On 8/31/2010 8:49 AM, Shawn Heisey wrote: I believe I may have solved this. After a more careful reading of SOLR-1653, I noticed that they referred to another filter. I changed my configuration from /solr/.PatternReplaceCharFilterFactory to /solr/.PatternReplaceFilterFactory and updated

[ANN] Webinar, Sep 8: From RDBMS to Apache Solr/Lucene - Open Source Search for database developers

2010-08-31 Thread Chris Hostetter
An upcoming Webinar sponsored by my employeer that folks might be interested in... From RDBMS to Apache Solr/Lucene Open Source Search for database developers September 8, 2010 - 09:00 PDT, 12:00 EDT, 16:00 GMT Hosted by Bitpipe.com and sponsored by Lucid

questions about synonyms

2010-08-31 Thread Ma, Xiaohui (NIH/NLM/LHC) [C]
Hello, I have an couple of questions about synonyms. 1. I got a very big text file of synonyms. How I can use it? Do I need to index this text file first? 2. Is there a way to do synonyms' highlight in search result? 3. Does anyone use WordNet to solr? Thanks so much in advance,

Re: Resume Solr indexing CSV after exception

2010-08-31 Thread romiawasthy
How do I use this feature, is there some parameter that I need to specify in the update request? curl http://localhost:8983/solr/update/csv?stream.file=exampledocs/books.csvstream.contentType=text/plain;charset=utf-8 -- View this message in context:

Re: questions about synonyms

2010-08-31 Thread Geert-Jan Brits
concerning: . I got a very big text file of synonyms. How I can use it? Do I need to index this text file first? have you seen http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#SynonymFilter ? Cheers, Geert-Jan http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#SynonymFilter

Job Announcement: Web Developer with Solr and JSP / JSTL Experience

2010-08-31 Thread John Roberts
Hello, We need an experienced Solr and JSP / JSTL developer to create web applications for data retrieval. You'll write Java code to transform data from its native format into the format required for Solr indexing. You'll then build a web application to enable users to search the data and view

Re: How to use protwords.txt

2010-08-31 Thread Erick Erickson
In addition to Tomas' question, be aware that if you're already indexed data, the stemming has already been done, you'll have to re-index to get the right tokens in there. Best Erick On Tue, Aug 31, 2010 at 6:08 AM, Tomas tomasflo...@yahoo.com.ar wrote: Shaui, are you using a

Re: Alphanumeric wildcard search problem

2010-08-31 Thread Erick Erickson
Really look at the analysis page in solr admin for how your analyzer chain handles things, or you'll spend time until you're really old having trouble :). Here's what I see on a quick scan: StandardTokenizer tries to, among other things, preserve email addresses. The kinds of strings you're

Re: Solr creates whitespace in dismax query

2010-08-31 Thread Erick Erickson
keywordtokenizerfactory interprets the entire input as a single token, so this could be a problem for you. For instance, the text: bloom2000.de is some text will get indexed as a single token. Seaches on some or text won't match. This may be what you're looking for, but I really think Mitch

Problems indexing spatial field - undefined subField

2010-08-31 Thread Simon Wistow
I'm trying to index a latLon field. I have a fieldType in my schema.xml that looks like fieldType name=latLon class=solr.LatLonType subFieldSuffix=_latLon/ and a field that looks like field name=location type=latLon indexed=true stored=true/ I'm trying upload via the JSON update handler but

Re: Problems indexing spatial field - undefined subField

2010-08-31 Thread Simon Wistow
On Wed, Sep 01, 2010 at 01:05:47AM +0100, me said: I'm trying to index a latLon field. fieldType name=latLon class=solr.LatLonType subFieldSuffix=_latLon/ field name=location type=latLon indexed=true stored=true/ Turns out changing it to fieldType name=latLon class=solr.LatLonType

Re: Memcache for Solr

2010-08-31 Thread Glen Newton
? The second post was relevant to the original post. And even dealt with some of the questions asked in the original: Q are there any down sides to it and difficult to implement A We found it wasn't feasible to cache arbitrary result sets... ? -glen On 31 August 2010 15:11, Chris Hostetter

Re: Memcache for Solr

2010-08-31 Thread Chris Hostetter
: ? : The second post was relevant to the original post. : And even dealt with some of the questions asked in the original: The first msg with subject Memcache for Solr was a thread-jack of an existing thread Stripping leading/trailing punctuation with SOLR-1653

Re: Memcache for Solr

2010-08-31 Thread Glen Newton
Apologies Chris: my mistake. -Glen On 31 August 2010 23:27, Chris Hostetter hossman_luc...@fucit.org wrote: : ? : The second post was relevant to the original post. : And even dealt with some of the questions asked in the original: The first msg with subject Memcache for Solr was a

Re: about some examples

2010-08-31 Thread Pavan Gupta
Hi Kajal, You can refer to the example given at this link: http://wiki.apache.org/solr/SolrJS#Creating_the_reuters_example http://wiki.apache.org/solr/SolrJS#Creating_the_reuters_exampleRegards, Pavan On Tue, Aug 31, 2010 at 10:47 PM, kajal patil kpp0...@gmail.com wrote: We are working on