Regex in Stopword.xml

2013-07-22 Thread Scatman
Hi, 

I was looking for an issue, in order to put some regular expression in the
StopWord.xml, but it seems that we can only have words in the file.
I'm just wondering if there is a feature which will be done in this way or
if someone got a tip it will help me a lot :) 

Best,
Scatman.




--
View this message in context: 
http://lucene.472066.n3.nabble.com/Regex-in-Stopword-xml-tp4079412.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Regex in Stopword.xml

2013-07-22 Thread Manuel Le Normand
Use the pattern replace filter factory




This will do exactly what you asked for


http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.PatternReplaceFilterFactory




On Mon, Jul 22, 2013 at 12:22 PM, Scatman  wrote:

> Hi,
>
> I was looking for an issue, in order to put some regular expression in the
> StopWord.xml, but it seems that we can only have words in the file.
> I'm just wondering if there is a feature which will be done in this way or
> if someone got a tip it will help me a lot :)
>
> Best,
> Scatman.
>
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Regex-in-Stopword-xml-tp4079412.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>


Re: Regex in Stopword.xml

2013-07-22 Thread Scatman
Thank for reply but it's not a solution that i'm looking for, and i should
better explained myself, because i got like 100 hundred regex to put in the
config. In order to manage easiest Solr, i think the better way is to put
regex in a file... I know that GSA from google do it, so i'd just hoped that
it will the case for Solr :)  

Best,
Scatman. 



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Regex-in-Stopword-xml-tp4079412p4079438.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Regex in Stopword.xml

2013-07-22 Thread Jack Krupansky
How did you get the impression that GSA supports regex stop words? GSA seems 
to follow the same rules as Solr.


See the doc:
http://www.google.com/support/enterprise/static/gsa/docs/admin/70/gsa_doc_set/admin_searchexp/ce_improving_search.html#1050255

As with GSA, the stop words are a simple .TXT file.

In any case, Solr and Lucene do not support "stop words" that are regular 
expressions, although a regex filter can simulate them to a limited degree.


-- Jack Krupansky

-Original Message- 
From: Scatman

Sent: Monday, July 22, 2013 7:48 AM
To: solr-user@lucene.apache.org
Subject: Re: Regex in Stopword.xml

Thank for reply but it's not a solution that i'm looking for, and i should
better explained myself, because i got like 100 hundred regex to put in the
config. In order to manage easiest Solr, i think the better way is to put
regex in a file... I know that GSA from google do it, so i'd just hoped that
it will the case for Solr :)

Best,
Scatman.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Regex-in-Stopword-xml-tp4079412p4079438.html
Sent from the Solr - User mailing list archive at Nabble.com. 



Re: Regex in Stopword.xml

2013-07-22 Thread Scatman
I know it because i actually want to change GSA with Solr who his much better
in the enterprise's situation :) 

Thank's for reply anyway !

Best,
Scatman.





--
View this message in context: 
http://lucene.472066.n3.nabble.com/Regex-in-Stopword-xml-tp4079412p4079491.html
Sent from the Solr - User mailing list archive at Nabble.com.