Re: charfilter doesn't do anything

2013-09-11 Thread Andreas Owen
. -- Jack Krupansky -Original Message- From: Andreas Owen Sent: Tuesday, September 10, 2013 7:07 AM To: solr-user@lucene.apache.org Subject: Re: charfilter doesn't do anything ok i am getting there now but if there are newlines involved the regex stops as soon as it reaches a \r\n

Re: charfilter doesn't do anything

2013-09-10 Thread Andreas Owen
: Monday, September 09, 2013 7:05 PM To: solr-user@lucene.apache.org Subject: Re: charfilter doesn't do anything i tried but that isn't working either, it want a data-stream, i'll have to check how to post json instead of xml On 10. Sep 2013, at 12:52 AM, Jack Krupansky wrote: Did you at least

Re: charfilter doesn't do anything

2013-09-10 Thread Jack Krupansky
. -- Jack Krupansky -Original Message- From: Andreas Owen Sent: Tuesday, September 10, 2013 7:07 AM To: solr-user@lucene.apache.org Subject: Re: charfilter doesn't do anything ok i am getting there now but if there are newlines involved the regex stops as soon as it reaches a \r\n

Re: charfilter doesn't do anything

2013-09-09 Thread Jack Krupansky
Did you in fact try my suggested example? If not, please do so. -- Jack Krupansky -Original Message- From: Andreas Owen Sent: Monday, September 09, 2013 4:42 PM To: solr-user@lucene.apache.org Subject: Re: charfilter doesn't do anything i index html pages with a lot of lines

Re: charfilter doesn't do anything

2013-09-09 Thread Andreas Owen
, HTML tag stripped In your original query, you didn't show us what your default field, df parameter, was. -- Jack Krupansky -Original Message- From: Andreas Owen Sent: Sunday, September 08, 2013 5:21 AM To: solr-user@lucene.apache.org Subject: Re: charfilter doesn't do anything

Re: charfilter doesn't do anything

2013-09-09 Thread Andreas Owen
the standard Solr simple post tool. -- Jack Krupansky -Original Message- From: Andreas Owen Sent: Monday, September 09, 2013 6:40 PM To: solr-user@lucene.apache.org Subject: Re: charfilter doesn't do anything i've downloaded curl and tried it in the comman prompt and power shell

Re: charfilter doesn't do anything

2013-09-09 Thread Andreas Owen
? If not, please do so. -- Jack Krupansky -Original Message- From: Andreas Owen Sent: Monday, September 09, 2013 4:42 PM To: solr-user@lucene.apache.org Subject: Re: charfilter doesn't do anything i index html pages with a lot of lines and not just a string with the body-tag

Re: charfilter doesn't do anything

2013-09-09 Thread Jack Krupansky
@lucene.apache.org Subject: Re: charfilter doesn't do anything i've downloaded curl and tried it in the comman prompt and power shell on my win 2008r2 server, thats why i used my dataimporter with a single line html file and copy/pastet the lines into schema.xml On 9. Sep 2013, at 11:20 PM, Jack

Re: charfilter doesn't do anything

2013-09-09 Thread Jack Krupansky
, September 09, 2013 7:05 PM To: solr-user@lucene.apache.org Subject: Re: charfilter doesn't do anything i tried but that isn't working either, it want a data-stream, i'll have to check how to post json instead of xml On 10. Sep 2013, at 12:52 AM, Jack Krupansky wrote: Did you at least try

Re: charfilter doesn't do anything

2013-09-08 Thread Andreas Owen
yes but that filter html and not the specific tag i want. On 7. Sep 2013, at 7:51 PM, Erick Erickson wrote: Hmmm, have you looked at: http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.HTMLStripCharFilterFactory Not quite the body, perhaps, but might it help? On Fri, Sep

Re: charfilter doesn't do anything

2013-09-08 Thread Jack Krupansky
; Shows nothing, HTML tag stripped In your original query, you didn't show us what your default field, df parameter, was. -- Jack Krupansky -Original Message- From: Andreas Owen Sent: Sunday, September 08, 2013 5:21 AM To: solr-user@lucene.apache.org Subject: Re: charfilter doesn't do

Re: charfilter doesn't do anything

2013-09-07 Thread Erick Erickson
Hmmm, have you looked at: http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.HTMLStripCharFilterFactory Not quite the body, perhaps, but might it help? On Fri, Sep 6, 2013 at 11:33 AM, Andreas Owen a...@conx.ch wrote: ok i have html pages with html.!--body--content i

Re: charfilter doesn't do anything

2013-09-07 Thread Jack Krupansky
: charfilter doesn't do anything i would like to filter / replace a word during indexing but it doesn't do anything and i dont get a error. in schema.xml i have the following: field name=text_html type=text_cutHtml indexed=true stored=true multiValued=true/ fieldType name=text_cutHtml class

Re: charfilter doesn't do anything

2013-09-06 Thread Andreas Owen
Sent: Thursday, September 05, 2013 2:41 PM To: solr-user@lucene.apache.org Subject: Re: charfilter doesn't do anything On 9/5/2013 10:03 AM, Andreas Owen wrote: i would like to filter / replace a word during indexing but it doesn't do anything and i dont get a error. in schema.xml i have

Re: charfilter doesn't do anything

2013-09-06 Thread Jack Krupansky
and a sample of your actual Solr input (Solr XML or JSON?) so that we can see what fields are being populated. -- Jack Krupansky -Original Message- From: Andreas Owen Sent: Friday, September 06, 2013 4:01 AM To: solr-user@lucene.apache.org Subject: Re: charfilter doesn't do anything

Re: charfilter doesn't do anything

2013-09-06 Thread Andreas Owen
fields are being populated. -- Jack Krupansky -Original Message- From: Andreas Owen Sent: Friday, September 06, 2013 4:01 AM To: solr-user@lucene.apache.org Subject: Re: charfilter doesn't do anything the input string is a normal html page with the word Zahlungsverkehr in it and my

Re: charfilter doesn't do anything

2013-09-06 Thread Shawn Heisey
On 9/6/2013 7:09 AM, Andreas Owen wrote: i've managed to get it working if i use the regexTransformer and string is on the same line in my tika entity. but when the string is multilined it isn't working even though i tried ?s to set the flag dotall. entity name=tika

Re: charfilter doesn't do anything

2013-09-06 Thread Andreas Owen
ok i have html pages with html.!--body--content i want!--/body--./html. i want to extract (index, store) only that between the body-comments. i thought regexTransformer would be the best because xpath doesn't work in tika and i cant nest a xpathEntetyProcessor to use xpath. what i

charfilter doesn't do anything

2013-09-05 Thread Andreas Owen
i would like to filter / replace a word during indexing but it doesn't do anything and i dont get a error. in schema.xml i have the following: field name=text_html type=text_cutHtml indexed=true stored=true multiValued=true/ fieldType name=text_cutHtml class=solr.TextField analyzer

Re: charfilter doesn't do anything

2013-09-05 Thread Jack Krupansky
And show us an input string and a query that fail. -- Jack Krupansky -Original Message- From: Shawn Heisey Sent: Thursday, September 05, 2013 2:41 PM To: solr-user@lucene.apache.org Subject: Re: charfilter doesn't do anything On 9/5/2013 10:03 AM, Andreas Owen wrote: i would like

Re: charfilter doesn't do anything

2013-09-05 Thread Shawn Heisey
On 9/5/2013 10:03 AM, Andreas Owen wrote: i would like to filter / replace a word during indexing but it doesn't do anything and i dont get a error. in schema.xml i have the following: field name=text_html type=text_cutHtml indexed=true stored=true multiValued=true/ fieldType