Re: DIH regex remove email + extract url

2011-02-25 Thread Rosa (Anuncios)
Hi Koji, My question was more about the solr DIH syntax. It doesn't work either with the new regex. Especially the syntax for this: field column=source xpath=/product/url regex=http:\/\/(.*?)\/(.*) / --- Is it correct? (not the regex, the syntax)? Example:

Re: DIH regex remove email + extract url

2011-02-25 Thread Koji Sekiguchi
Hi Rosa, Are you sure you have transformer=RegexTransformer in your entity/? My question was more about the solr DIH syntax. It doesn't work either with the new regex. Especially the syntax for this: field column=source xpath=/product/url regex=http:\/\/(.*?)\/(.*) / --- Is it correct?

Re: DIH regex remove email + extract url

2011-02-25 Thread Rosa (Anuncios)
Hi Koji, Yes of course i have RegexTransformer in my entity/. What i'm not sure is the syntax of this field column=source xpath=/product/url regex= / i don't need any other parameter here? Rosa Le 25/02/2011 12:21, Koji Sekiguchi a écrit : Hi Rosa, Are you sure you have

Re: DIH regex remove email + extract url

2011-02-25 Thread Koji Sekiguchi
(11/02/26 5:24), Rosa (Anuncios) wrote: Hi Koji, Yes of course i have RegexTransformer in my entity/. What i'm not sure is the syntax of this field column=source xpath=/product/url regex= / i don't need any other parameter here? Hi Rosa, So I've mentioned groupNames attribute for field

Re: DIH regex remove email + extract url

2011-02-24 Thread Koji Sekiguchi
Hi Rosa, field column=description xpath=/product/content regex=[a-zA-Z0-9._%+-]+@[a-zA-Z0-9.-]+\.[A-Z]{2,4} replaceWith= / Shouldn't it be regex=[a-zA-Z0-9._%+-]+@[a-zA-Z0-9.-]+\.[a-z]{2,4}? field column=source xpath=/product/url regex=http://(.*?)\\/(.*) / Example: