Hi,

I'm trying to remove all email address in my content field with following line:

<field column="description" xpath="/product/content" regex="[a-zA-Z0-9._%+-]+@[a-zA-Z0-9.-]+\.[A-Z]{2,4}" replaceWith="" />

But it doesn't seem to remove emails? Is the syntax right?

Second thing:

I would like to extract domain name from url via a regex:

<field column="source" xpath="/product/url" regex="http://(.*?)\\/(.*)"  />

Example: url=http://www.abcd.com/product.php?id=324 --> i want to index source = abcd.com

What the syntax for this one?

Thanks for your help

Rosa

Reply via email to