Hi,
I'm trying to remove all email address in my content field with
following line:
<field column="description" xpath="/product/content"
regex="[a-zA-Z0-9._%+-]+@[a-zA-Z0-9.-]+\.[A-Z]{2,4}" replaceWith="" />
But it doesn't seem to remove emails? Is the syntax right?
Second thing:
I would like to extract domain name from url via a regex:
<field column="source" xpath="/product/url" regex="http://(.*?)\\/(.*)" />
Example: url=http://www.abcd.com/product.php?id=324 --> i want to
index source = abcd.com
What the syntax for this one?
Thanks for your help
Rosa