Hi Koji,
My question was more about the solr DIH syntax. It doesn't work either
with the new regex.
Especially the syntax for this:
<field column="source" xpath="/product/url" regex="http:\/\/(.*?)\/(.*)"
/> <--- Is it correct? (not the regex, the syntax)?
Example: url=http://www.abcd.com/product.php?id=324 --> i want to index
source = abcd.com
thanks for your help
Le 25/02/2011 01:43, Koji Sekiguchi a écrit :
Hi Rosa,
<field column="description" xpath="/product/content"
regex="[a-zA-Z0-9._%+-]+@[a-zA-Z0-9.-]+\.[A-Z]{2,4}" replaceWith="" />
Shouldn't it be regex="[a-zA-Z0-9._%+-]+@[a-zA-Z0-9.-]+\.[a-z]{2,4}"?
<field column="source" xpath="/product/url"
regex="http://(.*?)\\/(.*)" />
Example: url=http://www.abcd.com/product.php?id=324 --> i want to
index source = abcd.com
Probably it could be regex="http:\/\/(.*?)\/(.*)"
I use a regex web tool:
http://www.regexplanet.com/simple/index.html
Koji