Hi Koji,

My question was more about the solr DIH syntax. It doesn't work either with the new regex.

Especially the syntax for this:

<field column="source" xpath="/product/url" regex="http:\/\/(.*?)\/(.*)" /> <--- Is it correct? (not the regex, the syntax)?

Example: url=http://www.abcd.com/product.php?id=324 --> i want to index source = abcd.com

thanks for your help


Le 25/02/2011 01:43, Koji Sekiguchi a écrit :
Hi Rosa,

<field column="description" xpath="/product/content"
regex="[a-zA-Z0-9._%+-]+@[a-zA-Z0-9.-]+\.[A-Z]{2,4}" replaceWith="" />

Shouldn't it be regex="[a-zA-Z0-9._%+-]+@[a-zA-Z0-9.-]+\.[a-z]{2,4}"?

<field column="source" xpath="/product/url" regex="http://(.*?)\\/(.*)" />

Example: url=http://www.abcd.com/product.php?id=324 --> i want to index source = abcd.com

Probably it could be regex="http:\/\/(.*?)\/(.*)"

I use a regex web tool:

http://www.regexplanet.com/simple/index.html

Koji

Reply via email to