Philip Brown wrote:
I am new to regex. What will the "$1$3" reproduce in the following element. What values are $1$3?

<regex>
 <pattern>(.*)(;jsessionid=[a-zA-Z0-9]{32})(.*)</pattern>
 <substitution>$1$3</substitution>
</regex>

if I leave substitution as <substitution></substitution> will this just get rid of ;jsessionid=123456789...

thanks


.

Am I wrong to assume $1 = (.*) & $3 = (.*). Which would make $2 the following (;jsessionid=[a-zA-Z0-9]{32}) from (.*)(;jsessionid=[a-zA-Z0-9]{32})(.*)

I have run a crawl with this value:

<regex>
<pattern>(.*)(;jsessionid=[a-zA-Z0-9]{32})(.*)</pattern>
<substitution>$1$3</substitution>
</regex>

added to my regex-normalizer.xml

however it did not change db. does bin/nutch crawl need a special command to get this to run?

Reply via email to