Philip Brown wrote:
I am new to regex. What will the "$1$3" reproduce in the following
element. What values are $1$3?
<regex>
<pattern>(.*)(;jsessionid=[a-zA-Z0-9]{32})(.*)</pattern>
<substitution>$1$3</substitution>
</regex>
if I leave substitution as <substitution></substitution> will this
just get rid of ;jsessionid=123456789...
thanks
.
Am I wrong to assume $1 = (.*) & $3 = (.*). Which would make $2 the
following (;jsessionid=[a-zA-Z0-9]{32}) from
(.*)(;jsessionid=[a-zA-Z0-9]{32})(.*)
I have run a crawl with this value:
<regex>
<pattern>(.*)(;jsessionid=[a-zA-Z0-9]{32})(.*)</pattern>
<substitution>$1$3</substitution>
</regex>
added to my regex-normalizer.xml
however it did not change db. does bin/nutch crawl need a special
command to get this to run?