Ken Krugler wrote:
I'm wondering whether it would also make sense to remove anchor text from URLs. For example, currently these two URLs are treated as different:http://www.dina.kvl.dk/~sestoft/gcsharp/index.html#wordindexandhttp://www.dina.kvl.dk/~sestoft/gcsharp/index.html Is it safe to always strip # followed by (valid anchor characters) at the end of a URL?
Yes, I think so. Please submit a patch. Are there other common session ids that we should remove in this file? Doug