hi,all:
in the crawl log, i found a lot of url like this: Indexing [ http://app02.laopdr.gov.la/ePortal/news/search.action;jsessionid=1g6GJ1LBlkVh9xnQJTzr39qCtCTM2w8c70jzmnmn72GwJ4yTVlRl!1869151030?d-2558428-p=2] with analyzer org.apache.nutch.analysis.nutchdocumentanaly...@6937f9 (null) Indexing [ http://app02.laopdr.gov.la/ePortal/news/search.action;jsessionid=1g6GJ1LBlkVh9xnQJTzr39qCtCTM2w8c70jzmnmn72GwJ4yTVlRl!1869151030?d-2558428-p=3] with analyzer org.apache.nutch.analysis.nutchdocumentanaly...@6937f9 (null) Indexing [ http://app02.laopdr.gov.la/ePortal/news/search.action;jsessionid=1gQ3J1QJGJBz6Rxj5pJGZGTQLppQWpQyW1f7XnGM2W21LTsnbF6L!1869151030?d-2558428-p=2] with analyzer org.apache.nutch.analysis.nutchdocumentanaly...@6937f9 (null) Indexing [ http://app02.laopdr.gov.la/ePortal/news/search.action;jsessionid=1gQ3J1QJGJBz6Rxj5pJGZGTQLppQWpQyW1f7XnGM2W21LTsnbF6L!1869151030?d-2558428-p=3] with analyzer org.apache.nutch.analysis.nutchdocumentanaly...@6937f9 (null) Indexing [ http://app02.laopdr.gov.la/ePortal/news/search.action;jsessionid=1gn8J1yLDfFCcTLkSWVg1WdKpc6wXGXG531vyBGnCnpcnpWjknPR!1869151030?d-2558428-p=2] with analyzer org.apache.nutch.analysis.nutchdocumentanaly...@6937f9 (null) Indexing [ http://app02.laopdr.gov.la/ePortal/news/search.action;jsessionid=1gn8J1yLDfFCcTLkSWVg1WdKpc6wXGXG531vyBGnCnpcnpWjknPR!1869151030?d-2558428-p=3] with analyzer org.apache.nutch.analysis.nutchdocumentanaly...@6937f9 (null) Indexing [ http://app02.laopdr.gov.la/ePortal/news/search.action;jsessionid=1kNWJ19TFq16pdQMyf61Rv2LzQlvxRKxRjLTcnnkB27qZ9XQ5HGj!1869151030?d-2558428-p=2] with analyzer org.apache.nutch.analysis.nutchdocumentanaly...@6937f9 (null) Indexing [ http://app02.laopdr.gov.la/ePortal/news/search.action;jsessionid=1kNWJ19TFq16pdQMyf61Rv2LzQlvxRKxRjLTcnnkB27qZ9XQ5HGj!1869151030?d-2558428-p=3] with analyzer org.apache.nutch.analysis.nutchdocumentanaly...@6937f9 (null) Indexing [ http://app02.laopdr.gov.la/ePortal/news/search.action;jsessionid=1njPJ1THhvJRGLLtSfTWbBLL252N6hTD4hyj9rp8YNQTfRnm96cJ!1869151030?d-2558428-p=2] with analyzer org.apache.nutch.analysis.nutchdocumentanaly...@6937f9 (null) Indexing [ http://app02.laopdr.gov.la/ePortal/news/search.action;jsessionid=1njPJ1THhvJRGLLtSfTWbBLL252N6hTD4hyj9rp8YNQTfRnm96cJ!1869151030?d-2558428-p=3] with analyzer org.apache.nutch.analysis.nutchdocumentanaly...@6937f9 (null) The url have a lot of "jsessionid", is it right? it has any impact?(like topN)