Hi, I am trying to dump all of the URLS from a segment to a text file. I was able to do this successfully under Nutch 0.6 but am not able to do so under 0.7.1
Please take a look a the line below and let me know if you can figure out why I'm getting an error. Perhaps it a due to change from version 0.6 to 0.7.1, or maybe I just have the wrong syntax. Note: the segments/20051107233629 directory is a valid segments directory, as is evidence by the "ls" statement below. _______________________________________________________________________________________________________________________- -bash-2.05b$ bin/nutch net.nutch.pagedb.FetchListEntry -dumpurls segments/20051107233629 >foo.txt Exception in thread "main" java.lang.NoClassDefFoundError: net/nutch/pagedb/FetchListEntry -bash-2.05b$ ls -la segments/20051107233629 total 8 drwxr-xr-x 8 bryan bryan 1024 Nov 7 23:36 . drwxr-xr-x 3 bryan bryan 1024 Nov 7 23:36 .. drwxr-xr-x 2 bryan bryan 1024 Nov 7 23:36 content drwxr-xr-x 2 bryan bryan 1024 Nov 7 23:36 fetcher drwxr-xr-x 2 bryan bryan 1024 Nov 7 23:36 fetchlist drwxr-xr-x 2 bryan bryan 1024 Nov 7 23:36 index -rw-r--r-- 1 bryan bryan 0 Nov 7 23:36 index.done drwxr-xr-x 2 bryan bryan 1024 Nov 7 23:36 parse_data drwxr-xr-x 2 bryan bryan 1024 Nov 7 23:36 parse_text
