//svn.apache.org/repos/asf/lucene/nutch/branches/mapred/
Doug Cutting> Doug
Doug Cutting> Egor Chernodarov wrote:
>> Hello!
>>
>> I want to test NDFS on my nutch installation, but I have some problem.
>> I have started from wiki, where is quick demo for NDFS:
>> ht
e mapred branch and retry.
Doug Cutting> svn co
Doug Cutting> https://svn.apache.org/repos/asf/lucene/nutch/branches/mapred/
Doug Cutting> Doug
Doug Cutting> Egor Chernodarov wrote:
>> Hello!
>>
>> I want to test NDFS on my nutch installation, but I have some probl
Hello!
I want to test NDFS on my nutch installation, but I have some problem.
I have started from wiki, where is quick demo for NDFS:
http://wiki.apache.org/nutch/NutchDistributedFileSystem
On "$ nutch ndfs -put local_file /test/testfile"(or ./nutch admin db
-create and etc.) I always have except
egards,
Egor Chernodarov
---
This SF.Net email is sponsored by Yahoo.
Introducing Yahoo! Search Developer Network - Create apps using Yahoo!
Search APIs Find out how you can build Yahoo! directly into yo
Hello, all!
I have killed crawl process and now I want to use fetched data.
I have try to "nutch segread -fix -dir dbdir/segments", but process is
frozen. No any output, cpu usage is high.
Last output:
--
run java in /usr/local/jdk1.4.2/
expr: syntax error
050417 023311 No Nutc
Hello!
Monday, February 28, 2005, 1:53:07 AM, you wrote:
Y> 2. How to get the number of pages in DB?
try "bin/nutch readdb dbpath -stats"
--
Best regards, mailto:[EMAIL PROTECTED]
Chernodarov Egor
--
Hello all!
I do not see any replies on my previous posts...;-/ Maybe this
details can help.
Now I can precisely tell that parse stops at URL:
http://www5a.biglobe.ne.jp/~wakers/top/
I'm poorly familiar with JAVA...
I have tried to find where in sou
Hello, all!
I have made more tests and think that parser always stops on entry 16037
Last output (specified 1 thread for parse):
050210 085448 Entry: 16037 3360 wait=0 read=0 parse=4 wait=0 write=0ms
050210 085448 Read in entry 16038
050210 085448 parsing http://www5a.biglobe.ne.jp/~wakers/top
Hello, all!
I have wrote earlier about problem with dmoz db fetch. After
unsuccessful retries I have started fetch without parse by
"-noParsing" option. All links has been fetched successfully and
now I trying to make "nutch parse" and again see that the parse
stops after ~146