Hi Pal, hi all,
The merge-output appears while merging indexes, not merge segments. I attach 
the recrawl script, for clear.
In several first time of recrawl, it works without problem, but after that, the 
merge-output dir appears  and it says

        ...merge-output already exists...

Anyone know exactly to resolve this ?
Thanks a lot,

-----Original Message-----
From: Susam Pal [mailto:[EMAIL PROTECTED] 
Sent: 21 tháng sáu 2007 5:00 Chiều
To: [EMAIL PROTECTED]
Subject: Re: Problem with merge-output

This is something I usually do:-

$NUTCH_HOME/bin/nutch mergesegs crawl/MERGEDsegments crawl/segments/*
rm -rf crawl/segments/*
mv crawl/MERGEDsegments/* crawl/segments

You might want to replace the second statement with a 'mv' statement
to backup the segments.

Regards,
Susam Pal
http://susam.in/

On 6/21/07, Phạm Hải Thanh <[EMAIL PROTECTED]> wrote:
> Hi all,
>
> After recrawl several times, I have problem with the directory: merge-output. 
> I have digged into mail archive and found some clue: you should use a new dir 
> name for the new merge, e.g., merge-output_new, then mv merge-output_new to 
> merge-output.
>
>
>
> Anyone can show me exactly how to do this ?
>
> Thanks a lot
>
>
>
> ============================================================================
>
> After refetching database during index merging I get following error.
>
>
>
> 2007-04-27 15:58:37,787 FATAL indexer.IndexMerger - IndexMerger:
>
> java.io.IOException: Target /usr/local/nutch/nutchdb/index/merge-output 
> already
>
> exists
>
>         at org.apache.hadoop.fs.FileUtil.checkDest(FileUtil.java:230)
>
>         at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:70)
>
>         at
>
> org.apache.hadoop.fs.LocalFileSystem.copyFromLocalFile(LocalFileSystem.java:49)
>
>         at
>
> org.apache.hadoop.fs.FileSystem.moveFromLocalFile(FileSystem.java:750)
>
>         at
>
> org.apache.hadoop.fs.ChecksumFileSystem.completeLocalOutput(ChecksumFileSystem.java:622)
>
>         at org.apache.nutch.indexer.IndexMerger.merge(IndexMerger.java:104)
>
>         at org.apache.nutch.indexer.IndexMerger.run(IndexMerger.java:150)
>
>         at org.apache.hadoop.util.ToolBase.doMain(ToolBase.java:189)
>
>         at org.apache.nutch.indexer.IndexMerger.main(IndexMerger.java:113)
>
>
-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to