[jira] Issue Comment Edited: (NUTCH-664) Possibility to update already stored documents.

2008-12-02 Thread Sergey Khilkov (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12651458#action_12651458 ] skhil edited comment on NUTCH-664 at 12/2/08 1:29 AM: --- Good ne

Re: Pending Commits for Nutch Issues

2008-12-02 Thread Susam Pal
I agree with John too. Probably you meant $ 0.02, since 0.02 cents is too less. It is usually 2 cents. :-P Regards, Susam Pal On Tue, Dec 2, 2008 at 6:09 PM, John Martyniak <[EMAIL PROTECTED]> wrote: > Is NUTCH-442 going to be part of the 1.0 release? I hope so, Nutch/Solr > integration would b

Re: Pending Commits for Nutch Issues

2008-12-02 Thread Julien Nioche
I agree with John. NUTCH-442 is by far the most popular/watched item in JIRA and, I think, has been already used by quite a lot of different people to be deemed reliable. Julien 2008/12/2 John Martyniak <[EMAIL PROTECTED]> > Is NUTCH-442 going to be part of the 1.0 release? I hope so, Nutch/So

[jira] Closed: (NUTCH-662) Upgrade Nutch to use Lucene 2.4

2008-12-02 Thread Dennis Kubes (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-662?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Kubes closed NUTCH-662. -- closed > Upgrade Nutch to use Lucene 2.4 > --- > > Key: NUTCH-6

[jira] Resolved: (NUTCH-662) Upgrade Nutch to use Lucene 2.4

2008-12-02 Thread Dennis Kubes (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-662?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Kubes resolved NUTCH-662. Resolution: Fixed Committed with revision 722475 > Upgrade Nutch to use Lucene 2.4 > --

[jira] Closed: (NUTCH-663) Upgrade Nutch to use Hadoop 0.19

2008-12-02 Thread Dennis Kubes (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Kubes closed NUTCH-663. -- > Upgrade Nutch to use Hadoop 0.19 > > > Key: NUTCH-663 >

[jira] Resolved: (NUTCH-663) Upgrade Nutch to use Hadoop 0.19

2008-12-02 Thread Dennis Kubes (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Kubes resolved NUTCH-663. Resolution: Fixed Committed with revision 722477 > Upgrade Nutch to use Hadoop 0.19 > -

[jira] Closed: (NUTCH-647) Resolve URLs tool

2008-12-02 Thread Dennis Kubes (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Kubes closed NUTCH-647. -- > Resolve URLs tool > - > > Key: NUTCH-647 > URL: https://is

[jira] Resolved: (NUTCH-647) Resolve URLs tool

2008-12-02 Thread Dennis Kubes (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Kubes resolved NUTCH-647. Resolution: Fixed Fix Version/s: 1.0.0 Committed with revision 722478 > Resolve URLs tool >

[jira] Resolved: (NUTCH-665) Search Load Testing Tool

2008-12-02 Thread Dennis Kubes (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Kubes resolved NUTCH-665. Resolution: Fixed Committed with revision 722481 > Search Load Testing Tool > -

[jira] Closed: (NUTCH-665) Search Load Testing Tool

2008-12-02 Thread Dennis Kubes (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Kubes closed NUTCH-665. -- > Search Load Testing Tool > > > Key: NUTCH-665 > U

[jira] Closed: (NUTCH-667) Input Format for working with Content in Hadoop Streaming

2008-12-02 Thread Dennis Kubes (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Kubes closed NUTCH-667. -- > Input Format for working with Content in Hadoop Streaming > --

[jira] Resolved: (NUTCH-667) Input Format for working with Content in Hadoop Streaming

2008-12-02 Thread Dennis Kubes (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Kubes resolved NUTCH-667. Resolution: Fixed Committed with revision 722483 > Input Format for working with Content in Hadoop

Re: Pending Commits for Nutch Issues

2008-12-02 Thread John Martyniak
Is NUTCH-442 going to be part of the 1.0 release? I hope so, Nutch/ Solr integration would be a huge. just my .02 cents. -John On Nov 27, 2008, at 12:10 PM, Doğacan Güney wrote: And here is a list of issues from me that needs more discussion/ review: NUTCH-442 - Integrate Nutch/Solr: If N

named parameters in crawl command

2008-12-02 Thread Koch Martina
Hi all, I've defined a couple of custom parameters for the usage of bin/nutch like for example the parameter "-conf" to set the conf dir from the command line. To be able to use the crawl command, I have to adjust the for-loop and if/else statements for the command line arguments args[] in the c

[jira] Created: (NUTCH-668) Domain URL Filter

2008-12-02 Thread Dennis Kubes (JIRA)
Domain URL Filter - Key: NUTCH-668 URL: https://issues.apache.org/jira/browse/NUTCH-668 Project: Nutch Issue Type: Improvement Affects Versions: 1.0.0 Environment: All Reporter: Dennis Kubes

[jira] Updated: (NUTCH-668) Domain URL Filter

2008-12-02 Thread Dennis Kubes (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Kubes updated NUTCH-668: --- Attachment: NUTCH-668-1-20081202.patch Includes the DomainURLFilter and test files. Domains can

Build failed in Hudson: Nutch-trunk #649

2008-12-02 Thread Apache Hudson Server
See http://hudson.zones.apache.org/hudson/job/Nutch-trunk/649/changes Changes: [kubes] NUTCH-667: Input Format for working with Content in Hadoop Streaming [kubes] NUTCH-665: Search Load Testing Tool [kubes] NUTCH-647: Resolve URLs tool [kubes] NUTCH-647: Resolve URLs tool [kubes] NUTCH-663: