[Nutch-dev] [jira] Updated: (NUTCH-52) Parser plugin for MS Excel files

2005-04-25 Thread Rohit Kulkarni (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-52?page=all ] Rohit Kulkarni updated NUTCH-52: Attachment: parse-msexcel.zip The plugin is tested with the latest nutch SVN and seems to work fine. Currently only STRING and NUMERIC Excel cell data types are b

[Nutch-dev] [jira] Created: (NUTCH-53) Parser plugin for Zip files

2005-04-25 Thread Rohit Kulkarni (JIRA)
Parser plugin for Zip files --- Key: NUTCH-53 URL: http://issues.apache.org/jira/browse/NUTCH-53 Project: Nutch Type: Improvement Components: fetcher Reporter: Rohit Kulkarni Priority: Trivial Nutch plugin to parse Zip files (usi

[Nutch-dev] [PATCH] - NDFS TestClient command line handling

2005-04-25 Thread Piotr Kosiorowski
Hello, I am attaching a really small patch for TestClient that fixes display of command line usage adding supported but not mentioned in usage information parameters (-report and -mkdir) . It also allows to invoke TestClient with one parameter (eg. -report when filesystem information is read f

[Nutch-dev] Bug: Nutch indexer crashed

2005-04-25 Thread John Doe
I spent about 30 minutes trying to figure out how to submit a bug via JIRA. There must be a way, but it's not shown on any of the JIRA pages I clicked on. Anyway, here's the bug report: Component: indexer Priority: major After running for several hours on the intranet, the Nutch indexer crashe

Re: [Nutch-dev] Getting HTML source

2005-04-25 Thread Hasan Diwan
On 23/04/05, rajat swarup <[EMAIL PROTECTED]> wrote: > Most of the methods in the code return Page objects in the code. But > looking at the Page class definitions I found that there were no > fields in the Page class that would give me access to the actual HTML > source code or the parsed data ins

[Nutch-dev] Re: To get Nutch to print debug messages

2005-04-25 Thread rajat swarup
On 4/25/05, Stefan Groschupf <[EMAIL PROTECTED]> wrote: > Try to setup the logging.properties file in your jre/lib folder, there > you can setup the logging level of the java logging api that is used by > nutch. Thanks Stefan! Now the logging is better than before. > On Apr 25, 2005 4:51 AM, Raja

[Nutch-dev] [jira] Commented: (NUTCH-51) Removing a plugin after fetch but before indexing causes errors

2005-04-25 Thread Piotr Kosiorowski (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-51?page=comments#action_63652 ] Piotr Kosiorowski commented on NUTCH-51: Looking at initialization log I cannot find any kind of IndexingFilter plugin included. index-basic plugin is not included. Pr

[Nutch-dev] Re: To get Nutch to print debug messages

2005-04-25 Thread Stefan Groschupf
Try to setup the logging.properties file in your jre/lib folder, there you can setup the logging level of the java logging api that is used by nutch. Am 25.04.2005 um 13:51 schrieb rajat swarup: Hi, I'm trying to get some debug messages to be printed on the screen while the crawl is being done