nutch-dev
Thread
Date
Earlier messages
Later messages
Messages by Thread
Re: [DISCUSS] contents of nutch release artifact
Sami Siren
Re: [DISCUSS] contents of nutch release artifact
Eric J. Christeson
Re: [DISCUSS] contents of nutch release artifact
Bartosz Gadzimski
Re: [DISCUSS] contents of nutch release artifact
Andrzej Bialecki
Re: [DISCUSS] contents of nutch release artifact
Sami Siren
Re: [DISCUSS] contents of nutch release artifact
Doğacan Güney
Re: [DISCUSS] contents of nutch release artifact
Andrzej Bialecki
Re: [DISCUSS] contents of nutch release artifact
Jukka Zitting
Re: [DISCUSS] contents of nutch release artifact
Jukka Zitting
Re: [DISCUSS] contents of nutch release artifact
Eric J. Christeson
[jira] Created: (NUTCH-727) Add KEYS file to release artifact
Sami Siren (JIRA)
[jira] Resolved: (NUTCH-727) Add KEYS file to release artifact
Sami Siren (JIRA)
[jira] Commented: (NUTCH-727) Add KEYS file to release artifact
Hudson (JIRA)
[jira] Created: (NUTCH-726) README.txt is lacking info that should be there
Sami Siren (JIRA)
[jira] Resolved: (NUTCH-726) README.txt is lacking info that should be there
Sami Siren (JIRA)
[jira] Commented: (NUTCH-726) README.txt is lacking info that should be there
Hudson (JIRA)
[jira] Commented: (NUTCH-525) DeleteDuplicates generates ArrayIndexOutOfBoundsException when trying to rerun dedup on a segment
minhthucpham (JIRA)
[jira] Created: (NUTCH-724) Drop the JAI libraries
Jukka Zitting (JIRA)
[jira] Resolved: (NUTCH-724) Drop the JAI libraries
Sami Siren (JIRA)
[jira] Created: (NUTCH-725) NOTICE.txt is lacking info that should be there
Sami Siren (JIRA)
[jira] Resolved: (NUTCH-725) NOTICE.txt is lacking info that should be there
Sami Siren (JIRA)
[jira] Commented: (NUTCH-725) NOTICE.txt is lacking info that should be there
Jukka Zitting (JIRA)
[jira] Commented: (NUTCH-725) NOTICE.txt is lacking info that should be there
Hudson (JIRA)
[jira] Created: (NUTCH-723) LICENCE.txt is lacking info that should be there
Sami Siren (JIRA)
[jira] Resolved: (NUTCH-723) LICENCE.txt is lacking info that should be there
Sami Siren (JIRA)
[jira] Issue Comment Edited: (NUTCH-723) LICENCE.txt is lacking info that should be there
Sami Siren (JIRA)
[jira] Commented: (NUTCH-723) LICENCE.txt is lacking info that should be there
Jukka Zitting (JIRA)
[jira] Commented: (NUTCH-723) LICENCE.txt is lacking info that should be there
Hudson (JIRA)
[jira] Commented: (NUTCH-723) LICENCE.txt is lacking info that should be there
Sami Siren (JIRA)
[jira] Created: (NUTCH-722) Nutch contains jars that we cannot redistribute
Sami Siren (JIRA)
[jira] Commented: (NUTCH-722) Nutch contains jars that we cannot redistribute
Andrzej Bialecki (JIRA)
[jira] Commented: (NUTCH-722) Nutch contains jars that we cannot redistribute
Jukka Zitting (JIRA)
[jira] Commented: (NUTCH-722) Nutch contains jars that we cannot redistribute
Jukka Zitting (JIRA)
[jira] Commented: (NUTCH-722) Nutch contains jars that we cannot redistribute
Andrzej Bialecki (JIRA)
[jira] Commented: (NUTCH-722) Nutch contains jars that we cannot redistribute
Sami Siren (JIRA)
[jira] Commented: (NUTCH-722) Nutch contains jars that we cannot redistribute
Sami Siren (JIRA)
[jira] Resolved: (NUTCH-722) Nutch contains jars that we cannot redistribute
Sami Siren (JIRA)
[jira] Commented: (NUTCH-722) Nutch contains jars that we cannot redistribute
Hudson (JIRA)
MergeSegments Error.
Armando Gonçalves
[jira] Created: (NUTCH-721) Fetcher2 Slow
Roger Dunk (JIRA)
[jira] Updated: (NUTCH-721) Fetcher2 Slow
Roger Dunk (JIRA)
[jira] Commented: (NUTCH-721) Fetcher2 Slow
JIRA
[jira] Commented: (NUTCH-721) Fetcher2 Slow
JIRA
[jira] Issue Comment Edited: (NUTCH-721) Fetcher2 Slow
JIRA
[jira] Commented: (NUTCH-721) Fetcher2 Slow
Roger Dunk (JIRA)
[jira] Commented: (NUTCH-721) Fetcher2 Slow
Hudson (JIRA)
[jira] Commented: (NUTCH-721) Fetcher2 Slow
JIRA
[jira] Commented: (NUTCH-721) Fetcher2 Slow
Roger Dunk (JIRA)
[jira] Commented: (NUTCH-721) Fetcher2 Slow
Julien Nioche (JIRA)
[jira] Commented: (NUTCH-721) Fetcher2 Slow
Roger Dunk (JIRA)
[jira] Commented: (NUTCH-721) Fetcher2 Slow
Otis Gospodnetic (JIRA)
[jira] Commented: (NUTCH-721) Fetcher2 Slow
Otis Gospodnetic (JIRA)
[jira] Commented: (NUTCH-721) Fetcher2 Slow
Roger Dunk (JIRA)
[jira] Commented: (NUTCH-721) Fetcher2 Slow
Steven Denny (JIRA)
[jira] Commented: (NUTCH-721) Fetcher2 Slow
JIRA
[jira] Commented: (NUTCH-721) Fetcher2 Slow
Julien Nioche (JIRA)
[jira] Commented: (NUTCH-721) Fetcher2 Slow
Andrzej Bialecki (JIRA)
[jira] Commented: (NUTCH-721) Fetcher2 Slow
JIRA
[jira] Updated: (NUTCH-721) Fetcher2 Slow
Julien Nioche (JIRA)
[jira] Closed: (NUTCH-721) Fetcher2 Slow
JIRA
[Nutch Wiki] Update of "NutchTutorial" by FrankMcCown
Apache Wiki
[jira] Created: (NUTCH-720) site: search operator with no query term
Frank McCown (JIRA)
[jira] Resolved: (NUTCH-720) site: search operator with no query term
Frank McCown (JIRA)
[jira] Created: (NUTCH-719) fetchQueues.totalSize incorrect in Fetcher2
Julien Nioche (JIRA)
[jira] Issue Comment Edited: (NUTCH-719) fetchQueues.totalSize incorrect in Fetcher2
Steven Denny (JIRA)
[jira] Commented: (NUTCH-719) fetchQueues.totalSize incorrect in Fetcher2
Steven Denny (JIRA)
[jira] Commented: (NUTCH-719) fetchQueues.totalSize incorrect in Fetcher2
Steven Denny (JIRA)
[jira] Commented: (NUTCH-719) fetchQueues.totalSize incorrect in Fetcher2
JIRA
[jira] Commented: (NUTCH-719) fetchQueues.totalSize incorrect in Fetcher2
Steven Denny (JIRA)
[jira] Commented: (NUTCH-719) fetchQueues.totalSize incorrect in Fetcher2
Steven Denny (JIRA)
[jira] Assigned: (NUTCH-719) fetchQueues.totalSize incorrect in Fetcher2
Julien Nioche (JIRA)
[jira] Resolved: (NUTCH-719) fetchQueues.totalSize incorrect in Fetcher2
Julien Nioche (JIRA)
[jira] Closed: (NUTCH-719) fetchQueues.totalSize incorrect in Fetcher2
Julien Nioche (JIRA)
[jira] Commented: (NUTCH-719) fetchQueues.totalSize incorrect in Fetcher2
Hudson (JIRA)
[jira] Commented: (NUTCH-719) fetchQueues.totalSize incorrect in Fetcher2
Euan Clark (JIRA)
[jira] Commented: (NUTCH-719) fetchQueues.totalSize incorrect in Fetcher2
Julien Nioche (JIRA)
[jira] Created: (NUTCH-718) urlfilter-subnets plugin
Dmitry Lihachev (JIRA)
[jira] Updated: (NUTCH-718) urlfilter-subnets plugin
Dmitry Lihachev (JIRA)
[jira] Updated: (NUTCH-718) urlfilter-subnets plugin
Dmitry Lihachev (JIRA)
PowerPoint Parsing Exception
Bullard, Luke
[no subject]
Agnieszka Zbrzezny
Use of gene...@l.a.o for...
Grant Ingersoll
[jira] Created: (NUTCH-717) Make Nutch Solr integration easier
Sami Siren (JIRA)
[jira] Commented: (NUTCH-717) Make Nutch Solr integration easier
Alex McLintock (JIRA)
[jira] Commented: (NUTCH-717) Make Nutch Solr integration easier
JIRA
[jira] Updated: (NUTCH-717) Make Nutch Solr integration easier
Chris A. Mattmann (JIRA)
Moving Nutch parsers to Tika
Andrzej Bialecki
Re: Moving Nutch parsers to Tika
Sami Siren
Re: Moving Nutch parsers to Tika
Otis Gospodnetic
[jira] Created: (NUTCH-716) Make subcollection index filed multivalued
Dmitry Lihachev (JIRA)
[jira] Updated: (NUTCH-716) Make subcollection index filed multivalued
Dmitry Lihachev (JIRA)
[jira] Updated: (NUTCH-716) Make subcollection index filed multivalued
Dmitry Lihachev (JIRA)
[jira] Updated: (NUTCH-716) Make subcollection index filed multivalued
Chris A. Mattmann (JIRA)
[jira] Created: (NUTCH-715) Subcollection plugin doesn't work with default subcollections.xml file
Dmitry Lihachev (JIRA)
[jira] Updated: (NUTCH-715) Subcollection plugin doesn't work with default subcollections.xml file
Dmitry Lihachev (JIRA)
[jira] Updated: (NUTCH-715) Subcollection plugin doesn't work with default subcollections.xml file
Dmitry Lihachev (JIRA)
[jira] Updated: (NUTCH-715) Subcollection plugin doesn't work with default subcollections.xml file
Dmitry Lihachev (JIRA)
[jira] Updated: (NUTCH-715) Subcollection plugin doesn't work with default subcollections.xml file
Dmitry Lihachev (JIRA)
[jira] Assigned: (NUTCH-715) Subcollection plugin doesn't work with default subcollections.xml file
Sami Siren (JIRA)
[jira] Resolved: (NUTCH-715) Subcollection plugin doesn't work with default subcollections.xml file
Sami Siren (JIRA)
[jira] Commented: (NUTCH-715) Subcollection plugin doesn't work with default subcollections.xml file
Hudson (JIRA)
[jira] Created: (NUTCH-714) Need a SFTP and SCP Protocol Handler
Sanjoy Ghosh (JIRA)
[jira] Commented: (NUTCH-714) Need a SFTP and SCP Protocol Handler
Chris A. Mattmann (JIRA)
[jira] Assigned: (NUTCH-714) Need a SFTP and SCP Protocol Handler
Chris A. Mattmann (JIRA)
[jira] Updated: (NUTCH-714) Need a SFTP and SCP Protocol Handler
Sanjoy Ghosh (JIRA)
RE: [jira] Updated: (NUTCH-714) Need a SFTP and SCP Protocol Handler
sanjoy.ghosh
[jira] Updated: (NUTCH-714) Need a SFTP and SCP Protocol Handler
Julien Nioche (JIRA)
Nutch ML cleanup
Otis Gospodnetic
Re: Nutch ML cleanup
Sami Siren
Re: Nutch ML cleanup
Sami Siren
Re: Nutch ML cleanup
Andrzej Bialecki
Re: Nutch ML cleanup
Doug Cutting
Re: Nutch ML cleanup
Otis Gospodnetic
[jira] Created: (NUTCH-713) Config options for webgraph Scoring not documented
Eric J. Christeson (JIRA)
[jira] Updated: (NUTCH-713) Config options for webgraph Scoring not documented
Eric J. Christeson (JIRA)
NUTCH-684 [was: Re: [VOTE] Release Apache Nutch 1.0]
Sami Siren
Re: NUTCH-684 [was: Re: [VOTE] Release Apache Nutch 1.0]
Doğacan Güney
Re: NUTCH-684 [was: Re: [VOTE] Release Apache Nutch 1.0]
Sami Siren
Re: NUTCH-684 [was: Re: [VOTE] Release Apache Nutch 1.0]
Doğacan Güney
[VOTE] Release Apache Nutch 1.0
Sami Siren
Re: [VOTE] Release Apache Nutch 1.0
Doğacan Güney
Re: [VOTE] Release Apache Nutch 1.0
Dennis Kubes
Re: [VOTE] Release Apache Nutch 1.0
Marko Bauhardt
Re: [VOTE] Release Apache Nutch 1.0
Eric J. Christeson
Re: [VOTE] Release Apache Nutch 1.0
Sami Siren
[VOTE] Release Apache Nutch 1.0
Sami Siren
Re: [VOTE] Release Apache Nutch 1.0
Marko Bauhardt
Re: [VOTE] Release Apache Nutch 1.0
Doğacan Güney
Re: [VOTE] Release Apache Nutch 1.0
Sami Siren
Re: [VOTE] Release Apache Nutch 1.0
Mattmann, Chris A
Re: [VOTE] Release Apache Nutch 1.0
Andrzej Bialecki
Re: [VOTE] Release Apache Nutch 1.0
Grant Ingersoll
Re: [VOTE] Release Apache Nutch 1.0
Sami Siren
Re: [VOTE] Release Apache Nutch 1.0
Grant Ingersoll
Re: [VOTE] Release Apache Nutch 1.0
Sami Siren
Re: [VOTE] Release Apache Nutch 1.0
Sami Siren
Re: [VOTE] Release Apache Nutch 1.0
Jukka Zitting
Re: [VOTE] Release Apache Nutch 1.0
Sami Siren
Re: [VOTE] Release Apache Nutch 1.0
Jukka Zitting
Re: [VOTE] Release Apache Nutch 1.0
buddha1021
[VOTE] Release Apache Nutch 1.0
Sami Siren
Re: [VOTE] Release Apache Nutch 1.0
Doğacan Güney
Re: [VOTE] Release Apache Nutch 1.0
Dennis Kubes
Re: [VOTE] Release Apache Nutch 1.0
Doğacan Güney
Re: [VOTE] Release Apache Nutch 1.0
Techie
Re: [VOTE] Release Apache Nutch 1.0
Andrzej Bialecki
Re: [VOTE] Release Apache Nutch 1.0
Sami Siren
Re: [VOTE] Release Apache Nutch 1.0
Cosmin Lehene
Re: [VOTE] Release Apache Nutch 1.0
Roger Dunk
Re: [VOTE] Release Apache Nutch 1.0
Cosmin Lehene
[Nutch Wiki] Update of "NewScoringIndexingExample" by DennisKubes
Apache Wiki
[Nutch Wiki] Update of "NewScoringIndexingExample" by DennisKubes
Apache Wiki
[Nutch Wiki] Update of "NewScoringIndexingExample" by DennisKubes
Apache Wiki
[jira] Created: (NUTCH-712) ParseOutputFormat should catch java.net.MalformedURLException coming from normalizers
Julien Nioche (JIRA)
[jira] Updated: (NUTCH-712) ParseOutputFormat should catch java.net.MalformedURLException coming from normalizers
Julien Nioche (JIRA)
[jira] Commented: (NUTCH-712) ParseOutputFormat should catch java.net.MalformedURLException coming from normalizers
Andrzej Bialecki (JIRA)
[jira] Updated: (NUTCH-712) ParseOutputFormat should catch java.net.MalformedURLException coming from normalizers
Julien Nioche (JIRA)
[jira] Updated: (NUTCH-712) ParseOutputFormat should catch java.net.MalformedURLException coming from normalizers
Julien Nioche (JIRA)
[jira] Closed: (NUTCH-712) ParseOutputFormat should catch java.net.MalformedURLException coming from normalizers
Andrzej Bialecki (JIRA)
[jira] Commented: (NUTCH-712) ParseOutputFormat should catch java.net.MalformedURLException coming from normalizers
Andrzej Bialecki (JIRA)
[jira] Commented: (NUTCH-712) ParseOutputFormat should catch java.net.MalformedURLException coming from normalizers
Hudson (JIRA)
[jira] Created: (NUTCH-711) Indexer failing after upgrade to Hadoop 0.19.1
Andrzej Bialecki (JIRA)
[jira] Updated: (NUTCH-711) Indexer failing after upgrade to Hadoop 0.19.1
Andrzej Bialecki (JIRA)
[jira] Updated: (NUTCH-711) Indexer failing after upgrade to Hadoop 0.19.1
Andrzej Bialecki (JIRA)
[jira] Commented: (NUTCH-711) Indexer failing after upgrade to Hadoop 0.19.1
Sami Siren (JIRA)
[jira] Resolved: (NUTCH-711) Indexer failing after upgrade to Hadoop 0.19.1
Andrzej Bialecki (JIRA)
Re: [jira] Resolved: (NUTCH-711) Indexer failing after upgrade to Hadoop 0.19.1
Sami Siren
[jira] Updated: (NUTCH-711) Indexer failing after upgrade to Hadoop 0.19.1
Andrzej Bialecki (JIRA)
[jira] Commented: (NUTCH-711) Indexer failing after upgrade to Hadoop 0.19.1
Hudson (JIRA)
[jira] Created: (NUTCH-710) Support for rel="canonical" attribute
Frank McCown (JIRA)
[jira] Updated: (NUTCH-710) Support for rel="canonical" attribute
Julien Nioche (JIRA)
[jira] Commented: (NUTCH-710) Support for rel="canonical" attribute
Julien Nioche (JIRA)
site: operator with no query term
Frank McCown
Re: site: operator with no query term
Otis Gospodnetic
Re: site: operator with no query term
John Martyniak
[jira] Created: (NUTCH-709) JSParseFilter gets into an infinate loop and ets all the stack
Tim Hawkins (JIRA)
[jira] Commented: (NUTCH-709) JSParseFilter gets into an infinate loop and ets all the stack
Julien Nioche (JIRA)
[jira] Commented: (NUTCH-709) JSParseFilter gets into an infinate loop and ets all the stack
Tim Hawkins (JIRA)
[jira] Commented: (NUTCH-709) JSParseFilter gets into an infinate loop and ets all the stack
Martina Koch (JIRA)
[jira] Updated: (NUTCH-709) JSParseFilter gets into an infinate loop and ets all the stack
Julien Nioche (JIRA)
[jira] Commented: (NUTCH-709) JSParseFilter gets into an infinate loop and ets all the stack
Julien Nioche (JIRA)
[jira] Commented: (NUTCH-709) JSParseFilter gets into an infinate loop and ets all the stack
Tim Hawkins (JIRA)
[jira] Commented: (NUTCH-709) JSParseFilter gets into an infinate loop and ets all the stack
Tim Hawkins (JIRA)
[jira] Commented: (NUTCH-709) JSParseFilter gets into an infinate loop and ets all the stack
Tim Hawkins (JIRA)
[jira] Commented: (NUTCH-709) JSParseFilter gets into an infinate loop and ets all the stack
Julien Nioche (JIRA)
[jira] Commented: (NUTCH-709) JSParseFilter gets into an infinate loop and ets all the stack
Jeff Shafer (JIRA)
[jira] Issue Comment Edited: (NUTCH-709) JSParseFilter gets into an infinate loop and ets all the stack
Jeff Shafer (JIRA)
[jira] Commented: (NUTCH-709) JSParseFilter gets into an infinate loop and ets all the stack
JIRA
Build failed in Hudson: Nutch-trunk #741
Apache Hudson Server
Hudson build is back to normal: Nutch-trunk #742
Apache Hudson Server
Parsing, Indexing multiple values (of same type) per document - Nutch-0.9
Stefan Dlugolinsky
Job offer for Nutch-Lucene Programmer
Wolfgang Sander-Beuermann
[jira] Closed: (NUTCH-419) unavailable robots.txt kills fetch
Andrzej Bialecki (JIRA)
How to make parse-xml plugin (NUTCH-185) compatible with the latest trunk ?
Gopikrishnan Kookkal
[jira] Created: (NUTCH-708) NutchBean: OOM due to searcher.max.hits and dedup.
Aaron Binns (JIRA)
[jira] Updated: (NUTCH-419) unavailable robots.txt kills fetch
Doug Cook (JIRA)
[jira] Commented: (NUTCH-419) unavailable robots.txt kills fetch
Doug Cook (JIRA)
[jira] Commented: (NUTCH-419) unavailable robots.txt kills fetch
Andrzej Bialecki (JIRA)
[jira] Commented: (NUTCH-419) unavailable robots.txt kills fetch
Hudson (JIRA)
[jira] Created: (NUTCH-707) Generation of multiple segments in multiple runs returns only 1 segment
Michael Chan (JIRA)
Earlier messages
Later messages