Messages by Date
-
2009/07/29
[Nutch Wiki] Update of "FrontPage" by AlexMc
Apache Wiki
-
2009/07/29
Re: Nutch dev. plans
Kirby Bohling
-
2009/07/29
[Nutch Wiki] Update of "CommandLineOptions" by AlexMc
Apache Wiki
-
2009/07/29
Re: New Extension Points?
Andrzej Bialecki
-
2009/07/29
New Extension Points?
Marko Bauhardt
-
2009/07/29
[jira] Updated: (NUTCH-249) black- white list url filtering
Marko Bauhardt (JIRA)
-
2009/07/29
Re: Nutch dev. plans
Andrzej Bialecki
-
2009/07/29
Re: Nutch dev. plans
Doğacan Güney
-
2009/07/27
RE: Running the Crawl without using bin/nutch in side a scala program
Sailaja Dhiviti
-
2009/07/27
Wiki errors?
Alex McLintock
-
2009/07/27
[Nutch Wiki] Trivial Update of "bin/nutch readdb" by AlexMc
Apache Wiki
-
2009/07/27
[Nutch Wiki] Update of "bin/nutch readdb" by AlexMc
Apache Wiki
-
2009/07/27
Re: Running the Crawl without using bin/nutch in side a scala program
Doğacan Güney
-
2009/07/27
Running the Crawl without using bin/nutch in side a scala program
Sailaja Dhiviti
-
2009/07/26
[jira] Updated: (NUTCH-746) NutchBeanConstructor does not close NutchBean upon contextDestroyed, causing resource leak in the container.
Kirby Bohling (JIRA)
-
2009/07/26
[jira] Updated: (NUTCH-738) Close SegmentUpdater when FetchedSegments is closed
Kirby Bohling (JIRA)
-
2009/07/26
[jira] Created: (NUTCH-746) NutchBeanConstructor does not close NutchBean upon contextDestroyed, causing resource leak in the container.
Kirby Bohling (JIRA)
-
2009/07/26
Re: Nutch dev. plans
Andrzej Bialecki
-
2009/07/25
Re: Nutch dev. plans
Kirby Bohling
-
2009/07/25
Re: Server suggestion
Dennis Kubes
-
2009/07/25
Re: Nutch dev. plans
Andrzej Bialecki
-
2009/07/24
Re: Server suggestion
Doğacan Güney
-
2009/07/24
Re: Server suggestion
Dennis Kubes
-
2009/07/24
Server suggestion
fredericoagent
-
2009/07/22
Re: Nutch dev. plans
Enis Soztutar
-
2009/07/22
[ApacheCon US] Travel Assistance
Grant Ingersoll
-
2009/07/22
How to test searcher of nutch 1.0?
xiao yang
-
2009/07/20
Re: Nutch dev. plans
Ken Krugler
-
2009/07/17
Re: Nutch dev. plans
Kirby Bohling
-
2009/07/17
Re: Nutch dev. plans
Andrzej Bialecki
-
2009/07/17
Re: Nutch dev. plans
Dennis Kubes
-
2009/07/17
Re: Nutch dev. plans
Doğacan Güney
-
2009/07/17
Re: Nutch dev. plans
Andrzej Bialecki
-
2009/07/17
Re: Nutch dev. plans
Doğacan Güney
-
2009/07/17
Nutch dev. plans
Andrzej Bialecki
-
2009/07/16
[jira] Commented: (NUTCH-650) Hbase Integration
Andrzej Bialecki (JIRA)
-
2009/07/16
[jira] Commented: (NUTCH-650) Hbase Integration
JIRA
-
2009/07/16
[jira] Commented: (NUTCH-650) Hbase Integration
Andrzej Bialecki (JIRA)
-
2009/07/16
[Nutch Wiki] Update of "FrontPage" by DanielZhou
Apache Wiki
-
2009/07/13
[jira] Commented: (NUTCH-721) Fetcher2 Slow
JIRA
-
2009/07/13
[jira] Issue Comment Edited: (NUTCH-650) Hbase Integration
JIRA
-
2009/07/13
[jira] Commented: (NUTCH-650) Hbase Integration
JIRA
-
2009/07/13
[jira] Commented: (NUTCH-650) Hbase Integration
JIRA
-
2009/07/13
[jira] Commented: (NUTCH-719) fetchQueues.totalSize incorrect in Fetcher2
Steven Denny (JIRA)
-
2009/07/13
[jira] Commented: (NUTCH-721) Fetcher2 Slow
Steven Denny (JIRA)
-
2009/07/13
[jira] Commented: (NUTCH-719) fetchQueues.totalSize incorrect in Fetcher2
Steven Denny (JIRA)
-
2009/07/10
[jira] Commented: (NUTCH-719) fetchQueues.totalSize incorrect in Fetcher2
JIRA
-
2009/07/10
[jira] Commented: (NUTCH-719) fetchQueues.totalSize incorrect in Fetcher2
Steven Denny (JIRA)
-
2009/07/09
[jira] Created: (NUTCH-745) MyHtmlParser getParse return not null,so all Analyzer-(zh|fr) cannot run
jcore_XiaTian (JIRA)
-
2009/07/09
[jira] Commented: (NUTCH-744) indexing items in rss-feed in seperate page
Tarun (JIRA)
-
2009/07/09
[jira] Closed: (NUTCH-744) indexing items in rss-feed in seperate page
JIRA
-
2009/07/09
[jira] Created: (NUTCH-744) indexing items in rss-feed in seperate page
Tarun Agrawal (JIRA)
-
2009/07/09
[jira] Commented: (NUTCH-719) fetchQueues.totalSize incorrect in Fetcher2
Steven Denny (JIRA)
-
2009/07/09
[jira] Issue Comment Edited: (NUTCH-719) fetchQueues.totalSize incorrect in Fetcher2
Steven Denny (JIRA)
-
2009/07/09
[jira] Commented: (NUTCH-717) Make Nutch Solr integration easier
JIRA
-
2009/07/09
Re: Upgrade to hadoop 0.20?
Doğacan Güney
-
2009/07/09
Test Mail <EOM>
Sailaja Dhiviti
-
2009/07/08
[jira] Commented: (NUTCH-717) Make Nutch Solr integration easier
Alex McLintock (JIRA)
-
2009/07/08
[jira] Commented: (NUTCH-422) index-extra plugin creates additional fields in the index, based on configurable logic
Alex McLintock (JIRA)
-
2009/07/08
Re: Upgrade to hadoop 0.20?
Julien Nioche
-
2009/07/07
Upgrade to hadoop 0.20?
Doğacan Güney
-
2009/07/07
adding fields to index
Beats
-
2009/07/06
what is Non DFS Used in cluster summary? how to delete Non DFS Used data
Pravin Karne
-
2009/07/03
[jira] Commented: (NUTCH-743) Site search powered by Lucene/Solr
Hudson (JIRA)
-
2009/07/02
[jira] Resolved: (NUTCH-743) Site search powered by Lucene/Solr
Sami Siren (JIRA)
-
2009/07/02
what is diff between "mapred.map.tasks" and "mapred.tasktracker.map.tasks.maximum"
Pravin Karne
-
2009/07/01
Nutch is very slow....what does following graph shows
Pravin Karne
-
2009/07/01
test mail
Pravin Karne
-
2009/06/30
Getting Crawl Depth During Runtime
MyD
-
2009/06/30
Hudson build is back to normal: Nutch-trunk #861
Apache Hudson Server
-
2009/06/30
How to optimize nutch's fetch perfotmance
Pravin Karne
-
2009/06/29
Build failed in Hudson: Nutch-trunk #860
Apache Hudson Server
-
2009/06/28
Build failed in Hudson: Nutch-trunk #859
Apache Hudson Server
-
2009/06/27
Build failed in Hudson: Nutch-trunk #858
Apache Hudson Server
-
2009/06/27
Re: Build failed in Hudson: Nutch-trunk #857
Dennis Kubes
-
2009/06/27
Re: Build failed in Hudson: Nutch-trunk #857
Doğacan Güney
-
2009/06/26
Build failed in Hudson: Nutch-trunk #857
Apache Hudson Server
-
2009/06/25
Build failed in Hudson: Nutch-trunk #856
Apache Hudson Server
-
2009/06/24
Build failed in Hudson: Nutch-trunk #855
Apache Hudson Server
-
2009/06/24
Re: Per-host fetch-interval
Sandeep Tata
-
2009/06/24
Re: Per-host fetch-interval
Andrzej Bialecki
-
2009/06/23
Build failed in Hudson: Nutch-trunk #854
Apache Hudson Server
-
2009/06/23
[jira] Commented: (NUTCH-729) NPE in FieldIndexer when BasicFields url doesn't exist
Tadesse Sefer (JIRA)
-
2009/06/23
Per-host fetch-interval
Sandeep Tata
-
2009/06/23
[jira] Commented: (NUTCH-743) Site search powered by Lucene/Solr
Andrzej Bialecki (JIRA)
-
2009/06/23
[jira] Updated: (NUTCH-743) Site search powered by Lucene/Solr
Sami Siren (JIRA)
-
2009/06/23
[jira] Created: (NUTCH-743) Site search powered by Lucene/Solr
Sami Siren (JIRA)
-
2009/06/22
Build failed in Hudson: Nutch-trunk #853
Apache Hudson Server
-
2009/06/21
Build failed in Hudson: Nutch-trunk #852
Apache Hudson Server
-
2009/06/20
Build failed in Hudson: Nutch-trunk #851
Apache Hudson Server
-
2009/06/20
[jira] Resolved: (NUTCH-742) Checksum Error
Otis Gospodnetic (JIRA)
-
2009/06/20
[jira] Updated: (NUTCH-731) Redirection of robots.txt in RobotRulesParser
Otis Gospodnetic (JIRA)
-
2009/06/20
[Nutch Wiki] Update of "AddingNewLocalization" by Mike Dawson
Apache Wiki
-
2009/06/20
[Nutch Wiki] Update of "AddingNewLocalization" by Mike Dawson
Apache Wiki
-
2009/06/20
[jira] Commented: (NUTCH-731) Redirection of robots.txt in RobotRulesParser
Ken Krugler (JIRA)
-
2009/06/20
[Nutch Wiki] Update of "AddingNewLocalization" by Mike Dawson
Apache Wiki
-
2009/06/20
[jira] Commented: (NUTCH-731) Redirection of robots.txt in RobotRulesParser
Julien Nioche (JIRA)
-
2009/06/20
[Nutch Wiki] Update of "AddingNewLocalization" by Mike Dawson
Apache Wiki
-
2009/06/20
[jira] Created: (NUTCH-742) Checksum Error
mawanqiang (JIRA)
-
2009/06/19
[jira] Resolved: (NUTCH-101) RobotRulesParser
Otis Gospodnetic (JIRA)
-
2009/06/19
Build failed in Hudson: Nutch-trunk #850
Apache Hudson Server
-
2009/06/19
[jira] Commented: (NUTCH-101) RobotRulesParser
Ken Krugler (JIRA)
-
2009/06/19
Re: Plugins: when to perform web service requests, on fetch or on index?
caezar
-
2009/06/18
Re: Plugins: when to perform web service requests, on fetch or on index?
Kirby Bohling
-
2009/06/18
Build failed in Hudson: Nutch-trunk #849
Apache Hudson Server
-
2009/06/18
Language plugin tokenizers in Indexer?
Aaron Binns
-
2009/06/18
Re: Plugins: when to perform web service requests, on fetch or on index?
caezar
-
2009/06/18
Re: Plugins: when to perform web service requests, on fetch or on index?
joel gump
-
2009/06/18
Re: Plugins: when to perform web service requests, on fetch or on index?
Stefan Dlugolinsky
-
2009/06/18
Re: Plugins: when to perform web service requests, on fetch or on index?
caezar
-
2009/06/18
Re: Plugins: when to perform web service requests, on fetch or on index?
caezar
-
2009/06/18
Re: Plugins: when to perform web service requests, on fetch or on index?
Stefan Dlugolinsky
-
2009/06/18
Re: Plugins: when to perform web service requests, on fetch or on index?
joel gump
-
2009/06/18
Plugins: when to perform web service requests, on fetch or on index?
caezar
-
2009/06/17
Build failed in Hudson: Nutch-trunk #848
Apache Hudson Server
-
2009/06/17
[Nutch Wiki] Update of "HttpAuthenticationSchemes" by susam
Apache Wiki
-
2009/06/17
[Nutch Wiki] Update of "HttpAuthenticationSchemes" by wobbet
Apache Wiki
-
2009/06/17
[Nutch Wiki] Update of "Support" by Justin Gilbreath
Apache Wiki
-
2009/06/17
[Nutch Wiki] Update of "Support" by Justin Gilbreath
Apache Wiki
-
2009/06/16
Build failed in Hudson: Nutch-trunk #847
Apache Hudson Server
-
2009/06/16
Re: a nutch Chinese language processing problem
joel.gump
-
2009/06/16
a nutch Chinese language processing problem
fashengliu
-
2009/06/16
Re: Antwort: Re: Why does TestNodeWalker keep failing?
Andrzej Bialecki
-
2009/06/15
Antwort: Re: Why does TestNodeWalker keep failing?
marcel . schnippe
-
2009/06/15
Build failed in Hudson: Nutch-trunk #846
Apache Hudson Server
-
2009/06/14
Build failed in Hudson: Nutch-trunk #845
Apache Hudson Server
-
2009/06/13
Build failed in Hudson: Nutch-trunk #844
Apache Hudson Server
-
2009/06/13
Re: Why does TestNodeWalker keep failing?
Doğacan Güney
-
2009/06/12
Build failed in Hudson: Nutch-trunk #843
Apache Hudson Server
-
2009/06/12
Re: Why does TestNodeWalker keep failing?
Andrzej Bialecki
-
2009/06/12
Why does TestNodeWalker keep failing?
Doğacan Güney
-
2009/06/11
Build failed in Hudson: Nutch-trunk #842
Apache Hudson Server
-
2009/06/10
Build failed in Hudson: Nutch-trunk #841
Apache Hudson Server
-
2009/06/09
Build failed in Hudson: Nutch-trunk #840
Apache Hudson Server
-
2009/06/09
[jira] Updated: (NUTCH-740) Configuration option to override default language for fetched pages.
Marcin Okraszewski (JIRA)
-
2009/06/09
[Nutch Wiki] Update of "IntranetRecrawl" by susam
Apache Wiki
-
2009/06/09
[Nutch Wiki] Update of "IntranetRecrawl" by susam
Apache Wiki
-
2009/06/07
[jira] Commented: (NUTCH-735) crawl-tool.xml must be read before nutch-site.xml when invoked using crawl command
Hudson (JIRA)
-
2009/06/07
[jira] Commented: (NUTCH-740) Configuration option to override default language for fetched pages.
JIRA
-
2009/06/07
[jira] Updated: (NUTCH-702) Lazy Instanciation of Metadata in CrawlDatum
JIRA
-
2009/06/07
[jira] Commented: (NUTCH-692) AlreadyBeingCreatedException with Hadoop 0.19
JIRA
-
2009/06/07
[jira] Commented: (NUTCH-709) JSParseFilter gets into an infinate loop and ets all the stack
JIRA
-
2009/06/07
[jira] Commented: (NUTCH-733) plain text view of cached files ignores HTML encoding
JIRA
-
2009/06/07
[jira] Closed: (NUTCH-735) crawl-tool.xml must be read before nutch-site.xml when invoked using crawl command
JIRA
-
2009/06/07
[jira] Commented: (NUTCH-650) Hbase Integration
JIRA
-
2009/06/06
Software to Evaluate Algorithms
kloc4mif
-
2009/06/04
org.apache.nutch.protocol.file.FileError: File Error: 404
Mr Shore
-
2009/06/04
Extending Nutch to create HTML text summaries
Rodrigo Reyes C.
-
2009/06/03
Re: IOException in dedup
Nic M
-
2009/06/03
anyone sucessfully debug nutch1.0 in ecli...@windows?
Mr Shore
-
2009/06/03
Re: IOException in dedup
Doğacan Güney
-
2009/06/02
[Nutch Wiki] Update of "GettingNutchRunningWithWindows" by JohnWhelan
Apache Wiki
-
2009/06/02
[Nutch Wiki] Update of "FrontPage" by JohnWhelan
Apache Wiki
-
2009/06/02
Re: IOException in dedup
MyD
-
2009/06/02
Re: IOException in dedup
Ken Krugler
-
2009/06/02
Re: IOException in dedup
Nic M
-
2009/06/02
Re: IOException in dedup
Ken Krugler
-
2009/06/02
IOException in dedup
Nic M
-
2009/06/02
[Nutch Wiki] Update of "Support" by JulienNioche
Apache Wiki
-
2009/06/01
[jira] Updated: (NUTCH-663) Upgrade Nutch to use Hadoop 0.19
buddha1021 (JIRA)
-
2009/06/01
debugging problem of nutch10
Mr Shore
-
2009/06/01
Re: How can I get startted with Nutch 1.0
Susam Pal
-
2009/06/01
How can I get startted with Nutch 1.0
逐鹿
-
2009/05/31
Re: Ranking & Scoring Algorithm Pseudocode
Dennis Kubes
-
2009/05/31
Ranking & Scoring Algorithm Pseudocode
atencorps
-
2009/05/29
Re: Remove duplicate nutch conf files from .job file
Kirby Bohling
-
2009/05/29
[jira] Updated: (NUTCH-741) Job file includes multiple copies of nutch config files.
Kirby Bohling (JIRA)
-
2009/05/29
[jira] Created: (NUTCH-741) Job file includes multiple copies of nutch config files.
Kirby Bohling (JIRA)
-
2009/05/29
[jira] Commented: (NUTCH-739) SolrDeleteDuplications too slow when using hadoop
Otis Gospodnetic (JIRA)
-
2009/05/29
Re: Eclipse Nutch1.0 IOException
Georg Kirschner
-
2009/05/29
Re: Eclipse Nutch1.0 IOException
Frank McCown
-
2009/05/29
Re: Eclipse Nutch1.0 IOException
Marko Bauhardt
-
2009/05/29
Eclipse Nutch1.0 IOException
Georg Kirschner
-
2009/05/29
[jira] Commented: (NUTCH-739) SolrDeleteDuplications too slow when using hadoop
Dmitry Lihachev (JIRA)
-
2009/05/29
[jira] Commented: (NUTCH-739) SolrDeleteDuplications too slow when using hadoop
Dmitry Lihachev (JIRA)
-
2009/05/29
[jira] Commented: (NUTCH-739) SolrDeleteDuplications too slow when using hadoop
Dmitry Lihachev (JIRA)
-
2009/05/28
[jira] Commented: (NUTCH-739) SolrDeleteDuplications too slow when using hadoop
JIRA
-
2009/05/28
[jira] Commented: (NUTCH-739) SolrDeleteDuplications too slow when using hadoop
Dmitry Lihachev (JIRA)
-
2009/05/28
[jira] Commented: (NUTCH-739) SolrDeleteDuplications too slow when using hadoop
Dmitry Lihachev (JIRA)
-
2009/05/28
[jira] Commented: (NUTCH-739) SolrDeleteDuplications too slow when using hadoop
Dmitry Lihachev (JIRA)
-
2009/05/28
[jira] Commented: (NUTCH-739) SolrDeleteDuplications too slow when using hadoop
Otis Gospodnetic (JIRA)
-
2009/05/28
[jira] Commented: (NUTCH-739) SolrDeleteDuplications too slow when using hadoop
Ken Krugler (JIRA)
-
2009/05/28
[jira] Commented: (NUTCH-739) SolrDeleteDuplications too slow when using hadoop
Dmitry Lihachev (JIRA)
-
2009/05/28
[jira] Updated: (NUTCH-740) Configuration option to override default language for fetched pages.
Otis Gospodnetic (JIRA)
-
2009/05/28
Re: Remove duplicate nutch conf files from .job file
Otis Gospodnetic
-
2009/05/28
[jira] Updated: (NUTCH-740) Configuration option to override default language for fetched pages.
Marcin Okraszewski (JIRA)
-
2009/05/28
[jira] Created: (NUTCH-740) Configuration option to override default language for fetched pages.
Marcin Okraszewski (JIRA)
-
2009/05/28
Remove duplicate nutch conf files from .job file
Kirby Bohling
-
2009/05/28
[jira] Commented: (NUTCH-677) Segment merge filering based on segment content
Otis Gospodnetic (JIRA)
-
2009/05/28
[jira] Commented: (NUTCH-739) SolrDeleteDuplications too slow when using hadoop
Otis Gospodnetic (JIRA)
-
2009/05/28
[jira] Updated: (NUTCH-739) SolrDeleteDuplications too slow when using hadoop
Dmitry Lihachev (JIRA)
-
2009/05/27
[jira] Updated: (NUTCH-739) SolrDeleteDuplications too slow when using hadoop
Dmitry Lihachev (JIRA)
-
2009/05/27
[jira] Created: (NUTCH-739) SolrDeleteDuplications too slow when using hadoop
Dmitry Lihachev (JIRA)
-
2009/05/27
[jira] Commented: (NUTCH-650) Hbase Integration
Otis Gospodnetic (JIRA)
-
2009/05/27
[jira] Commented: (NUTCH-693) Add configurable option for treating nofollow behaviour.
Otis Gospodnetic (JIRA)
-
2009/05/27
[jira] Assigned: (NUTCH-693) Add configurable option for treating nofollow behaviour.
Otis Gospodnetic (JIRA)
-
2009/05/27
[jira] Updated: (NUTCH-490) Extension point with filters for Neko HTML parser (with patch)
Marcin Okraszewski (JIRA)
-
2009/05/27
[jira] Updated: (NUTCH-677) Segment merge filering based on segment content
Marcin Okraszewski (JIRA)
-
2009/05/27
[jira] Updated: (NUTCH-702) Lazy Instanciation of Metadata in CrawlDatum
Julien Nioche (JIRA)
-
2009/05/26
[jira] Commented: (NUTCH-702) Lazy Instanciation of Metadata in CrawlDatum
Dmitry Lihachev (JIRA)