Build failed in Jenkins: Nutch-nutchgora #796

2013-10-22 Thread Apache Jenkins Server
See -- [...truncated 3528 lines...] [ivy:resolve] :: loading settings :: file = compile: jar: deps-test: deploy: copy-genera

Re: About ParseMetadata

2013-10-22 Thread feng lu
On Tue, Oct 22, 2013 at 7:34 PM, Talat UYARER wrote: > ORIGINAL_CHAR_ENCODING > yes, in nutch 2.x , it not use parseMeta and contentMeta in Parse Object. one way is to clean this code block and another way is to add parseMeta in Parse Object. and another parser may will use this meta data. I agre

Re: Alternative to Forrest for Nutch website

2013-10-22 Thread Julien Nioche
Thanks Chris. Interesting indeed. see that Mahout and Lucene use the Apache CMS, whereas Tika uses Maven and Hadoop is on Forrest. Definitely an option to keep in mind when one of us has the time to do some work on that Julien On 22 October 2013 14:07, Markus Jelsma wrote: > Sounds great! Forr

RE: Alternative to Forrest for Nutch website

2013-10-22 Thread Markus Jelsma
Sounds great! Forrest is a bit tedious to work with. -Original message- > From:Chris Mattmann > Sent: Tuesday 22nd October 2013 15:02 > To: dev@nutch.apache.org > Subject: Re: Alternative to Forrest for Nutch website > > Hey Jul, > > A lot are using the Apache CMS: > > http://www.ap

Re: Alternative to Forrest for Nutch website

2013-10-22 Thread Chris Mattmann
Hey Jul, A lot are using the Apache CMS: http://www.apache.org/dev/cms.html That's infra recommended. Besides that some are using Confluence; some use Maven; others use Markdown via CMS, etc. My +1 would be for the CMS, but I don't have time to set it up (luckily infra can help and we can reu

Alternative to Forrest for Nutch website

2013-10-22 Thread Julien Nioche
Hi guys I am about to modify the list of committers on the website and realised that I had forgotten how frustrating it is to have to install Forrest etc... Any idea of what is used by other Apache projects for their websites? Thanks Julien -- * *Open Source Solutions for Text Engineering h

About ParseMetadata

2013-10-22 Thread Talat UYARER
Hi, When I try to port ATLANTBH's filter-xpath pluigns. I saw a parsemetadata object. I think this used from 1.x. I do little search in 2.x I found in HTMLParser.java. it created but it is not set any every. Can you explain this is need us in 2.x or we can clean this code block ? If this is u

[jira] [Commented] (NUTCH-1477) NPE when injecting with DataFileAvroStore

2013-10-22 Thread Alfonso Nishikawa (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1477?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13801603#comment-13801603 ] Alfonso Nishikawa commented on NUTCH-1477: -- [~alexmc] : Nutch Persistent classes