[jira] [Commented] (NUTCH-2081) outseq and vectors directories pollute $NUTCH_HOME

2015-08-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14702414#comment-14702414 ] Lewis John McGibbney commented on NUTCH-2081: - Sounds good to me. > outseq an

[jira] [Commented] (NUTCH-1486) Upgrade to Solr 4.10.2

2015-08-18 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14702383#comment-14702383 ] Hudson commented on NUTCH-1486: --- SUCCESS: Integrated in Nutch-trunk #3258 (See [https://bui

[jira] [Commented] (NUTCH-2081) outseq and vectors directories pollute $NUTCH_HOME

2015-08-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14702314#comment-14702314 ] Chris A. Mattmann commented on NUTCH-2081: -- I would suggest: model/bayes/

[jira] [Updated] (NUTCH-2049) Upgrade Trunk to Hadoop > 2.4 stable

2015-08-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2049?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2049: Attachment: (was: TEST-org.apache.nutch.tika.TestPdfParser.txt) > Upgrade Trunk

[jira] [Updated] (NUTCH-2049) Upgrade Trunk to Hadoop > 2.4 stable

2015-08-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2049?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2049: Attachment: NUTCH-2049v3.patch New v3 which includes parsefilter-naivebayes and pass

[jira] [Updated] (NUTCH-2049) Upgrade Trunk to Hadoop > 2.4 stable

2015-08-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2049?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2049: Attachment: (was: NUTCH-2049v3.patch) > Upgrade Trunk to Hadoop > 2.4 stable > -

[jira] [Created] (NUTCH-2082) Upgrade to Apache Tika 1.10

2015-08-18 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-2082: --- Summary: Upgrade to Apache Tika 1.10 Key: NUTCH-2082 URL: https://issues.apache.org/jira/browse/NUTCH-2082 Project: Nutch Issue Type: Improveme

[jira] [Commented] (NUTCH-2081) outseq and vectors directories pollute $NUTCH_HOME

2015-08-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14702221#comment-14702221 ] Lewis John McGibbney commented on NUTCH-2081: - [~asitang] FYI > outseq and ve

[jira] [Created] (NUTCH-2081) outseq and vectors directories pollute $NUTCH_HOME

2015-08-18 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-2081: --- Summary: outseq and vectors directories pollute $NUTCH_HOME Key: NUTCH-2081 URL: https://issues.apache.org/jira/browse/NUTCH-2081 Project: Nutch

[jira] [Updated] (NUTCH-2049) Upgrade Trunk to Hadoop > 2.4 stable

2015-08-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2049?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2049: Attachment: TEST-org.apache.nutch.tika.TestPdfParser.txt NUTCH-2049v3

[jira] [Updated] (NUTCH-2049) Upgrade Trunk to Hadoop > 2.4 stable

2015-08-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2049?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2049: Attachment: (was: NUTCH-2049v3.patch) > Upgrade Trunk to Hadoop > 2.4 stable > -

[jira] [Commented] (NUTCH-1712) Use MultipleInputs in Injector to make it a single mapreduce job

2015-08-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14702028#comment-14702028 ] Lewis John McGibbney commented on NUTCH-1712: - [~tejasp] we are in the process

[jira] [Updated] (NUTCH-2049) Upgrade Trunk to Hadoop > 2.4 stable

2015-08-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2049?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2049: Attachment: NUTCH-2049v3.patch Patch for trunk rebased post NUTCH-1486 > Upgrade Tr

[jira] [Resolved] (NUTCH-1486) Upgrade to Solr 4.10.2

2015-08-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1486. - Resolution: Fixed Committed @revision 1696506 in trunk. Thank you to all who helpe

[jira] [Commented] (NUTCH-1486) Upgrade to Solr 4.10.2

2015-08-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14701997#comment-14701997 ] Lewis John McGibbney commented on NUTCH-1486: - Hi Asitang, if you check the pa

[jira] [Commented] (NUTCH-1486) Upgrade to Solr 4.10.2

2015-08-18 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14701978#comment-14701978 ] Sebastian Nagel commented on NUTCH-1486: +1: successfully run a small crawl and So

[jira] [Commented] (NUTCH-2049) Upgrade Trunk to Hadoop > 2.4 stable

2015-08-18 Thread Michael Joyce (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14701707#comment-14701707 ] Michael Joyce commented on NUTCH-2049: -- Great stuff Lewis. Builds and runs cleanly lo

[jira] [Commented] (NUTCH-1486) Upgrade to Solr 4.10.2

2015-08-18 Thread Asitang Mishra (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14701696#comment-14701696 ] Asitang Mishra commented on NUTCH-1486: --- Hey Lewis, Just noticed when I was applyin

[jira] [Commented] (NUTCH-2049) Upgrade Trunk to Hadoop > 2.4 stable

2015-08-18 Thread Asitang Mishra (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14701514#comment-14701514 ] Asitang Mishra commented on NUTCH-2049: --- Ack!! > Upgrade Trunk to Hadoop > 2.4 stab

[jira] [Commented] (NUTCH-2049) Upgrade Trunk to Hadoop > 2.4 stable

2015-08-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14701497#comment-14701497 ] Chris A. Mattmann commented on NUTCH-2049: -- Asitang, if you recall, we discussed

[jira] [Commented] (NUTCH-2049) Upgrade Trunk to Hadoop > 2.4 stable

2015-08-18 Thread Asitang Mishra (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14701498#comment-14701498 ] Asitang Mishra commented on NUTCH-2049: --- Hi Lewis, Had some issues applying your pa

[jira] [Commented] (NUTCH-2049) Upgrade Trunk to Hadoop > 2.4 stable

2015-08-18 Thread Asitang Mishra (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14701495#comment-14701495 ] Asitang Mishra commented on NUTCH-2049: --- Hi Chris, The Naive Bayes plugin, since ha

[jira] [Commented] (NUTCH-2049) Upgrade Trunk to Hadoop > 2.4 stable

2015-08-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14701315#comment-14701315 ] Chris A. Mattmann commented on NUTCH-2049: -- Great, thanks Lewis. The introduction

[jira] [Updated] (NUTCH-1486) Upgrade to Solr 4.10.2

2015-08-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1486: Flags: Patch,Important > Upgrade to Solr 4.10.2 > -- > >

[jira] [Updated] (NUTCH-2049) Upgrade Trunk to Hadoop > 2.4 stable

2015-08-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2049?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2049: Patch Info: Patch Available > Upgrade Trunk to Hadoop > 2.4 stable > ---

[jira] [Updated] (NUTCH-2049) Upgrade Trunk to Hadoop > 2.4 stable

2015-08-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2049?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2049: Labels: memex (was: ) > Upgrade Trunk to Hadoop > 2.4 stable >

[jira] [Comment Edited] (NUTCH-1486) Upgrade to Solr 4.10.2

2015-08-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14699858#comment-14699858 ] Lewis John McGibbney edited comment on NUTCH-1486 at 8/18/15 1:45 PM: --

[jira] [Commented] (NUTCH-2049) Upgrade Trunk to Hadoop > 2.4 stable

2015-08-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14700895#comment-14700895 ] Lewis John McGibbney commented on NUTCH-2049: - Hi [~chrismattmann] please see