Re: [DISCUSS] Release Apache Nutch 1.10

2015-01-31 Thread Tyler Palsulich
I'd like to see TIKA-1925, but I'm having trouble building Nutch on my machine. So, no update yet. Tyler On Thu, Jan 29, 2015 at 10:13 AM, Mattmann, Chris A (3980) chris.a.mattm...@jpl.nasa.gov wrote: Thanks Lewis. ++ Chris

RE: [DISCUSS] Release Apache Nutch 1.10

2015-01-31 Thread Markus Jelsma
Ah yes indeed. NUTCH-1925 should be a blocker. The new Tika carries an updated PDFBox 1.8.8 with significant memory handing improvements on PDF's. -Original message- From: Tyler Palsulichtpalsul...@gmail.com Sent: Saturday 31st January 2015 22:35 To: dev@nutch.apache.org Subject: Re:

[jira] [Updated] (NUTCH-1925) Upgrade Tika to version 1.7

2015-01-31 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1925: - Priority: Blocker (was: Major) Upgrade Tika to version 1.7 ---

[jira] [Updated] (NUTCH-1925) Upgrade Tika to version 1.7

2015-01-31 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1925: - Fix Version/s: 1.10 Upgrade Tika to version 1.7 ---

[jira] [Commented] (NUTCH-1925) Upgrade Tika to version 1.7

2015-01-31 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1426#comment-1426 ] Markus Jelsma commented on NUTCH-1925: -- Tyler, can you attempt to provide a patch for

[jira] [Assigned] (NUTCH-1925) Upgrade Tika to version 1.7

2015-01-31 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma reassigned NUTCH-1925: Assignee: Markus Jelsma Upgrade Tika to version 1.7 ---

[jira] [Updated] (NUTCH-1925) Upgrade Tika to version 1.7

2015-01-31 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1925: Fix Version/s: 2.4 Upgrade Tika to version 1.7 ---

[jira] [Updated] (NUTCH-1925) Upgrade Tika to version 1.7

2015-01-31 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1925: - Priority: Blocker (was: Major) Upgrade Tika to version 1.7 ---

[jira] [Updated] (NUTCH-1922) DbUpdater overwrites fetch status for URLs from previous batches, causes repeated re-fetches

2015-01-31 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1922?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1922: Fix Version/s: (was: 2.4) 2.3.1 DbUpdater overwrites fetch

[jira] [Updated] (NUTCH-1925) Upgrade Tika to version 1.7

2015-01-31 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1925: Fix Version/s: (was: 2.4) 2.3.1 Upgrade Tika to version 1.7

[jira] [Updated] (NUTCH-1679) UpdateDb using batchId, link may override crawled page.

2015-01-31 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1679: Fix Version/s: (was: 2.4) 2.3.1 UpdateDb using batchId,

RE: [DISCUSS] Release Apache Nutch 1.10

2015-01-31 Thread Tyler Palsulich
Doh! NUTCH-1925, you're right. Tyler On Jan 31, 2015 4:37 PM, Markus Jelsma markus.jel...@openindex.io wrote: Ah yes indeed. NUTCH-1925 should be a blocker. The new Tika carries an updated PDFBox 1.8.8 with significant memory handing improvements on PDF's. -Original message- From:

[jira] [Resolved] (NUTCH-1104) Port issues from trunk NutchGora branch

2015-01-31 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1104. - Resolution: Won't Fix Fix Version/s: (was: 2.4) This issue represents

[jira] [Assigned] (NUTCH-827) HTTP POST Authentication

2015-01-31 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-827: -- Assignee: Lewis John McGibbney HTTP POST Authentication

[jira] [Updated] (NUTCH-827) HTTP POST Authentication

2015-01-31 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-827: --- Fix Version/s: 2.4 HTTP POST Authentication

[jira] [Commented] (NUTCH-827) HTTP POST Authentication

2015-01-31 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14300071#comment-14300071 ] Lewis John McGibbney commented on NUTCH-827: I am working on this issue as I

[jira] [Commented] (NUTCH-1925) Upgrade Tika to version 1.7

2015-01-31 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14300060#comment-14300060 ] Tyler Palsulich commented on NUTCH-1925: I'd be happy to, [~markus17]! But, I

[jira] [Updated] (NUTCH-1924) Nutch + HBase Docker

2015-01-31 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1924?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1924: Fix Version/s: (was: 2.4) 2.3.1 Nutch + HBase Docker