[jira] [Updated] (NUTCH-1994) Upgrade to Apache Tika 1.8

2015-04-25 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-1994: - Attachment: NUTCH-1994-Mattmann.042515.patch.txt > Upgrade to Apache Tika 1.8 > --

[jira] [Commented] (NUTCH-1994) Upgrade to Apache Tika 1.8

2015-04-25 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14512910#comment-14512910 ] Chris A. Mattmann commented on NUTCH-1994: -- OK, so here's some more info. I print

Re: [ANNOUNCE] New Nutch committer and PMC - Guiseppe Totaro

2015-04-25 Thread Julien Nioche
Congrats and welcome Giuseppe! On 25 April 2015 at 22:43, Giuseppe Totaro wrote: > Thanks a lot Sebastian. > I am very proud to be part of this project as committer and member of the > Nutch PMC. > > I am working on Information Retrieval at scale under the supervision of > Professor Chris Mattma

Build failed in Jenkins: Nutch-trunk #3090

2015-04-25 Thread Apache Jenkins Server
See -- [...truncated 5385 lines...] [echo] Testing plugin: urlfilter-validator [junit] WARNING: multiple versions of ant detected in path for junit [junit] jar:file:/home/jenkins/tools

[jira] [Commented] (NUTCH-1994) Upgrade to Apache Tika 1.8

2015-04-25 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14512858#comment-14512858 ] Chris A. Mattmann commented on NUTCH-1994: -- Hey [~jorgelbg] I thought it was NUTC

[jira] [Commented] (NUTCH-1994) Upgrade to Apache Tika 1.8

2015-04-25 Thread Jorge Luis Betancourt Gonzalez (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14512856#comment-14512856 ] Jorge Luis Betancourt Gonzalez commented on NUTCH-1994: --- This is due

[jira] [Commented] (NUTCH-1994) Upgrade to Apache Tika 1.8

2015-04-25 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14512846#comment-14512846 ] Chris A. Mattmann commented on NUTCH-1994: -- https://builds.apache.org/job/Nutch-t

[jira] [Commented] (NUTCH-1994) Upgrade to Apache Tika 1.8

2015-04-25 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14512845#comment-14512845 ] Chris A. Mattmann commented on NUTCH-1994: -- So, for whatever reason, this is brea

[jira] [Commented] (NUTCH-1991) Tika mime detection not using Nutch supplied tika-mimetypes.xml for content based detection

2015-04-25 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1991?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14512844#comment-14512844 ] Chris A. Mattmann commented on NUTCH-1991: -- This was a red herring and not the ca

Re: Unsubscribe

2015-04-25 Thread Gioele Zanzico
Me too, Thank you ! Sent from my iPhone On 23 Apr 2015, at 22:23, "Zhaohui Zhang" mailto:happy...@gmail.com>> wrote: Hi, I want to unsubscribe the email list. Best, Zhaohui -- Zhaohui Zhang Dept. of Chemical Engineering, University of Southern California Addr: 2611 Portland Street, Los Ange

[GitHub] nutch pull request: Branch 1.6

2015-04-25 Thread isAbird
GitHub user isAbird opened a pull request: https://github.com/apache/nutch/pull/22 Branch 1.6 You can merge this pull request into a Git repository by running: $ git pull https://github.com/apache/nutch branch-1.6 Alternatively you can review and apply these changes as the pa

Re: [ANNOUNCE] New Nutch committer and PMC - Guiseppe Totaro

2015-04-25 Thread Michael Joyce
Congrats Guiseppe! -- Jimmy On Fri, Apr 24, 2015 at 1:00 PM, Sebastian Nagel wrote: > Dear all, > > it is my pleasure to announce that Guiseppe Totaro has joined us > as committer and member of the Nutch PMC. Congratulations on your > new role within the Apache Nutch community! > > Guiseppe,

Re: [ANNOUNCE] New Nutch committer and PMC - Guiseppe Totaro

2015-04-25 Thread Giuseppe Totaro
Thanks a lot Sebastian. I am very proud to be part of this project as committer and member of the Nutch PMC. I am working on Information Retrieval at scale under the supervision of Professor Chris Mattmann at NASA JPL. I developed the CommonCrawlDataDumper

[jira] [Commented] (NUTCH-1991) Tika mime detection not using Nutch supplied tika-mimetypes.xml for content based detection

2015-04-25 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1991?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14512691#comment-14512691 ] Chris A. Mattmann commented on NUTCH-1991: -- So, the problem here is that tika.det

[jira] [Commented] (NUTCH-1991) Tika mime detection not using Nutch supplied tika-mimetypes.xml for content based detection

2015-04-25 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1991?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14512627#comment-14512627 ] Chris A. Mattmann commented on NUTCH-1991: -- Darn, so this seems to have broke the

[jira] [Commented] (NUTCH-1991) Tika mime detection not using Nutch supplied tika-mimetypes.xml for content based detection

2015-04-25 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1991?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14512588#comment-14512588 ] Hudson commented on NUTCH-1991: --- FAILURE: Integrated in Nutch-trunk #3089 (See [https://bui

[jira] [Commented] (NUTCH-1997) Add CBOR "magic header" to CommonCrawlDataDumper output

2015-04-25 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14512589#comment-14512589 ] Hudson commented on NUTCH-1997: --- FAILURE: Integrated in Nutch-trunk #3089 (See [https://bui

Build failed in Jenkins: Nutch-trunk #3089

2015-04-25 Thread Apache Jenkins Server
See Changes: [mattmann] NUTCH-1997: Fix for Add CBOR magic header to CommonCrawlDataDumper output contributed by Giuseppe Totaro, and Luke Sh. [mattmann] Fix for NUTCH-1991 Tika mime detection not using Nutch supplied tika-mimetypes.xml

[jira] [Resolved] (NUTCH-1997) Add CBOR "magic header" to CommonCrawlDataDumper output

2015-04-25 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-1997. -- Resolution: Fixed thanks [~gostep] and [~Lukeliush]! {noformat} [chipotle:~/tmp/nutch-1

[jira] [Updated] (NUTCH-1997) Add CBOR "magic header" to CommonCrawlDataDumper output

2015-04-25 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated NUTCH-1997: - Fix Version/s: 1.10 > Add CBOR "magic header" to CommonCrawlDataDumper output > --

[jira] [Assigned] (NUTCH-1997) Add CBOR "magic header" to CommonCrawlDataDumper output

2015-04-25 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann reassigned NUTCH-1997: Assignee: Chris A. Mattmann > Add CBOR "magic header" to CommonCrawlDataDumper outp

[jira] [Work started] (NUTCH-1997) Add CBOR "magic header" to CommonCrawlDataDumper output

2015-04-25 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-1997 started by Chris A. Mattmann. > Add CBOR "magic header" to CommonCrawlDataDumper output > -

[jira] [Resolved] (NUTCH-1991) Tika mime detection not using Nutch supplied tika-mimetypes.xml for content based detection

2015-04-25 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-1991. -- Resolution: Fixed Fix Version/s: 1.10 Committed! {noformat} [chipotle:~/tmp/nutc