Preparations for release (Re: [jira] Commented: (NUTCH-621) Nutch needs to declare it's crypto usage)
Hudson (JIRA) wrote: [ https://issues.apache.org/jira/browse/NUTCH-621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12635790#action_12635790 ] Hudson commented on NUTCH-621: -- Integrated in Nutch-trunk #585 (See [http://hudson.zones.apache.org/hudson/job/Nutch-trunk/585/]) This is out of the way now, so we can realistically start thinking about the release. :) Committers (myself included), please review the outstanding issues in JIRA and see if they can be acted upon. Thanks! -- Best regards, Andrzej Bialecki ___. ___ ___ ___ _ _ __ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com
[jira] Commented: (NUTCH-621) Nutch needs to declare it's crypto usage
[ https://issues.apache.org/jira/browse/NUTCH-621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12635790#action_12635790 ] Hudson commented on NUTCH-621: -- Integrated in Nutch-trunk #585 (See [http://hudson.zones.apache.org/hudson/job/Nutch-trunk/585/]) - Nutch needs to declare it's crypto usage Key: NUTCH-621 URL: https://issues.apache.org/jira/browse/NUTCH-621 Project: Nutch Issue Type: Task Affects Versions: 0.7, 0.7.1, 0.7.2, 0.8, 0.8.1, 0.9.0 Reporter: Grant Ingersoll Assignee: Chris A. Mattmann Priority: Blocker Fix For: 1.0.0 Attachments: NUTCH-621.Mattmann.091008.step3.txt, NUTCH-621.step1.Mattmann.090408.patch.txt, NUTCH-621.step1.Mattmann.091008.patch.txt Per the ASF board direction outlined at http://www.apache.org/dev/crypto.html, Nutch needs to declare it's use of crypto libraries (i.e. BouncyCastle, via PDFBox/Tika). See TIKA-118. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Commented: (NUTCH-621) Nutch needs to declare it's crypto usage
[ https://issues.apache.org/jira/browse/NUTCH-621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12635359#action_12635359 ] Grant Ingersoll commented on NUTCH-621: --- Looks good! Thanks, Chris! Nutch needs to declare it's crypto usage Key: NUTCH-621 URL: https://issues.apache.org/jira/browse/NUTCH-621 Project: Nutch Issue Type: Task Reporter: Grant Ingersoll Assignee: Chris A. Mattmann Priority: Blocker Attachments: NUTCH-621.Mattmann.091008.step3.txt, NUTCH-621.step1.Mattmann.090408.patch.txt, NUTCH-621.step1.Mattmann.091008.patch.txt Per the ASF board direction outlined at http://www.apache.org/dev/crypto.html, Nutch needs to declare it's use of crypto libraries (i.e. BouncyCastle, via PDFBox/Tika). See TIKA-118. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Commented: (NUTCH-621) Nutch needs to declare it's crypto usage
[ https://issues.apache.org/jira/browse/NUTCH-621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12635218#action_12635218 ] Jukka Zitting commented on NUTCH-621: - [...] get back the email from the govt [...] It's a one-way notification, AFAIK the government never responds to the crypto notifications. I guess it's just for archival purposes. So it's better not to wait for a response. :-) Nutch needs to declare it's crypto usage Key: NUTCH-621 URL: https://issues.apache.org/jira/browse/NUTCH-621 Project: Nutch Issue Type: Task Reporter: Grant Ingersoll Assignee: Chris A. Mattmann Priority: Blocker Attachments: NUTCH-621.Mattmann.091008.step3.txt, NUTCH-621.step1.Mattmann.090408.patch.txt, NUTCH-621.step1.Mattmann.091008.patch.txt Per the ASF board direction outlined at http://www.apache.org/dev/crypto.html, Nutch needs to declare it's use of crypto libraries (i.e. BouncyCastle, via PDFBox/Tika). See TIKA-118. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Commented: (NUTCH-621) Nutch needs to declare it's crypto usage
[ https://issues.apache.org/jira/browse/NUTCH-621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12635241#action_12635241 ] Chris A. Mattmann commented on NUTCH-621: - Folks, Based on Jukka's comments, I've ahead and updated Nutch's README file and completed step 4/4 of the crypto usage for Nutch: http://svn.apache.org/viewvc?rev=699866view=rev Nutch is now fully compliant with Apache crypto reqts! Grant, if this is satisfactory, and you are +1, I will go ahead and close this issue. Thanks for everyone's help! Cheers, Chris Nutch needs to declare it's crypto usage Key: NUTCH-621 URL: https://issues.apache.org/jira/browse/NUTCH-621 Project: Nutch Issue Type: Task Reporter: Grant Ingersoll Assignee: Chris A. Mattmann Priority: Blocker Attachments: NUTCH-621.Mattmann.091008.step3.txt, NUTCH-621.step1.Mattmann.090408.patch.txt, NUTCH-621.step1.Mattmann.091008.patch.txt Per the ASF board direction outlined at http://www.apache.org/dev/crypto.html, Nutch needs to declare it's use of crypto libraries (i.e. BouncyCastle, via PDFBox/Tika). See TIKA-118. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Commented: (NUTCH-621) Nutch needs to declare it's crypto usage
[ https://issues.apache.org/jira/browse/NUTCH-621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12630271#action_12630271 ] Grant Ingersoll commented on NUTCH-621: --- I'll take care of it. Doug actually resigned his chair duties a while ago. :-) http://www.apache.org/foundation/ Nutch needs to declare it's crypto usage Key: NUTCH-621 URL: https://issues.apache.org/jira/browse/NUTCH-621 Project: Nutch Issue Type: Task Reporter: Grant Ingersoll Assignee: Chris A. Mattmann Priority: Blocker Attachments: NUTCH-621.Mattmann.091008.step3.txt, NUTCH-621.step1.Mattmann.090408.patch.txt, NUTCH-621.step1.Mattmann.091008.patch.txt Per the ASF board direction outlined at http://www.apache.org/dev/crypto.html, Nutch needs to declare it's use of crypto libraries (i.e. BouncyCastle, via PDFBox/Tika). See TIKA-118. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Commented: (NUTCH-621) Nutch needs to declare it's crypto usage
[ https://issues.apache.org/jira/browse/NUTCH-621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12630273#action_12630273 ] Grant Ingersoll commented on NUTCH-621: --- Done. Nutch needs to declare it's crypto usage Key: NUTCH-621 URL: https://issues.apache.org/jira/browse/NUTCH-621 Project: Nutch Issue Type: Task Reporter: Grant Ingersoll Assignee: Chris A. Mattmann Priority: Blocker Attachments: NUTCH-621.Mattmann.091008.step3.txt, NUTCH-621.step1.Mattmann.090408.patch.txt, NUTCH-621.step1.Mattmann.091008.patch.txt Per the ASF board direction outlined at http://www.apache.org/dev/crypto.html, Nutch needs to declare it's use of crypto libraries (i.e. BouncyCastle, via PDFBox/Tika). See TIKA-118. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Commented: (NUTCH-621) Nutch needs to declare it's crypto usage
[ https://issues.apache.org/jira/browse/NUTCH-621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12630445#action_12630445 ] Chris A. Mattmann commented on NUTCH-621: - Grant: Great, thanks. Okay, once you get back the email from the govt (which hopefully we will since perhaps they will CC nutch-dev@ on the reply), I will proceed with step 4: http://www.apache.org/dev/crypto.html#inform And update the appropriate Nutch README file here: http://svn.apache.org/repos/asf/lucene/nutch/trunk/README.txt with the crypto notice and then I think we're done! Cheers, Chris Nutch needs to declare it's crypto usage Key: NUTCH-621 URL: https://issues.apache.org/jira/browse/NUTCH-621 Project: Nutch Issue Type: Task Reporter: Grant Ingersoll Assignee: Chris A. Mattmann Priority: Blocker Attachments: NUTCH-621.Mattmann.091008.step3.txt, NUTCH-621.step1.Mattmann.090408.patch.txt, NUTCH-621.step1.Mattmann.091008.patch.txt Per the ASF board direction outlined at http://www.apache.org/dev/crypto.html, Nutch needs to declare it's use of crypto libraries (i.e. BouncyCastle, via PDFBox/Tika). See TIKA-118. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Commented: (NUTCH-621) Nutch needs to declare it's crypto usage
[ https://issues.apache.org/jira/browse/NUTCH-621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12628340#action_12628340 ] Grant Ingersoll commented on NUTCH-621: --- Hmmm, it's PMC board report time again and still no progress on this one. Can we get this wrapped up by the 17th of September? Nutch needs to declare it's crypto usage Key: NUTCH-621 URL: https://issues.apache.org/jira/browse/NUTCH-621 Project: Nutch Issue Type: Task Reporter: Grant Ingersoll Assignee: Chris A. Mattmann Priority: Blocker Per the ASF board direction outlined at http://www.apache.org/dev/crypto.html, Nutch needs to declare it's use of crypto libraries (i.e. BouncyCastle, via PDFBox/Tika). See TIKA-118. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Commented: (NUTCH-621) Nutch needs to declare it's crypto usage
[ https://issues.apache.org/jira/browse/NUTCH-621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12604409#action_12604409 ] Chris A. Mattmann commented on NUTCH-621: - Hi Grant: Thanks. The code does exist in nutch, in the parse-pdf plugin. It seems to be using PDFBox's decrypt functionality: http://svn.apache.org/viewvc/lucene/nutch/trunk/src/plugin/parse-pdf/src/java/org/apache/nutch/parse/pdf/PdfParser.java?view=markup Judging by your comment, it sounds like this makes Nutch have to declare its crypto usage. I will work to move Nutch towards this. Thanks for the clarification. Cheers, Chris Nutch needs to declare it's crypto usage Key: NUTCH-621 URL: https://issues.apache.org/jira/browse/NUTCH-621 Project: Nutch Issue Type: Task Reporter: Grant Ingersoll Assignee: Chris A. Mattmann Priority: Blocker Per the ASF board direction outlined at http://www.apache.org/dev/crypto.html, Nutch needs to declare it's use of crypto libraries (i.e. BouncyCastle, via PDFBox/Tika). See TIKA-118. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Commented: (NUTCH-621) Nutch needs to declare it's crypto usage
[ https://issues.apache.org/jira/browse/NUTCH-621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12603878#action_12603878 ] Grant Ingersoll commented on NUTCH-621: --- Hi Chris, I'm sure you have lots on your plate, but I would like to have this wrapped up for our next board report, which is due on the 16th of June. Is that feasible given your load or does it make sense to try to get some help from other Nutch members? Thanks, Grant Nutch needs to declare it's crypto usage Key: NUTCH-621 URL: https://issues.apache.org/jira/browse/NUTCH-621 Project: Nutch Issue Type: Task Reporter: Grant Ingersoll Assignee: Chris A. Mattmann Priority: Blocker Per the ASF board direction outlined at http://www.apache.org/dev/crypto.html, Nutch needs to declare it's use of crypto libraries (i.e. BouncyCastle, via PDFBox/Tika). See TIKA-118. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Commented: (NUTCH-621) Nutch needs to declare it's crypto usage
[ https://issues.apache.org/jira/browse/NUTCH-621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12603890#action_12603890 ] Sami Siren commented on NUTCH-621: -- I agree, seem to me that we're in same situation as jackrabbit ? I think we do not provide bc libraries with nutch, only pdfbox. Nutch needs to declare it's crypto usage Key: NUTCH-621 URL: https://issues.apache.org/jira/browse/NUTCH-621 Project: Nutch Issue Type: Task Reporter: Grant Ingersoll Assignee: Chris A. Mattmann Priority: Blocker Per the ASF board direction outlined at http://www.apache.org/dev/crypto.html, Nutch needs to declare it's use of crypto libraries (i.e. BouncyCastle, via PDFBox/Tika). See TIKA-118. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Commented: (NUTCH-621) Nutch needs to declare it's crypto usage
[ https://issues.apache.org/jira/browse/NUTCH-621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12603884#action_12603884 ] Chris A. Mattmann commented on NUTCH-621: - Hi Grant: Thanks for the poke on this. I was speaking with Jukka Zitting about this. Tika requires the crypto declaration because of its transitive Maven dependencies in its Parsing framework on the Bountycastle libraries. Nutch, on the other hand, is using Tika at this point for mime detection only, and Nutch achieves its usage of Tika (0.1-incubating) by CM'ing only the Apache Tika 0.1 jar, and not making use of any of its transitive dependencies (which are inherently Parsing specific, and not Mime Detection specific). In addition, there was a similar thread discussed here: http://markmail.org/message/u7sjfzt7naknsv34 where the consensus was you don't need crypto notifications if you don't include any crypto libraries or use the related functionality in an included other library that has an optional dependency on a crypto library. So, I think that Nutch falls within that category. Would you agree? Thanks for your help and guidance. Cheers, Chris Nutch needs to declare it's crypto usage Key: NUTCH-621 URL: https://issues.apache.org/jira/browse/NUTCH-621 Project: Nutch Issue Type: Task Reporter: Grant Ingersoll Assignee: Chris A. Mattmann Priority: Blocker Per the ASF board direction outlined at http://www.apache.org/dev/crypto.html, Nutch needs to declare it's use of crypto libraries (i.e. BouncyCastle, via PDFBox/Tika). See TIKA-118. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Commented: (NUTCH-621) Nutch needs to declare it's crypto usage
[ https://issues.apache.org/jira/browse/NUTCH-621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12603898#action_12603898 ] Grant Ingersoll commented on NUTCH-621: --- My understanding is (and I haven't looked at the code) that Nutch has/had the following lines of code somewhere in it: if (pdf.isEncrypted()) { DocumentEncryption decryptor = new DocumentEncryption(pdf); //Just try using the default password and move on decryptor.decryptDocument(); } We discussed this at the PMC level a while back and felt that this, unfortunately, was enough to qualify Nutch as having crypto capabilities at some point in time since it explicitly refers to PDFBox's API for decrypting. Note, also, that it doesn't matter whether it is removed going forward, the code is out there already, as I understand it. I can't speak to Jackrabbit's assessment. Nutch needs to declare it's crypto usage Key: NUTCH-621 URL: https://issues.apache.org/jira/browse/NUTCH-621 Project: Nutch Issue Type: Task Reporter: Grant Ingersoll Assignee: Chris A. Mattmann Priority: Blocker Per the ASF board direction outlined at http://www.apache.org/dev/crypto.html, Nutch needs to declare it's use of crypto libraries (i.e. BouncyCastle, via PDFBox/Tika). See TIKA-118. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.