[ 
https://issues.apache.org/jira/browse/PDFBOX-3906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16165073#comment-16165073
 ] 

Andreas Lehmkühler commented on PDFBOX-3906:
--------------------------------------------

I've did some more research. Here is the current status of the testfiles:

+official and public records from the US+
- 002.jb2, 006.jb2
- 2012311000*.jb2 (11 files in total)

Representations of public U.S. government documents should be in the public 
domain and could be included 

+Lewinsky+
- 003.jb2

One page from an [official 
report|https://www.gpo.gov/fdsys/pkg/GPO-CDOC-106sdoc3/pdf/GPO-CDOC-106sdoc3-2.pdf]
 concerning the Lewinsky scandal. Should be in the public domain as well

+University of Britisch Columbia+
- 007.jb2, 042_*.jb2 (23 files in total): all files contain the same single 
page from the CCITT spec)
- amb_1.jb2, amb_2.jb2: a dithered low resolution image of Ally McBeal aka 
Callista Flockhart

We have the written permission from the generator of these files to use them. 
All files contain some copyrighted stuff. As we are not interested in the 
"visual content" at all and IMHO no one else might find a use case to benefit 
from the "visual content" itself, it shall be safe to claim fair use for that 
files.
I'm going to create a JIRA-ticket to clarify that.

+Constitution+
- 005.jb2

Seems to be 1 page from an older version of "The Constitution of the United 
States of America, Analyis and Interpretation"
[Here|https://www.gpo.gov/fdsys/pkg/GPO-CONAN-REV-2016/content-detail.html] is 
a more recent version

+ITU+
- sampledata_page1.jb2
- sampledata_page2.jb2
- sampledata_page3.jb2

Bitstreams which are reproduced as hex-dumps within the T.88 specification 
document. The ITU grants us a license to use them but with a non-commercial 
restriction.
I'm going to create a JIRA-ticket to see if that is an issue or if we can make 
an exception as these files are used for tests only. 

+Libertarianism: A Primer+
- 004.jb2

2 pages from "Libertarianism: A Primer" by David Boaz, a copyrighted work from 
1997. We might claim fair use as well, as these are just 2 pages from a 300 
pages book and we are not interested in the content itself. I'm going to create 
a JIRA-ticket to clarify that.

In the end there might be some files which must not be included in our repo. We 
have to decide wether we host them somewhere outside like the the isator files 
or if we simply omit them

> Contributing the JBig2 ImageIO Plugin to PDFBox​
> ------------------------------------------------
>
>                 Key: PDFBOX-3906
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-3906
>             Project: PDFBox
>          Issue Type: Task
>            Reporter: Jörg Henne
>         Attachments: jbig2-imageio.tgz, Re_JBIG2bitstreamTestfiles.eml
>
>
> Levigo solutions GmbH donates the Java ImageIO-Plugin for the JBIG2 to the 
> PDFBox project. The Plugin is currently hosted at 
> https://github.com/levigo/jbig2-imageio and has already been prepared for 
> integration. 
> The steps completed so far are:
> - ​IP vetting for contributions by non-levigo developers
> - Merging/application of all pending pull requests
> - Update of the project structure in anticipation of the new home:
> -- package names
> -- license headers
> -- license files
> -- README.md
> -- release notes
> -- Maven project information
> A tgz containing the source code has been attached:
> - It is based on commit 483aab3eb9bbc02f6995a637155adf6b922ed0c0 
> (https://github.com/levigo/jbig2-imageio/commit/483aab3eb9bbc02f6995a637155adf6b922ed0c0).
>  
> - Its SHA1 is 0e07111b4bf7f5a51bf0fdd903f02f082ea3bf65



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org

Reply via email to