Hi,

On Sat, Mar 8, 2008 at 4:27 PM, Jukka Zitting <[EMAIL PROTECTED]> wrote:
>  On Fri, Feb 29, 2008 at 8:06 AM, Jukka Zitting <[EMAIL PROTECTED]> wrote:
>  >  For 3 I guess manual copying is easiest, but there are 300+ open
>  >  issues in PDFBox. Can we get some sort of an export from SourceForge?
>
>  I found an export feature in SourceForge and exported all PDFBox,
>  FontBox, and JempBox issues (both open and closed) to
>  http://svn.apache.org/repos/asf/incubator/pdfbox/trunk/migration/.
>
>  The data probably needs some massaging before it can be imported to Jira.

I cleaned all control characters from the exports and wrote an XSL
transformation (sf2asf.xslt) for turning the exported issues to
something that can be imported to Jira.

See the resulting CSV file at
http://people.apache.org/~jukka/pdfbox.csv. I generated the CSV file
with "for f in */*.xml; do java -jar xalan.jar -in $f -xsl
sf2asf.xslt; done" and added the column headers manually.

The generated CSV import only contains the open issues in SourceForge.
It maps the submission date, issue type, component, and summary
metadata reasonably well, and tries to put all other relevant
information to the description field with links to the original
SourceForge issue.

Any comments or wishes about the import? Unless anyone objects, I plan
to move forward with the import early next week.

>  I also set up separate pdfbox, fontbox, and jempbox components for the
>  PDFBOX project in Jira.

I further updated the list of components based on the SourceForge
settings. We now have:

    FontBox
    JempBox
    Lucene
    Parsing
    PDFReader
    PDModel
    PDModel.AcroForm
    Text extraction
    Utilities
    Writing

BR,

Jukka Zitting

Reply via email to