Hi, On Sat, Mar 8, 2008 at 4:27 PM, Jukka Zitting <[EMAIL PROTECTED]> wrote: > On Fri, Feb 29, 2008 at 8:06 AM, Jukka Zitting <[EMAIL PROTECTED]> wrote: > > For 3 I guess manual copying is easiest, but there are 300+ open > > issues in PDFBox. Can we get some sort of an export from SourceForge? > > I found an export feature in SourceForge and exported all PDFBox, > FontBox, and JempBox issues (both open and closed) to > http://svn.apache.org/repos/asf/incubator/pdfbox/trunk/migration/. > > The data probably needs some massaging before it can be imported to Jira.
I cleaned all control characters from the exports and wrote an XSL transformation (sf2asf.xslt) for turning the exported issues to something that can be imported to Jira. See the resulting CSV file at http://people.apache.org/~jukka/pdfbox.csv. I generated the CSV file with "for f in */*.xml; do java -jar xalan.jar -in $f -xsl sf2asf.xslt; done" and added the column headers manually. The generated CSV import only contains the open issues in SourceForge. It maps the submission date, issue type, component, and summary metadata reasonably well, and tries to put all other relevant information to the description field with links to the original SourceForge issue. Any comments or wishes about the import? Unless anyone objects, I plan to move forward with the import early next week. > I also set up separate pdfbox, fontbox, and jempbox components for the > PDFBOX project in Jira. I further updated the list of components based on the SourceForge settings. We now have: FontBox JempBox Lucene Parsing PDFReader PDModel PDModel.AcroForm Text extraction Utilities Writing BR, Jukka Zitting
